Monday Exercise 2.2: Explore condor_status¶
The goal of this exercise is try out some of the most common options to the
condor_status command, so that you can view slots effectively.
The main part of this exercise should take just a few minutes, but if you have more time later, come back and work on the extension ideas at the end to become a
condor_status program has many options for selecting which slots are listed. You've already learned the basic
condor_status and the
condor_status -compact variation (which you may wish to retry now, before proceeding).
Another convenient option is to list only those slots that are available now:
%UCL_PROMPT_SHORT% <strong>condor_status -avail</strong>
Of course, the individual execute machines only report their slots to the collector at certain time intervals, so this list will not reflect the up-to-the-second reality of all slots. But this limitation is true of all
condor_status output, not just with the
condor_q, you can limit the slots that are listed in two easy ways. To list just the slots on a specific machine:
%UCL_PROMPT_SHORT% <strong>condor_status <em><i>HOSTNAME</i></em></strong>
For example, if you want to see the slots on
e242.chtc.wisc.edu (in the CHTC pool):
%UCL_PROMPT_SHORT% <strong>condor_status e242.chtc.wisc.edu</strong>
To list a specific slot on a machine:
%UCL_PROMPT_SHORT% <strong>condor_status <em><i>SLOT</i></em>@<em><i>HOSTNAME</i></em></strong>
For example, to see the “first” slot on the machine above:
%UCL_PROMPT_SHORT% <strong>condor_status [email protected]</strong>
Note: You can name more than one hostname, slot, or combination thereof on the command line, in which case slots for all of the named hostnames and/or slots are listed.
Let’s get some practice using
- List all slots in the pool — how many are there total?
- Practice using all forms of
condor_statusthat you have learned:
- List the available slots
- List the slots on a specific machine (e.g.,
- List a specific slot from that machine
- Try listing the slots from a few (but not all) machines at once
- Try using a mix of hostnames and slot IDs at once
Viewing a Slot ClassAd¶
Just as with
condor_q, you can use
condor_status to view the complete ClassAd for a given slot (often confusingly called the “machine” ad):
%UCL_PROMPT_SHORT% <strong>condor_status -long <em><i>SLOT</i></em>@<em><i>HOSTNAME</i></em></strong>
Because slot ClassAds may have 150–200 attributes (or more), it probably makes the most sense to show the ClassAd for a single slot at a time, as shown above.
Here are some examples of common, interesting attributes taken directly from
OpSys = "LINUX" DetectedCpus = 24 OpSysAndVer = "SL6" MyType = "Machine" LoadAvg = 0.99 TotalDisk = 798098404 OSIssue = "Scientific Linux release 6.6 (Carbon)" TotalMemory = 24016 Machine = "e242.chtc.wisc.edu" CondorVersion = "$CondorVersion: 8.5.5 May 03 2016 BuildID: 366162 $" Memory = 1024
As you may be able to tell, there is a mix of attributes about the machine as a whole (hence the name “machine ad”) and about the slot in particular.
Go ahead and examine a machine ClassAd now. I suggest looking at one of the slots on, say,
c010.chtc.wisc.edu because of its relatively simple configuration.
Viewing Slots by ClassAd Expression¶
Often, it is helpful to view slots that meet some particular criteria. For example, if you know that your job needs a lot of memory to run, you may want to see how many high-memory slots there are and whether they are busy. You can filter the list of slots like this using the
-constraint option and a ClassAd expression.
For example, suppose we want to list all slots that are running Scientific Linux 6 (operating system) and have at least 16 GB memory available. Note that memory is reported in units of Megabytes. The command is:
%UCL_PROMPT_SHORT% <strong>condor_status -constraint 'OpSysAndVer == "SL6" && Memory >= 64000'</strong>
Note: Be very careful with using quote characters appropriately in these commands. In the example above, the single quotes (
') are for the shell, so that the entire expression is passed to
condor_status untouched, and the double quotes (
") surround a string value within the expression itself.
Currently on CHTC, there are only a few slots that meet these criteria.
If you are interested in learning more about writing ClassAd expressions, look at section 4.1 and especially 4.1.4 of the HTCondor Manual. This is definitely advanced material, so if you do not want to read it, that is fine. But if you do, take some time to practice writing expressions for the
condor_status -constraint command.
condor_q command accepts the
-constraint option as well! As you might expect, the option allows you to limit the jobs that are listed based on a ClassAd expression.
Formatting Output (Optional)¶
condor_status command accepts the same
-af) options that
condor_q accepts, and the options have the same meanings in both commands. Of course, the attributes available in machine ads may differ from the ones that are available in job ads. Use the HTCondor Manual or look at individual slot ClassAds to get a better idea of what attributes are available.
For example, I was curious about the Windows slot listed in the
condor_status summary output. Here are two commands that show the full hostnames and major version information for the Windows slots:
%UCL_PROMPT_SHORT% <strong>condor_status -format '%30s ' Machine -format '%s\n' OpSysAndVer -constraint 'OpSys == "WINDOWS"'</strong> %UCL_PROMPT_SHORT% <strong>condor_status -af Machine -af OpSysAndVer -constraint 'OpSys == "WINDOWS"'</strong>
If you like, spend a few minutes now or later experimenting with
As suggested above, if you want to learn more about
condor_q, you can do some reading:
- Read the
condor_statusman page or HTCondor Manual section (same text) to learn about more options
- Read about ClassAd attributes in Appendix A of the HTCondor Manual
- Read about ClassAd expressions in section 4.1.4 of the HTCondor Manual