SLURM Reference Manual
SINFO Node States
In SINFO reports, the strings below are the only possible values of the
STATE column, indicating the current status of a node, a set of nodes,
or a node partition.
STATE codes with * appended indicate that a reported node is not
responding (SLURM does not allocate new work to such nodes, which
eventually enter the DOWN state).
The default width of the STATE column is 5characters,
but SINFO option -o (--format) can change this and the reported
STATE codes will lengthen or shorten to fill the allowed spaces.
SINFO option -t (--states) will limit your report to only those nodes
with the (comma-delimited list of) state(s) that you specify.
A separate section covers SQUEUE's job
- means that this node (or set, or partition)
has already been assigned to one or more jobs.
- means that job(s) assigned to this node are
COMPLETING disappears when all of the job's processes as well as
the SLURM epilog program (if any) have terminated.
See the slurm.conf MAN page for details.
- means that this node is unvailable for jobs.
SLRUM automatically declares nodes DOWN if some failure occurs.
Also, system administrators may declare a node DOWN.
If a node resumes normal operation, SLRUM can automatically return it
See ReturnToService and SlurmdTimeout descriptions in the slurm.conf
MAN page for more details.
- means that this node has been declared unavailable
by a system administrator using SCONTROL's UPDATE command.
- [DRNG] means that this node is currently running a job,
but it will not be allocated to additional jobs.
The node state changes to DRAINED when the last job on it completes.
System administrators put nodes in this state by
using SCONTROL's UPDATE command.
- means that this node is not currently assigned to any jobs
and it available for use.
- means that the SLRUM controller has just started
and hence this node's real status has not yet been determined.
Navigation Links: [
Document List ] [
HPC Home ]