/v1/operator
The /operator
endpoint provides cluster-level tools for Nomad operators, such
as interacting with the Raft subsystem.
Use this interface with extreme caution, as improper use could lead to a Nomad outage and even loss of data.
See the Outage Recovery guide for some examples of how
these capabilities are used. For a CLI to perform these operations manually,
please see the documentation for the
nomad operator
command.
Read Raft Configuration
This endpoint queries the status of a client node registered with Nomad.
Method | Path | Produces |
---|---|---|
GET | /v1/operator/raft/configuration | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | management |
Parameters
stale
- Specifies if the cluster should respond without an active leader. This is specified as a query string parameter.
Sample Request
Sample Response
Field Reference
Index
(int)
- TheIndex
value is the Raft corresponding to this configuration. The latest configuration may not yet be committed if changes are in flight.Servers
(array: Server)
- The returnedServers
array has information about the servers in the Raft peer configuration.ID
(string)
- The ID of the server. This is the same as theAddress
but may be upgraded to a GUID in a future version of Nomad.Node
(string)
- The node name of the server, as known to Nomad, or"(unknown)"
if the node is stale and not known.Address
(string)
- Theip:port
for the server.Leader
(bool)
- is either "true" or "false" depending on the server's role in the Raft configuration.Voter
(bool)
- is "true" or "false", indicating if the server has a vote in the Raft configuration. Future versions of Nomad may add support for non-voting servers.
Remove Raft Peer
This endpoint removes a Nomad server with given address from the Raft configuration. The return code signifies success or failure.
Method | Path | Produces |
---|---|---|
DELETE | /v1/operator/raft/peer | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | management |
Parameters
address
(string: <optional>)
- Specifies the server to remove asip:port
. This cannot be provided along with theid
parameter.id
(string: <optional>)
- Specifies the server to remove asid
. This cannot be provided along with theaddress
parameter.
Sample Request
Read Autopilot Configuration
This endpoint retrieves its latest Autopilot configuration.
Method | Path | Produces |
---|---|---|
GET | /v1/operator/autopilot/configuration | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | operator:read |
Sample Request
Sample Response
For more information about the Autopilot configuration options, see the agent configuration section.
Update Autopilot Configuration
This endpoint updates the Autopilot configuration of the cluster.
Method | Path | Produces |
---|---|---|
PUT | /v1/operator/autopilot/configuration | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | operator:write |
Parameters
cas
(int: 0)
- Specifies to use a Check-And-Set operation. The update will only happen if the given index matches theModifyIndex
of the configuration at the time of writing.
Sample Payload
CleanupDeadServers
(bool: true)
- Specifies automatic removal of dead server nodes periodically and whenever a new server is added to the cluster.LastContactThreshold
(string: "200ms")
- Specifies the maximum amount of time a server can go without contact from the leader before being considered unhealthy. Must be a duration value such as10s
.MaxTrailingLogs
(int: 250)
specifies the maximum number of log entries that a server can trail the leader by before being considered unhealthy.ServerStabilizationTime
(string: "10s")
- Specifies the minimum amount of time a server must be stable in the 'healthy' state before being added to the cluster. Only takes effect if all servers are running Raft protocol version 3 or higher. Must be a duration value such as30s
.EnableRedundancyZones
(bool: false)
- (Enterprise-only) Specifies whether to enable redundancy zones.DisableUpgradeMigration
(bool: false)
- (Enterprise-only) Disables Autopilot's upgrade migration strategy in Nomad Enterprise of waiting until enough newer-versioned servers have been added to the cluster before promoting any of them to voters.EnableCustomUpgrades
(bool: false)
- (Enterprise-only) Specifies whether to enable using custom upgrade versions when performing migrations.
Read Health
This endpoint queries the health of the autopilot status.
Method | Path | Produces |
---|---|---|
GET | /v1/operator/autopilot/health | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | operator:read |
Sample Request
Sample response
Healthy
is whether all the servers are currently healthy.FailureTolerance
is the number of redundant healthy servers that could be fail without causing an outage (this would be 2 in a healthy cluster of 5 servers).Servers
holds detailed health information on each server:ID
is the Raft ID of the server.Name
is the node name of the server.Address
is the address of the server.SerfStatus
is the SerfHealth check status for the server.Version
is the Nomad version of the server.Leader
is whether this server is currently the leader.LastContact
is the time elapsed since this server's last contact with the leader.LastTerm
is the server's last known Raft leader term.LastIndex
is the index of the server's last committed Raft log entry.Healthy
is whether the server is healthy according to the current Autopilot configuration.Voter
is whether the server is a voting member of the Raft cluster.StableSince
is the time this server has been in its currentHealthy
state.
The HTTP status code will indicate the health of the cluster. If
Healthy
is true, then a status of 200 will be returned. IfHealthy
is false, then a status of 429 will be returned.
Read Scheduler Configuration
This endpoint retrieves the latest Scheduler configuration. This API was introduced in Nomad 0.9 and currently supports enabling/disabling preemption. More options may be added in the future.
Method | Path | Produces |
---|---|---|
GET | /v1/operator/scheduler/configuration | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | operator:read |
Sample Request
Sample Response
Field Reference
Index
(int)
- TheIndex
value is the Raft commit index corresponding to this configuration.SchedulerConfig
(SchedulerConfig)
- The returnedSchedulerConfig
object has configuration settings mentioned below.SchedulerAlgorithm
(string: "binpack")
- Specifies whether scheduler binpacks or spreads allocations on available nodes.PreemptionConfig
(PreemptionConfig)
- Options to enable preemption for various schedulers.SystemSchedulerEnabled
(bool: true)
- Specifies whether preemption for system jobs is enabled. Note that this defaults to true.BatchSchedulerEnabled
(bool: false)
(Enterprise Only) - Specifies whether preemption for batch jobs is enabled. Note that this defaults to false and must be explicitly enabled.ServiceSchedulerEnabled
(bool: false)
(Enterprise Only) - Specifies whether preemption for service jobs is enabled. Note that this defaults to false and must be explicitly enabled.
CreateIndex
- The Raft index at which the config was created.ModifyIndex
- The Raft index at which the config was modified.
Update Scheduler Configuration
This endpoint updates the scheduler configuration of the cluster.
Method | Path | Produces |
---|---|---|
PUT , POST | /v1/operator/scheduler/configuration | application/json |
The table below shows this endpoint's support for blocking queries and required ACLs.
Blocking Queries | ACL Required |
---|---|
NO | operator:write |
Bootstrap Configuration Element
The default_scheduler_config
attribute of the server stanza will provide a
starting value for this configuration. Once bootstrapped, the value in the
server state is authoritative.
Parameters
cas
(int: 0)
- Specifies to use a Check-And-Set operation. The update will only happen if the given index matches theModifyIndex
of the configuration at the time of writing.
Sample Payload
SchedulerAlgorithm
(string: "binpack")
- Specifies whether scheduler binpacks or spreads allocations on available nodes. Possible values are"binpack"
and"spread"
PreemptionConfig
(PreemptionConfig)
- Options to enable preemption for various schedulers.SystemSchedulerEnabled
(bool: true)
- Specifies whether preemption for system jobs is enabled. Note that if this is set to true, then system jobs can preempt any other jobs.BatchSchedulerEnabled
(bool: false)
(Enterprise Only) - Specifies whether preemption for batch jobs is enabled. Note that if this is set to true, then batch jobs can preempt any other jobs.ServiceSchedulerEnabled
(bool: false)
(Enterprise Only) - Specifies whether preemption for service jobs is enabled. Note that if this is set to true, then service jobs can preempt any other jobs.
Sample Response