Queries the health of cluster for monitoring
Returns the health of the cluster as assessed by the supervision (agency)
Queries the health of the cluster for monitoring purposes. The response is a JSON object, containing the standard
errorMessage fields as appropriate. The endpoint-specific fields are as follows:
ClusterId: A UUID string identifying the cluster
Health: An object containing a descriptive sub-object for each node in the cluster. Each entry in
Healthwill be keyed by the node ID and contain the the following attributes:
Endpoint: A string representing the network endpoint of the server.
Role: The role the server plays. Possible values are
CanBeDeleted: Boolean representing whether the node can safely be removed from the cluster.
Version: Version String of ArangoDB used by that node.
Engine: Storage Engine used by that node.
Status: A string indicating the health of the node as assessed by the supervision (agency). This should be considered primary source of truth for coordinator and dbservers node health. If the node is responding normally to requests, it is
"GOOD". If it has missed one heartbeat, it is
"BAD". If it has been declared failed by the supervision, which occurs after missing heartbeats for about 15 seconds, it will be marked
Additionally it will also have the following attributes for
- Coordinators and DBServer:
SyncStatus: The last sync status reported by the node. This value is primarily used to determine the value of
Status. Possible values include
LastAckedTime: ISO 8601 timestamp specifying the last heartbeat received.
ShortName: A string representing the shortname of the server, e.g.
Timestamp: ISO 8601 timestamp specifying the last heartbeat received. (deprecated)
Host: An optional string, specifying the host machine if known.
- Only Coordinators:
AdvertisedEndpoint: A string representing the advertised endpoint, if set. (e.g. external IP address or load balancer, optional)
- "Leader": ID of the agent this node regards as leader.
- "Leading": Whether this agent is the leader (true) or not (false).
- "LastAckedTime": Time since last
- 200: is returned when everything went well.