Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

If you want to decrease the number of nodes, what happens depends on your chose COE. If you're running k8s, magnum will try to find a node with no running containers and delete them. If no empty nodes are found, magnum will warn you, and delete a node at random. With docker swarm, magnum have no logic to discover an empty node, and will just delete nodes at random.

Troubleshooting

Debugging an uhealthy cluster

To check status of a cluster

Code Block
$ heat stack-list -n
# Get the id of the cluster
$ openstack stack failure list <id of the cluster>
# OR
$ openstack coe cluster list
# Get the id of the cluster, NB, it's shorter than the heat ID
$ openstack coe cluster show <cluster ID>

Debugging a part of the cluster

Use heat to find the id's of the cluster

Code Block
$ heat stack-list -n
<snip output>
$ heat resource-list <id from list above>
<snip output>

Run a check of the cluster

Code Block
$ openstack stack check <ID from heat stack-list -n>

Scaling down cluster when status is unhealthy

When scaling up the cluster beyond quota limit, the openstack coe cluster update command doesn't work. But resize does

Code Block
$ openstack coe cluster resize <your-cluster>  <N>

Upgrading

TBA