[StarCluster] Spot instances not removed from the queue when terminated

Mircea Cimpoi mircea at magicpony.technology
Tue Nov 24 07:01:19 EST 2015


Hi All,

We’ve started using StarCluster to run experiments on AWS and have
encountered some problems with the load balancer when using spot instances.

Specifically, we’re finding that the nodes are not being removed from the
queue (qconf -sel) after the spot instances are terminated and still appear
in qhost output. The jobs also appear to still be running after termination
which seems to be stopping the load balancer from adding new nodes as we
would expect.

We are using g2.2xlarge instances, and requesting 8 slots per submitted job.

Has anyone experienced similar issues?

Mircea
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/pipermail/starcluster/attachments/20151124/58b6adfd/attachment.html


More information about the StarCluster mailing list