[StarCluster] problem adding nodes. Node003 doesnt exist so I crash

Ramon Ramirez-Linan rlinan at navteca.com
Fri Nov 14 10:41:02 EST 2014


I was trying to add 97 nodes to y 3 nodes cluster.

I try last evening without success because AWS didnt have enought EC2 of
this type on my region (c3.4xlarge US-EAST)

I tried this morning and it looks like it was adding them, but for some
reason some nodes did not get added and StarCluster crashed since it
couldnt find those nodes to configure them

This is the error

ec2-user at ip-172-31-1-249 ~]$ time starcluster addnode -n 97 aes-300
/usr/lib64/python2.6/site-packages/Crypto/Util/number.py:57:
PowmInsecureWarning: Not using mpz_powm_sec.  You should rebuild using
libgmp >= 5 to avoid timing attack vulnerability.
  _warn("Not using mpz_powm_sec.  You should rebuild using libgmp >= 5 to
avoid timing attack vulnerability.", PowmInsecureWarning)
StarCluster - (http://star.mit.edu/cluster) (v. 0.95.5)
Software Tools for Academics and Researchers (STAR)
Please submit bug reports to starcluster at mit.edu

>>> Launching node(s): node003, node004, node005, node006, node007,
node008, node009, node010, node011, node012, node013, node014, node015,
node016, node017, node018, node019, node020, node021, node022, node023,
node024, node025, node026, node027, node028, node029, node030, node031,
node032, node033, node034, node035, node036, node037, node038, node039,
node040, node041, node042, node043, node044, node045, node046, node047,
node048, node049, node050, node051, node052, node053, node054, node055,
node056, node057, node058, node059, node060, node061, node062, node063,
node064, node065, node066, node067, node068, node069, node070, node071,
node072, node073, node074, node075, node076, node077, node078, node079,
node080, node081, node082, node083, node084, node085, node086, node087,
node088, node089, node090, node091, node092, node093, node094, node095,
node096, node097, node098, node099
Reservation:r-2fe2d905
>>> Waiting for instances to propagate...
97/97 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for node(s) to come up... (updating every 30s)
>>> Waiting for all nodes to be in a 'running' state...
38/38 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for SSH to come up on all nodes...
38/38 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for cluster to come up took 1.882 mins
!!! ERROR - node 'node003' does not exist

real    2m4.162s
user    0m4.868s
sys     0m0.332s
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/pipermail/starcluster/attachments/20141114/d8d30346/attachment.htm


More information about the StarCluster mailing list