<div dir="ltr"><div><div><div>Hey Starcluster,<br><br></div>I'm getting the same error as this guy:<br><a href="http://star.mit.edu/cluster/mlarchives/1592.html">http://star.mit.edu/cluster/mlarchives/1592.html</a><br>
<br></div><div>Briefly:<br></div>When I go to use addnode, a spot request opens on amazon (i'm starting a spot cluster, so addnode bids). But starcluster proceeds to try to install ssh without waiting for the node to come up.<br>
<br>>>> Launching node(s): node002<br>SpotInstanceRequest:sir-b35acc5e<br>>>> Waiting for spot requests to propagate... <br>>>> Waiting for node(s) to come up... (updating every 30s)<br>>>> Waiting for all nodes to be in a 'running' state...<br>
2/2 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% <br>>>> Waiting for SSH to come up on all nodes...<br>2/2 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% <br>
>>> Waiting for cluster to come up took 0.020 mins<br>!!! ERROR - node 'node002' does not exist<br><br></div><div>Morever, this only happens when addnode tried to bid (either by defualt becuase im running a spot cluster or by inline directive)<br>
</div><div><br></div><div>I don't know what to try next tho. Do you guys have any ideas where to start?<br></div><br>thanks<br>Yoshi<br></div>