<div dir="ltr">I attach a crash report.<div style>I think this may be an error in mapping a node name to an internet name.</div><div style>The host <a href="http://ec2-23-22-72-123.compute-1.amazonaws.com">ec2-23-22-72-123.compute-1.amazonaws.com</a> was not actually node004 which I was trying to remove, it was node003.</div>
<div style>Do you think the github version will be better than the released version at the moment? I do have the latest release.</div><div style>Dan</div><div style><br></div><div><div><br></div><div>>>> Removing node004 from SGE</div>
<div>!!! ERROR - command 'source /etc/profile && qconf -de node004' failed with status 1</div><div>!!! ERROR - command 'pkill -9 sge_execd' failed with status 1</div><div>>>> Updating SGE parallel environment 'orte'</div>
<div>4/4 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100%</div><div>error occurred in job (id=139906233857792): failed to connect to host <a href="http://ec2-23-22-72-123.compute-1.amazonaws.com">ec2-23-22-72-123.compute-1.amazonaws.com</a> on port 22</div>
<div>Traceback (most recent call last):</div><div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/threadpool.py", line 31, in run</div><div> job.run()</div><div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/threadpool.py", line 58, in run</div>
<div> r = self.method(*self.args, **self.kwargs)</div><div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/plugins/sge.py", line 50, in <lambda></div><div> num_processors = sum(self.pool.map(lambda n: n.num_processors, nodes))</div>
<div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/node.py", line 169, in num_processors</div><div> 'cat /proc/cpuinfo | grep processor | wc -l')[0])</div><div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/sshutils/__init__.py", line 519, in execute</div>
<div> channel = self.transport.open_session()</div><div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/sshutils/__init__.py", line 136, in transport</div><div> port=self._port, timeout=self._timeout)</div>
<div> File "/opt/lib/python2.6/site-packages/StarCluster-0.93.3-py2.6.egg/starcluster/sshutils/__init__.py", line 103, in connect</div><div> raise exception.SSHConnectionError(host, port)</div><div>SSHConnectionError: failed to connect to host <a href="http://ec2-23-22-72-123.compute-1.amazonaws.com">ec2-23-22-72-123.compute-1.amazonaws.com</a> on port 22</div>
</div><div><br></div></div>