[Starcluster] Error while ">>> Installing Sun Grid Engine..."

Justin Riley justin.t.riley at gmail.com
Sat Mar 6 12:37:18 EST 2010


Nicolas,

This error has something to do with paramiko. Which version of paramiko are 
you using? I've had a couple other users report a similar problem but I have 
not been able to reproduce it myself to track it down. If you're not already, 
make sure you're using the latest paramiko version.

My only real guess as to why people experience this is that there's a loss of 
an ssh connection somehow. The point in my code that causes this is when I 
request a remote file from paramiko (/opt/sge6/ec2_sge.conf). I'm not doing 
anything fancy there. It's either a connection problem or a paramiko bug. If 
you can reproduce it, let me know.

Perhaps I should try/except all remote_file calls and just retry a few times 
if they fail and then error out completely if it still cant create the file.

Does this happen repeatedly?

~Justin

On Friday 05 March 2010 8:48:10 pm Nicolas Pinto wrote:
> Hey Justin,
> 
> I'm getting the following error when launching a cluster using 0.90.1
> (I changed the user to be 'thor' instead of 'sgeadmin')
> 
> >>> Installing Sun Grid Engine...
> 
> Traceback (most recent call last):
>   File "/usr/local/bin/starcluster", line 100, in <module>
>     main()
>   File "/usr/local/bin/starcluster", line 71, in main
>     ec2utils.start_cluster(create=create)
>   File "/usr/local/lib/python2.6/dist-packages/starcluster/ec2utils.py",
> line 27, in wrapper
>     res = func(*arg, **kargs)
>   File "/usr/local/lib/python2.6/dist-packages/starcluster/ec2utils.py",
> line 321, in start_cluster
>     cluster_setup.main(get_nodes())
>   File
>  "/usr/local/lib/python2.6/dist-packages/starcluster/cluster_setup.py",
>  line 227, in main
>     setup_sge(nodes)
>   File
>  "/usr/local/lib/python2.6/dist-packages/starcluster/cluster_setup.py",
>  line 189, in setup_sge
>     ec2_sge_conf = mconn.remote_file("/opt/sge6/ec2_sge.conf")
>   File "/usr/local/lib/python2.6/dist-packages/starcluster/ssh.py",
> line 109, in remote_file
>     rfile = self._sftp.open(file, mode)
>   File "/usr/local/lib/python2.6/dist-packages/paramiko/sftp_client.py",
> line 245, in open
>     t, msg = self._request(CMD_OPEN, filename, imode, attrblock)
>   File "/usr/local/lib/python2.6/dist-packages/paramiko/sftp_client.py",
> line 628, in _request
>     return self._read_response(num)
>   File "/usr/local/lib/python2.6/dist-packages/paramiko/sftp_client.py",
> line 675, in _read_response
>     self._convert_status(msg)
>   File "/usr/local/lib/python2.6/dist-packages/paramiko/sftp_client.py",
> line 701, in _convert_status
>     raise IOError(errno.ENOENT, text)
> IOError: [Errno 2] No such file
> 
> Any idea what's wrong?
> 
> Best,
> 



More information about the StarCluster mailing list