Hi Justin,<br><br>thanks for the quick respond.<br><br>I've download the development version and I've notice many new changes/features<br>compared to version 0.91.<br><br>However, I've find 2 bugs.<br><br>Bug-1:<br>
If there is no volume is been specified to a cluster in the configure file, the <br>starcluster will crash with:<br><br><blockquote style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;" class="gmail_quote">
TypeError: 'NoneType' object is not iterable<br></blockquote><br>Cause:<br>After tracking the problem: the "self.VOLUMES" in "cluster.py" <br>have a value of None. And, there are two functions(setup_ebs, setup_nfs) in <br>
"clustersetup.py" are tying to for-loop it.<br><br>A Quick-Fix:<br>Cluster.py( line: 154): I've added the following after line 154:<br><br><blockquote style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;" class="gmail_quote">
if not volumes:<br> self.VOLUMES = []<br></blockquote><div> </div> I'm sure there is a better way to fix this.<br><br><br><br>Bug-2:<br>Start a cluster and tagged as "bug2", then stop the cluster.<br>
Then, start another cluster and tagged as "bug2", the startcluster will crash.<br><br>Cause:<br>Starcluster is trying to SSH (ssh.py:49) the terminated instances from the <br>first cluster where the terminated instances have no hostname.<br>
<br>Quck-fix:<br>In cluster.py(self.nodes): I've add the following after line 278 to filter out the<br> terminated instances:<br><br><blockquote style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;" class="gmail_quote">
if node.state == 'terminated':<br> continue <br></blockquote><br><br>Again, thanks for your great effort<br>Nasser<br><br><br><div class="gmail_quote">On Tue, Mar 30, 2010 at 5:39 AM, Justin Riley <span dir="ltr"><<a href="mailto:jtriley@mit.edu">jtriley@mit.edu</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">-----BEGIN PGP SIGNED MESSAGE-----<br>
Hash: SHA1<br>
<br>
Hi Nasser<br>
<br>
I've cc'd the starcluster mailing list, hope you don't mind.<br>
<br>
BTW, I'd like to invite you to join the starcluster mailing list. It's a<br>
good place to keep up with things and submit issues<br>
such as these. You can join the list here:<br>
<br>
<a href="http://mailman.mit.edu/mailman/listinfo/starcluster" target="_blank">http://mailman.mit.edu/mailman/listinfo/starcluster</a><br>
<br>
Thanks for reporting this issue. I've made a quick-fix change in the<br>
development version of the code on github by bumping the timeout to 5<br>
sec. This still might not help you if the latency is really bad.<br>
<br>
My current thinking on this is to 'throttle' the timeout time the longer<br>
it takes for the cluster to appear to be up. So, at first it would<br>
attempt a 5 second timeout, and then incrementally raise it up to 15<br>
seconds as necessary. After a maximum of 15 seconds and enough retries,<br>
it would likely just error out.<br>
<br>
This is on my list for the next version.<br>
<br>
Thanks for reporting!<br>
<br>
~Justin<br>
<br>
<br>
<br>
> Problem:<br>
> I've installed & configured StartCluster correctly. However, when I<br>
try to start it with "startcluster -s", everything goes fine until it<br>
reach the line ">>> Waiting for cluster to start..." and that when it<br>
run forever(infinite loop). Even after all the instances are in<br>
"running" state.<br>
><br>
> Solution:<br>
> After debugging, I found out that the value of socket's timeout(0.25) in:<br>
><br>
> File: starcluster/ec2utils.py<br>
> Function: is_ssh_up()<br>
> Line: s.settimeout(0.25)<br>
><br>
> is too small for my connection; due to a latency issue.<br>
><br>
> So I've commented, as a quick fix, that line and everything work fine.<br>
><br>
> A bigger value would solve this.<br>
><br>
> Thanks for your great work and keep it up<br>
> Nasser<br>
-----BEGIN PGP SIGNATURE-----<br>
Version: GnuPG v2.0.14 (GNU/Linux)<br>
Comment: Using GnuPG with Mozilla - <a href="http://enigmail.mozdev.org/" target="_blank">http://enigmail.mozdev.org/</a><br>
<br>
iEYEARECAAYFAkuw8/YACgkQ4llAkMfDcrmlBwCePfX/zZoQjqlh9dQS7xo4geQm<br>
wn4AoJHE0/AdvRbAMB4EIz5yvompZsRt<br>
=kjHp<br>
-----END PGP SIGNATURE-----<br>
</blockquote></div><br>