<div dir="ltr">I actually joined the mailing list today because of this issue with spot instances dying. I stumbled across this fork that promises to handle it.<div><br></div><div><a href="https://github.com/datacratic/StarCluster">https://github.com/datacratic/StarCluster</a><br></div><div><br></div><div>Anyone have experience using it?</div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Mar 16, 2015 at 4:39 PM, Steve Darnell <span dir="ltr"><<a href="mailto:darnells@dnastar.com" target="_blank">darnells@dnastar.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="EN-US" link="blue" vlink="purple">
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Hi Raj,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Thanks for the reply. Manual clean-up is indeed required to deal with these rouge instances. It would be really convenient if loadbalancer resolved this scenario
automatically once an hour. One can dream (or implement)…<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Best regards,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Steve<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">From:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> <a href="mailto:rqbanerjee@gmail.com" target="_blank">rqbanerjee@gmail.com</a> [mailto:<a href="mailto:rqbanerjee@gmail.com" target="_blank">rqbanerjee@gmail.com</a>]
<b>On Behalf Of </b>Rajat Banerjee<br>
<b>Sent:</b> Monday, March 16, 2015 2:04 PM<br>
<b>To:</b> Steve Darnell<br>
<b>Cc:</b> Eduardo Gurgel Valente; Nicholas Chammas; <a href="mailto:starcluster@mit.edu" target="_blank">starcluster@mit.edu</a></span></p><div><div class="h5"><br>
<b>Subject:</b> Re: [StarCluster] How does StarCluster track the clusters it's managing?<u></u><u></u></div></div><p></p><div><div class="h5">
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal" style="margin-bottom:12.0pt">Sorry for the super-slow response.<u></u><u></u></p>
</div>
<p class="MsoNormal" style="margin-bottom:12.0pt">The elastic load balancer parses the output of 'qhost' on the cluster:<br>
<br>
<a href="https://github.com/jtriley/StarCluster/blob/develop/starcluster/balancers/sge/__init__.py#L59" target="_blank">https://github.com/jtriley/StarCluster/blob/develop/starcluster/balancers/sge/__init__.py#L59</a><u></u><u></u></p>
</div>
<p class="MsoNormal">I don't remember the exact reason for using that instead of the same logic as 'listclusters' above, but here's my guess a few years after the fact:<u></u><u></u></p>
</div>
<p class="MsoNormal">- Avoids another remote API call to AWS' tagging service to retrieve the tags for all instances within an account. This needs to be called every minute, so a speedy call to your cluster instead of to a remote API is beneficial<u></u><u></u></p>
</div>
<p class="MsoNormal" style="margin-bottom:12.0pt">- qhost outputs the number of machines correctly configured and able to process work. If a machine shows up in 'listcluster' but not in 'qhost' it's likely not usable to process jobs, and would probably need
manual cleanup.<u></u><u></u></p>
</div>
<p class="MsoNormal">HTH<u></u><u></u></p>
</div>
<p class="MsoNormal">Raj<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<p class="MsoNormal">On Tue, Mar 10, 2015 at 4:04 PM, Steve Darnell <<a href="mailto:darnells@dnastar.com" target="_blank">darnells@dnastar.com</a>> wrote:<u></u><u></u></p>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">On a related topic, does anyone know how the load balancing feature tracks the cluster and its compute
nodes? I have gotten into situations where listclusters correctly reports that a cluster and its nodes are running (I can ssh into master and the nodes, etc.); however, loadbalance reports that the cluster is not running and refuses to balance the cluster.</span><u></u><u></u></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"> </span><u></u><u></u></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Best regards,</span><u></u><u></u></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Steve</span><u></u><u></u></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"> </span><u></u><u></u></p>
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">From:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">
<a href="mailto:starcluster-bounces@mit.edu" target="_blank">starcluster-bounces@mit.edu</a> [mailto:<a href="mailto:starcluster-bounces@mit.edu" target="_blank">starcluster-bounces@mit.edu</a>]
<b>On Behalf Of </b>Eduardo Gurgel Valente<br>
<b>Sent:</b> Tuesday, March 10, 2015 2:08 PM<br>
<b>To:</b> Nicholas Chammas<br>
<b>Cc:</b> <a href="mailto:starcluster@mit.edu" target="_blank">starcluster@mit.edu</a><br>
<b>Subject:</b> Re: [StarCluster] How does StarCluster track the clusters it's managing?</span><u></u><u></u></p>
<div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
<div>
<div>
<div>
<p class="MsoNormal">Hi Nick,<u></u><u></u></p>
</div>
<p class="MsoNormal"> Look at the security group it creates. It follows a naming convention. In addition there are tags with encrypted information at play.<u></u><u></u></p>
</div>
<p class="MsoNormal">Eduardo<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
<div>
<p class="MsoNormal">On Mon, Mar 9, 2015 at 11:16 PM, Nicholas Chammas <<a href="mailto:nicholas.chammas@gmail.com" target="_blank">nicholas.chammas@gmail.com</a>> wrote:<u></u><u></u></p>
<div>
<div>
<p style="margin:0px!important">Howdy!<u></u><u></u></p>
<p style="margin:0px!important">At <a href="http://youtu.be/vC3lJcPq1FY?t=7m20s" target="_blank">
this point in the StarCluster demo video</a>, the presenter runs the following command to list all the clusters being managed by StarCluster:<u></u><u></u></p>
<pre style="margin-right:1.8pt;margin-bottom:0in;margin-left:1.8pt;margin-bottom:.0001pt;line-height:14.4pt"><code><span style="font-family:Consolas;color:#333333;border:solid #cccccc 1.0pt;padding:6.0pt;background:ghostwhite">starcluster listclusters</span></code><u></u><u></u></pre>
<p style="margin:0px!important">How does StarCluster track all the clusters it’s managing? Is it through the use of EC2 instance tags? A pointer to the relevant code would also be helpful.<u></u><u></u></p>
<p style="margin:0px!important">I’m looking to implement a feature similar to <code>
<span style="font-size:10.0pt;font-family:Consolas;border:solid #eaeaea 1.0pt;padding:0in;background:#f8f8f8">listclusters</span></code> but for
<a href="http://spark.apache.org/docs/1.2.1/ec2-scripts.html" target="_blank">spark-ec2</a>. Tagging seems like the way to go to do that, but
<a href="https://issues.apache.org/jira/browse/SPARK-3332" target="_blank">we had some issues with it</a> when we used it with spark-ec2.<u></u><u></u></p>
<p style="margin:0px!important">So I’m curious to know how StarCluster did things.<u></u><u></u></p>
<p style="margin:0px!important">Nick<u></u><u></u></p>
<div>
<p class="MsoNormal"><span style="font-size:1.0pt"></span><u></u><u></u></p>
</div>
</div>
</div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><br>
_______________________________________________<br>
StarCluster mailing list<br>
<a href="mailto:StarCluster@mit.edu" target="_blank">StarCluster@mit.edu</a><br>
<a href="http://mailman.mit.edu/mailman/listinfo/starcluster" target="_blank">http://mailman.mit.edu/mailman/listinfo/starcluster</a><u></u><u></u></p>
</div>
<p class="MsoNormal"> <u></u><u></u></p>
</div>
</div>
</div>
</div>
</div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><br>
_______________________________________________<br>
StarCluster mailing list<br>
<a href="mailto:StarCluster@mit.edu" target="_blank">StarCluster@mit.edu</a><br>
<a href="http://mailman.mit.edu/mailman/listinfo/starcluster" target="_blank">http://mailman.mit.edu/mailman/listinfo/starcluster</a><u></u><u></u></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div></div></div>
</div>
<br>_______________________________________________<br>
StarCluster mailing list<br>
<a href="mailto:StarCluster@mit.edu">StarCluster@mit.edu</a><br>
<a href="http://mailman.mit.edu/mailman/listinfo/starcluster" target="_blank">http://mailman.mit.edu/mailman/listinfo/starcluster</a><br>
<br></blockquote></div><br><br clear="all"><div><br></div><br>
</div></div>