[StarCluster] Starcluster stuck during setup

Dmitry Serenbrennikov dmitry at adchemy.com
Tue Mar 25 18:25:20 EDT 2014


I'll add another voice for being able to add spot nodes using addnode even without the special options.

The settings in your cluster template will be used when running addnode. The extra options come in handy to use other values if needed, but otherwise just setting the spot bid in the cluster template will ensure that all nodes added later use that bid. This is what we do regularly, so I know that this works. Also, this is critical to allow the loadbalancer to auto-scale your cluster with spot instances, which is a very useful feature as well!


Best regards!

-Dmitry



________________________________
From: starcluster-bounces at mit.edu <starcluster-bounces at mit.edu> on behalf of Steve Darnell <darnells at dnastar.com>
Sent: Tuesday, March 25, 2014 3:14 PM
To: Cory Dolphin
Cc: starcluster at mit.edu
Subject: Re: [StarCluster] Starcluster stuck during setup

Hi Cory,

Rayson offered the following advice last year: http://star.mit.edu/cluster/mlarchives/1742.html

The "right" way is to first boot a cluster of say, 8-10 nodes, submit
jobs, and then use the StarCluster addnode command to grow your
cluster.

I do not see why you cannot add spot instances using addnode. The documentation says it is supported: http://star.mit.edu/cluster/docs/latest/manual/addremovenode.html

The addnode command has additional options for customizing the new node's instance type, AMI, spot bid, and more.
See the help menu for a detailed list of all available options:
$ starcluster addnode -help

Best regards,
Steve

--
Steve Darnell
DNASTAR, Inc.
Madison, WI USA

From: starcluster-bounces at mit.edu [mailto:starcluster-bounces at mit.edu] On Behalf Of Cory Dolphin
Sent: Tuesday, March 25, 2014 4:16 PM
To: Butson, Christopher
Cc: starcluster at mit.edu
Subject: Re: [StarCluster] Starcluster stuck during setup

I have had similar issues starting large (30+) node clusters. Anyone else find a good pattern for doing so? Sadly I cannot add nodes incrementally since I need spot instances.

On Tue, Mar 25, 2014 at 8:04 AM, Butson, Christopher <cbutson at mcw.edu<mailto:cbutson at mcw.edu>> wrote:
Interesting: I let it go and it eventually continued but it took over an hour to Configuring passwordless ssh for root. Still waiting for the cluster to finish startup...

Christopher R. Butson, Ph.D.
Associate Professor
Biotechnology & Bioengineering Center
Departments of Neurology, Neurosurgery, Psychiatry & Behavioral Medicine
Medical College of Wisconsin
(414) 955-2678<tel:%28414%29%20955-2678>
cbutson at mcw.edu<mailto:cbutson at mcw.edu><mailto:cbutson at mcw.edu<mailto:cbutson at mcw.edu>>

From: <Butson>, Christopher Butson <cbutson at mcw.edu<mailto:cbutson at mcw.edu><mailto:cbutson at mcw.edu<mailto:cbutson at mcw.edu>>>
Date: Tuesday, March 25, 2014 12:13 PM
To: "starcluster at mit.edu<mailto:starcluster at mit.edu><mailto:starcluster at mit.edu<mailto:starcluster at mit.edu>>" <starcluster at mit.edu<mailto:starcluster at mit.edu><mailto:starcluster at mit.edu<mailto:starcluster at mit.edu>>>
Subject: Starcluster stuck during setup

I'm on a slow internet connection overseas, trying to initiate a cluster using StarCluster. Once I type "starcluster start mycluster" everything seems to go ok but it gets stuck at the following point and never seems to get past it:
>>> Mounting all NFS export path(s) on 79 worker node(s)
79/79 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100%
>>> Setting up NFS took 2.777 mins
>>> Configuring passwordless ssh for root

Any idea why this might occur? Thanks,
Chris

Christopher R. Butson, Ph.D.
Associate Professor
Biotechnology & Bioengineering Center
Departments of Neurology, Neurosurgery, Psychiatry & Behavioral Medicine
Medical College of Wisconsin
(414) 955-2678<tel:%28414%29%20955-2678>
cbutson at mcw.edu<mailto:cbutson at mcw.edu><mailto:cbutson at mcw.edu<mailto:cbutson at mcw.edu>>


_______________________________________________
StarCluster mailing list
StarCluster at mit.edu<mailto:StarCluster at mit.edu>
http://mailman.mit.edu/mailman/listinfo/starcluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/pipermail/starcluster/attachments/20140325/1f4f3dcf/attachment-0001.htm


More information about the StarCluster mailing list