[StarCluster] Starcluster stuck during setup

Rayson Ho raysonlogin at gmail.com
Tue Mar 25 19:42:03 EDT 2014


If you really have a slow connection, you may consider bootstrapping
StarCluster on AWS - ie. configure an m1.small (or even t1.micro) and
install StarCluster on that node. In fact, there's a CloudFormation
template for that:
http://aws.typepad.com/aws/2012/06/ec2-spot-instance-updates-auto-scaling-and-cloudformation-integration-new-sample-app-1.html
. On the other hand, it's way easier to do it by hand and just launch
an instance from the standard Ubuntu AMI, and then install StarCluster
on that instance.

And like others mentioned, most large StarClusters are launched by
first starting a small cluster, and then grow it dynamically. You
should be able to run the addnode command from your qmaster node
provided that you have StarCluster setup there (note that your AWS key
will be on the EC2 instance so it is slightly more risky if security
is the main concern).

Rayson

==================================================
Open Grid Scheduler - The Official Open Source Grid Engine
http://gridscheduler.sourceforge.net/
http://gridscheduler.sourceforge.net/GridEngine/GridEngineCloud.html


On Tue, Mar 25, 2014 at 8:04 AM, Butson, Christopher <cbutson at mcw.edu> wrote:
> Interesting: I let it go and it eventually continued but it took over an hour to Configuring passwordless ssh for root. Still waiting for the cluster to finish startup...
>
> Christopher R. Butson, Ph.D.
> Associate Professor
> Biotechnology & Bioengineering Center
> Departments of Neurology, Neurosurgery, Psychiatry & Behavioral Medicine
> Medical College of Wisconsin
> (414) 955-2678
> cbutson at mcw.edu<mailto:cbutson at mcw.edu>
>
>
> From: <Butson>, Christopher Butson <cbutson at mcw.edu<mailto:cbutson at mcw.edu>>
> Date: Tuesday, March 25, 2014 12:13 PM
> To: "starcluster at mit.edu<mailto:starcluster at mit.edu>" <starcluster at mit.edu<mailto:starcluster at mit.edu>>
> Subject: Starcluster stuck during setup
>
> I'm on a slow internet connection overseas, trying to initiate a cluster using StarCluster. Once I type "starcluster start mycluster" everything seems to go ok but it gets stuck at the following point and never seems to get past it:
>>>> Mounting all NFS export path(s) on 79 worker node(s)
> 79/79 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100%
>>>> Setting up NFS took 2.777 mins
>>>> Configuring passwordless ssh for root
>
> Any idea why this might occur? Thanks,
> Chris
>
> Christopher R. Butson, Ph.D.
> Associate Professor
> Biotechnology & Bioengineering Center
> Departments of Neurology, Neurosurgery, Psychiatry & Behavioral Medicine
> Medical College of Wisconsin
> (414) 955-2678
> cbutson at mcw.edu<mailto:cbutson at mcw.edu>
>
>
> _______________________________________________
> StarCluster mailing list
> StarCluster at mit.edu
> http://mailman.mit.edu/mailman/listinfo/starcluster


More information about the StarCluster mailing list