[StarCluster] master node not in qstat
David Erickson
derickso at stanford.edu
Mon Mar 19 15:08:27 EDT 2012
On 3/9/2012 3:04 PM, Justin Riley wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hi Adam,
>
>> Setup a cluster today (0.93.2) and suddenly noticed that the
>> 'master' node was not being reported in a "qstat -f" command and
>> was not accepting run jobs from the queue . . . i.e., with 12 nodes
>> x 8 cpus each (96), when 96 jobs are submitted, only 88 run (nodes
>> 1-11) while 8 remain in the queue waiting.
>>
>> I tried restarting the cluster using the 'sge' plugin to manually
>> ensure that master_is_exec_host was set to 'True'. But the result
>> was the same: 88 running - 8 waiting.
> Thanks for testing and reporting. I can confirm this bug and have
> created an issue on github[1]. I should have a hotfix release 0.93.3
> out tonight specifically to fix this.
Hi just wanted to follow up on this, I did an easy install update today
and still got 93.2, any eta on 93.3 with this fix?
Thanks,
David
More information about the StarCluster
mailing list