[StarCluster] starcluster plugin status code 127

Wei Tao wei.tao at tsibiocomputing.com
Wed Dec 21 03:33:25 EST 2011


Hi Don,

The plugin picked up the queue_to_config (all.q) as evidenced in the error
message:

!!! ERROR - command 'qconf -mattr queue load_thresholds np_load_avg=1.5 *
all.q*' failed with status 127

My intention is to config the SGE at the cluster boot up time using the
plugin. Since I executed "starcluster runplugin" after the cluster already
booted up, it apparently is not an issue of plugin execution timing.

The only reason I run the plugin or the plugin command after cluster
already booted up is for debugging purposes.

It's just very strange to me that as root I can execute the exact same
command on the master node without any issue, but running as starcluster
plugin would fail.

Also, what is status 127 anyway??

Thanks!

-Wei


On Wed, Dec 21, 2011 at 1:42 AM, Don MacMillen <macd at nimbic.com> wrote:

> The only difference that I can see is that I have not used arguments to
> the plugin.  I guess you did remember to set the argument "queue_to_config"
> in your config file?
>
> Another possible issue is if you are trying to reconfig a cluster that is
> just
> in the process of coming up.  If you try that command early on, it will
> fail because
> sge has not been installed yet.  Why do you want to config the cluster
> afterwards
> rather than just on the initial bring up?  HTH and let us know what you
> find out.
> Regards.
>
> Don
>
>
> On Tue, Dec 20, 2011 at 10:02 PM, Wei Tao <wei.tao at tsibiocomputing.com>wrote:
>
>> Hi all,
>>
>> I tried to implement the queue configuration suggested by Don MacMillen
>> a while ago. Here is my plugin code:
>>
>> from starcluster.clustersetup import ClusterSetup
>>
>> class SgeConfig(ClusterSetup):
>>     def __init__(self, queue_to_config):
>>         self.queue_to_config = queue_to_config
>>
>>     def run(self, nodes, master, user, user_shell, volumes):
>>         cmd_strg = 'qconf -mattr queue load_thresholds np_load_avg=1.5
>> %s' %self.queue_to_config
>>         output = master.ssh.execute(cmd_strg)
>>
>> When I execute "starcluster runplugin <myplugin> <mycluster>", I got:
>>
>> >>> Running plugin <myplugin>
>> !!! ERROR - command 'qconf -mattr queue load_thresholds np_load_avg=1.5
>> all.q' failed with status 127
>>
>> If I sshmaster and run the command directly as this:
>>
>> root at master:~# qconf -mattr queue load_thresholds np_load_avg=1.5 all.q
>> root at master modified "all.q" in cluster queue list
>>
>> It works fine. Could someone please point out why the plugin would have a
>> status code 127 when direct execution of the command apparently works fine?
>>
>> Thanks for the help!
>>
>>
>> -Wei
>> _______________________________________________
>> StarCluster mailing list
>> StarCluster at mit.edu
>> http://mailman.mit.edu/mailman/listinfo/starcluster
>>
>>
>


-- 
Wei Tao, Ph.D.
TSI Biocomputing LLC
617-564-0934
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/pipermail/starcluster/attachments/20111221/0ea304f4/attachment-0001.htm


More information about the StarCluster mailing list