[Starcluster] Load Balancer Problems

Rajat Banerjee rbanerj at fas.harvard.edu
Mon Aug 2 14:38:31 EDT 2010


Hey Amaro,
Cool thanks. I called Brian and got info regarding the array of jobs.
I checked in some test code that works fine on my (simple) cluster
with qsub -t 1-20:1. I'd appreciate it if you'd test and let me know
how it goes. Just committed to github:
http://github.com/rqbanerjee/StarCluster/commit/17998a68feab3d1440aa5d9edc2e74697e43ef54

Making requests during a business day has its rewards :)

Regarding the host that had been inactive for a short time:
1. If the "tasks" field was properly recognized , as it is now, the
queue should be recognized as full, and that node probably wouldn't
have been killed.
2. What is your preference for how long a job should stay idle before
being killed? I usually don't check how long it has been idle. If it
is idle now and the queue is empty then kill it. I could add code to
check how long it has been idle, if it seems useful. Is there a use
case?

Thanks,
Rajat



More information about the StarCluster mailing list