<div dir="ltr"><div><div><div>Hi Ryan,<br>Sorry wrong qacct command. I think i may know
what's happening. Are your jobs really long running? I think the
'lookback window' for checking the job history may be too short for you?
You could try it with being at least twice the duration of one of your
qsub'd tasks. See how every other line says ">>> Loading full
job history" That comes up because jobstats are empty, 'qacct -j -b
<some time' is coming back empty.<br>
<br></div><div>Trying to reproduce the behavior from:<br><a href="https://github.com/jtriley/StarCluster/blob/develop/starcluster/balancers/sge/__init__.py#L504" target="_blank">https://github.com/jtriley/StarCluster/blob/develop/starcluster/balancers/sge/__init__.py#L504</a><br>
<br></div>Could you send the output from this:<br></div>make a date of when you started your cluster, approximately, in this format: <br><pre>MMDDhhmm Months, Days, hours, minutes</pre>qacct -j -b <put that date format><br>
<br></div><div>And please paste that qacct output here. That should
always have a history of all jobs. Then try the same with the date
format being only 3 hours ago. You can try toying with the lookback
windows. The default is 3 hours and you can feed a new one in on the
command line:<br>
<br><strong>Lookback window</strong> (-l LOOKBACK_WIN, –lookback_window=LOOKBACK_WIN) - How
long, in hours, to look back for past job history<br><br></div><div>Justin Riley, can you please update the doc on this site?<br><a href="http://star.mit.edu/cluster/docs/0.93.3/manual/load_balancer.html" target="_blank">http://star.mit.edu/cluster/docs/0.93.3/manual/load_balancer.html</a><br>
<br></div>It says the window is in minutes but it's in fact in hours.<br><br>Thanks,<br>Raj</div>