Greetings !<br><br>I created a star cluster on EC2 and use qsub to submit jobs. It used to work well. From this afternoon, after I requested for additional EC2 instance from Amazon, the issue comes out.<br><br>Only the jobs submitted to the master node are executed. Other jobs disappeared just in no time. Some diagonosis is as below. Any helps are appreciated !<br>
<br>Happy New Year !<br><br><br>root@master:/# qacct -j 23<br>==============================================================<br>qname all.q <br>hostname node006 <br>group root <br>
owner root <br>project NONE <br>department defaultdepartment <br>jobname single.sh out 3 <br>jobnumber 23 <br>taskid undefined<br>account sge <br>
priority 0 <br>qsub_time Sat Dec 31 01:38:32 2011<br><span style="background-color:rgb(255,255,0)">start_time Sat Dec 31 01:38:39 2011</span><br style="background-color:rgb(255,255,0)"><span style="background-color:rgb(255,255,0)">end_time Sat Dec 31 01:38:39 2011</span><br style="background-color:rgb(255,255,0)">
granted_pe NONE <br>slots 1 <br>failed 0 <br>exit_status 0 <br>ru_wallclock 0 <br>ru_utime 0.010 <br>ru_stime 0.010 <br>
ru_maxrss 2276 <br>ru_ixrss 0 <br>ru_ismrss 0 <br>ru_idrss 0 <br>ru_isrss 0 <br>ru_minflt 2648 <br>
ru_majflt 0 <br>ru_nswap 0 <br>ru_inblock 0 <br>ru_oublock 272 <br>ru_msgsnd 0 <br>ru_msgrcv 0 <br>
ru_nsignals 0 <br>ru_nvcsw 12 <br>ru_nivcsw 3 <br>cpu 0.020 <br>mem 0.000 <br>io 0.000 <br>iow 0.000 <br>
maxvmem 0.000<br>arid undefined<br><br>=========================<br><br>Thanks,<br>-Liang<br>