<div dir="ltr">Hi Folks,<div><br></div><div>I am using 0.94.2. I am experimenting w/scaling. I had started a cluster w/two nodes initially, using default names of master and node001. I added another node, node002, then did a removenode of node001. When I attempted to add another node of the c3.8xlarge type (supported by Rayson's mod--thanks, Rayson) using alias of -a c3.8xlarge. Everything went fine until it attempted to install OGS. At that point, it tried to reference the node being added as node001, instead of as the alias:</div>
<div><br></div><div><div>Gerner:.starcluster mary$ sc addnode e1d -a c3.8xlarge -I c3.8xlarge</div><div>StarCluster - (<a href="http://star.mit.edu/cluster">http://star.mit.edu/cluster</a>) (v. 0.94.2)</div><div>Software Tools for Academics and Researchers (STAR)</div>
<div>Please submit bug reports to <a href="mailto:starcluster@mit.edu">starcluster@mit.edu</a></div><div><br></div><div>>>> Launching node(s): c3.8xlarge</div><div>Reservation:r-7ad16218</div><div>>>> Waiting for instances to propagate... </div>
<div>>>> Waiting for node(s) to come up... (updating every 30s)</div><div>>>> Waiting for all nodes to be in a 'running' state...</div><div>3/3 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div>
<div>>>> Waiting for SSH to come up on all nodes...</div><div>3/3 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div><div>>>> Waiting for cluster to come up took 2.089 mins</div>
<div>>>> Running plugin starcluster.clustersetup.DefaultClusterSetup</div><div>>>> Configuring hostnames...</div><div>1/1 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div>
<div>>>> Configuring /etc/hosts on each node</div><div>3/3 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div><div>>>> Configuring NFS exports path(s):</div><div>/home /usr/share/jobs/</div>
<div>>>> Mounting all NFS export path(s) on 1 worker node(s)</div><div>1/1 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div><div>>>> Setting up NFS took 0.166 mins</div><div>
1/1 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div><div>>>> Configuring scratch space for user(s): sgeadmin</div><div>1/1 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div>
<div>>>> Configuring passwordless ssh for root</div><div>>>> Configuring passwordless ssh for sgeadmin</div><div>>>> Running plugin starcluster.plugins.sge.SGEPlugin</div><div>>>> Adding c3.8xlarge to SGE</div>
<div>>>> Configuring NFS exports path(s):</div><div>/opt/sge6</div><div>>>> Mounting all NFS export path(s) on 1 worker node(s)</div><div>1/1 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| 100% </div>
<div>>>> Setting up NFS took 0.128 mins</div><div>!!! ERROR - Error occured while running plugin 'starcluster.plugins.sge.SGEPlugin':</div><div>!!! ERROR - remote command 'source /etc/profile && cd /opt/sge6 &&</div>
<div>!!! ERROR - TERM=rxvt ./inst_sge -x -noremote -auto ./ec2_sge.conf'</div><div>!!! ERROR - failed with status 1:</div><div>!!! ERROR - Reading configuration from file ./ec2_sge.conf</div><div>!!! ERROR - [H[2J</div>
<div>!!! ERROR - error resolving host "node001": can't resolve host name</div><div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>!!! ERROR - error resolving host "node001": can't resolve host name</div>
<div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>!!! ERROR - error resolving host "node001": can't resolve host name</div><div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>!!! ERROR - error resolving host "node001": can't resolve host name</div>
<div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>!!! ERROR - error resolving host "node001": can't resolve host name</div><div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>!!! ERROR - error resolving host "node001": can't resolve host name</div>
<div>!!! ERROR - (h_errno = HOST_NOT_FOUND)</div><div>Gerner:.starcluster mary$ sc sm e1d</div></div><div><br></div><div style>I don't plan on using this approach routinely, but thought you'd want to know about the error.</div>
<div style><br></div><div style>Thanks,</div><div style>Lyn</div></div>