<div dir="ltr">Hi Steve,<div><br></div><div>Sorry for not responding sooner - I didn't see this in my email until now. I have never encountered those errors, and not knowing what you did to produce them, I'm not sure how helpful I can be. Here's a stab at it:</div>
<div><br></div><div>Based on the error, it seems that when you deleted the nodes, but you didn't delete the security group. You should delete the security group in the aws console in addition to terminating the nodes.</div>
<div><br></div><div>To the larger problem of why you got that error in the first place, here are some questions: </div><div> - The VPC code you need to use is here: <a href="https://github.com/jtriley/StarCluster/pull/236">https://github.com/jtriley/StarCluster/pull/236</a></div>
<div> - did you install in correctly?</div><div> - Did you modify the code base to get this to work? (You don't need to).</div><div> - Did you attempt to start the cluster from inside the vpc? If you don't, the initialization should hang (but should be recoverable)</div>
<div> - If you still have problems, have you tried spinning up non-spot (flat rate) instances in vpc? I know for sure that those work. There are more things that can go wrong with spot instances, so I'd try a non-spot cluster first.</div>
<div><br></div><div>I just spun up a spot instance cluster with no problems, so it should work for you. If you get things working, I'd love to know what you did to fix it.</div><div><br></div><div>Alex</div></div><div class="gmail_extra">
<br><br><div class="gmail_quote">On Mon, Sep 16, 2013 at 3:22 PM, Steve Heistand <span dir="ltr"><<a href="mailto:steve.heistand@nasa.gov" target="_blank">steve.heistand@nasa.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
-----BEGIN PGP SIGNED MESSAGE-----<br>
Hash: SHA1<br>
<br>
hi folks,<br>
<br>
so Im playing around with the VPC version of starcluster and in the process have<br>
gotten things a little wedged. Errors pop up in the start process that I fix<br>
but at some point a cluster got up enough to be "alive" but not really working.<br>
I had to clean things up via the AWS console and now I cant clean up what<br>
starcluster thinks is going on.:<br>
<br>
# starcluster listclusters<br>
StarCluster - (<a href="http://star.mit.edu/cluster" target="_blank">http://star.mit.edu/cluster</a>) (v. 0.94)<br>
Software Tools for Academics and Researchers (STAR)<br>
Please submit bug reports to <a href="mailto:starcluster@mit.edu">starcluster@mit.edu</a><br>
<br>
*** WARNING - Setting 'EC2_PRIVATE_KEY' from environment...<br>
*** WARNING - Setting 'EC2_CERT' from environment...<br>
- ----------------------------<br>
hos (security group: sc-hos)<br>
- ----------------------------<br>
Launch time: N/A<br>
Uptime: N/A<br>
Zone: N/A<br>
Keypair: N/A<br>
EBS volumes: N/A<br>
!!! ERROR - InvalidPermission.NotFound: The specified rule does not exist in this<br>
security group.<br>
Traceback (most recent call last):<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cli.py",<br>
line 274, in main<br>
sc.execute(args)<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/commands/listclusters.py",<br>
line 36, in execute<br>
show_ssh_status=self.opts.show_ssh_status)<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",<br>
line 331, in list_clusters<br>
spot_reqs = cl.spot_requests<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",<br>
line 777, in spot_requests<br>
filters = {'launch.group-id': <a href="http://self.cluster_group.id" target="_blank">self.cluster_group.id</a>,<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",<br>
line 684, in cluster_group<br>
static.WORLD_CIDRIP)<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/ec2/securitygroup.py",<br>
line 222, in revoke<br>
src_group_group_id)<br>
File<br>
"/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/ec2/connection.py",<br>
line 2634, in revoke_security_group<br>
params, verb='POST')<br>
File "/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/connection.py",<br>
line 1115, in get_status<br>
raise self.ResponseError(response.status, response.reason, body)<br>
EC2ResponseError: EC2ResponseError: 400 Bad Request<br>
<?xml version="1.0" encoding="UTF-8"?><br>
<Response><Errors><Error><Code>InvalidPermission.NotFound</Code><Message>The specified<br>
rule does not exist in this security<br>
group.</Message></Error></Errors><RequestID>d4d51a6a-cb7d-471f-b1aa-54a4ceb35ede</RequestID></Response><br>
<br>
<br>
is there a database that can be modified to clean up the exists of this bad cluster?<br>
or some such method of cleaning things up?<br>
<br>
the startcluster terminate -f cluster_name also fails in various and exciting ways.<br>
<br>
thanks<br>
<br>
s<br>
<br>
- --<br>
************************************************************************<br>
Steve Heistand NASA Ames Research Center<br>
email: <a href="mailto:steve.heistand@nasa.gov">steve.heistand@nasa.gov</a> Steve Heistand/Mail Stop 258-6<br>
ph: <a href="tel:%28650%29%20604-4369" value="+16506044369">(650) 604-4369</a> Bldg. 258, Rm. 232-5<br>
Scientific & HPC Application P.O. Box 1<br>
Development/Optimization Moffett Field, CA 94035-0001<br>
************************************************************************<br>
"Any opinions expressed are those of our alien overlords, not my own."<br>
<br>
# For Remedy #<br>
#Action: Resolve #<br>
#Resolution: Resolved #<br>
#Reason: No Further Action Required #<br>
#Tier1: User Code #<br>
#Tier2: Other #<br>
#Tier3: Assistance #<br>
#Notification: None #<br>
-----BEGIN PGP SIGNATURE-----<br>
Version: GnuPG v2.0.14 (GNU/Linux)<br>
<br>
iEYEARECAAYFAlI3WoEACgkQoBCTJSAkVrF0gACg4uiExQ5N4hJfFP+RfyvVRbRB<br>
BxsAoIAHX33knuzEkEeat7JeL7pTNPYC<br>
=yNt6<br>
-----END PGP SIGNATURE-----<br>
_______________________________________________<br>
StarCluster mailing list<br>
<a href="mailto:StarCluster@mit.edu">StarCluster@mit.edu</a><br>
<a href="http://mailman.mit.edu/mailman/listinfo/starcluster" target="_blank">http://mailman.mit.edu/mailman/listinfo/starcluster</a><br>
</blockquote></div><br></div>