[StarCluster] post wedge cleanup

Steve Heistand steve.heistand at nasa.gov
Mon Sep 16 15:22:41 EDT 2013


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

hi folks,

so Im playing around with the VPC version of starcluster and in the process have
gotten things a little wedged. Errors pop up in the start process that I fix
but at some point a cluster got up enough to be "alive" but not really working.
I had to clean things up via the AWS console and now I cant clean up what
starcluster thinks is going on.:

# starcluster listclusters
StarCluster - (http://star.mit.edu/cluster) (v. 0.94)
Software Tools for Academics and Researchers (STAR)
Please submit bug reports to starcluster at mit.edu

*** WARNING - Setting 'EC2_PRIVATE_KEY' from environment...
*** WARNING - Setting 'EC2_CERT' from environment...
- ----------------------------
hos (security group: sc-hos)
- ----------------------------
Launch time: N/A
Uptime: N/A
Zone: N/A
Keypair: N/A
EBS volumes: N/A
!!! ERROR - InvalidPermission.NotFound: The specified rule does not exist in this
security group.
Traceback (most recent call last):
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cli.py",
line 274, in main
    sc.execute(args)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/commands/listclusters.py",
line 36, in execute
    show_ssh_status=self.opts.show_ssh_status)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",
line 331, in list_clusters
    spot_reqs = cl.spot_requests
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",
line 777, in spot_requests
    filters = {'launch.group-id': self.cluster_group.id,
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py",
line 684, in cluster_group
    static.WORLD_CIDRIP)
  File
"/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/ec2/securitygroup.py",
line 222, in revoke
    src_group_group_id)
  File
"/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/ec2/connection.py",
line 2634, in revoke_security_group
    params, verb='POST')
  File "/usr/local/lib/python2.7/dist-packages/boto-2.9.9-py2.7.egg/boto/connection.py",
line 1115, in get_status
    raise self.ResponseError(response.status, response.reason, body)
EC2ResponseError: EC2ResponseError: 400 Bad Request
<?xml version="1.0" encoding="UTF-8"?>
<Response><Errors><Error><Code>InvalidPermission.NotFound</Code><Message>The specified
rule does not exist in this security
group.</Message></Error></Errors><RequestID>d4d51a6a-cb7d-471f-b1aa-54a4ceb35ede</RequestID></Response>


is there a database that can be modified to clean up the exists of this bad cluster?
or some such method of cleaning things up?

the startcluster terminate -f cluster_name also fails in various and exciting ways.

thanks

s

- -- 
************************************************************************
 Steve Heistand                          NASA Ames Research Center
 email: steve.heistand at nasa.gov          Steve Heistand/Mail Stop 258-6
 ph: (650) 604-4369                      Bldg. 258, Rm. 232-5
 Scientific & HPC Application            P.O. Box 1
 Development/Optimization                Moffett Field, CA 94035-0001
************************************************************************
 "Any opinions expressed are those of our alien overlords, not my own."

# For Remedy                        #
#Action: Resolve                    #	
#Resolution: Resolved               #
#Reason: No Further Action Required #
#Tier1:	User Code                   #
#Tier2:	Other                       #
#Tier3:	Assistance                  #
#Notification: None                 #
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.14 (GNU/Linux)

iEYEARECAAYFAlI3WoEACgkQoBCTJSAkVrF0gACg4uiExQ5N4hJfFP+RfyvVRbRB
BxsAoIAHX33knuzEkEeat7JeL7pTNPYC
=yNt6
-----END PGP SIGNATURE-----


More information about the StarCluster mailing list