From aldavis at wi.mit.edu Tue Mar 22 17:14:03 2005 From: aldavis at wi.mit.edu (Al Davis) Date: Tue, 22 Mar 2005 17:14:03 -0500 Subject: [CSBi-HPC] Whitehead cluster reboot Message-ID: <424098AB.9000506@wi.mit.edu> Most of the Whitehead cluster nodes have been up for several months and kernel and loadleveler appear to be confused about several things. I'd like to schedule a system wide reboot for Thursday morning @ 8AM to clear things up. I'll also reconfigure some of the network connections which allow us to take better advantage of the direct links between the two clusters. This is a necessary step before we can finish the cluster merger into one functioning system. The system will be down for approx 30 min and I can schedule it for another day if anyone has some critical jobs that they're doing this week. Let me know asap if this schedule doesn't work for you and I'll push it out till later in the week. thanks, al -- Al Davis aldavis at wi.mit.edu | aldavis at mit.edu Systems Manager 617.324.0519 CSBi & WI/MIT BioImaging Center NE47 Rm 311 (500 Technology Sq) From aldavis at wi.mit.edu Wed Mar 23 14:37:19 2005 From: aldavis at wi.mit.edu (Al Davis) Date: Wed, 23 Mar 2005 14:37:19 -0500 Subject: [CSBi-HPC] access to Whitehad part of the IBM cluster is not available Message-ID: <4241C56F.5020807@wi.mit.edu> Whitehead suffered a security breach and have turned off all access from outside their firewall until they get it under control. I'll let you know when access to the cluster is available again. al -- Al Davis aldavis at wi.mit.edu | aldavis at mit.edu Systems Manager 617.324.0519 CSBi & WI/MIT BioImaging Center NE47 Rm 311 (500 Technology Sq) From aldavis at wi.mit.edu Thu Mar 24 14:13:57 2005 From: aldavis at wi.mit.edu (Al Davis) Date: Thu, 24 Mar 2005 14:13:57 -0500 Subject: [CSBi-HPC] CSBi cluster access is back up Message-ID: <42431175.5040804@wi.mit.edu> It looks like we've fixed the problems on both the firewalls, so you should be able to ssh into the wi cluster again. Also loadleveler is back up and working and able to submit jobs to all the nodes. I'm working on a plan to finish the integration of both clusters into one system, so stay tuned for updates on that. Let me know if you have a problem logging into the cluster? al -- Al Davis aldavis at wi.mit.edu | aldavis at mit.edu Systems Manager 617.324.0519 CSBi & WI/MIT BioImaging Center NE47 Rm 311 (500 Technology Sq)