[Linerva-announce] Linerva reboot today

Geoffrey Thomas geofft at MIT.EDU
Fri Jul 29 09:45:05 EDT 2011


Hi,

Linerva crashed this morning at around 9 AM and has been restarted. The 
crash is the same as we saw on Sunday; there are also indications that 
this may have been caused of an Athena-wide network blip. We've restarted 
Linerva and will look into exactly what happened here.

Apologies for the inconvenience; as always feel free to contact us at 
linerva at mit.edu if you have any questions.

-- 
Geoffrey Thomas
SIPB Linerva team
linerva at mit.edu

On Sun, 24 Jul 2011, Geoffrey Thomas wrote:

> Hi,
>
> From approximately 1 PM to 5 PM today, Linerva was offline due to an AFS 
> crash. This outage was much longer than usual because we upgraded to a newer 
> prerelease (pre7) of OpenAFS 1.6, which should bring several stability 
> improvements.
>
> We were planning to schedule downtime for Linerva later this week to upgrade 
> to OpenAFS 1.6.0pre7. In addition to having wanted to do so for some time, 
> this was made more urgent by increasing AFS instability problems over the 
> past week. (This reboot should resolve any "Connection timed out" errors you 
> may have seen in normal operation on Linerva; if you still see those, please 
> let us know.) Unfortunately, the AFS instability issues caused a kernel crash 
> at 1 PM today which we could not recover from, and we decided for the sake of 
> future stability to continue with the plan of building and upgrading OpenAFS 
> 1.6.0pre7 for Linerva, instead of rebooting quickly and scheduling further 
> downtime for later in the week.
>
> Linerva is back online now and should be stable. We apologize for the 
> inconvenience and the extended outage; if you have any questions, feel free 
> to contact us at linerva at mit.edu.
>
> -- 
> Geoffrey Thomas
> SIPB Linerva team
> linerva at mit.edu
>



More information about the Linerva-announce mailing list