Current Articles | RSS Feed RSS Feed

An explosion ate our servers - how cool is that?

Posted by Andy Singleton on Mon, Jun 02, 2008 @ 12:18 PM
Digg digg it | Reddit reddit | del.icio.us del.icio.us | StumbleUpon StumbleUpon 

Our site started misbehaving on Saturday when we got this message from our datacenter: "...electrical gear shorted, creating an explosion and fire that knocked down three walls surrounding our electrical equipment room ... This is a significant outage, impacting approximately 9,000 servers"

 

An explosion. How cool is that? It's way cooler than having a guy accidentally dig up your wiring with a backhoe, or run it over with a truck. It's an outright privilege compared with the squirrel that ate the fiber cable. Imagine spending nights and weekends rallying your troops around a disaster recovery plan, all to do battle with a squirrel. That would be humiliating. My wall of flame incinerates your squirrel.

 

The flame walled us off from three servers out of the approximately 8 that we need to run Assembla.com at full power. In theory, the rest of the system, including the www.assembla.com site itself, should have continued to work properly. In practice, we found that processes on the remaining servers would hang while waiting for responses from the missing servers, and users would get “Unavailable” errors on www.assembla.com or find themselves unable to access trac.

 

It's better now. The missing servers are back online or replaced. We'll be moving to improved virtual server topologies that will hold up under explosive attack.

Tags: 

COMMENTS

If only the explosion was recorded, good thing no one was hurt... also good news nothing was lost and everything seems to be working again!
Cheers guys!

posted @ Monday, June 02, 2008 1:36 PM by Dan2k3k4


Wow. Glad to see it's all back up, though.

posted @ Monday, June 02, 2008 4:19 PM by Taylor Sullivan


Uh, I taught It was the end of the world. It's great to be on-line again.
An explosion, heh, how often is that :).

posted @ Monday, June 02, 2008 5:17 PM by nemke


Unbelievable. <---!!!
Tis a fair tale, all the same.

posted @ Monday, June 02, 2008 10:34 PM by Lt. Squeeky Nuts


Insane! Sounds like truth *is* stranger than fiction.
As a slight side note, great job on getting the site back online so fast. I didn't even notice!

posted @ Monday, June 02, 2008 11:04 PM by Peter


Congrats on the spectacular outage, Andy. :)

posted @ Tuesday, June 03, 2008 3:04 PM by Seth Bienek


I am having problems using the assembla chat. After I click the link "Click here to chat with a Web client", I will get an error saying "Authorization failed."
I also tried using other jabber clients and could not log in either.

posted @ Friday, June 06, 2008 1:39 AM by cannot login to chat


Didn't even feel a thing. Probably cause I've been in the dark Southern tip of Africa for five hundred years!
Hello guys!! I is Webfarmer. So weird. A friend of mine is pointing me through all of this stuff - assember, ruby on rails... the works... I was literally trying to invent all this cactus in PHP - and it's all been done already... It's like I've been starving on an island for years and coming to land! I actually want to cry. mind if I ask questions on things that may sound completely stupid at the time - but I really have been in the dark...?

posted @ Friday, June 06, 2008 9:53 PM by Paul


I noticed it went down and thought "oh no they haven't gone bust have they!"
Great to see that wasn't the case and reassuring to see that none of our data was lost...even after an explosion!!! :-)

posted @ Saturday, June 07, 2008 11:54 PM by Mark


Trying to browse assembla.com it responds with a 503 Service Unavailable. Is it offline or is it me?
Thanks!

posted @ Monday, June 09, 2008 3:36 PM by ASanchez


Yeah, looks like it is down again, hopefully not another fire.

posted @ Monday, June 09, 2008 3:50 PM by deehoc


Thanks deehoc!
And damn!!! I needed to review a few tickets before a meeting.

posted @ Monday, June 09, 2008 3:51 PM by ASanchez


It's down now for a long time...hope every thing will end soon

posted @ Monday, June 09, 2008 4:26 PM by hsalem


cheers, i quess you did great recovery.

posted @ Monday, June 09, 2008 5:14 PM by Andy


Please let us know when there is a geographically redundant failover svn server. We really need our svn service to be available at all times.

posted @ Wednesday, June 11, 2008 1:20 PM by Samuel Kennedy


Is assembla down again today?

posted @ Saturday, June 21, 2008 6:35 PM by down again


Post Comment
Name
 *
Email
 *
Website (optional)
Comment
 *

Allowed tags: <a> link, <b> bold, <i> italics

Receive email when someone replies.

Blog Navigator

Navigate By : 
[Article Index]

Subscribe by Email

Your email:

About This Blog

Accelerating Software Development with Agile, open-source style processes, distributed teams, on-demand teams, new product launches, Web 2.0 strategies, startups.  Author Andy Singleton builds new products fast.

About Us

Assembla offers services for building software with agile, distributed teams.