foobar on computers, software and the rest of the world


Amazon S3 outage: Almost 7 hours and counting

, posted: 21-Jul-2008 10:44

Today has been a bad day for many web-sites and startups across the Internet. The reason? An Amazon S3 outage, which started some 7 hours ago (around 9 am US-west-coast time), and which is still not fully resolved. According to Amazon's status dashboard, they have finally managed to get the European service back online again, but are still working on the US service.

For those of you who are not familiar with Amazon S3: It's a cloud-computing service from Amazon (yes, THE Amazon that we normally only know as an online book and music store). It allows for cheap, and usually reliable on-demand online storage. It's one of several offerings from Amazon in that space, which also includes Amazon EC2 (on-demand computing) and others.

These cloud-services are popular, because they allow you to build massively scalable architectures without having to spend a dime on hardware. You only pay for the storage or the computing capacity you use in their data centres. For example, in EC2 you can bring up new instances of virtual GNU/Linux servers when you need them during times of high demand, and shut them down when the load has lessened. You pay for uptime of instances (a few cents per hour), stored gigabytes per month (also just a few cents) and transmitted data (few cents per gigabyte).

One of the most high-profile victims of the current S3 outage is Twitter: Images, such as avatars of users, are currently not being served, because they are all stored on S3.

I have been following Amazon's services for a while now and had a chance to experiment with them. I am impressed by how powerful the concept is and how well it normally works. I also believe that Amazon learns from glitches like this and manages to improve their system as a result of it. But nevertheless, it is still relatively young, and so apparently not all issues are sorted out yet.

Needless to say, a 6 or 7 hour outage means a lot of egg on Amazon's face. That's not the kind of publicity they want. Still, though, if you can think of strategies to soften the impact of outages of individual components in your architecture, I would still recommend the Amazon services if you are a startup in search for cheap and scalable computing and storage resources.

Update: Just as I posted this, I notice that the avatars on Twitter have started to appear again. Slowly, though, and only some of them so far. But it appears as if finally they are finishing off with the restauration of the US service as well. At 4 pm US west-coast-time, 11 am NZ time, Amazon advises that they estimate the service to be fully restored within one hour. That would make for a total outage duration of 8 hours. We will see.

Update 2:
At 5 pm US-west-coast-time (12 noon, NZ time), it appears as if S3 is fully operational again. Twitter got its avatars back, and many startups and web-based companies are finally back in business.

One of the comments on Twitter said it perfectly: "S3 sneezes and the cloud catches a cold." I really like that one. :-)


Tag(s):               


Other related posts:
More Apple madness (follow up)
The GPU, your personal desktop super computer
A truly light-weight OS: Written in ASM, with GUI, networking and apps








Comment by Jon Travis, on 21-Jul-2008 12:46

CloudStatus was able to detect this pretty quickly this morning. We put our writeup on the blog:

http://www.hyperic.com/blog/hyperic/2008/07/20/amazon-ec2-s3-sqs-outage-0720/


Comment by Mike, on 22-Jul-2008 05:17

"S3 sneezes and the cloud catches a cold." I'm getting tired of this, why doesn't S3 take the right medicine? Outages are too frequent with S3, I might switch over to a better cloud, maybe Nirvanix, they never get sick.


Author's note by foobar, on 22-Jul-2008 05:58

@Mike: But Nirvanix has 66% higher storage prices (for the 99.9% SLA), 80% higher upload prices and download prices that are 6% to 80% higher (depending on download volume). They do offer higher SLAs, but the upload and storage prices are going up greatly then.

So it appears that for the same level of service you pay significantly less at Amazon. And the fact that Nirvanix offers 99.9% and 99.99% SLAs seems to indicate that they DO go down, at least occasionally.

I just hope that Amazon is learning from whatever happened yesterday, because their prices are attractive ... at least as long as their service is attractive as well.


Add a comment

Please note: comments that are inappropriate or promotional in nature will be deleted. E-mail addresses are not displayed, but you must enter a valid e-mail address to confirm your comments.

Are you a registered Geekzone user? Login to have the fields below automatically filled in for you and to enable links in comments. If you have (or qualify to have) a Geekzone Blog then your comment will be automatically confirmed and placed in the moderation queue for the blog owner's approval.

Your name:

Your e-mail:

Your webpage:

foobar's profile

 
New Zealand


  • Who I am: Software developer and consultant.
  • What I do: System level programming, Linux/Unix. C, C++, Java, Python, and a long time ago even Assembler.
  • What I like: I'm a big fan of free and open source software. I'm Windows-free, running Ubuntu on my laptop. To a somewhat lesser degree, I also follow the SaaS industry.
  • Where I have been: Here and there, all over the place.




Google Search


Recent posts

Attack on net neutrality right...
Munich already saved millions ...
Iceland's public administratio...
More Apple madness (follow up)...
Apple demonstrates: With great...
Smooth sailing with the Karmic...
Censorship in New Zealand: Wid...
Image roll-over effects withou...
How about: Three strikes and Y...
UK government supports open so...


Top 10

How to write a Linux virus in ...
(11-Feb-2009 06:33, 395091 views)
Follow up: How to write a Linu...
(12-Feb-2009 08:10, 55537 views)
A truly light-weight OS: Writt...
(3-Feb-2009 10:39, 43229 views)
The 'Verified by Visa' fiasco ...
(20-Jun-2008 09:59, 20676 views)
EEE PC with XP is cheaper than...
(9-May-2008 06:50, 18784 views)
11 reasons to switch to Linux...
(4-Feb-2009 09:24, 18575 views)
Would you use Google App Engin...
(8-Apr-2008 20:02, 17042 views)
Censorship in New Zealand: Wid...
(16-Jul-2009 12:11, 16316 views)
Django Plugables: Tons of plug...
(11-Apr-2008 03:24, 15703 views)
Slow file copy bug in Vista: A...
(21-Dec-2007 12:18, 14585 views)