Amazon investigating problem after S3 suffers 8-hour outage

By Tim Conneally | Published July 21, 2008, 5:40 PM

Amazon's Simple Storage Service (S3) was down for more than eight hours over the weekend, affecting many prominent sites, and the company is still investigating the cause of the problem.

Cloud-based services such as those offered by Amazon provide cost effective solutions in computing and storage. However, the oft-cited drawback of relying on such offerings is that customers are left with little or no control if something goes wrong. The only option is to wait -- and in cases like this, wait nearly half a day.

Amazon's S3 Simple Storage Service which was introduced in 2006 is a part of the Amazon Web Services (AWS) suite, also consisting of the Elastic Compute Cloud (EC2) and SimpleDB services.

On July 20, the S3 component of AWS was down for more than 8 hours, affecting sites like SmugMug, Twitter, Centernetworks, and many of Amazon's own sites. The Amazon Web Service Health Dashboard shows that the Simple Storage Service and Simple Queue service experienced a "service disruption."

In a communication with the company, GigaOM's Om Malik received a rather general explanation as to why the service was down: "As a distributed system, the different components of S3 need to be aware of the state of each other. For example, this awareness makes it possible for the system to decide which redundant physical storage server to route a request to."

"We experienced a problem with those internal system communications, leaving the components unable to interact properly, and customers unable to successfully process requests. After exploring several alternatives, the team determined it had to take the service offline to restore proper communication and then bring service online again."

"These are sophisticated systems and it generally takes a while to get to root cause in such a situation -- we will be providing our customers with more information when we've fully investigated the incident," the company added.

Many companies utilize AWS, so a loss of functionality has the potential to affect a huge number of services. Both Red Hat and Sun utilize EC2, which has also experienced various outages. Consumer-aimed services like HP's Upline have faced numerous outages as well.

Comments

View comments by with a score of at least

I noticed the outage as a jungledisk user, but I've got to be honest S3 is the fastest, cheapest, most awesome online backup solution out there right now. 8 hours out of the 2 months I've been using it wasn't a problem. Now for me it's a backup solution, I don't host data I need to access immediately up there (unless of course my drives fail and then I would :).

Score: 0

|

Microsoft's Ray Ozzie: 'Nobody's going to be 100% open'

The mobile apps ecosystems of the world may converge over time, led by apps being ported over across platforms, according to the Chief Software Architect.

Will Firefox beat IE9 to Direct2D rendering?

Just days after Microsoft executives gave conference attendees a peek at a new rendering technology, a Mozilla contributor revealed he's working on the same thing.

Where there's smoke: Apple warranty stance raises troubling questions

Carmi Levy | Wide Angle Zoom: Smoking can be dangerous not only for your lungs, it appears, but for your Apple hardware warranty.

AOL's decision to rebrand as Aol. takes a bad brand and makes it worse

The idea behind the social Web is to crowd source before bringing out something new. But not at AOL, which new logo debuted with a cry of "fail!" across the blogosphere and Twittersphere today.

Microsoft 'worked with Apple' for Silverlight on iPhone, says Goldfarb

By not making such a big deal out of trying to stream video to the iPhone, Microsoft got a big deal out of it, revealed the Silverlight product manager.

Clicker.com cuts through the Web video chaos

In a world where homemade video and Hollywood movies travel the same pipeline, it's good to have a real search engine to cut through the clutter.

A case study in improving software: What Office 2010 can learn from Notion 3

A music composition product gambles with a complete overhaul, in an effort to make headway against two well-known competitors in a tough market.

Kindle 2 update adds battery life, native PDF reader

Amazon has pushed out an update to the Kindle 2 e-reader that lengthens battery life and adds a native PDF viewer.

Safari on iPhone gets competition from a $1 browser app

Apple likes to say it gives iPhone users a full browsing experience, but a new competitor tries to incorporate more desktop browser features.

Action Replay maker sues Microsoft for Xbox 360 'predatory technological barriers'

Third-party video game accessory maker Datel has filed an antitrust lawsuit against Microsoft over the Xbox 360's recent Dashboard update.

Microsoft's Bob Muglia and Ray Ozzie on Silverlight vs. standards

Bob Muglia: "We're trying to provide people with an environment that has capabilities that you just simply can't do today in the standards-based world."