Email this article   Print article 


Microsoft Cloud Outage Blamed On Faulty Update

By Ken Presti
March 14, 2013    1:45 PM ET

Microsoft is apologizing for this week's service interruptions involving Hotmail, Outlook.com and SkyDrive. The outage began at 4:35 p.m. Eastern time on Tuesday, but service was not fully restored until approximately 8:45 a.m. Eastern time on Wednesday morning.

According to a blog post by a Microsoft vice president Arthur de Haan, an attempted firmware update on the afternoon of the 12th was the source of the problem.

"This is an update that had been done successfully previously, but failed in this specific instance in an unexpected way," he wrote. "This failure resulted in a rapid and substantial temperature spike in the datacenter. This spike was significant enough … that it caused our safeguards to come in to place for a large number of servers in this part of the datacenter.

[Related: The 100 Coolest Cloud Computing Vendors of 2013]

"These safeguards prevented access to mailboxes housed on these servers and also prevented any other pieces of our infrastructure to automatically failover and allow continued access," he continued. "This area of the datacenter houses parts of the Hotmail.com, Outlook.com, and SkyDrive infrastructure, and so some people trying to access those services were impacted."

The executive went on to explain that the team was immediately alerted of the outage, and begin working to restore service. "Based on the failure scenario, there was a mix of infrastructure software and human intervention that was needed to bring the core infrastructure back online," he said. "Requiring this kind of human intervention is not the norm for our services and added significant time to the restoration."

Service was restored on an incremental basis from Tuesday night through Wednesday morning, with the majority of mailboxes restored before midnight.

This was the second major outage suffered by Microsoft in recent weeks. On February 23 a glitch involving an expired SSL certificate impacted more than 50 services delivered through Microsoft Azure. Impacted customers have been promised a refund in the Azure incident.

PUBLISHED MARCH 14, 2013

To continue reading this article, please download the free CRN Tech News app for your iPad or Windows 8 device.
Related: Videos | Slide Shows | Comments

SHARE THIS ARTICLE

More Cloud

Recent Articles

10 Intriguing Product Updates From Google I/O 2013

CRN takes a look at some of the key ways Google intends to influence the way we do business and enjoy our free time. A number of product rollouts and updates were made at I/O 2013. Here are the most intriguing.

8 Tips For Successful Cloud Migrations

Successful cloud migrations don't merely focus on changes in technology; they are also focused on the comfort levels of both people who are familiar with the new technology as well as those who might be slightly apprehensive about the forthcoming changes.

9 Key Concerns That Block Cloud Sales

The benefits of the cloud are heavily touted by cloud providers and the various types of channel partners with which they work. But a number of stumbling blocks still remain. Channel partners outlined for CRN some of the objectives they hear most often.

  More Slide Shows




Related Videos
Loading...