The 10 Biggest Cloud Outages Of 2022 (So Far)
Wade Tyler Millward
Cloud outages so far this year have affected companies including Apple, Microsoft and Google.
June Microsoft Azure And M365 Online Outages
On June 7, customers had trouble connecting to resources hosted in the East U.S. 2 region, located in Virginia, according to Microsoft. The issue lasted for about 12 hours and should not have affected customers with always-available or zone-redundant services.
The Redmond, Wash.-based tech giant blamed the outage on “an unplanned power oscillation in one of our datacenters within one of our Availability Zones in the East US 2 region,” according to a Microsoft report.
It continued: “Components of our redundant power system created unexpected electrical transients, which resulted in the Air Handling Units (AHUs) detecting a potential fault, and therefore shutting themselves down pending a manual reset.”
The outage affected Application Insights, Log Analytics, Managed Identity Service, Media Services and NetApp Files, according to the report.
Microsoft is working on ways to “improve our tooling and processes to flag anomalies more quickly” and “fine-tuning our alerting to inform onsite datacenter operators more comprehensively,” according to the report.
The company is also “developing a plan for fault injection testing relevant critical environment systems, in partnership with our industry partners, to be even more proactive in identifying and remediating potential risks” and “expanding how many Azure services support Availability Zones, so that customers can opt for automatic replication and/or architect their own resiliency across services.”
On June 21, Microsoft tweeted that it was investigating delays and connection issues with Exchange Online. About two hours later, the company tweeted that it “determined multiple Microsoft 365 services are experiencing delays, connection and search issues,” responding by rerouting traffic.
About nine hours later, Microsoft tweeted that “rerouting traffic combined with targeted infrastructure restarts has successfully restored service access and functionality.”