Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and Copilots
    • Innovation & Leadership
    • Cybersecurity
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
  • Summit NA
  • Dynamics Communities
  • Ask Copilot
Twitter Instagram
  • Summit NA
  • Dynamics Communities
  • AI Copilot Summit NA
  • Ask Cloud Wars
Twitter LinkedIn
Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and CopilotsWelcome to the Acceleration Economy AI Index, a weekly segment where we cover the most important recent news in AI innovation, funding, and solutions in under 10 minutes. Our goal is to get you up to speed – the same speed AI innovation is taking place nowadays – and prepare you for that upcoming customer call, board meeting, or conversation with your colleague.
    • Innovation & Leadership
    • CybersecurityThe practice of defending computers, servers, mobile devices, electronic systems, networks, and data from malicious attacks.
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
    • Login / Register
Cloud Wars
    • Login / Register
Home » Microsoft Tackles Top Challenge: Boosting Azure’s Reliability
Cloud

Microsoft Tackles Top Challenge: Boosting Azure’s Reliability

Bob EvansBy Bob EvansJanuary 28, 20206 Mins Read
Facebook Twitter LinkedIn Email
Share
Facebook Twitter LinkedIn Email

With Azure having become Microsoft’s centerpiece for the future, the company’s making some huge investments around ML and other technologies to boost Azure’s reliability as more global corporations bet their businesses on it.

Several months ago, I wrote about some earlier steps Microsoft was taking to address Azure’s reliability. Called After 3 Cloud Failures in 12 Months, Microsoft Fortifies Azure, the piece drew from a blog post from Azure CTO Mark Russinovich. Russinovich had described a range of approaches Microsoft is taking to boost reliability without disrupting customers’ operations in doing so.

In light of those cloud failures during a time when many large global corporations are moving mission-critical workloads to the Azure cloud, I pegged Azure reliability as Microsoft’s #1 challenge for the coming year. 

Cloud Wars: Outlook 2020

The Top 10’s Biggest Challenges

1. Microsoft — Can it sustain a reputation for reliability for the Azure cloud?
2. Amazon — Can it win vs. Oracle Autonomous DB? AND vs. Microsoft Azure?
3. Salesforce — Can Marc Benioff win the battle to redefine CRM?
4. SAP — Can it sell the marketplace on Experience Management / HXM?
5. Oracle — Larry Ellison is talking a big talk—can Oracle back it up?
6. Google — Can it outflank Amazon through software skills and $$$?
7. IBM — Can it catch up with the rest of the Top 10 in growth rate?
8. Workday — Can it hold or expand its lead among Fortune 100?
9. ServiceNow — Can it live up to McDermott’s sky-high ambitions?
10. TBD

Earlier this month, Russinovich posted an update on Microsoft’s ongoing efforts. In this round, there’s no question that Microsoft is banking on dynamic ML technology to provide new improvements and capabilities. (Russinovich wrote the opening portion of the post—called Advancing no-impact and low-impact maintenance technologies—and a few leaders from his team wrote the rest.)

In a section called “The future of Azure maintenance,” the post outlines Microsoft’s big bet on machine learning: “We are investing heavily in machine learning-based insights and automation to maintain availability and reliability. Eventually, this ‘AI Operations’ model will carry out preventative maintenance, initiate automated mitigations, and identify contributing factors and dependencies during incidents more effectively than our human engineers can.” 

But in the meantime, Microsoft hopes to “minimize customer impact when updating the fleet. Today, the vast majority of updates to the host operating system are deployed in place with absolute transparency and zero customer impact using hot patching. In infrequent cases in which the update cannot be hot patched, we typically utilize low-impact memory preserving update technologies to roll out the update… Thanks to continued investments in this space, we are at a point where the vast majority of host maintenance activities do not impact the VMs hosted on the affected infrastructure.”

Here are some additional highlights from this new post, which Russinovich said is intended to highlight “several initiatives underway to keep improving platform availability, as part of our commitment to provide a trusted set of cloud services.”

The post outlines 4 approaches: Plan A, which involves Hot Patching; Plan B, which centers on Memory-Preserving Maintenance; Plan C, which is about Self-Service Maintenance; and Plan D, which is Live Migration. Here’s a look at each:

Plan A: Hot Patching.

This technique allows customers to “make targeted changes to running code without incurring any downtime for customer VMs.” It’s considered a “no-impact update technology because it directs “all new invocations of a function on the host to an updated version of that function.” For applying updates, Microsoft said it has been using hot patching wherever possible to avoid “any impact to the VMs running on that host.” Microsoft first began using hot patching in Azure in 2017, extending its use from the hypervisor to firmware hot patches sometime down the road. And since “some large host updates contain changes that cannot be applied using function-level hot patching…we endeavor to use memory-preserving maintenance.”

Plan B: Memory-preserving maintenance.

This approach “involves ‘pausing’ the guest VMs (while preserving their memory in RAM), updating the host server, then resuming the VMs and automatically synchronizing their clocks,” the post says. Since memory-preserving maintenance in Azure made its debut in 2018, 3 important improvements have been made. First, host reboots are not always required. Second, the length of the “customer pause” has been reduced. And finally, the technology’s now available on more types of VMs.  

Plan C: Self-service maintenance.

This option gives customers a specific window of time “within which they can choose when to initiate impactful maintenance on their VM(s). This initial self-service phase typically lasts around a month and empowers organizations to perform the maintenance on their own schedules so it has no or minimal disruption to users.” Afterward, Azure begins to perform maintenance automatically. The post also describes “rebootless updates” that result in “pauses” of only a few seconds. This approach is “useful for VMs running ultra-sensitive workloads which can’t sustain any interruption even if it lasts just for a few seconds.”

Plan D: Live migration.

This entails “moving a running customer VM from one ‘source’ host to another ‘destination’ host… Once most of the local state is moved, the guest VM experiences a short pause usually lasting five seconds or less. After that pause, the VM resumes running on the destination host… Today, when Azure Machine Learning algorithms predict an impending hardware failure, live migration can be used to move guest VMs onto different hosts preemptively.”

RECOMMENDED READING

Amazon’s Worst Nightmare Comes True: Microsoft Azure #1 among CIOs

Why Microsoft and Not Amazon Is #1 in Cloud: Migrations Are 5X Cheaper

Microsoft’s $50-Billion Moonshot: #1 Cloud Vendor Lays Out New Growth Plans

Satya Nadella Admits: Microsoft Cloud Business Is Bigger than Amazon’s 

Why #1 Microsoft Is Top Cloud Vendor: Cloud Now Drives 35% of Revenue 

#1 Microsoft Puts Amazon and Google on Notice: We’re Just Getting Started 

Microsoft Torches Google and Amazon on Big-Data Benchmarks, Says Microsoft

Subscribe to the Cloud Wars Newsletter for in-depth analysis of the major cloud vendors from the perspective of business customers. It’s free, it’s exclusive, and it’s great!

Azure Cloud Wars Cloud Wars Archive Latest Articles Microsoft
Share. Facebook Twitter LinkedIn Email
Founderuser

Bob Evans

Founder
Cloud Wars

Areas of Expertise
  • AI
  • Cloud
  • Digital Business
  • Innovation
  • Leadership
  • LinkedIn

Cloud Wars Founder Bob Evans actively analyzes the Cloud and AI categories through video reports, in-depth analyses, and interviews with the Cloud and AI market’s leaders and innovators. He’s also the creator of the Cloud Wars Top 10, a ranking and ongoing analysis of the world's most influential tech companies driving digital business and the digital economy. Bob is recognized as a world-class strategic communicator focused on emerging business strategy, disruptive innovation, and forward-looking leadership.

  Contact Bob Evans ...

Related Posts

Oracle Will Leapfrog Google Cloud as World’s #1 Hottest Cloud Vendor

June 10, 2025

Data, Governance & Infrastructure: Key Takeaways from Marine Corps AI Strategy

June 10, 2025

Can Oracle Overtake Google Cloud as #1 Fastest-Growing Vendor?

June 10, 2025

Slow-Walking AI Hazardous to CEO Health, Warns OpenAI CEO Sam Altman

June 9, 2025
Add A Comment

Comments are closed.

Recent Posts
  • Oracle Will Leapfrog Google Cloud as World’s #1 Hottest Cloud Vendor
  • Data, Governance & Infrastructure: Key Takeaways from Marine Corps AI Strategy
  • Can Oracle Overtake Google Cloud as #1 Fastest-Growing Vendor?
  • Slow-Walking AI Hazardous to CEO Health, Warns OpenAI CEO Sam Altman
  • OpenAI’s Sam Altman: CEOs Must Move Fast to Win in the AI Era

  • Ask Cloud Wars AI Agent
  • Tech Guidebooks
  • Industry Reports
  • Newsletters

Join Today

Most Popular Guidebooks

Accelerating GenAI Impact: From POC to Production Success

November 1, 2024

ExFlow from SignUp Software: Streamlining Dynamics 365 Finance & Operations and Business Central with AP Automation

September 10, 2024

Delivering on the Promise of Multicloud | How to Realize Multicloud’s Full Potential While Addressing Challenges

July 19, 2024

Zero Trust Network Access | A CISO Guidebook

February 1, 2024

Advertisement
Cloud Wars
Twitter LinkedIn
  • Home
  • About Us
  • Privacy Policy
  • Get In Touch
  • Marketing Services
  • Do not sell my information
© 2025 Cloud Wars.

Type above and press Enter to search. Press Esc to cancel.

  • Login
Forgot Password?
Lost your password? Please enter your username or email address. You will receive a link to create a new password via email.