SlideShare a Scribd company logo
1 of 16
Applied Math at Microsoft
Azure
What to expect
• I will talk about two interesting use cases of Applied Math in Azure.
• Unfortunately, I can’t go into details of Azure or the numbers but I’m
hoping the gist will be clear.
What is Azure?
• Azure is a cloud service.
• Competitor to AWS
• Basic Architecture
Topic 1: Dirichlet Entropy for anomaly
detection
Contributors:
• Rohit Pandey
• Gil Lapid Shafriri
Background
• At Azure, we keep track of various causes and components associated
with downtimes of customer VMs (categorical histograms).
• We use this data to prioritize fixes for top downtime reasons and
components.
• But what about patterns that manage to stay out of sight?
• There is a tendency to confuse “small” with “ambient”. And
over a large timeframe, “small” becomes “large”.
• Ambient noise should be like a fair dice.
• Truly ambient noise won’t unduly favor any component (Ex:
Rack).
• We need one measure for how “skewed” our histogram is and
trend that over time.
Background (continued)
Approach
• Categorical histograms are like rolls of a dice and the canonical
distribution for the parameters of a dice is the Dirichlet.
• A great metric for determining skewness is Entropy (for a random
variable 𝑋: 𝑓𝑋(𝑥)).
H 𝑥 = 𝐸[log(1/𝑓𝑋(𝑥))]
0
5
10
15
20
25
30
35
1 2 3 4 5 6
0
1
2
3
4
5
6
7
8
1 2 3 4 5 6
Low Entropy High Entropy
Implementation and Results
• Set up a portal that shows
list of categorical histograms
descending by Entropy.
• Caught multiple instances of
rack failures.
• Nodes stuck in reboot loop
due to incorrect
configuration.
• And more..
Topic 2: To reboot or not to reboot
Contributors:
• Rohit Pandey
• Durmus Karatay
• Gil Lapid Shafriri
• Randolph Yao
The Problem
• Machines in Azure can be in various “states”. For example, “Healthy”
and “Unwell”.
• When a machine becomes unwell, we wait a certain amount of time
(𝜏0) to give it a chance to organically recover.
• How do we optimize this 𝜏0 so as to minimize the downtime.
Toy Transition Diagram
Unwell
Healthy
Rebooting
State 1
State 2
Transition Matrices
• Transition probabilities matrix (𝑷 = {𝒑𝒊𝒋})
• Transition times matrix (𝑳 = {tij})
Formulation
𝐸 𝑇 = P 𝑋 ≤ 𝜏 × 𝐸 𝑋 𝑋 ≤ 𝜏
+ P X > 𝜏 × (𝜏 + 𝑌)
=
0
𝜏
𝑥𝑓𝑋 𝑥 𝑑𝑥 + (𝜏 + 𝑌)
𝜏
∞
𝑓𝑋 𝑥 𝑑𝑥
• In our estimate of Y, we consider both the happy and the sad paths.
• We can find the threshold ( 𝜏) that minimizes the expected downtime by
setting
𝜕𝐸[𝑇]
𝜕𝜏
= 0.
𝐻 𝑋 𝜏 =
𝑓 𝑋 𝜏
𝑃(𝑋 > 𝜏)
=
1
𝑌
Unwell
Healthy
Rebooting 𝒀
𝑿: 𝒇 𝑿(𝒙)
𝝉
Choice of X
• Considered 7-8 distributions and settled on Lomax because it can model
extreme values the best.
𝐻 𝑋 𝜏 =
𝑐1
1 + 𝑐2. 𝜏
=
1
𝑌
⇒ 𝜏 =
𝑐1 𝑌 − 1
𝑐2
• To estimate the parameters –
• All samples that we saw from Unwell to Ready
• The instances of Unwell to Rebooting which were all cases where it took more than
𝜏0 for sure.
𝐿𝐿 𝑐1, 𝑐2 𝑥1, 𝑥2, … , 𝑥 𝑛, 𝑚 =
1
𝑛
log(𝑓𝑋 𝑥𝑖; 𝑐1, 𝑐2 ) + 𝑚. log(𝑃(𝑋 > 𝜏0))
Choice of Y
• We think of “Healthy” as the absorbing state, others as transient.
• We denote by 𝑥𝑖 the time taken to get to the absorbing state from
transient state 𝑖.
𝑥𝑖 =
1
𝑛
𝑝𝑖𝑗(𝑡𝑖𝑗 + 𝑥𝑗)
⇒ 𝐼 − 𝑄 . 𝑥 = 𝑃 𝑜 𝐿 . 𝟏
Result
𝒀
𝑬[𝑿]
𝑬[𝑻]
𝝉𝝉𝝉 𝟎
Savings

More Related Content

Similar to Applied Math at Microsoft Azure - Rohit Pandey

Market Basket Analysis in SQL Server Machine Learning Services
Market Basket Analysis in SQL Server Machine Learning ServicesMarket Basket Analysis in SQL Server Machine Learning Services
Market Basket Analysis in SQL Server Machine Learning ServicesLuca Zavarella
 
Visualizing Systems with Statemaps
Visualizing Systems with StatemapsVisualizing Systems with Statemaps
Visualizing Systems with Statemapsbcantrill
 
Foundations: Artificial Neural Networks
Foundations: Artificial Neural NetworksFoundations: Artificial Neural Networks
Foundations: Artificial Neural Networksananth
 
Performance OR Capacity #CMGimPACt2016
Performance OR Capacity #CMGimPACt2016 Performance OR Capacity #CMGimPACt2016
Performance OR Capacity #CMGimPACt2016 Alex Gilgur
 
c++ Data Types and Selection
c++ Data Types and Selectionc++ Data Types and Selection
c++ Data Types and SelectionAhmed Nobi
 
It Probably Works - QCon 2015
It Probably Works - QCon 2015It Probably Works - QCon 2015
It Probably Works - QCon 2015Fastly
 
DA lecture 3.pptx
DA lecture 3.pptxDA lecture 3.pptx
DA lecture 3.pptxSayanSen36
 
Fuzzy logic
Fuzzy logicFuzzy logic
Fuzzy logicAdPatel5
 
Algorithm analysis insertion sort and asymptotic notations
Algorithm analysis insertion sort and  asymptotic notationsAlgorithm analysis insertion sort and  asymptotic notations
Algorithm analysis insertion sort and asymptotic notationsAmit Kumar Rathi
 
Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
Using Apache Pulsar to Provide Real-Time IoT Analytics on the EdgeUsing Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
Using Apache Pulsar to Provide Real-Time IoT Analytics on the EdgeDataWorks Summit
 
Abductive commonsense reasoning
Abductive commonsense reasoningAbductive commonsense reasoning
Abductive commonsense reasoningSan Kim
 
On the Configuration-LP of the Restricted Assignment Problem
On the Configuration-LP of the Restricted Assignment ProblemOn the Configuration-LP of the Restricted Assignment Problem
On the Configuration-LP of the Restricted Assignment ProblemArash Pourdamghani
 

Similar to Applied Math at Microsoft Azure - Rohit Pandey (20)

Poster presentation
Poster presentationPoster presentation
Poster presentation
 
Market Basket Analysis in SQL Server Machine Learning Services
Market Basket Analysis in SQL Server Machine Learning ServicesMarket Basket Analysis in SQL Server Machine Learning Services
Market Basket Analysis in SQL Server Machine Learning Services
 
Big o notation
Big o notationBig o notation
Big o notation
 
Big o notation
Big o notationBig o notation
Big o notation
 
Visualizing Systems with Statemaps
Visualizing Systems with StatemapsVisualizing Systems with Statemaps
Visualizing Systems with Statemaps
 
Foundations: Artificial Neural Networks
Foundations: Artificial Neural NetworksFoundations: Artificial Neural Networks
Foundations: Artificial Neural Networks
 
Performance OR Capacity #CMGimPACt2016
Performance OR Capacity #CMGimPACt2016 Performance OR Capacity #CMGimPACt2016
Performance OR Capacity #CMGimPACt2016
 
c++ Data Types and Selection
c++ Data Types and Selectionc++ Data Types and Selection
c++ Data Types and Selection
 
It Probably Works - QCon 2015
It Probably Works - QCon 2015It Probably Works - QCon 2015
It Probably Works - QCon 2015
 
DA lecture 3.pptx
DA lecture 3.pptxDA lecture 3.pptx
DA lecture 3.pptx
 
Fuzzy logic
Fuzzy logicFuzzy logic
Fuzzy logic
 
daa unit 1.pptx
daa unit 1.pptxdaa unit 1.pptx
daa unit 1.pptx
 
5954987.ppt
5954987.ppt5954987.ppt
5954987.ppt
 
Searching Algorithms
Searching AlgorithmsSearching Algorithms
Searching Algorithms
 
Algorithm analysis insertion sort and asymptotic notations
Algorithm analysis insertion sort and  asymptotic notationsAlgorithm analysis insertion sort and  asymptotic notations
Algorithm analysis insertion sort and asymptotic notations
 
Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
Using Apache Pulsar to Provide Real-Time IoT Analytics on the EdgeUsing Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
 
Abductive commonsense reasoning
Abductive commonsense reasoningAbductive commonsense reasoning
Abductive commonsense reasoning
 
On the Configuration-LP of the Restricted Assignment Problem
On the Configuration-LP of the Restricted Assignment ProblemOn the Configuration-LP of the Restricted Assignment Problem
On the Configuration-LP of the Restricted Assignment Problem
 
Analysis of algorithms
Analysis of algorithmsAnalysis of algorithms
Analysis of algorithms
 
SVD.ppt
SVD.pptSVD.ppt
SVD.ppt
 

More from WithTheBest

Recreating history in virtual reality
Recreating history in virtual realityRecreating history in virtual reality
Recreating history in virtual realityWithTheBest
 
Engaging and sharing your VR experience
Engaging and sharing your VR experienceEngaging and sharing your VR experience
Engaging and sharing your VR experienceWithTheBest
 
How to survive the early days of VR as an Indie Studio
How to survive the early days of VR as an Indie StudioHow to survive the early days of VR as an Indie Studio
How to survive the early days of VR as an Indie StudioWithTheBest
 
Mixed reality 101
Mixed reality 101 Mixed reality 101
Mixed reality 101 WithTheBest
 
Unlocking Human Potential with Immersive Technology
Unlocking Human Potential with Immersive TechnologyUnlocking Human Potential with Immersive Technology
Unlocking Human Potential with Immersive TechnologyWithTheBest
 
Building your own video devices
Building your own video devicesBuilding your own video devices
Building your own video devicesWithTheBest
 
Maximizing performance of 3 d user generated assets in unity
Maximizing performance of 3 d user generated assets in unityMaximizing performance of 3 d user generated assets in unity
Maximizing performance of 3 d user generated assets in unityWithTheBest
 
Haptics & amp; null space vr
Haptics & amp; null space vrHaptics & amp; null space vr
Haptics & amp; null space vrWithTheBest
 
How we use vr to break the laws of physics
How we use vr to break the laws of physicsHow we use vr to break the laws of physics
How we use vr to break the laws of physicsWithTheBest
 
The Virtual Self
The Virtual Self The Virtual Self
The Virtual Self WithTheBest
 
You dont have to be mad to do VR and AR ... but it helps
You dont have to be mad to do VR and AR ... but it helpsYou dont have to be mad to do VR and AR ... but it helps
You dont have to be mad to do VR and AR ... but it helpsWithTheBest
 
Omnivirt overview
Omnivirt overviewOmnivirt overview
Omnivirt overviewWithTheBest
 
VR Interactions - Jason Jerald
VR Interactions - Jason JeraldVR Interactions - Jason Jerald
VR Interactions - Jason JeraldWithTheBest
 
Japheth Funding your startup - dating the devil
Japheth  Funding your startup - dating the devilJapheth  Funding your startup - dating the devil
Japheth Funding your startup - dating the devilWithTheBest
 
Transported vr the virtual reality platform for real estate
Transported vr the virtual reality platform for real estateTransported vr the virtual reality platform for real estate
Transported vr the virtual reality platform for real estateWithTheBest
 
Measuring Behavior in VR - Rob Merki Cognitive VR
Measuring Behavior in VR - Rob Merki Cognitive VRMeasuring Behavior in VR - Rob Merki Cognitive VR
Measuring Behavior in VR - Rob Merki Cognitive VRWithTheBest
 
Global demand for Mixed Realty (VR/AR) content is about to explode.
Global demand for Mixed Realty (VR/AR) content is about to explode. Global demand for Mixed Realty (VR/AR) content is about to explode.
Global demand for Mixed Realty (VR/AR) content is about to explode. WithTheBest
 
VR, a new technology over 40,000 years old
VR, a new technology over 40,000 years oldVR, a new technology over 40,000 years old
VR, a new technology over 40,000 years oldWithTheBest
 
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...WithTheBest
 

More from WithTheBest (20)

Recreating history in virtual reality
Recreating history in virtual realityRecreating history in virtual reality
Recreating history in virtual reality
 
Engaging and sharing your VR experience
Engaging and sharing your VR experienceEngaging and sharing your VR experience
Engaging and sharing your VR experience
 
How to survive the early days of VR as an Indie Studio
How to survive the early days of VR as an Indie StudioHow to survive the early days of VR as an Indie Studio
How to survive the early days of VR as an Indie Studio
 
Mixed reality 101
Mixed reality 101 Mixed reality 101
Mixed reality 101
 
Unlocking Human Potential with Immersive Technology
Unlocking Human Potential with Immersive TechnologyUnlocking Human Potential with Immersive Technology
Unlocking Human Potential with Immersive Technology
 
Building your own video devices
Building your own video devicesBuilding your own video devices
Building your own video devices
 
Maximizing performance of 3 d user generated assets in unity
Maximizing performance of 3 d user generated assets in unityMaximizing performance of 3 d user generated assets in unity
Maximizing performance of 3 d user generated assets in unity
 
Wizdish rovr
Wizdish rovrWizdish rovr
Wizdish rovr
 
Haptics & amp; null space vr
Haptics & amp; null space vrHaptics & amp; null space vr
Haptics & amp; null space vr
 
How we use vr to break the laws of physics
How we use vr to break the laws of physicsHow we use vr to break the laws of physics
How we use vr to break the laws of physics
 
The Virtual Self
The Virtual Self The Virtual Self
The Virtual Self
 
You dont have to be mad to do VR and AR ... but it helps
You dont have to be mad to do VR and AR ... but it helpsYou dont have to be mad to do VR and AR ... but it helps
You dont have to be mad to do VR and AR ... but it helps
 
Omnivirt overview
Omnivirt overviewOmnivirt overview
Omnivirt overview
 
VR Interactions - Jason Jerald
VR Interactions - Jason JeraldVR Interactions - Jason Jerald
VR Interactions - Jason Jerald
 
Japheth Funding your startup - dating the devil
Japheth  Funding your startup - dating the devilJapheth  Funding your startup - dating the devil
Japheth Funding your startup - dating the devil
 
Transported vr the virtual reality platform for real estate
Transported vr the virtual reality platform for real estateTransported vr the virtual reality platform for real estate
Transported vr the virtual reality platform for real estate
 
Measuring Behavior in VR - Rob Merki Cognitive VR
Measuring Behavior in VR - Rob Merki Cognitive VRMeasuring Behavior in VR - Rob Merki Cognitive VR
Measuring Behavior in VR - Rob Merki Cognitive VR
 
Global demand for Mixed Realty (VR/AR) content is about to explode.
Global demand for Mixed Realty (VR/AR) content is about to explode. Global demand for Mixed Realty (VR/AR) content is about to explode.
Global demand for Mixed Realty (VR/AR) content is about to explode.
 
VR, a new technology over 40,000 years old
VR, a new technology over 40,000 years oldVR, a new technology over 40,000 years old
VR, a new technology over 40,000 years old
 
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...
Getting Started with Intelligent IoT Messaging - Ken Herron, Emy Carlan, and ...
 

Recently uploaded

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 

Recently uploaded (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 

Applied Math at Microsoft Azure - Rohit Pandey

  • 1. Applied Math at Microsoft Azure
  • 2. What to expect • I will talk about two interesting use cases of Applied Math in Azure. • Unfortunately, I can’t go into details of Azure or the numbers but I’m hoping the gist will be clear.
  • 3. What is Azure? • Azure is a cloud service. • Competitor to AWS • Basic Architecture
  • 4. Topic 1: Dirichlet Entropy for anomaly detection Contributors: • Rohit Pandey • Gil Lapid Shafriri
  • 5. Background • At Azure, we keep track of various causes and components associated with downtimes of customer VMs (categorical histograms). • We use this data to prioritize fixes for top downtime reasons and components. • But what about patterns that manage to stay out of sight?
  • 6. • There is a tendency to confuse “small” with “ambient”. And over a large timeframe, “small” becomes “large”. • Ambient noise should be like a fair dice. • Truly ambient noise won’t unduly favor any component (Ex: Rack). • We need one measure for how “skewed” our histogram is and trend that over time. Background (continued)
  • 7. Approach • Categorical histograms are like rolls of a dice and the canonical distribution for the parameters of a dice is the Dirichlet. • A great metric for determining skewness is Entropy (for a random variable 𝑋: 𝑓𝑋(𝑥)). H 𝑥 = 𝐸[log(1/𝑓𝑋(𝑥))] 0 5 10 15 20 25 30 35 1 2 3 4 5 6 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 Low Entropy High Entropy
  • 8. Implementation and Results • Set up a portal that shows list of categorical histograms descending by Entropy. • Caught multiple instances of rack failures. • Nodes stuck in reboot loop due to incorrect configuration. • And more..
  • 9. Topic 2: To reboot or not to reboot Contributors: • Rohit Pandey • Durmus Karatay • Gil Lapid Shafriri • Randolph Yao
  • 10. The Problem • Machines in Azure can be in various “states”. For example, “Healthy” and “Unwell”. • When a machine becomes unwell, we wait a certain amount of time (𝜏0) to give it a chance to organically recover. • How do we optimize this 𝜏0 so as to minimize the downtime.
  • 12. Transition Matrices • Transition probabilities matrix (𝑷 = {𝒑𝒊𝒋}) • Transition times matrix (𝑳 = {tij})
  • 13. Formulation 𝐸 𝑇 = P 𝑋 ≤ 𝜏 × 𝐸 𝑋 𝑋 ≤ 𝜏 + P X > 𝜏 × (𝜏 + 𝑌) = 0 𝜏 𝑥𝑓𝑋 𝑥 𝑑𝑥 + (𝜏 + 𝑌) 𝜏 ∞ 𝑓𝑋 𝑥 𝑑𝑥 • In our estimate of Y, we consider both the happy and the sad paths. • We can find the threshold ( 𝜏) that minimizes the expected downtime by setting 𝜕𝐸[𝑇] 𝜕𝜏 = 0. 𝐻 𝑋 𝜏 = 𝑓 𝑋 𝜏 𝑃(𝑋 > 𝜏) = 1 𝑌 Unwell Healthy Rebooting 𝒀 𝑿: 𝒇 𝑿(𝒙) 𝝉
  • 14. Choice of X • Considered 7-8 distributions and settled on Lomax because it can model extreme values the best. 𝐻 𝑋 𝜏 = 𝑐1 1 + 𝑐2. 𝜏 = 1 𝑌 ⇒ 𝜏 = 𝑐1 𝑌 − 1 𝑐2 • To estimate the parameters – • All samples that we saw from Unwell to Ready • The instances of Unwell to Rebooting which were all cases where it took more than 𝜏0 for sure. 𝐿𝐿 𝑐1, 𝑐2 𝑥1, 𝑥2, … , 𝑥 𝑛, 𝑚 = 1 𝑛 log(𝑓𝑋 𝑥𝑖; 𝑐1, 𝑐2 ) + 𝑚. log(𝑃(𝑋 > 𝜏0))
  • 15. Choice of Y • We think of “Healthy” as the absorbing state, others as transient. • We denote by 𝑥𝑖 the time taken to get to the absorbing state from transient state 𝑖. 𝑥𝑖 = 1 𝑛 𝑝𝑖𝑗(𝑡𝑖𝑗 + 𝑥𝑗) ⇒ 𝐼 − 𝑄 . 𝑥 = 𝑃 𝑜 𝐿 . 𝟏