SlideShare a Scribd company logo
1 of 31
Download to read offline
Brainstorming Failure
Bio - Jeff Smith
• Manager, Site Reliability Engineering
at Grubhub
• Yes, we are also hiring.
• Yes, there is free food. Yes, it's totally
awesome to work here.
Email: jsmith@grubhub.com
Twitter: @DarkAndNerdy
Blog: http://www.allthingsdork.com
Systems Metaphor
First Class Luxuries
Top of the Line Internals
The Cockpit
WAT?
The Cockpit
The cockpit you're expecting
FEEDBACK!
What Do You
Measure?
FMEAFailure Mode Effects Analysis is a step-by-step approach for
identifying the possible ways a process, product or service
might fail. The process is commonly leveraged in quality
organizations across a wide range of industries.
FMEA in Software Engineering
We can use FMEA in a number of ways in software to help us
brainstorm, rank and prioritize different actionable bits about
the system. The process will help us
• Identify key metrics that need tracking
• Identify monitoring and or alerts that need to be created
• Identify necessary feedback loops
The Case for Cross-Functional
Teams
The Process
1. Examine the process
2. Brainstorm potential failures
3. List potential effects of failure
4. Identify Your Scale
5. Assign Severity ranking
6. Assign Occurrence ranking
7. Assign Detection ranking
Examine the Process
Brainstorm Potential Failures
• Brainstorming should be fluid. Everything goes
• Cross-Functional teams should be involved. (Business,
development, operations, design)
List Potential Effects of Failure
Think through the impact of failure. The impact might be
something process related, reputation related or technical, just
to name a few. Examples:
• Degraded customer experience
• Order not fulfilled
• Delay in payment to accounts receivable
Agree on Risk Level Scales
Technology Industry
• Low severity could be degraded performance
• High severity could be complete site outage
Airline Industry
• Low severity could be departure delay
• High severity could be customer death
Assign Severity Ranking
Rank the severity on a scale between 1-10.
• 1 being the severity is inconsequential
• 10 being a catastrophic failure
In some organizations, 9 and 10 are reserved for personal
injury and death.
If a failure mode has more than one effect, select only the most
severe of the effects
Assign Occurrence Ranking
Rank the likelihood that this condition will occur.
• 1 being extremely unlikely
• 10 being inevitable.
Assign Detection Ranking
Rank the likelihood that this condition would be detected if it
occurred. A scenario is only considered "detected" if it is found
before it would impact a customer or user.
• 1 means the control would absolutely be detected
• 10 means the control is certain to not detect the failure.
Calculate the Risk Priority
Number
The Risk Priority Number is a value that is calculated to rank a
particular failure mode. The higher the RPN the sooner the
failure mode should be addressed
RPN = S * O * D
Develop an Action Plan
Evaluate the list and develop an action plan to eliminate or
mitigate the items with the highest RPN value first.
• Prioritize solutions that are self-healing and exist within the
system under consideration.
• Develop metrics that help to track the health surrounding a
failure item
• The goal is to reduce the RPN by lowering Severity,
Occurrence or Detection scores
Ensuring You Have a Feedback
Loop
The feedback loop is a constant evaluation of these
measurements and indicators. The feedback loop should give
a strong indicator that the system is working as expected, while
at the same time exposing trends in the environment.
Leading and Lagging Indicators
Leading Indicator - A measurable factor that changes before
the system enters a particular state of failure. (Metrics)
Lagging Indicator - A measurable factor that changes after
the system enters a particular state of failure. (Logs/Reporting)
Recap
• Examine your process, and assemble a cross-functional
team with different views of the system
• Brainstorm all your potential failure modes
• Calculate your RPN
• Develop action plans to reduce risk. Ensure the system is
providing feedback loops to be able to identify the current
state of the system
• Profit
Resources
• Quality One FMEA Writeup
• Purdue University FMEA Presentation
• iSixSigma
• Google Docs FMEA Template
• Brainstorming Tools Mind Node, FreeMind, XMind
Thanks!
Brainstorming failure

More Related Content

What's hot

Project Management Uncertainty, Presented by upul chanaka from Sri Lanka
Project Management Uncertainty, Presented by upul chanaka from Sri Lanka Project Management Uncertainty, Presented by upul chanaka from Sri Lanka
Project Management Uncertainty, Presented by upul chanaka from Sri Lanka Upul Chanaka
 
FMEA training for Healthcare - Sample
FMEA training for Healthcare - SampleFMEA training for Healthcare - Sample
FMEA training for Healthcare - SampleMark H. Davis
 
Failure Mode & Effect Analysis
Failure Mode & Effect AnalysisFailure Mode & Effect Analysis
Failure Mode & Effect AnalysisECC International
 
Patch and Vulnerability Management
Patch and Vulnerability ManagementPatch and Vulnerability Management
Patch and Vulnerability ManagementMarcelo Martins
 
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian Beale
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian BealeFailure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian Beale
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian BealeAdrian Beale
 
FMEA and FMECA
FMEA and FMECAFMEA and FMECA
FMEA and FMECASjur
 
Risk Analysis PowerPoint Presentation Slides
Risk Analysis PowerPoint Presentation Slides Risk Analysis PowerPoint Presentation Slides
Risk Analysis PowerPoint Presentation Slides SlideTeam
 
Risk management using FMEA in pharma
Risk management using FMEA in pharmaRisk management using FMEA in pharma
Risk management using FMEA in pharmaSandeep Naik
 
Are my RCM Analysts effective?
Are my RCM Analysts effective?Are my RCM Analysts effective?
Are my RCM Analysts effective?Daryl Mather
 
How to implement an effective fmea process
How to implement an effective fmea processHow to implement an effective fmea process
How to implement an effective fmea processASQ Reliability Division
 
Failure Mode Effects & Criticality Analysis Workshop by Tetrahedron
Failure Mode Effects & Criticality Analysis Workshop by TetrahedronFailure Mode Effects & Criticality Analysis Workshop by Tetrahedron
Failure Mode Effects & Criticality Analysis Workshop by TetrahedronSagar Sangam Sahu
 
Good Risk Software and Why a Spreadsheet will not DO IT!
Good Risk Software and Why a Spreadsheet will not DO IT!Good Risk Software and Why a Spreadsheet will not DO IT!
Good Risk Software and Why a Spreadsheet will not DO IT!Andrew Ferguson
 
Failure mode
Failure modeFailure mode
Failure modeGia Lara
 
Automating the application lifecycle.pptx
Automating the application lifecycle.pptxAutomating the application lifecycle.pptx
Automating the application lifecycle.pptxDevan Stormont
 
Effective Vulnerability Management
Effective Vulnerability ManagementEffective Vulnerability Management
Effective Vulnerability ManagementVicky Ames
 
Dependable Systems - System Dependability Evaluation (8/16)
Dependable Systems - System Dependability Evaluation (8/16)Dependable Systems - System Dependability Evaluation (8/16)
Dependable Systems - System Dependability Evaluation (8/16)Peter Tröger
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control PlanMatt Hansen
 

What's hot (20)

Project Management Uncertainty, Presented by upul chanaka from Sri Lanka
Project Management Uncertainty, Presented by upul chanaka from Sri Lanka Project Management Uncertainty, Presented by upul chanaka from Sri Lanka
Project Management Uncertainty, Presented by upul chanaka from Sri Lanka
 
Advanced Pfmea
Advanced PfmeaAdvanced Pfmea
Advanced Pfmea
 
FMEA training for Healthcare - Sample
FMEA training for Healthcare - SampleFMEA training for Healthcare - Sample
FMEA training for Healthcare - Sample
 
Failure Mode & Effect Analysis
Failure Mode & Effect AnalysisFailure Mode & Effect Analysis
Failure Mode & Effect Analysis
 
Patch and Vulnerability Management
Patch and Vulnerability ManagementPatch and Vulnerability Management
Patch and Vulnerability Management
 
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian Beale
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian BealeFailure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian Beale
Failure Mode and Effects Analysis WithAdrian™ FMEA 2013 Adrian Beale
 
FMEA and FMECA
FMEA and FMECAFMEA and FMECA
FMEA and FMECA
 
Fracas - Failure Scene Investigation
Fracas - Failure Scene InvestigationFracas - Failure Scene Investigation
Fracas - Failure Scene Investigation
 
Risk Analysis PowerPoint Presentation Slides
Risk Analysis PowerPoint Presentation Slides Risk Analysis PowerPoint Presentation Slides
Risk Analysis PowerPoint Presentation Slides
 
Risk management using FMEA in pharma
Risk management using FMEA in pharmaRisk management using FMEA in pharma
Risk management using FMEA in pharma
 
Are my RCM Analysts effective?
Are my RCM Analysts effective?Are my RCM Analysts effective?
Are my RCM Analysts effective?
 
How to implement an effective fmea process
How to implement an effective fmea processHow to implement an effective fmea process
How to implement an effective fmea process
 
Failure Mode Effects & Criticality Analysis Workshop by Tetrahedron
Failure Mode Effects & Criticality Analysis Workshop by TetrahedronFailure Mode Effects & Criticality Analysis Workshop by Tetrahedron
Failure Mode Effects & Criticality Analysis Workshop by Tetrahedron
 
Good Risk Software and Why a Spreadsheet will not DO IT!
Good Risk Software and Why a Spreadsheet will not DO IT!Good Risk Software and Why a Spreadsheet will not DO IT!
Good Risk Software and Why a Spreadsheet will not DO IT!
 
Failure mode
Failure modeFailure mode
Failure mode
 
Automating the application lifecycle.pptx
Automating the application lifecycle.pptxAutomating the application lifecycle.pptx
Automating the application lifecycle.pptx
 
Effective Vulnerability Management
Effective Vulnerability ManagementEffective Vulnerability Management
Effective Vulnerability Management
 
Failure Reporting Process Map
Failure Reporting Process MapFailure Reporting Process Map
Failure Reporting Process Map
 
Dependable Systems - System Dependability Evaluation (8/16)
Dependable Systems - System Dependability Evaluation (8/16)Dependable Systems - System Dependability Evaluation (8/16)
Dependable Systems - System Dependability Evaluation (8/16)
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control Plan
 

Similar to Brainstorming failure

11. faliure mode & effect analysis
11. faliure mode & effect analysis11. faliure mode & effect analysis
11. faliure mode & effect analysisHakeem-Ur- Rehman
 
FAILURE MODE EFFECT ANALYSIS
FAILURE MODE EFFECT ANALYSISFAILURE MODE EFFECT ANALYSIS
FAILURE MODE EFFECT ANALYSISANOOPA NARAYANAN
 
Failure mode and effects analysis
Failure mode and effects analysisFailure mode and effects analysis
Failure mode and effects analysisDeep parmar
 
IT 381_Chap_7.ppt
IT 381_Chap_7.pptIT 381_Chap_7.ppt
IT 381_Chap_7.pptRajendran C
 
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...SUNDHARAVADIVELR1
 
FMEA presentation for research and development
FMEA presentation for research and developmentFMEA presentation for research and development
FMEA presentation for research and developmentbasant11731
 
FMEA análise e odos de falhas e seus efeitos.ppt
FMEA análise e odos de falhas e seus efeitos.pptFMEA análise e odos de falhas e seus efeitos.ppt
FMEA análise e odos de falhas e seus efeitos.pptGleibsonHenrique
 
IT 381_Chap hubybybybybybybytggyguh7h7_7.ppt
IT 381_Chap hubybybybybybybytggyguh7h7_7.pptIT 381_Chap hubybybybybybybytggyguh7h7_7.ppt
IT 381_Chap hubybybybybybybytggyguh7h7_7.pptksujith0034
 
fmea-130116034507-phpapp01.pdf
fmea-130116034507-phpapp01.pdffmea-130116034507-phpapp01.pdf
fmea-130116034507-phpapp01.pdfRajendran C
 
Failure Mode & Effect Analysis
Failure Mode & Effect AnalysisFailure Mode & Effect Analysis
Failure Mode & Effect AnalysisNafis Ahmad
 
Six Sigma Green Belt Training Part 7
Six Sigma Green Belt Training Part 7Six Sigma Green Belt Training Part 7
Six Sigma Green Belt Training Part 7Skillogic Solutions
 
An analytical approach to effective risk based test planning
An analytical approach to effective risk based test planning An analytical approach to effective risk based test planning
An analytical approach to effective risk based test planning Joe Kevens
 
The on-call survival guide - how to be confident on-call
The on-call survival guide - how to be confident on-call The on-call survival guide - how to be confident on-call
The on-call survival guide - how to be confident on-call Raygun
 
Proven Methods to Abnormality Management and Error Proofing
Proven Methods to Abnormality Management and Error ProofingProven Methods to Abnormality Management and Error Proofing
Proven Methods to Abnormality Management and Error ProofingSafetyChain Software
 

Similar to Brainstorming failure (20)

FMEA
FMEAFMEA
FMEA
 
11. faliure mode & effect analysis
11. faliure mode & effect analysis11. faliure mode & effect analysis
11. faliure mode & effect analysis
 
FAILURE MODE EFFECT ANALYSIS
FAILURE MODE EFFECT ANALYSISFAILURE MODE EFFECT ANALYSIS
FAILURE MODE EFFECT ANALYSIS
 
Failure mode and effects analysis
Failure mode and effects analysisFailure mode and effects analysis
Failure mode and effects analysis
 
Dfmea
DfmeaDfmea
Dfmea
 
FMEA
FMEAFMEA
FMEA
 
IT 381_Chap_7.ppt
IT 381_Chap_7.pptIT 381_Chap_7.ppt
IT 381_Chap_7.ppt
 
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...
Failure mode effects analysis, Computer Integrated Manufacturing, Quality fun...
 
FMEA presentation for research and development
FMEA presentation for research and developmentFMEA presentation for research and development
FMEA presentation for research and development
 
FMEA análise e odos de falhas e seus efeitos.ppt
FMEA análise e odos de falhas e seus efeitos.pptFMEA análise e odos de falhas e seus efeitos.ppt
FMEA análise e odos de falhas e seus efeitos.ppt
 
IT 381_Chap hubybybybybybybytggyguh7h7_7.ppt
IT 381_Chap hubybybybybybybytggyguh7h7_7.pptIT 381_Chap hubybybybybybybytggyguh7h7_7.ppt
IT 381_Chap hubybybybybybybytggyguh7h7_7.ppt
 
FMEA.pptx
FMEA.pptxFMEA.pptx
FMEA.pptx
 
fmea-130116034507-phpapp01.pdf
fmea-130116034507-phpapp01.pdffmea-130116034507-phpapp01.pdf
fmea-130116034507-phpapp01.pdf
 
Failure Mode & Effect Analysis
Failure Mode & Effect AnalysisFailure Mode & Effect Analysis
Failure Mode & Effect Analysis
 
Six Sigma Green Belt Training Part 7
Six Sigma Green Belt Training Part 7Six Sigma Green Belt Training Part 7
Six Sigma Green Belt Training Part 7
 
FMEA Presentation V1.1
FMEA Presentation V1.1FMEA Presentation V1.1
FMEA Presentation V1.1
 
An analytical approach to effective risk based test planning
An analytical approach to effective risk based test planning An analytical approach to effective risk based test planning
An analytical approach to effective risk based test planning
 
The on-call survival guide - how to be confident on-call
The on-call survival guide - how to be confident on-call The on-call survival guide - how to be confident on-call
The on-call survival guide - how to be confident on-call
 
Cyber Security # Lec 3
Cyber Security # Lec 3 Cyber Security # Lec 3
Cyber Security # Lec 3
 
Proven Methods to Abnormality Management and Error Proofing
Proven Methods to Abnormality Management and Error ProofingProven Methods to Abnormality Management and Error Proofing
Proven Methods to Abnormality Management and Error Proofing
 

More from Jeffery Smith

Cutting Costs in COVID-19
Cutting Costs in COVID-19Cutting Costs in COVID-19
Cutting Costs in COVID-19Jeffery Smith
 
Moving from ops to dev ops
Moving from ops to dev opsMoving from ops to dev ops
Moving from ops to dev opsJeffery Smith
 
Making On-Call More Humane - Ignite Version
Making On-Call More Humane - Ignite VersionMaking On-Call More Humane - Ignite Version
Making On-Call More Humane - Ignite VersionJeffery Smith
 
Elevate Your Career as an Ops Engineer
Elevate Your Career as an Ops EngineerElevate Your Career as an Ops Engineer
Elevate Your Career as an Ops EngineerJeffery Smith
 
Dungeons and dragons and dev ops
Dungeons and dragons and dev opsDungeons and dragons and dev ops
Dungeons and dragons and dev opsJeffery Smith
 
DevOps: What's Buried in the Fine Print
DevOps: What's Buried in the Fine PrintDevOps: What's Buried in the Fine Print
DevOps: What's Buried in the Fine PrintJeffery Smith
 
It Sounded Good on Paper - Lessons Learned with Puppet
It Sounded Good on Paper - Lessons Learned with PuppetIt Sounded Good on Paper - Lessons Learned with Puppet
It Sounded Good on Paper - Lessons Learned with PuppetJeffery Smith
 
Puppet Development Workflow
Puppet Development WorkflowPuppet Development Workflow
Puppet Development WorkflowJeffery Smith
 

More from Jeffery Smith (9)

Cutting Costs in COVID-19
Cutting Costs in COVID-19Cutting Costs in COVID-19
Cutting Costs in COVID-19
 
Moving from ops to dev ops
Moving from ops to dev opsMoving from ops to dev ops
Moving from ops to dev ops
 
Making On-Call More Humane - Ignite Version
Making On-Call More Humane - Ignite VersionMaking On-Call More Humane - Ignite Version
Making On-Call More Humane - Ignite Version
 
Elevate Your Career as an Ops Engineer
Elevate Your Career as an Ops EngineerElevate Your Career as an Ops Engineer
Elevate Your Career as an Ops Engineer
 
Dungeons and dragons and dev ops
Dungeons and dragons and dev opsDungeons and dragons and dev ops
Dungeons and dragons and dev ops
 
DevOps: What's Buried in the Fine Print
DevOps: What's Buried in the Fine PrintDevOps: What's Buried in the Fine Print
DevOps: What's Buried in the Fine Print
 
Starting with c
Starting with cStarting with c
Starting with c
 
It Sounded Good on Paper - Lessons Learned with Puppet
It Sounded Good on Paper - Lessons Learned with PuppetIt Sounded Good on Paper - Lessons Learned with Puppet
It Sounded Good on Paper - Lessons Learned with Puppet
 
Puppet Development Workflow
Puppet Development WorkflowPuppet Development Workflow
Puppet Development Workflow
 

Recently uploaded

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 

Brainstorming failure

  • 2. Bio - Jeff Smith • Manager, Site Reliability Engineering at Grubhub • Yes, we are also hiring. • Yes, there is free food. Yes, it's totally awesome to work here. Email: jsmith@grubhub.com Twitter: @DarkAndNerdy Blog: http://www.allthingsdork.com
  • 5. Top of the Line Internals
  • 9. The cockpit you're expecting
  • 12. FMEAFailure Mode Effects Analysis is a step-by-step approach for identifying the possible ways a process, product or service might fail. The process is commonly leveraged in quality organizations across a wide range of industries.
  • 13. FMEA in Software Engineering We can use FMEA in a number of ways in software to help us brainstorm, rank and prioritize different actionable bits about the system. The process will help us • Identify key metrics that need tracking • Identify monitoring and or alerts that need to be created • Identify necessary feedback loops
  • 14. The Case for Cross-Functional Teams
  • 15. The Process 1. Examine the process 2. Brainstorm potential failures 3. List potential effects of failure 4. Identify Your Scale 5. Assign Severity ranking 6. Assign Occurrence ranking 7. Assign Detection ranking
  • 17. Brainstorm Potential Failures • Brainstorming should be fluid. Everything goes • Cross-Functional teams should be involved. (Business, development, operations, design)
  • 18. List Potential Effects of Failure Think through the impact of failure. The impact might be something process related, reputation related or technical, just to name a few. Examples: • Degraded customer experience • Order not fulfilled • Delay in payment to accounts receivable
  • 19. Agree on Risk Level Scales Technology Industry • Low severity could be degraded performance • High severity could be complete site outage Airline Industry • Low severity could be departure delay • High severity could be customer death
  • 20. Assign Severity Ranking Rank the severity on a scale between 1-10. • 1 being the severity is inconsequential • 10 being a catastrophic failure In some organizations, 9 and 10 are reserved for personal injury and death. If a failure mode has more than one effect, select only the most severe of the effects
  • 21. Assign Occurrence Ranking Rank the likelihood that this condition will occur. • 1 being extremely unlikely • 10 being inevitable.
  • 22. Assign Detection Ranking Rank the likelihood that this condition would be detected if it occurred. A scenario is only considered "detected" if it is found before it would impact a customer or user. • 1 means the control would absolutely be detected • 10 means the control is certain to not detect the failure.
  • 23. Calculate the Risk Priority Number The Risk Priority Number is a value that is calculated to rank a particular failure mode. The higher the RPN the sooner the failure mode should be addressed
  • 24. RPN = S * O * D
  • 25. Develop an Action Plan Evaluate the list and develop an action plan to eliminate or mitigate the items with the highest RPN value first. • Prioritize solutions that are self-healing and exist within the system under consideration. • Develop metrics that help to track the health surrounding a failure item • The goal is to reduce the RPN by lowering Severity, Occurrence or Detection scores
  • 26. Ensuring You Have a Feedback Loop The feedback loop is a constant evaluation of these measurements and indicators. The feedback loop should give a strong indicator that the system is working as expected, while at the same time exposing trends in the environment.
  • 27. Leading and Lagging Indicators Leading Indicator - A measurable factor that changes before the system enters a particular state of failure. (Metrics) Lagging Indicator - A measurable factor that changes after the system enters a particular state of failure. (Logs/Reporting)
  • 28. Recap • Examine your process, and assemble a cross-functional team with different views of the system • Brainstorm all your potential failure modes • Calculate your RPN • Develop action plans to reduce risk. Ensure the system is providing feedback loops to be able to identify the current state of the system • Profit
  • 29. Resources • Quality One FMEA Writeup • Purdue University FMEA Presentation • iSixSigma • Google Docs FMEA Template • Brainstorming Tools Mind Node, FreeMind, XMind