Fault, Errors, and Promise Theory

•

2 likes•245 views

Mark Burgess

How to think about systems and their problems

Technology

FAULTS AND ERRORS
Promise Theory
Mark Burgess

From components to interactions
• Promiser (component) -> promisee(s) / stakeholder(s)
• Quantitative delivery and qualitative interpretation (perceptions)
• Error/fault -> Measured deviations from expected behaviour (probability)
• Incident -> Promise not kept (overlap of intent)
• Ticket -> Diagnostics (graph causation)

Agents and their promises
• Agents can be humans or machines
• Promises are quantifiable (not just yes/no)
• Scalable theory (agents inside agents)
• Different agents promise different capabilities
• Different agents perceive differently (context and capability)

• Agents can be humans or machines
• Promises are quantifiable
• Scalable theory
• Flawed communication
• Correctly intended/sent AND correctly perceived/received?
• Flawed / missing promises
• Flawed / missing cooperation (agreement) or interpretation
• Just wrong mindset / intuition
• Byzantine failures
Fault/Error modes

Issues relating to faulty cooperation
• Dependency (makes faults travel)
• Amplification (makes faults worse)
• Redundancy (helps to absorb faults)
• Repair (make faults disappear before they are noticed)
• Tolerance (keep working in spite of faults)
Diverge
Converge

Redundancy
• Serial:
• Humans: “Are you sure you want to do X?” (Self-confirm - AND = circuit breaker)
• Clients: failover to server X if server Y is not available (Self-repair - XOR Backup)
• Parallel:
• Humans: “Insert both your keys to confirm” (Average/voting - AND circuit breaker)
• Clients: query all sources for quorum (minimum acceptable confirmation - vote)
Converge/Confirm

Repair and tolerance
• There can be MANY ways to break a component
• It is MORE efficient to detect and repair quickly than to try to prevent
failure
• It is MOST efficient to tolerate errors and failures at all stages
Converge

Statistics suggest general strategies for reliability

Too late = broken promise
• Separate concerns
by timescales, not
be features
• Management is
about balance
• Error correction as
fast as error
generation

Example products and their timescales
• Workloads
• Governance
• Constraints

Separation of timescales: governance vs workloads

Details, details, details ….
Context, context, context ….
Interpretation and performance

Similar to Fault, Errors, and Promise Theory

Machine_Learning.pptxVickyKumar131533

Agile Contracting in the Second Decade of AgilitySimon Bennett

Identifying a project in trouble and re-planningmfarbstein

Unit4 for st.pdfPoonkodi Jayakumar

Enterprise Machine Learning Governance Terence Siganakis

September16Mark Burgess

Test case design techniquesAshutosh Garg

Test case design techniques2PiRTechnologies

PA2557_SQM_Lecture7 - Defect Prevention.pdfhulk smash

This is LeanJohn Rauser

Successful Business Sponsorship of Agile IT ProjectsChris Mundy

Reporting for operationDick Lam

Unit4 for st.pdfPoonkodi Jayakumar

Descaling Organizational Complexity to Expedite Product Delivery_newChandan Patary

Writing effective requirementsLiz Lavaveshkul

Engineering Perspectives on Business Michael Zargham

Requirements Gathering Best Practice PackAmy Slater

Pmp exam prep Pdf- 2Amr Miqdadi

8D Training Presentation (tai lieu tham khao)nguyenanvuong2007

PECB Webinar: Designing and Implementing an OHSAS Service DeskPECB

Similar to Fault, Errors, and Promise Theory (20)

Machine_Learning.pptx

Agile Contracting in the Second Decade of Agility

Identifying a project in trouble and re-planning

Unit4 for st.pdf

Enterprise Machine Learning Governance

September16

Test case design techniques

PA2557_SQM_Lecture7 - Defect Prevention.pdf

This is Lean

Successful Business Sponsorship of Agile IT Projects

Reporting for operation

Unit4 for st.pdf

Descaling Organizational Complexity to Expedite Product Delivery_new

Writing effective requirements

Engineering Perspectives on Business

Requirements Gathering Best Practice Pack

Pmp exam prep Pdf- 2

8D Training Presentation (tai lieu tham khao)

PECB Webinar: Designing and Implementing an OHSAS Service Desk

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Key Features Of Token Development (1).pptxLBM Solutions

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Gen AI in Business - Global Trends Report 2024.pdfAddepto

CloudStudio User manual (basic edition):comworks

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Artificial intelligence in the post-deep learning eraDeakin University

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Key Features Of Token Development (1).pptx

APIForce Zurich 5 April Automation LPDG

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

SIP trunking in Janus @ Kamailio World 2024

My INSURER PTE LTD - Insurtech Innovation Award 2024

Vertex AI Gemini Prompt Engineering Tips

Gen AI in Business - Global Trends Report 2024.pdf

CloudStudio User manual (basic edition):

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Unleash Your Potential - Namagunga Girls Coding Club

SQL Database Design For Developers at php[tek] 2024

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Unblocking The Main Thread Solving ANRs and Frozen Frames

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service

Streamlining Python Development: A Guide to a Modern Project Setup

Artificial intelligence in the post-deep learning era

Fault, Errors, and Promise Theory

1. FAULTS AND ERRORS Promise Theory Mark Burgess

2. From components to interactions • Promiser (component) -> promisee(s) / stakeholder(s) • Quantitative delivery and qualitative interpretation (perceptions) • Error/fault -> Measured deviations from expected behaviour (probability) • Incident -> Promise not kept (overlap of intent) • Ticket -> Diagnostics (graph causation)

3. Agents and their promises • Agents can be humans or machines • Promises are quantifiable (not just yes/no) • Scalable theory (agents inside agents) • Different agents promise different capabilities • Different agents perceive differently (context and capability)

4. • Agents can be humans or machines • Promises are quantifiable • Scalable theory • Flawed communication • Correctly intended/sent AND correctly perceived/received? • Flawed / missing promises • Flawed / missing cooperation (agreement) or interpretation • Just wrong mindset / intuition • Byzantine failures Fault/Error modes

5. Issues relating to faulty cooperation • Dependency (makes faults travel) • Amplification (makes faults worse) • Redundancy (helps to absorb faults) • Repair (make faults disappear before they are noticed) • Tolerance (keep working in spite of faults) Diverge Converge

6. Redundancy • Serial: • Humans: “Are you sure you want to do X?” (Self-confirm - AND = circuit breaker) • Clients: failover to server X if server Y is not available (Self-repair - XOR Backup) • Parallel: • Humans: “Insert both your keys to confirm” (Average/voting - AND circuit breaker) • Clients: query all sources for quorum (minimum acceptable confirmation - vote) Converge/Confirm

7. Repair and tolerance • There can be MANY ways to break a component • It is MORE efficient to detect and repair quickly than to try to prevent failure • It is MOST efficient to tolerate errors and failures at all stages Converge

8. Statistics suggest general strategies for reliability

9. Too late = broken promise • Separate concerns by timescales, not be features • Management is about balance • Error correction as fast as error generation

10. Example products and their timescales • Workloads • Governance • Constraints

11. Separation of timescales: governance vs workloads

12. Details, details, details …. Context, context, context …. Interpretation and performance

Fault, Errors, and Promise Theory

Recommended

Recommended

More Related Content

Similar to Fault, Errors, and Promise Theory

Similar to Fault, Errors, and Promise Theory (20)

More from Mark Burgess

More from Mark Burgess (6)

Recently uploaded

Recently uploaded (20)

Fault, Errors, and Promise Theory