SlideShare a Scribd company logo
1 of 20
4. engineering to
avoid a crisis
ProblemManagementFoundation
Objectives
• Redundancy
• Resilience
• Fail-over
• Documentation
ProblemManagementFoundation
Avoidance elements
• Resilience - can deal with errors
• Redundancy - can deal with failures
• Fail-over - can deal with loss
• Documentation cannot be created in a crisis. Needs to be available
in advance
• Correct implementation
ProblemManagementFoundation
Factors that determine redundancy
Redundancy requires alternative component to be available
• Complexity: Degree of complexity based on the number of items and
interconnects required to provide service
• Hardware age: Measured against the stated lifecycle from the vendor
• Software age: Based on number of items including versions count
from current release
• Supportability: Supportability factor based on in house capability (key
man dependencies), reliance on 3rd party and contractual
arrangements
ProblemManagementFoundation
Factors that determine redundancy
(cont.)
• Single points of failure: Based on number of infrastructure single
points of failure
• Disaster recovery time: Based on time to implement full DR plan
• Capacity/Performance: Utilised capacity at tightest bottle neck
• Environmental: Factor based on risk to virus attack, user breakage,
physical damage, power failure etc.
ProblemManagementFoundation
Example for redundancy
• Spare tyre in car
• If a type punctures it is possible to stop the car, replace the type with a spare
from the boot and continue the journey.
• A full working alternative of the failed component is available.
ProblemManagementFoundation
Factors that determine resilience
Resilience is the ability for a component to continue to operate even
though a failure has occurred.
• Factors that determine resilience are similar to redundancy
• Budget required for resilience will be different to that of redundancy
and failover.
ProblemManagementFoundation
Example of resilience
• A BMW car with run flats
• A MTB with gel that self seals a hole – commonly known as
sludge or slime
• The radials in tubeless tyres
ProblemManagementFoundation
Resilience: The Swiss Cheese experiment
1
2
3
Internet
Peering partners
JINX/CINX
Transits
S
P
S
P
Tiered SPs
Gateways
Peering distribution
SiSi SiSi
Primary data centre Secondary data centre
Core Core
Caches
Caches
Network Management Systems
PDSN
PPPOE
PDSN
PPPOE
OSS/BSS systems Overview of an ISP
Fibre rings
RFrings
Satellite
RF high sites
Customer CPEs
Value Added ServicesValue Added ServicesTelkom IPC
ADSL
Fixed line
Mobile
Interconnects
VVVV VVVV
MetroethernetMetroethernet
ProblemManagementFoundation
Factors that determine fail-over
Component is able to swop over to another component without
interruption.
• Similar to redundancy and resilience
• Budget required would be different to resilience and redundancy
ProblemManagementFoundation
Example of fail-over
• Trucks with multiple wheels per hub
• Electrical supply via utility with ATS switch to generator which
auto starts.
ProblemManagementFoundation
Documentation
• Why do you need documentation?
• Advantages of having documentation before a crisis
• Types of documentation required
• Impact of lack of documentation
• Keeping documents updated
• Documentation standards
ProblemManagementFoundation
Documentation
• Advantages of having documentation before a crisis
• When a crisis occurs it is time consuming to start the diagnosis if
documentation of the systems is not available
• An understanding of the system needs to be created and there should be
a set of up-to-date, fully documented procedures and processes that are
available and easy to implement
• New staff members require a reference for processes. A process can only
be as good as its documentation. Correctly used processes avoid errors.
In the event of a crisis, uncertainty is reduced and time to resolution
increased.
• When a failure occurs, processes and documentation need to be
changed to avoid a re-occurrence. It could be as simple as a more
detailed sanity check before running that process that nukes some part
of the system.
ProblemManagementFoundation
Documentation
• Types of documentation required
• Inventory listings
• Rack and floor plan capacity
• Rack layout diagrams
• Patch panel connections
• Network switch connections
• Power strip connections
• Network diagrams
• Storage diagrams
• Domain diagrams
• Capacity reports
• Change audit trails
ProblemManagementFoundation
Documentation
• Impact of lack of documentation
ProblemManagementFoundation
Documentation
• Keeping documents updated
ProblemManagementFoundation
Documentation
• Documentation standards
ProblemManagementFoundation
Correct implementation
• Use a structured approach and plan, no “fly by the seat of your
pants”.
• Understand the deliverables and measure/gauge progress to the
target.
• David Allen, a productivity guru, frequently asserts that anything
that takes more than two steps and two minutes to accomplish is
a project.
• David Ruiz, director of IT at DIC Entertainment Corp, states that
nine out of 10 times taking the extra time to create a plan will
save you time and money.
• Refer to Appendix for project management resources.
ProblemManagementFoundation
Review:
Bottom line
In order to avoid a crisis, ensure you
have redundancy and resilience
implemented correctly, supported by
appropriate documentation and
measurements.
A crisis can be mitigated if systems have
been engineered with foresight with
how failures are handled

More Related Content

Similar to Problem management foundation - Engineering

The Website Resiliency Imperative
The Website Resiliency ImperativeThe Website Resiliency Imperative
The Website Resiliency ImperativeDistil Networks
 
Cloud native defined
Cloud native definedCloud native defined
Cloud native definedKim Clark
 
Infrastructure Strategy
Infrastructure StrategyInfrastructure Strategy
Infrastructure StrategyRobert Jones
 
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?Edunomica
 
Benefits of a Guidewire-FileNet Integration
Benefits of a Guidewire-FileNet IntegrationBenefits of a Guidewire-FileNet Integration
Benefits of a Guidewire-FileNet Integrationcmartin11
 
New Tech for Project Managers
New Tech for Project ManagersNew Tech for Project Managers
New Tech for Project ManagersPratip Mallik
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkDellNMS
 
Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara Aichinger
 
The Business Case for Hosting JD Edwards in the Cloud
The Business Case for Hosting JD Edwards in the CloudThe Business Case for Hosting JD Edwards in the Cloud
The Business Case for Hosting JD Edwards in the CloudNERUG
 
Top Down Network Design - ebrahma.com
Top Down Network Design - ebrahma.comTop Down Network Design - ebrahma.com
Top Down Network Design - ebrahma.comPawan Sharma
 
Icinga Camp Bangalore - Enterprise exceptions
Icinga Camp Bangalore - Enterprise exceptions Icinga Camp Bangalore - Enterprise exceptions
Icinga Camp Bangalore - Enterprise exceptions Icinga
 
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...Lviv Startup Club
 
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld
 
CQRS + Event Sourcing
CQRS + Event SourcingCQRS + Event Sourcing
CQRS + Event SourcingMike Bild
 
SEC Presentation V2
SEC Presentation V2SEC Presentation V2
SEC Presentation V2Salim Sheikh
 
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...DevOps.com
 
VMworld 2013: Building a Validation Factory for VMware Partners
VMworld 2013: Building a Validation Factory for VMware Partners VMworld 2013: Building a Validation Factory for VMware Partners
VMworld 2013: Building a Validation Factory for VMware Partners VMworld
 
Moving Applications to the Cloud
Moving Applications to the CloudMoving Applications to the Cloud
Moving Applications to the CloudGary Irwin
 
Webinar: What's Wrong with DRaaS and How to Fix it
Webinar: What's Wrong with DRaaS and How to Fix itWebinar: What's Wrong with DRaaS and How to Fix it
Webinar: What's Wrong with DRaaS and How to Fix itStorage Switzerland
 

Similar to Problem management foundation - Engineering (20)

The Website Resiliency Imperative
The Website Resiliency ImperativeThe Website Resiliency Imperative
The Website Resiliency Imperative
 
Cloud native defined
Cloud native definedCloud native defined
Cloud native defined
 
Infrastructure Strategy
Infrastructure StrategyInfrastructure Strategy
Infrastructure Strategy
 
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult?
 
Benefits of a Guidewire-FileNet Integration
Benefits of a Guidewire-FileNet IntegrationBenefits of a Guidewire-FileNet Integration
Benefits of a Guidewire-FileNet Integration
 
New Tech for Project Managers
New Tech for Project ManagersNew Tech for Project Managers
New Tech for Project Managers
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your Network
 
Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014
 
The Business Case for Hosting JD Edwards in the Cloud
The Business Case for Hosting JD Edwards in the CloudThe Business Case for Hosting JD Edwards in the Cloud
The Business Case for Hosting JD Edwards in the Cloud
 
Top Down Network Design - ebrahma.com
Top Down Network Design - ebrahma.comTop Down Network Design - ebrahma.com
Top Down Network Design - ebrahma.com
 
Icinga Camp Bangalore - Enterprise exceptions
Icinga Camp Bangalore - Enterprise exceptions Icinga Camp Bangalore - Enterprise exceptions
Icinga Camp Bangalore - Enterprise exceptions
 
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...
Dave Davis: Infrastructure Projects – What Makes then Different and Difficult...
 
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
 
CQRS + Event Sourcing
CQRS + Event SourcingCQRS + Event Sourcing
CQRS + Event Sourcing
 
SEC Presentation V2
SEC Presentation V2SEC Presentation V2
SEC Presentation V2
 
Univa Presentation at DAC 2020
Univa Presentation at DAC 2020 Univa Presentation at DAC 2020
Univa Presentation at DAC 2020
 
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
 
VMworld 2013: Building a Validation Factory for VMware Partners
VMworld 2013: Building a Validation Factory for VMware Partners VMworld 2013: Building a Validation Factory for VMware Partners
VMworld 2013: Building a Validation Factory for VMware Partners
 
Moving Applications to the Cloud
Moving Applications to the CloudMoving Applications to the Cloud
Moving Applications to the Cloud
 
Webinar: What's Wrong with DRaaS and How to Fix it
Webinar: What's Wrong with DRaaS and How to Fix itWebinar: What's Wrong with DRaaS and How to Fix it
Webinar: What's Wrong with DRaaS and How to Fix it
 

More from Ronald Bartels

Implementing a modern Fusion Centre
Implementing a modern Fusion Centre Implementing a modern Fusion Centre
Implementing a modern Fusion Centre Ronald Bartels
 
NSA advisory about state sponsored cybersecurity threats
NSA advisory about state sponsored cybersecurity threatsNSA advisory about state sponsored cybersecurity threats
NSA advisory about state sponsored cybersecurity threatsRonald Bartels
 
The reasons why your business cannot afford to be offline
The reasons why your business cannot afford to be offlineThe reasons why your business cannot afford to be offline
The reasons why your business cannot afford to be offlineRonald Bartels
 
RADWIN, software defined wide area network, Press Release
RADWIN, software defined wide area network, Press ReleaseRADWIN, software defined wide area network, Press Release
RADWIN, software defined wide area network, Press ReleaseRonald Bartels
 
Infrastructure management presented to GPNOG (Updated)
Infrastructure management presented to GPNOG (Updated)Infrastructure management presented to GPNOG (Updated)
Infrastructure management presented to GPNOG (Updated)Ronald Bartels
 
Infrastructure management using a VPN Concentrator
Infrastructure management using a VPN ConcentratorInfrastructure management using a VPN Concentrator
Infrastructure management using a VPN ConcentratorRonald Bartels
 
Problem management foundation - Introduction
Problem management foundation - IntroductionProblem management foundation - Introduction
Problem management foundation - IntroductionRonald Bartels
 
Problem management foundation - Overview
Problem management foundation - OverviewProblem management foundation - Overview
Problem management foundation - OverviewRonald Bartels
 
Problem management foundation - Perceptions
Problem management foundation - PerceptionsProblem management foundation - Perceptions
Problem management foundation - PerceptionsRonald Bartels
 
Problem management foundation - Tiger teams
Problem management foundation - Tiger teamsProblem management foundation - Tiger teams
Problem management foundation - Tiger teamsRonald Bartels
 
Problem management foundation - Lifecycle
Problem management foundation - Lifecycle Problem management foundation - Lifecycle
Problem management foundation - Lifecycle Ronald Bartels
 
Problem management foundation - Tools
Problem management foundation - ToolsProblem management foundation - Tools
Problem management foundation - ToolsRonald Bartels
 
Problem management foundation - Analysing
Problem management foundation - AnalysingProblem management foundation - Analysing
Problem management foundation - AnalysingRonald Bartels
 
Problem management foundation Simulation
Problem management foundation SimulationProblem management foundation Simulation
Problem management foundation SimulationRonald Bartels
 
Problem management foundation - IT risk
Problem management foundation - IT riskProblem management foundation - IT risk
Problem management foundation - IT riskRonald Bartels
 
Problem management foundation - Continious improvement
Problem management foundation - Continious improvementProblem management foundation - Continious improvement
Problem management foundation - Continious improvementRonald Bartels
 
Problem management foundation - Mission control
Problem management foundation - Mission controlProblem management foundation - Mission control
Problem management foundation - Mission controlRonald Bartels
 
Problem management foundation - Significant havoc in technology
Problem management foundation - Significant havoc in technologyProblem management foundation - Significant havoc in technology
Problem management foundation - Significant havoc in technologyRonald Bartels
 
Problem management foundation Budget
Problem management foundation BudgetProblem management foundation Budget
Problem management foundation BudgetRonald Bartels
 
Problem management foundation Communications
Problem management foundation CommunicationsProblem management foundation Communications
Problem management foundation CommunicationsRonald Bartels
 

More from Ronald Bartels (20)

Implementing a modern Fusion Centre
Implementing a modern Fusion Centre Implementing a modern Fusion Centre
Implementing a modern Fusion Centre
 
NSA advisory about state sponsored cybersecurity threats
NSA advisory about state sponsored cybersecurity threatsNSA advisory about state sponsored cybersecurity threats
NSA advisory about state sponsored cybersecurity threats
 
The reasons why your business cannot afford to be offline
The reasons why your business cannot afford to be offlineThe reasons why your business cannot afford to be offline
The reasons why your business cannot afford to be offline
 
RADWIN, software defined wide area network, Press Release
RADWIN, software defined wide area network, Press ReleaseRADWIN, software defined wide area network, Press Release
RADWIN, software defined wide area network, Press Release
 
Infrastructure management presented to GPNOG (Updated)
Infrastructure management presented to GPNOG (Updated)Infrastructure management presented to GPNOG (Updated)
Infrastructure management presented to GPNOG (Updated)
 
Infrastructure management using a VPN Concentrator
Infrastructure management using a VPN ConcentratorInfrastructure management using a VPN Concentrator
Infrastructure management using a VPN Concentrator
 
Problem management foundation - Introduction
Problem management foundation - IntroductionProblem management foundation - Introduction
Problem management foundation - Introduction
 
Problem management foundation - Overview
Problem management foundation - OverviewProblem management foundation - Overview
Problem management foundation - Overview
 
Problem management foundation - Perceptions
Problem management foundation - PerceptionsProblem management foundation - Perceptions
Problem management foundation - Perceptions
 
Problem management foundation - Tiger teams
Problem management foundation - Tiger teamsProblem management foundation - Tiger teams
Problem management foundation - Tiger teams
 
Problem management foundation - Lifecycle
Problem management foundation - Lifecycle Problem management foundation - Lifecycle
Problem management foundation - Lifecycle
 
Problem management foundation - Tools
Problem management foundation - ToolsProblem management foundation - Tools
Problem management foundation - Tools
 
Problem management foundation - Analysing
Problem management foundation - AnalysingProblem management foundation - Analysing
Problem management foundation - Analysing
 
Problem management foundation Simulation
Problem management foundation SimulationProblem management foundation Simulation
Problem management foundation Simulation
 
Problem management foundation - IT risk
Problem management foundation - IT riskProblem management foundation - IT risk
Problem management foundation - IT risk
 
Problem management foundation - Continious improvement
Problem management foundation - Continious improvementProblem management foundation - Continious improvement
Problem management foundation - Continious improvement
 
Problem management foundation - Mission control
Problem management foundation - Mission controlProblem management foundation - Mission control
Problem management foundation - Mission control
 
Problem management foundation - Significant havoc in technology
Problem management foundation - Significant havoc in technologyProblem management foundation - Significant havoc in technology
Problem management foundation - Significant havoc in technology
 
Problem management foundation Budget
Problem management foundation BudgetProblem management foundation Budget
Problem management foundation Budget
 
Problem management foundation Communications
Problem management foundation CommunicationsProblem management foundation Communications
Problem management foundation Communications
 

Recently uploaded

Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Nehwal
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girladitipandeya
 
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceanilsa9823
 
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...Pooja Nehwal
 
internal analysis on strategic management
internal analysis on strategic managementinternal analysis on strategic management
internal analysis on strategic managementharfimakarim
 
Day 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampDay 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampPLCLeadershipDevelop
 
GENUINE Babe,Call Girls IN Baderpur Delhi | +91-8377087607
GENUINE Babe,Call Girls IN Baderpur  Delhi | +91-8377087607GENUINE Babe,Call Girls IN Baderpur  Delhi | +91-8377087607
GENUINE Babe,Call Girls IN Baderpur Delhi | +91-8377087607dollysharma2066
 
operational plan ppt.pptx nursing management
operational plan ppt.pptx nursing managementoperational plan ppt.pptx nursing management
operational plan ppt.pptx nursing managementTulsiDhidhi1
 

Recently uploaded (20)

Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls Ameerpet high-profile Call Girl
 
Peak Performance & Resilience - Dr Dorian Dugmore
Peak Performance & Resilience - Dr Dorian DugmorePeak Performance & Resilience - Dr Dorian Dugmore
Peak Performance & Resilience - Dr Dorian Dugmore
 
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Empowering Local Government Frontline Services - Mo Baines.pdf
Empowering Local Government Frontline Services - Mo Baines.pdfEmpowering Local Government Frontline Services - Mo Baines.pdf
Empowering Local Government Frontline Services - Mo Baines.pdf
 
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
 
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...
Call now : 9892124323 Nalasopara Beautiful Call Girls Vasai virar Best Call G...
 
Imagine - Creating Healthy Workplaces - Anthony Montgomery.pdf
Imagine - Creating Healthy Workplaces - Anthony Montgomery.pdfImagine - Creating Healthy Workplaces - Anthony Montgomery.pdf
Imagine - Creating Healthy Workplaces - Anthony Montgomery.pdf
 
internal analysis on strategic management
internal analysis on strategic managementinternal analysis on strategic management
internal analysis on strategic management
 
Discover -CQ Master Class - Rikita Wadhwa.pdf
Discover -CQ Master Class - Rikita Wadhwa.pdfDiscover -CQ Master Class - Rikita Wadhwa.pdf
Discover -CQ Master Class - Rikita Wadhwa.pdf
 
LoveLocalGov - Chris Twigg, Inner Circle
LoveLocalGov - Chris Twigg, Inner CircleLoveLocalGov - Chris Twigg, Inner Circle
LoveLocalGov - Chris Twigg, Inner Circle
 
Day 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampDay 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC Bootcamp
 
Imagine - HR; are handling the 'bad banter' - Stella Chandler.pdf
Imagine - HR; are handling the 'bad banter' - Stella Chandler.pdfImagine - HR; are handling the 'bad banter' - Stella Chandler.pdf
Imagine - HR; are handling the 'bad banter' - Stella Chandler.pdf
 
Unlocking the Future - Dr Max Blumberg, Founder of Blumberg Partnership
Unlocking the Future - Dr Max Blumberg, Founder of Blumberg PartnershipUnlocking the Future - Dr Max Blumberg, Founder of Blumberg Partnership
Unlocking the Future - Dr Max Blumberg, Founder of Blumberg Partnership
 
Leadership in Crisis - Helio Vogas, Risk & Leadership Keynote Speaker
Leadership in Crisis - Helio Vogas, Risk & Leadership Keynote SpeakerLeadership in Crisis - Helio Vogas, Risk & Leadership Keynote Speaker
Leadership in Crisis - Helio Vogas, Risk & Leadership Keynote Speaker
 
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SERVICE
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SERVICECall Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SERVICE
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SERVICE
 
Becoming an Inclusive Leader - Bernadette Thompson
Becoming an Inclusive Leader - Bernadette ThompsonBecoming an Inclusive Leader - Bernadette Thompson
Becoming an Inclusive Leader - Bernadette Thompson
 
GENUINE Babe,Call Girls IN Baderpur Delhi | +91-8377087607
GENUINE Babe,Call Girls IN Baderpur  Delhi | +91-8377087607GENUINE Babe,Call Girls IN Baderpur  Delhi | +91-8377087607
GENUINE Babe,Call Girls IN Baderpur Delhi | +91-8377087607
 
Disrupt or be Disrupted - Kirk Vallis.pdf
Disrupt or be Disrupted - Kirk Vallis.pdfDisrupt or be Disrupted - Kirk Vallis.pdf
Disrupt or be Disrupted - Kirk Vallis.pdf
 
operational plan ppt.pptx nursing management
operational plan ppt.pptx nursing managementoperational plan ppt.pptx nursing management
operational plan ppt.pptx nursing management
 

Problem management foundation - Engineering

  • 3. ProblemManagementFoundation Avoidance elements • Resilience - can deal with errors • Redundancy - can deal with failures • Fail-over - can deal with loss • Documentation cannot be created in a crisis. Needs to be available in advance • Correct implementation
  • 4. ProblemManagementFoundation Factors that determine redundancy Redundancy requires alternative component to be available • Complexity: Degree of complexity based on the number of items and interconnects required to provide service • Hardware age: Measured against the stated lifecycle from the vendor • Software age: Based on number of items including versions count from current release • Supportability: Supportability factor based on in house capability (key man dependencies), reliance on 3rd party and contractual arrangements
  • 5. ProblemManagementFoundation Factors that determine redundancy (cont.) • Single points of failure: Based on number of infrastructure single points of failure • Disaster recovery time: Based on time to implement full DR plan • Capacity/Performance: Utilised capacity at tightest bottle neck • Environmental: Factor based on risk to virus attack, user breakage, physical damage, power failure etc.
  • 6. ProblemManagementFoundation Example for redundancy • Spare tyre in car • If a type punctures it is possible to stop the car, replace the type with a spare from the boot and continue the journey. • A full working alternative of the failed component is available.
  • 7. ProblemManagementFoundation Factors that determine resilience Resilience is the ability for a component to continue to operate even though a failure has occurred. • Factors that determine resilience are similar to redundancy • Budget required for resilience will be different to that of redundancy and failover.
  • 8. ProblemManagementFoundation Example of resilience • A BMW car with run flats • A MTB with gel that self seals a hole – commonly known as sludge or slime • The radials in tubeless tyres
  • 10. Internet Peering partners JINX/CINX Transits S P S P Tiered SPs Gateways Peering distribution SiSi SiSi Primary data centre Secondary data centre Core Core Caches Caches Network Management Systems PDSN PPPOE PDSN PPPOE OSS/BSS systems Overview of an ISP Fibre rings RFrings Satellite RF high sites Customer CPEs Value Added ServicesValue Added ServicesTelkom IPC ADSL Fixed line Mobile Interconnects VVVV VVVV MetroethernetMetroethernet
  • 11. ProblemManagementFoundation Factors that determine fail-over Component is able to swop over to another component without interruption. • Similar to redundancy and resilience • Budget required would be different to resilience and redundancy
  • 12. ProblemManagementFoundation Example of fail-over • Trucks with multiple wheels per hub • Electrical supply via utility with ATS switch to generator which auto starts.
  • 13. ProblemManagementFoundation Documentation • Why do you need documentation? • Advantages of having documentation before a crisis • Types of documentation required • Impact of lack of documentation • Keeping documents updated • Documentation standards
  • 14. ProblemManagementFoundation Documentation • Advantages of having documentation before a crisis • When a crisis occurs it is time consuming to start the diagnosis if documentation of the systems is not available • An understanding of the system needs to be created and there should be a set of up-to-date, fully documented procedures and processes that are available and easy to implement • New staff members require a reference for processes. A process can only be as good as its documentation. Correctly used processes avoid errors. In the event of a crisis, uncertainty is reduced and time to resolution increased. • When a failure occurs, processes and documentation need to be changed to avoid a re-occurrence. It could be as simple as a more detailed sanity check before running that process that nukes some part of the system.
  • 15. ProblemManagementFoundation Documentation • Types of documentation required • Inventory listings • Rack and floor plan capacity • Rack layout diagrams • Patch panel connections • Network switch connections • Power strip connections • Network diagrams • Storage diagrams • Domain diagrams • Capacity reports • Change audit trails
  • 19. ProblemManagementFoundation Correct implementation • Use a structured approach and plan, no “fly by the seat of your pants”. • Understand the deliverables and measure/gauge progress to the target. • David Allen, a productivity guru, frequently asserts that anything that takes more than two steps and two minutes to accomplish is a project. • David Ruiz, director of IT at DIC Entertainment Corp, states that nine out of 10 times taking the extra time to create a plan will save you time and money. • Refer to Appendix for project management resources.
  • 20. ProblemManagementFoundation Review: Bottom line In order to avoid a crisis, ensure you have redundancy and resilience implemented correctly, supported by appropriate documentation and measurements. A crisis can be mitigated if systems have been engineered with foresight with how failures are handled

Editor's Notes

  1. Crisis engineering
  2. Objectives <Insert notes>
  3. Crisis engineering Component Failure Impact Analysis - http://www.itsmsolutions.com/newsletters/DITYvol1iss4.htm ITIL suggests “Component Failure Impact Analysis” aka ‘single point of failure’ analysis
  4. Complexity: Degree of complexity based on the number of items and interconnects required to provide service Hardware age: Measured against the stated lifecycle from the vendor Software age: Based on number of items including versions count from current release Supportability: Supportability factor based on in house capability (key man dependencies), reliance on 3rd party and contractual arrangements Single points of failure: Based on number of infrastructure single points of failure Disaster recovery time: Based on time to implement full DR plan Capacity/Performance Utilised capacity at tightest bottle neck Environmental: Factor based on risk to virus attack, user breakage, physical damage, power failure etc.
  5. Complexity: Degree of complexity based on the number of items and interconnects required to provide service Hardware age: Measured against the stated lifecycle from the vendor Software age: Based on number of items including versions count from current release Supportability: Supportability factor based on in house capability (key man dependencies), reliance on 3rd party and contractual arrangements Single points of failure: Based on number of infrastructure single points of failure Disaster recovery time: Based on time to implement full DR plan Capacity/Performance Utilised capacity at tightest bottle neck Environmental: Factor based on risk to virus attack, user breakage, physical damage, power failure etc.
  6. Example of redundancy
  7. Complexity: Degree of complexity based on the number of items and interconnects required to provide service Hardware age: Measured against the stated lifecycle from the vendor Software age: Based on number of items including versions count from current release Supportability: Supportability factor based on in house capability (key man dependencies), reliance on 3rd party and contractual arrangements Single points of failure: Based on number of infrastructure single points of failure Disaster recovery time: Based on time to implement full DR plan Capacity/Performance Utilised capacity at tightest bottle neck Environmental: Factor based on risk to virus attack, user breakage, physical damage, power failure etc.
  8. Demonstrate The Swiss cheese experiment Three pieces of paper and punching a random hole in the paper with a pen. Each paper is a system with multiple components. This illustrates the principle of resilience i.e if you line the systems up together, the dots wont align, and until they do, you wont have services that fail. This demonstrates the concept of resilience in systems.
  9. Best practice network design for a service provider The diagram is an example of engineering a solution for redundancy, resilience and failover.
  10. Complexity: Degree of complexity based on the number of items and interconnects required to provide service Hardware age: Measured against the stated lifecycle from the vendor Software age: Based on number of items including versions count from current release Supportability: Supportability factor based on in house capability (key man dependencies), reliance on 3rd party and contractual arrangements Single points of failure: Based on number of infrastructure single points of failure Disaster recovery time: Based on time to implement full DR plan Capacity/Performance Utilised capacity at tightest bottle neck Environmental: Factor based on risk to virus attack, user breakage, physical damage, power failure etc.
  11. Examples of fail-over
  12. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner
  13. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner <Ask Lee to add notes and information>
  14. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner <Ask Lee to add notes and information>
  15. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner <Ask Lee to add notes and information>
  16. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner <Ask Lee to add notes and information>
  17. Naming conventions Similar documents help to create easier understanding of systems when they're documented in a consistent manner <Ask Lee to add notes and information>
  18. Basic project management, can within reason, be applied to anything. The alternative method would be to "fly by the seat of your pants" and then "crash and burn.“ Eskom’s Medupi implementation. The methods of approaching large projects are well documented but that is not what we are talking about. Often a person is tasked to complete a set of deliverables in a few days. So there needs to be a method to approach these aspects of your work that are short in nature in a methodological manner. A lightweight version of doing projects. David Allen, a productivity guru, frequently asserts that anything that takes more than two steps and two minutes to accomplish is a project. David Ruiz, director of IT at DIC Entertainment Corp, states that nine out of 10 times taking the extra time to create a plan will save you time and money
  19. Basic project management, can within reason, be applied to anything. The alternative method would be to "fly by the seat of your pants" and then "crash and burn.“ Eskom’s Medupi implementation. The methods of approaching large projects are well documented but that is not what we are talking about. Often a person is tasked to complete a set of deliverables in a few days. So there needs to be a method to approach these aspects of your work that are short in nature in a methodological manner. A lightweight version of doing projects. David Allen, a productivity guru, frequently asserts that anything that takes more than two steps and two minutes to accomplish is a project. David Ruiz, director of IT at DIC Entertainment Corp, states that nine out of 10 times taking the extra time to create a plan will save you time and money