Automating the
Ops in DevOps
James Moore
Principal Offering Manager
Charlene O’Hanlon
Managing Editor, Devops.com
Speakers
James Moore
Principal Offering Manager, IBM
Digital transformation provides
the opportunity to disrupt
industries.
However, it requires a
transformation in how innovative
software is delivered to succeed.
Think 2018 / 8143 / March 22, 2018 / © 2018 IBM Corporation
To remain competitive, businesses
must continuously reinvent themselves
by creating:
– more compelling customer
experiences
– innovative new business models
– fast and flexible operational
processes
Companies are leveraging new
technologies to help them accelerate.
Think 2018 / 8143 / March 22, 2018 / © 2018 IBM Corporation
Today’s innovation pace demands a new management paradigm
#Agile
#Cognitive
#Collaborative
#Flexible
#DevOps
#Dynamic
#Efficient
#SRE
#Cloud
#12FactorApps
Ops Goal: Fewer problem
tickets, faster resolution
Dev Goal: Faster time to market,
reduce disruptions
Shift LeftShift Right
Dev Test Stage ProdStop
DevOps accelerates value for the Business
Business needs:
• Agile Application Delivery
• End user experience &reliability
• Lean Operations Management
Business needs:
• Agile Application Delivery
• End user experience &reliability
• Lean Operations Management
Dev Ops
Dev
Ops
Mean time
to Identify
Mean time
to Verify
Mean time
to Know
Mean time
to Fix
Continuous Delivery requires self-sufficiency AND collaboration
7
Management reality in today’s IT landscapes
Issues that go beyond Level 1 support often
involve at least 3-4 people taking up an
average of 5-7 total man hours
Mean Time to Repair for application-related
problems is often 3-6 hours
User calls to applications support are the
enterprise’s “heads up” for an application problem
35% of the time
58% of survey participants said their
enterprise had between 6 and 40 different
monitoring tools
Managing today’s applications is difficult because symptoms may not be visible
Increasingly
complex hybrid
multi cloud
environments
R
a
t
e
O
f
C
h
a
n
g
e
Breadth & Diversity of Workload, cross IT, Network, Storage, etc…
D
p
e
n
d
e
n
c
y
Increasingly
Dynamic
Applications with
flexible
architectures
Source: IDC FutureScape: Worldwide Cloud 2016 Predictions - Mastering the Raw Material of Digital Transformation, IDC, Nov. 3, 2015 Gartner Predicts 2016: Cloud Computing to Drive Digital Business 12/8/15 G00292047
Poll Question #1
Where does your company’s operations
management happen?
A. All in a central IT Ops group
B. All outside a central IT Ops group
C. Mix across central IT and other groups
D. External service provider
Shift left operations
Visibility, control and automation
to manage apps and
infrastructure anywhere
Reactive
Real-time analytics
Shift left to adaptively sense and respond
to dynamic environments
Proactive
Predictive insights
Business
Agility
Adaptive
Cognitively enhanced
workflow
Structure and Efficiency Starts with Automated Event & Incident Management:
Dynamic lifecycles amidst constant
change:
• Application launches & changes
• Virtualized infrastructure movement
• Digital transformation
• Organizational changes
• Business initiatives
• External: weather, disaster, cyber
attacks
Control of the source not required to detect and resolve incidents as they happen
Incident Management: restore normal service operations as quickly as possible to ensure the best levels
of service quality and availability.
Analyze operational activity to learn and improve:
Augmented and automated restoration activity fuels proactive improvement
AI Ops: applying analysis and machine learning to operational activity to understand, improve and predict.
Analyze
Learn
Improve
Adjust
Predict
Polling Question #2
What best describes your organization’s
operational methodologies?
A. Manual Only
B. Mostly manual with some automated
activity
C. Prevalent amount of automated activity
Route 2
Automated Operations – Event & Incident Correlation, Prioritization
14Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
My App is
Unresponsive!
Service x is
impacting our
Biz Critical
App!Route 1
App owner
Operator
Assign
Correlation, Consolidation, Prioritization
Chat
push
Automated Operations – Notification
15Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
My App is
Unresponsive!
I’m getting a
coffee.Route
Policy / Calendar
App owner
Operator
Chat
SM
S
Chat
e-mail
Automated Operations – Escalation
16Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
Escalate
Looks like Brock
is on it
Ouch. Olivia’s
app is
unresponsive
Re-route
Escalation Policy
App owner
Operator
App owner
Senior
Operator
Chat
SM
S
Chat
e-mail
Automated Operations- Collaboration
17Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
Chat
Escalate
What’s
Going on?
Looks like a
bad config.
I’ll roll it back
App owner
Operator
App owner
Senior
Operator
Chat
SM
S
Chat
e-mail
Automated Operations - Remediation
18Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
Chat
Run Books
Run
Rollback
That’s done.
Has the app
recovered?
Escalate
App owner
Operator
App owner
Senior
Operator
Chat
Resolved
SM
S
Chat
e-mail
Automated Operations - Validation
19Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
Chat
Run Books
Run
Rollback
Run App
Check
All
good!
Escalate
App owner
Operator
App owner
Senior
Operator
Chat
SM
S
Chat
e-mail
Automated Operations – Efficiency Improvement
20Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Notification
Incident
Chat
Run Books
Run
Rollback
Run App
Check
Escalate
Define
I’ll automate
this for next time
App owner
Operator
App owner
Senior
Operator
Automated Operations – Complete Automation
21Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Incident Resolved
Run Books
Run rollback
Run App Check
Polling Question #3
How important is automation to your DevOps
methodology?
A. Highly critical and prioritized
B. High importance but not a current priority
C. Somewhat important but low priority
D. Low priority and unlikely to be adopted
IBM’s Approach to Cloud Event Management
23
üExpedite problem determination
üRestore service quickly
üBridge the gap between Dev and Ops
üAnalyze and improve operational health
Consolidated operational event and incident correlation, prioritization and resolution in the cloud
• 95% faster time to acknowledge incidents
• 75% less time performing event investigation
• 50% fewer repeating events
• 50% fewer generated incidents
IBM’s Operations Management track record sets the bar:
• 30% faster incident resolution
• 75% fewer events presented to operators
Check out IBM Cloud Event Management here: https://www.ibm.com/us-en/marketplace/cloud-event-management#product-header-top
Thank You!!!
24Group Name / DOC ID / Month XX, 2017 / © 2017 IBM Corporation

Building Ops Automation in DevOps

  • 1.
    Automating the Ops inDevOps James Moore Principal Offering Manager
  • 2.
    Charlene O’Hanlon Managing Editor,Devops.com Speakers James Moore Principal Offering Manager, IBM
  • 3.
    Digital transformation provides theopportunity to disrupt industries. However, it requires a transformation in how innovative software is delivered to succeed. Think 2018 / 8143 / March 22, 2018 / © 2018 IBM Corporation
  • 4.
    To remain competitive,businesses must continuously reinvent themselves by creating: – more compelling customer experiences – innovative new business models – fast and flexible operational processes Companies are leveraging new technologies to help them accelerate. Think 2018 / 8143 / March 22, 2018 / © 2018 IBM Corporation
  • 5.
    Today’s innovation pacedemands a new management paradigm #Agile #Cognitive #Collaborative #Flexible #DevOps #Dynamic #Efficient #SRE #Cloud #12FactorApps
  • 6.
    Ops Goal: Fewerproblem tickets, faster resolution Dev Goal: Faster time to market, reduce disruptions Shift LeftShift Right Dev Test Stage ProdStop DevOps accelerates value for the Business Business needs: • Agile Application Delivery • End user experience &reliability • Lean Operations Management Business needs: • Agile Application Delivery • End user experience &reliability • Lean Operations Management Dev Ops Dev Ops Mean time to Identify Mean time to Verify Mean time to Know Mean time to Fix Continuous Delivery requires self-sufficiency AND collaboration
  • 7.
    7 Management reality intoday’s IT landscapes Issues that go beyond Level 1 support often involve at least 3-4 people taking up an average of 5-7 total man hours Mean Time to Repair for application-related problems is often 3-6 hours User calls to applications support are the enterprise’s “heads up” for an application problem 35% of the time 58% of survey participants said their enterprise had between 6 and 40 different monitoring tools Managing today’s applications is difficult because symptoms may not be visible Increasingly complex hybrid multi cloud environments R a t e O f C h a n g e Breadth & Diversity of Workload, cross IT, Network, Storage, etc… D p e n d e n c y Increasingly Dynamic Applications with flexible architectures Source: IDC FutureScape: Worldwide Cloud 2016 Predictions - Mastering the Raw Material of Digital Transformation, IDC, Nov. 3, 2015 Gartner Predicts 2016: Cloud Computing to Drive Digital Business 12/8/15 G00292047
  • 8.
    Poll Question #1 Wheredoes your company’s operations management happen? A. All in a central IT Ops group B. All outside a central IT Ops group C. Mix across central IT and other groups D. External service provider
  • 9.
    Shift left operations Visibility,control and automation to manage apps and infrastructure anywhere
  • 10.
    Reactive Real-time analytics Shift leftto adaptively sense and respond to dynamic environments Proactive Predictive insights Business Agility Adaptive Cognitively enhanced workflow
  • 11.
    Structure and EfficiencyStarts with Automated Event & Incident Management: Dynamic lifecycles amidst constant change: • Application launches & changes • Virtualized infrastructure movement • Digital transformation • Organizational changes • Business initiatives • External: weather, disaster, cyber attacks Control of the source not required to detect and resolve incidents as they happen Incident Management: restore normal service operations as quickly as possible to ensure the best levels of service quality and availability.
  • 12.
    Analyze operational activityto learn and improve: Augmented and automated restoration activity fuels proactive improvement AI Ops: applying analysis and machine learning to operational activity to understand, improve and predict. Analyze Learn Improve Adjust Predict
  • 13.
    Polling Question #2 Whatbest describes your organization’s operational methodologies? A. Manual Only B. Mostly manual with some automated activity C. Prevalent amount of automated activity
  • 14.
    Route 2 Automated Operations– Event & Incident Correlation, Prioritization 14Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation My App is Unresponsive! Service x is impacting our Biz Critical App!Route 1 App owner Operator Assign Correlation, Consolidation, Prioritization
  • 15.
    Chat push Automated Operations –Notification 15Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident My App is Unresponsive! I’m getting a coffee.Route Policy / Calendar App owner Operator
  • 16.
    Chat SM S Chat e-mail Automated Operations –Escalation 16Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident Escalate Looks like Brock is on it Ouch. Olivia’s app is unresponsive Re-route Escalation Policy App owner Operator App owner Senior Operator
  • 17.
    Chat SM S Chat e-mail Automated Operations- Collaboration 17Think2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident Chat Escalate What’s Going on? Looks like a bad config. I’ll roll it back App owner Operator App owner Senior Operator
  • 18.
    Chat SM S Chat e-mail Automated Operations -Remediation 18Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident Chat Run Books Run Rollback That’s done. Has the app recovered? Escalate App owner Operator App owner Senior Operator
  • 19.
    Chat Resolved SM S Chat e-mail Automated Operations -Validation 19Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident Chat Run Books Run Rollback Run App Check All good! Escalate App owner Operator App owner Senior Operator
  • 20.
    Chat SM S Chat e-mail Automated Operations –Efficiency Improvement 20Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Notification Incident Chat Run Books Run Rollback Run App Check Escalate Define I’ll automate this for next time App owner Operator App owner Senior Operator
  • 21.
    Automated Operations –Complete Automation 21Think 2018 / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Incident Resolved Run Books Run rollback Run App Check
  • 22.
    Polling Question #3 Howimportant is automation to your DevOps methodology? A. Highly critical and prioritized B. High importance but not a current priority C. Somewhat important but low priority D. Low priority and unlikely to be adopted
  • 23.
    IBM’s Approach toCloud Event Management 23 üExpedite problem determination üRestore service quickly üBridge the gap between Dev and Ops üAnalyze and improve operational health Consolidated operational event and incident correlation, prioritization and resolution in the cloud • 95% faster time to acknowledge incidents • 75% less time performing event investigation • 50% fewer repeating events • 50% fewer generated incidents IBM’s Operations Management track record sets the bar: • 30% faster incident resolution • 75% fewer events presented to operators Check out IBM Cloud Event Management here: https://www.ibm.com/us-en/marketplace/cloud-event-management#product-header-top
  • 24.
    Thank You!!! 24Group Name/ DOC ID / Month XX, 2017 / © 2017 IBM Corporation