Downtime is stressful. Don't let a lack of communication add extra weight to your issues.
Join Scott Klein, co-founder and head of product for StatusPage, as he shares best practices for communicating effectively during an incident. He'll offer tips for increasing communication efficiency, reducing ticket load, and how to use StatusPage alongside the rest of your Atlassian toolset. Finally, Scott will discuss the expert ingredients of an effective incident response team: pre-incident preparation and post-incident followup with affected users.
Scott Klein, Head of Product - StatusPage, Atlassian
8. Agenda
The life of an incident
Discovering the missing ingredient
Incident communication best practices
StatusPage tour
Managing incidents the Atlassian way
9. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
10. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
11. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
12. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
13. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
14. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
35. Meanwhile, over in customer land
3
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
36. Meanwhile, over in customer land
5
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
37. Meanwhile, over in customer land
7
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
38. Meanwhile, over in customer land
11
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
39. Meanwhile, over in customer land
13
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
40. Meanwhile, over in customer land
17
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
41. Meanwhile, over in customer land
19
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
42. Meanwhile, over in customer land
23
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
43. Meanwhile, over in customer land
27
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
44. Meanwhile, over in customer land
29
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
45. Meanwhile, over in customer land
29
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
46. Meanwhile, over in customer land
29
99+
57
Angry customers
Support tickets
Phone calls, live chat, tweets
47. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
48. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
What’s going on?!
Time to panic.
49. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
What’s going on?!
Time to panic.
Emails
Support Tickets
50. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
What’s going on?!
Time to panic.
Emails
Support Tickets
Phone Calls
Tweets
51. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
What’s going on?!
Time to panic.
Emails
Support Tickets
Phone Calls
Tweets
Assume the
worst, be angry!
52. THE LIFE OF AN INCIDENT: WHAT YOUR CUSTOMERS SEE
Something’s
wrong!
What’s going on?!
Time to panic.
Emails
Support Tickets
Phone Calls
Tweets
Assume the
worst, be angry!
53. Agenda
Incident communication best practices
StatusPage tour
Managing incidents the Atlassian way
The life of an incident
Discovering the missing ingredient
54. THE LIFE OF AN INCIDENT: NOW WITH MORE WORDS
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
55. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
56. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
57. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
58. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
59. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
60. THE LIFE OF AN INCIDENT: WHAT YOU SEE
Something’s
wrong!
Monitoring tool
alerts and escalates
On-call responds
and leverages team
Whole team swarms
to fix the problem
Back to normal!
Document what
failed, remediate
62. Agenda
Incident communication best practices
Managing incidents the Atlassian way
The life of an incident
Discovering the missing ingredient
StatusPage tour
115. Agenda
Managing incidents the Atlassian way
The life of an incident
Discovering the missing ingredient
StatusPage tour
Incident communication best practices
117. Team
composition
Commander
Communicator
Set up communication channels
And make sure the right people are using them :)
Delegate ALL information/repair
Not involved in doing any of the actual work
Drive post-incident review and work
Make sure this incident doesn’t happen again
118. Team
composition
Communicator
Commander Post all customer-facing communications
StatusPage, Twitter, high-value customers, etc
Notify IC of new customers affected
Ensure commander knows scope of issues
Relay postmortem to customers
Build trust with customers through beautiful followup
126. Postmortem
golden rules
Say sorry
Don’t name names
Be personal
Details inspire
confidence
Shape usage
Please only use circles, rectangles, and
rounded rectangles to call attention to a
particular part of a screenshot, for the
sake of consistency.
Close the loop
127. Postmortem
golden rules
Say sorry
Don’t name names
Be personal
Details inspire
confidence
Shape usage
Please only use circles, rectangles, and
rounded rectangles to call attention to a
particular part of a screenshot, for the
sake of consistency.
Close the loop
129. Postmortem
golden rules
Say sorry
Don’t name names
Be personal
Details inspire
confidence
Shape usage
Please only use circles, rectangles, and
rounded rectangles to call attention to a
particular part of a screenshot, for the
sake of consistency.
Close the loop
130. Postmortem
golden rules
Say sorry
Don’t name names
Be personal
Details inspire
confidence
Shape usage
Please only use circles, rectangles, and
rounded rectangles to call attention to a
particular part of a screenshot, for the
sake of consistency.
Close the loop
131. Agenda
The life of an incident
Discovering the missing ingredient
StatusPage tour
Incident communication best practices
Managing incidents the Atlassian way
137. Detail every step
Reduce mental overhead in a
stressful situation
Communicate early
It’s okay if you don’t know what went
wrong yet—just show you’re on it
138. Detail every step
Reduce mental overhead in a
stressful situation
Communicate often
As you learn more about the incident,
provide frequent and detailed updates
139. SWARM WITH YOUR TEAM IN
Plugins to keep all
of the responders
in sync together
Reconstruct timelines
for postmortems and
forensic report
140. Detail every step
Reduce mental overhead in a
stressful situation
Use JIRA and HipChat together
to get to resolution, faster
Create a HipChat incident room directly from your JIRA
issue (which automatically tags watchers), and see
updates on the issue in the room