SlideShare a Scribd company logo
Greg Dostatni
Team Lead, Application Hosting
Splunk at the University of
Alberta
Copyright © 2015 Splunk Inc.
2
• At U of A since 2007
• Responsible for 10-person
team managing applications
and databases university-wide
• Splunk user since 2013
• I’ve eaten BBQ chicken
intestines on a stick. Yummy.
• splunk> take the sh out of IT
3
The University of Alberta
• Public research university based in
Edmonton and founded in 1908
• 39,000+ students and 18,000
employees
• 5 campuses and 18 faculties
• One of the top 100 universities
worldwide
4
IT at the University of Alberta
Central IT group for authentication,
wireless and core services
Independent IT groups for most
faculties and departments
University-wide initiative to
consolidate more of IT
Need to standardize IT operations
and tame diverse technology stacks
4
5
Application Hosting Objectives
• Centralize more of IT
• Build and manage shared
environments
• Develop custom services as
needed
• Roll out/upgrade applications
• Investigate performance
problems
IT
Libraries
LMS
Public website
+ CMS
Ticketing
Billing
systems
Research group
serversOther applications
and databases
6
Challenges after Restructuring IT
• More interdependencies
among teams
• Massive volume of data,
housed in silos
• “Running blind” – no
understanding of the data
• Time-consuming to gather
data for incidents
7
Splunk Timeline
• Funding to
rebuild Splunk
environment
• New hardware,
clustering with
dedicated
storage
• 400 data sources
• 133 sourcetypes
April 2015
• Management
notification of
syslog data loss
• Incidents
escalated
• Splunk in
production?
Sept. 2014
• Data loss
concerns from
restarting
Splunk
• Management
relying on
Splunk reports
• Splunk not in
production
March 2014
• Pilot deployed
• Splunk as syslog
target
• Log aggregation
test; no need
for backup
Sept. 2013
8
Splunk at the University of Alberta
Infrastructure
Applications
(mail, authentication)
Networking
and Security
(switches, IPS)
Application
Hosting
(apps, databases)
9
Example: Troubleshooting Authentication Systems
Before
• 12GB/day, 20 machines
• No aggregation
• Reactive issue response
based on user feedback
• Manual investigations
• Delay in getting data
After
• Centralized data
• ½ hour to troubleshoot
• Proactive alerts for issues
• Easy access to
infrastructure data
• Real-time reporting
10
Example: Performance Monitoring
Track and correlate request response times to gauge user satisfaction
11
Example: First Responders App
Dashboards for initial incident review
12
Example: Proactive Alerts
Trigger alerts on both the count and percentage of messages
13
Example: Executive Dashboards
14
Splunk Deployment Takeaways
Successes
• Visibility cutting through team
boundaries
• More advanced initial incident
investigation
• Openness - signed standard IT
agreement for access to Splunk
data
• Management loves reports
• Defusing situations with rapid
access to facts
Challenges
• Accepting syslog data directly
• Log standardization
• Figuring out what to look at in the
logs to understand “good” system
behavior
15
Aha! Moments
Transactions
• End-to-end monitoring
of 4M+ email messages
per day (greylisting 
spam filtering 
Google)
• Used transactions to
combine logs across
systems into single,
message-centric log
• Ability to easily search
for anomalies
Generic Alerts
• Created alert to catch
errors across systems in
real time
• Used existing alert and
removed host
specification to create
the generic alert
• Catches errors that were
not in Splunk at the
moment the alert was
created
10-second Query
• 10-second window =
~35,000 events
• Statistics to rank likely
events triggering issues
• New Splunk window to
analyze unusual messages
• Ability to examine small
slice of time in detail
while running statistics
over longer period of time
16
“Splunk allows us to erase these lines and
any analyst can see all the data from
anywhere and investigate a problem from
end to end.”
Thank you

More Related Content

What's hot

SplunkLive! Customer Presentation – Availity
SplunkLive! Customer Presentation – AvailitySplunkLive! Customer Presentation – Availity
SplunkLive! Customer Presentation – Availity
Splunk
 
SplunkLive! Customer Presentation - SSA
SplunkLive! Customer Presentation - SSASplunkLive! Customer Presentation - SSA
SplunkLive! Customer Presentation - SSA
Splunk
 
Splunk for ITOA Breakout Session
Splunk for ITOA Breakout SessionSplunk for ITOA Breakout Session
Splunk for ITOA Breakout Session
Splunk
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
Splunk
 
Splunk at Weill Cornell Medical College
Splunk at Weill Cornell Medical CollegeSplunk at Weill Cornell Medical College
Splunk at Weill Cornell Medical College
Splunk
 
SplunkLive! Customer Presentation - Satcom Direct
SplunkLive! Customer Presentation - Satcom DirectSplunkLive! Customer Presentation - Satcom Direct
SplunkLive! Customer Presentation - Satcom Direct
Splunk
 
SplunkLive! Customer Presentation – athenahealth
SplunkLive! Customer Presentation – athenahealthSplunkLive! Customer Presentation – athenahealth
SplunkLive! Customer Presentation – athenahealth
Splunk
 
SplunkLive! Customer Presentation - FINRA
SplunkLive! Customer Presentation - FINRASplunkLive! Customer Presentation - FINRA
SplunkLive! Customer Presentation - FINRA
Splunk
 
Splunk for IT Operations Breakout Session
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout Session
Splunk
 
Data Onboarding Breakout Session
Data Onboarding Breakout SessionData Onboarding Breakout Session
Data Onboarding Breakout Session
Splunk
 
Splunk at Aaron's Inc
Splunk at Aaron's IncSplunk at Aaron's Inc
Splunk at Aaron's Inc
Splunk
 
Splunk @ Adobe
Splunk @ AdobeSplunk @ Adobe
Splunk @ Adobe
Splunk
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
Splunk
 
SplunkLive! Warsaw 2016 - Cisco
SplunkLive! Warsaw 2016 - Cisco SplunkLive! Warsaw 2016 - Cisco
SplunkLive! Warsaw 2016 - Cisco
Splunk
 
SplunkLive! Customer Presentation - ExxonMobil
SplunkLive! Customer Presentation - ExxonMobilSplunkLive! Customer Presentation - ExxonMobil
SplunkLive! Customer Presentation - ExxonMobil
Splunk
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Splunk
 
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
Splunk
 
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
Splunk
 
Customer Presentation - Financial Services Organization
Customer Presentation - Financial Services OrganizationCustomer Presentation - Financial Services Organization
Customer Presentation - Financial Services Organization
Splunk
 
Splunk and Cisco UCS Breakout Session
Splunk and Cisco UCS Breakout SessionSplunk and Cisco UCS Breakout Session
Splunk and Cisco UCS Breakout Session
Splunk
 

What's hot (20)

SplunkLive! Customer Presentation – Availity
SplunkLive! Customer Presentation – AvailitySplunkLive! Customer Presentation – Availity
SplunkLive! Customer Presentation – Availity
 
SplunkLive! Customer Presentation - SSA
SplunkLive! Customer Presentation - SSASplunkLive! Customer Presentation - SSA
SplunkLive! Customer Presentation - SSA
 
Splunk for ITOA Breakout Session
Splunk for ITOA Breakout SessionSplunk for ITOA Breakout Session
Splunk for ITOA Breakout Session
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
 
Splunk at Weill Cornell Medical College
Splunk at Weill Cornell Medical CollegeSplunk at Weill Cornell Medical College
Splunk at Weill Cornell Medical College
 
SplunkLive! Customer Presentation - Satcom Direct
SplunkLive! Customer Presentation - Satcom DirectSplunkLive! Customer Presentation - Satcom Direct
SplunkLive! Customer Presentation - Satcom Direct
 
SplunkLive! Customer Presentation – athenahealth
SplunkLive! Customer Presentation – athenahealthSplunkLive! Customer Presentation – athenahealth
SplunkLive! Customer Presentation – athenahealth
 
SplunkLive! Customer Presentation - FINRA
SplunkLive! Customer Presentation - FINRASplunkLive! Customer Presentation - FINRA
SplunkLive! Customer Presentation - FINRA
 
Splunk for IT Operations Breakout Session
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout Session
 
Data Onboarding Breakout Session
Data Onboarding Breakout SessionData Onboarding Breakout Session
Data Onboarding Breakout Session
 
Splunk at Aaron's Inc
Splunk at Aaron's IncSplunk at Aaron's Inc
Splunk at Aaron's Inc
 
Splunk @ Adobe
Splunk @ AdobeSplunk @ Adobe
Splunk @ Adobe
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
 
SplunkLive! Warsaw 2016 - Cisco
SplunkLive! Warsaw 2016 - Cisco SplunkLive! Warsaw 2016 - Cisco
SplunkLive! Warsaw 2016 - Cisco
 
SplunkLive! Customer Presentation - ExxonMobil
SplunkLive! Customer Presentation - ExxonMobilSplunkLive! Customer Presentation - ExxonMobil
SplunkLive! Customer Presentation - ExxonMobil
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
Splunk Discovery: Warsaw 2018 - Legacy SIEM to Splunk, How to Conquer Migrati...
 
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
Travis Perkins: Building a 'Lean SOC' over 'Legacy SOC'
 
Customer Presentation - Financial Services Organization
Customer Presentation - Financial Services OrganizationCustomer Presentation - Financial Services Organization
Customer Presentation - Financial Services Organization
 
Splunk and Cisco UCS Breakout Session
Splunk and Cisco UCS Breakout SessionSplunk and Cisco UCS Breakout Session
Splunk and Cisco UCS Breakout Session
 

Similar to University of Alberta Customer Presentation

SplunkLive! Customer Presentation – UMCP
SplunkLive! Customer Presentation – UMCPSplunkLive! Customer Presentation – UMCP
SplunkLive! Customer Presentation – UMCP
Splunk
 
Lecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfLecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdf
ahmedibrahimghnnam01
 
Transformative experience of implementing a next-generation library system - ...
Transformative experience of implementing a next-generation library system - ...Transformative experience of implementing a next-generation library system - ...
Transformative experience of implementing a next-generation library system - ...
CONUL Conference
 
Alsup gd13 final
Alsup gd13   finalAlsup gd13   final
Alsup gd13 final
Mike Alsup
 
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech DaySplunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
Zivaro Inc
 
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
Jasper Oosterveld
 
5 Tips to Optimize SharePoint While Preparing for Hybrid
5 Tips to Optimize SharePoint While Preparing for Hybrid5 Tips to Optimize SharePoint While Preparing for Hybrid
5 Tips to Optimize SharePoint While Preparing for Hybrid
Adam Levithan
 
11.online library management system
11.online library management system11.online library management system
11.online library management system
Pvrtechnologies Nellore
 
Top 3 Mistakes when Building
Top 3 Mistakes when BuildingTop 3 Mistakes when Building
Top 3 Mistakes when Building
Talbott Crowell
 
Accelerating SDLC for Large Public Sector Enterprise Applications
Accelerating SDLC for Large Public Sector Enterprise ApplicationsAccelerating SDLC for Large Public Sector Enterprise Applications
Accelerating SDLC for Large Public Sector Enterprise Applications
Splunk
 
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
Health IT Conference – iHT2
 
EMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by SigmaEMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by Sigma
Jonathan Simpson
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
Measuring impact
Measuring impactMeasuring impact
Measuring impact
Stephen Emmott
 
Webinar: Office 365 for Beginners
Webinar: Office 365 for BeginnersWebinar: Office 365 for Beginners
Webinar: Office 365 for Beginners
Cliff Ashcroft
 
Ecm implementation planning_workshop_hospital_sample
Ecm implementation planning_workshop_hospital_sampleEcm implementation planning_workshop_hospital_sample
Ecm implementation planning_workshop_hospital_sample
Christopher Wynder
 
Cloud Computing for Not-for-Profits
Cloud Computing for Not-for-ProfitsCloud Computing for Not-for-Profits
Cloud Computing for Not-for-Profits
rgtechnologies
 
(ATS6-APP05) Deploying Contur ELN to large organizations
(ATS6-APP05) Deploying Contur ELN to large organizations(ATS6-APP05) Deploying Contur ELN to large organizations
(ATS6-APP05) Deploying Contur ELN to large organizations
BIOVIA
 
How Elns can galvanise research data management.
How Elns can galvanise research data management. How Elns can galvanise research data management.
How Elns can galvanise research data management.
rmacneil88
 

Similar to University of Alberta Customer Presentation (20)

SplunkLive! Customer Presentation – UMCP
SplunkLive! Customer Presentation – UMCPSplunkLive! Customer Presentation – UMCP
SplunkLive! Customer Presentation – UMCP
 
Lecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfLecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdf
 
Transformative experience of implementing a next-generation library system - ...
Transformative experience of implementing a next-generation library system - ...Transformative experience of implementing a next-generation library system - ...
Transformative experience of implementing a next-generation library system - ...
 
Alsup gd13 final
Alsup gd13   finalAlsup gd13   final
Alsup gd13 final
 
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech DaySplunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
Splunk Fundamentals: Investigations with Core Splunk - Splunk Tech Day
 
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
SharePoint Saturday Helsinki 2019 - Collaboration Governance and Adoption Bes...
 
5 Tips to Optimize SharePoint While Preparing for Hybrid
5 Tips to Optimize SharePoint While Preparing for Hybrid5 Tips to Optimize SharePoint While Preparing for Hybrid
5 Tips to Optimize SharePoint While Preparing for Hybrid
 
11.online library management system
11.online library management system11.online library management system
11.online library management system
 
Top 3 Mistakes when Building
Top 3 Mistakes when BuildingTop 3 Mistakes when Building
Top 3 Mistakes when Building
 
Accelerating SDLC for Large Public Sector Enterprise Applications
Accelerating SDLC for Large Public Sector Enterprise ApplicationsAccelerating SDLC for Large Public Sector Enterprise Applications
Accelerating SDLC for Large Public Sector Enterprise Applications
 
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
 
EMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by SigmaEMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by Sigma
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Measuring impact
Measuring impactMeasuring impact
Measuring impact
 
Webinar: Office 365 for Beginners
Webinar: Office 365 for BeginnersWebinar: Office 365 for Beginners
Webinar: Office 365 for Beginners
 
Ecm implementation planning_workshop_hospital_sample
Ecm implementation planning_workshop_hospital_sampleEcm implementation planning_workshop_hospital_sample
Ecm implementation planning_workshop_hospital_sample
 
Cloud Computing for Not-for-Profits
Cloud Computing for Not-for-ProfitsCloud Computing for Not-for-Profits
Cloud Computing for Not-for-Profits
 
(ATS6-APP05) Deploying Contur ELN to large organizations
(ATS6-APP05) Deploying Contur ELN to large organizations(ATS6-APP05) Deploying Contur ELN to large organizations
(ATS6-APP05) Deploying Contur ELN to large organizations
 
How Elns can galvanise research data management.
How Elns can galvanise research data management. How Elns can galvanise research data management.
How Elns can galvanise research data management.
 

More from Splunk

.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
Splunk
 
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
Splunk
 
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
Splunk
 
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International
Splunk
 
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett .conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
Splunk
 
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär).conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
Splunk
 
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu....conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
Splunk
 
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever....conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
Splunk
 
.conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex).conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex)
Splunk
 
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
Splunk
 
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk
 
Splunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go KölnSplunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go Köln
Splunk
 
Splunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go KölnSplunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go Köln
Splunk
 
Data foundations building success, at city scale – Imperial College London
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College London
Splunk
 
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk
 
SOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security Webinar
Splunk
 
.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session
Splunk
 
.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote
Splunk
 
.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session
Splunk
 
.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session
Splunk
 

More from Splunk (20)

.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
 
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
 
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
 
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International
 
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett .conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
 
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär).conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
 
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu....conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
 
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever....conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
 
.conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex).conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex)
 
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
 
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11y
 
Splunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go KölnSplunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go Köln
 
Splunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go KölnSplunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go Köln
 
Data foundations building success, at city scale – Imperial College London
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College London
 
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
 
SOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security Webinar
 
.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session
 
.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote
 
.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session
 
.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session
 

Recently uploaded

Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 

Recently uploaded (20)

Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 

University of Alberta Customer Presentation

  • 1. Greg Dostatni Team Lead, Application Hosting Splunk at the University of Alberta Copyright © 2015 Splunk Inc.
  • 2. 2 • At U of A since 2007 • Responsible for 10-person team managing applications and databases university-wide • Splunk user since 2013 • I’ve eaten BBQ chicken intestines on a stick. Yummy. • splunk> take the sh out of IT
  • 3. 3 The University of Alberta • Public research university based in Edmonton and founded in 1908 • 39,000+ students and 18,000 employees • 5 campuses and 18 faculties • One of the top 100 universities worldwide
  • 4. 4 IT at the University of Alberta Central IT group for authentication, wireless and core services Independent IT groups for most faculties and departments University-wide initiative to consolidate more of IT Need to standardize IT operations and tame diverse technology stacks 4
  • 5. 5 Application Hosting Objectives • Centralize more of IT • Build and manage shared environments • Develop custom services as needed • Roll out/upgrade applications • Investigate performance problems IT Libraries LMS Public website + CMS Ticketing Billing systems Research group serversOther applications and databases
  • 6. 6 Challenges after Restructuring IT • More interdependencies among teams • Massive volume of data, housed in silos • “Running blind” – no understanding of the data • Time-consuming to gather data for incidents
  • 7. 7 Splunk Timeline • Funding to rebuild Splunk environment • New hardware, clustering with dedicated storage • 400 data sources • 133 sourcetypes April 2015 • Management notification of syslog data loss • Incidents escalated • Splunk in production? Sept. 2014 • Data loss concerns from restarting Splunk • Management relying on Splunk reports • Splunk not in production March 2014 • Pilot deployed • Splunk as syslog target • Log aggregation test; no need for backup Sept. 2013
  • 8. 8 Splunk at the University of Alberta Infrastructure Applications (mail, authentication) Networking and Security (switches, IPS) Application Hosting (apps, databases)
  • 9. 9 Example: Troubleshooting Authentication Systems Before • 12GB/day, 20 machines • No aggregation • Reactive issue response based on user feedback • Manual investigations • Delay in getting data After • Centralized data • ½ hour to troubleshoot • Proactive alerts for issues • Easy access to infrastructure data • Real-time reporting
  • 10. 10 Example: Performance Monitoring Track and correlate request response times to gauge user satisfaction
  • 11. 11 Example: First Responders App Dashboards for initial incident review
  • 12. 12 Example: Proactive Alerts Trigger alerts on both the count and percentage of messages
  • 14. 14 Splunk Deployment Takeaways Successes • Visibility cutting through team boundaries • More advanced initial incident investigation • Openness - signed standard IT agreement for access to Splunk data • Management loves reports • Defusing situations with rapid access to facts Challenges • Accepting syslog data directly • Log standardization • Figuring out what to look at in the logs to understand “good” system behavior
  • 15. 15 Aha! Moments Transactions • End-to-end monitoring of 4M+ email messages per day (greylisting  spam filtering  Google) • Used transactions to combine logs across systems into single, message-centric log • Ability to easily search for anomalies Generic Alerts • Created alert to catch errors across systems in real time • Used existing alert and removed host specification to create the generic alert • Catches errors that were not in Splunk at the moment the alert was created 10-second Query • 10-second window = ~35,000 events • Statistics to rank likely events triggering issues • New Splunk window to analyze unusual messages • Ability to examine small slice of time in detail while running statistics over longer period of time
  • 16. 16 “Splunk allows us to erase these lines and any analyst can see all the data from anywhere and investigate a problem from end to end.”

Editor's Notes

  1. Good morning everyone and welcome to SplunkLive Calgary Thanks so much for having me at your SplunkLive today
  2. Graduated from CS at University of Alberta in 2002, worked for various research projects and did some contract development for a few years. Joined university again in 2007.
  3. University of Alberta – friendly neighbor about 300 KM that away (for main campus) 18 faculties includes some big ones like Science, Medicine and Engineering The University ranking changes every year as well as different rankings get published. This seemed like a safe enough statement to make without going into half a page of small print.
  4. Our starting point about 4 years ago. Over the last 4 years or so we’ve been consolidating about 300+ individual groups IT into central department. Originally this was envisioned as a 10 year plan, so we still have a ways to go. At this point we are supporting a lot of different institutional needs, lots of different technologies and life is very exciting.
  5. Application Hosting. We manage the applications and databases for a lot of clients across campus. There are some big applications that are managed by others (LMS, Peoplesoft), but we make up for it with the number of different applications we support. I’ve stopped being amazed at the number of needs an institution of this size has. We have an application for printing out labels to put on file folders, tracking project time, billing and invoicing, databases supporting libraries, ticketing systems, wikis, departmental pages. Typically there is a piece of software behind a lot of business processes and all of that needs to be monitored, patched, upgraded. As our consolidation effort continues we will be using Splunk to look into how an application is used in order to determine how it could be consolidated with other applications of similar function. There is an amazing amount of information about usage patterns, what gets accessed and how often and who does the accessing.
  6. We’ve re-organized ourselves along functional lines. OS Support, Networking, Application Hosting, etc. What that means is that some investigations spanning multiple teams become very time consuming and expensive (two people looking at logs). Some of that is unavoidable, and even desirable, but for a large number of errors we’re just missing that one piece of crucial information that “solves” the problem. That could be a log line from the VM host indicating physical hardware problem, log from authentication system detailing why connection was rejected, etc. etc. Splunk allows us access to that information. Here is where I need to bring up a big warning flag. Having access to the logs does not mean you can understand the logs. There are some errors where the team running a system is required to correctly interpret the logs, but in general having more eyes is a good thing. Some of the expertise can be developed over time, some more through developing dashboards and applications within Splunk.
  7. I get a kick of these timelines, so I wanted to add our own. I’ve seen a few at a Splunk conference and they typically go something similar. An organization gets Splunk in limited capacity, something happens (systems get hacked, phishing attack, etc) and next year they are running 10x the license. We’ve had a bit of a different experience, where the Splunk importance snuck up on us. In September 2013 we’ve deployed Splunk as a pilot. Some of the conversations at that point were along the lines of “backups are not important as all the systems keep their own logs” Splunk is just a view into the logs, there is little to no new information contained. Let’s just send logs to see what we can make out of it. By March we were becoming concerned every time we needed to restart Splunk, since that meant data loss. Our installation was configured as a syslog target for networking devices and IMS logs so if Splunk was not there to receive the events, they went nowhere. By September (1 year later) we needed to notify management of Splunk outages because of the data loss. In April / May this year we are rebuilding the environment in clustered configuration with dedicated syslog servers. I don’t know if I can think of any one moment where Splunk suddenly became critical production environment, but it definitely is one now.
  8. What follows is a gross generalization. I obviously understand the challenges within my team the best, so take this with a grain of salt. Although other groups do use and send logs to Splunk, the three main groups are Infrastructure Applications, Networking and Application Hosting. It’s interesting that we all have different needs and different use cases. Infrastructure has a small number of highly critical applications that they need to understand end to end (mail, central authentication, etc). Networking has a few data sources that are fairly similar (switches, firewalls, etc). Security looks at a few different data sources like VPN and IPS as well as authentication logs from a number of systems. Application Hosting currently has a relatively few number of sources, but a lot of them are different and unique. There is probably 5 different ways apache is configured to log access requests, we run every major database type and version. Postgresql, MySQL, MSSQL, Oracle. In addition we support applications themselves or interact with software vendors on behalf of clients. On my last monthly report there were 374 different relational databases in environments supported by my team.
  9. An example from our Infrastructure Applications team. Our authentication system (approximately 180,000 accounts) generates around 12 GB of logs / day. Logs were stored on each individual node of a cluster in a text file. Trying to find logs related to a specific login id required signing on onto the 20 of so systems and using “grep” to identify individual log lines. That was not a quick task and required generating IO load on the servers. Doing anything more advanced than that was nearly impossible. After. We have summary indexes and reporting indexes on this data to quickly answer specific questions we know will be asked. We can correlate with data from other systems and alert in real-time for specific events. Users are no longer our main issue detection method.
  10. This is something we’ve used with great effect in my team for a few performance investigations so far. We break down the web traffic by percentiles and return size. It allows us to pinpoint problems (in some cases) as well as provide an instant report to the client how their application is really performing. This query is both complex enough to be useful, yet simple enough that I can explain it to non technical clients.
  11. This is a new initiative coming out of my team. It is called a First Responders App (even though it is currently a dashboard, I know it will end up an app eventually). First Responders is meant as the place to go at a start of an incident. It’s meant to put a lot of infrastructure information at analyst’s fingertips and it spans information from all of the operational teams. It allows an analyst to verify backup status, check logs, check tickets / change calendar, check monitoring system, who has logged in into the system last, etc. As we’re still rolling it out we do not have all the information we would perhaps wish, but the reception so far has been very positive.
  12. Also part of First Responders we have a holistic look at the logs. Not only do we look at the logs from the system, we also look at logs about the system. If our IPS detects activity against the host, that will appear in the window at top left. Same if out authentication suddenly starts throwing a lot of errors or messages about the host. Sparklines are great for very quickly identifying patterns and seeing if something is unusual. Lastly we have a query I’ve been playing around for a some months. This query tries to generate a statistical baseline of events from a system, and then compare last full hour against that baseline to highlight issues. In this screenshot I relaxed my alerting thresholds, that’s why the z scores are so low, it does illustrate what the output looks like. I’m working a next generation of this type of query that will log all deviations from the entire environment every hour. Wish me luck.
  13. This was probably our management’s first introduction to splunk, monthly reports for critical systems. We are still tweaking what goes on the reports, and will continue to do so. This particular dashboard shows our Adobe Connect, including a histogram of meeting sizes, which did include classes in the 240 – 245 participant range as well as an AppDex score. Eventually I’d like to standardize all application monthly reports and have them send automatically to each client / department.
  14. Things that worked and things that did not work so well. The main things that were successes: You can really defuse a situation by being able to rapidly provide facts. If you’re able to provide a list of users who accessed a specific file in the first 15 minutes of a breach investigation, that really brings down the stress level of everyone involved and situation rapidly de-escalates. Similarly for performance investigations. Everyone in the system can see all logs, we try to have the system as open as possible to all IT. Challenges: There is so many different way of configuring logging. That makes getting consistent reports a challenge. Knowing what is “normal”. Having the ability to rapidly generate graphs of values and have data that goes back a few months (at least) is highly beneficial. Syslog. Where possible use universal forwarders, where not possible have a syslog collector.
  15. Some of the “AHA moments” Using transactions to create message centric logs. It was nothing short of magical, especially when compared to the good old “grep” command across multiple systems. Generic alerts. Ability to create alerts that work for systems that are not in the system yet. The ability to look at the entire environment as a single event stream is incredibly powerful. An extension of the previous. While investigating a hiccup of some sort I performed a time constrained query across the entire environment. I wanted to see whether the error was limited to this system, or whether it appeared anywhere else. Using two windows I was able to run simple queries on specific messages and determine “normal event” or “not”. It was an amazing, and humbling, look at our environment. So many things happened within that 10 second window.