SlideShare a Scribd company logo
1 of 9
Data Warehouse | July 13, 2013
Kevin Jesse
Data Warehouse Team | University IT
David Andruczyk
Web Services Team | University IT
Data Warehouse | July 13, 2013
Diagnostic monitoring refers to collecting
ALL (or as many as possible) known
system metrics at periodic intervals over
time.
The information given allows you to see
fluctuations in areas of the system that
may or may not impact operational use.
This information also allows for detailed
system metrics which can be used for
further tuning.
Data Warehouse | July 13, 2013
Operational monitoring refers to
collecting KEY system metrics at
periodic intervals over time.
The information given allows you to
refine that initial configuration to be
more tailored to your requirements.
The information also prepares you to
address new problems that might
appear on their own or following
upgrades, increases in volumes, or new
deployments.
Data Warehouse | July 13, 2013
Apache Server Status
OK 0.031554 seconds response time. Idle 29, busy 1, open slots 470
WARNING 0.029917 seconds response time. Idle 27, busy 353, open slots 120
Open Files
OK: Open files is 9028 of 819200
System Core Files
OK - 0 Core(s) found
Java JVM Threads
JMX OK - ThreadCount=352
JMX WARNING - ThreadCount=683
Total Number of Processes
PROCS CRITICAL: 770 processes
Data Warehouse | July 13, 2013
Apache HTTP/HTTPS
HTTP OK: HTTP/1.1 200 OK - 245 bytes in 0.032 second response time
System CPU
24 CPU, average load 3.2% < 50% : OK
System Disk Usage
DISK OK - free space: / 6717 MB (92% inode=99%)
System Memory
OK - 79444M free
System Interfaces
OK: host 'localhost', interfaces up: 7, down: 0, dormant: 0
Data Warehouse | July 13, 2013
Benefits
Helps identify key operational
metrics
Helps with holistic view of a system
Performing poorly vs. down
Gives additional insight in to system
Allows for quicker understanding of
a failure based on data
Proactive monitoring of services
which can forecast impending
system failure
Allows SME’s to have more visibility
Enables vendors access to additional
data for troubleshooting
Risks or Downside
Over use or redundant monitoring
Initial implementation can have a
high technical cost with SME
Overwhelming amount of data to
analyze
Alert overload from
misconfiguration
Two systems to maintain (diagnostic
and operational)
Data Warehouse | July 13, 2013
Trend or Prediction Analysis
Identification of Overall Performance Metrics
Misconfigurations in Larger System
Can Help to Identify and Pinpoint System Abuse
Early detection via warning signals that an abnormality is
occurring helps avoid the “shock/panic” factor
Early detection of abnormalities vs. “System Down”
Allow more time for analysis, assisting with scenario /what-if
planning
Insight into enhancements that would otherwise go un-noticed
Data Warehouse | July 13, 2013
Nagios
Cacti
AWStats
Logwatch
Up.Time
SCOM
Tripwire
Solar Winds
Zabbix
Munin
Groundworks
Big Brother
Nfsen
MRTG
Hyperic HQ
Tivoli
http://en.wikipedia.org/wiki/Comparison_of_network_monitoring_systems
Data Warehouse | July 13, 2013
Diagnostic monitoring is something that SME’s
specialize in along with their other skills.
Many SME’s prefer to add a monitoring station
as an individual component of a larger cluster
or platform system. This helps an
administrator focus on tuning vs. being
impacted by other alerts or misconfigurations
in the monitoring station.
Smaller systems with less overall metrics may
not warrant standing up a unique monitoring
station. These systems would benefit most
form a collaborative and centralize diagnostic
monitoring station.

More Related Content

What's hot

VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
ASBIS SK
 

What's hot (6)

Kontrolmatik - Operational intelligence
Kontrolmatik - Operational intelligenceKontrolmatik - Operational intelligence
Kontrolmatik - Operational intelligence
 
VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
VMware: Nástroje na správu a efektívne riadenie fyzickej a virtuálnej infrašt...
 
NERC
NERCNERC
NERC
 
Replistor Resume
Replistor ResumeReplistor Resume
Replistor Resume
 
Compliance favors only the prepared plan
Compliance favors only the prepared planCompliance favors only the prepared plan
Compliance favors only the prepared plan
 
Integrating Medical Devices with Health System EHRs
Integrating Medical Devices with Health System EHRsIntegrating Medical Devices with Health System EHRs
Integrating Medical Devices with Health System EHRs
 

Similar to Diagnostic System Monitoring

7 deadly sins of backup and recovery
7 deadly sins of backup and recovery7 deadly sins of backup and recovery
7 deadly sins of backup and recovery
geekmodeboy
 
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehousesScalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
Finalyear Projects
 
Failure analysis buisness impact-backup-archive
Failure analysis buisness impact-backup-archiveFailure analysis buisness impact-backup-archive
Failure analysis buisness impact-backup-archive
Davin Abraham
 
Hrm database-management-java-project
Hrm database-management-java-projectHrm database-management-java-project
Hrm database-management-java-project
chetanmbhimewal
 
First Things First
First Things FirstFirst Things First
First Things First
MB Software & Consulting, Inc.
 
First Things First
First Things FirstFirst Things First
First Things First
MB Software & Consulting, Inc.
 
Top ten database_threats
Top ten database_threatsTop ten database_threats
Top ten database_threats
FITSFSd
 

Similar to Diagnostic System Monitoring (20)

7 deadly sins of backup and recovery
7 deadly sins of backup and recovery7 deadly sins of backup and recovery
7 deadly sins of backup and recovery
 
Attivio Active Security Technical Brief
Attivio Active Security Technical BriefAttivio Active Security Technical Brief
Attivio Active Security Technical Brief
 
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehousesScalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
 
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
 
Failure analysis buisness impact-backup-archive
Failure analysis buisness impact-backup-archiveFailure analysis buisness impact-backup-archive
Failure analysis buisness impact-backup-archive
 
Data warehouse testing
Data warehouse testingData warehouse testing
Data warehouse testing
 
Web based Prison management system
Web based Prison management systemWeb based Prison management system
Web based Prison management system
 
A1802030104
A1802030104A1802030104
A1802030104
 
Quals
QualsQuals
Quals
 
Using metadata in filtered logs for prevention of database intrusion through ...
Using metadata in filtered logs for prevention of database intrusion through ...Using metadata in filtered logs for prevention of database intrusion through ...
Using metadata in filtered logs for prevention of database intrusion through ...
 
Data Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyData Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubey
 
Hrm database-management-java-project
Hrm database-management-java-projectHrm database-management-java-project
Hrm database-management-java-project
 
Ch24 system administration
Ch24 system administration Ch24 system administration
Ch24 system administration
 
Ch24
Ch24Ch24
Ch24
 
Alexis leon erp
Alexis leon erpAlexis leon erp
Alexis leon erp
 
First Things First
First Things FirstFirst Things First
First Things First
 
First Things First
First Things FirstFirst Things First
First Things First
 
Ch10 Conducting Audits
Ch10 Conducting AuditsCh10 Conducting Audits
Ch10 Conducting Audits
 
Differentiating data collection for cloud environment monitoring
Differentiating data collection for cloud environment monitoringDifferentiating data collection for cloud environment monitoring
Differentiating data collection for cloud environment monitoring
 
Top ten database_threats
Top ten database_threatsTop ten database_threats
Top ten database_threats
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Diagnostic System Monitoring

  • 1. Data Warehouse | July 13, 2013 Kevin Jesse Data Warehouse Team | University IT David Andruczyk Web Services Team | University IT
  • 2. Data Warehouse | July 13, 2013 Diagnostic monitoring refers to collecting ALL (or as many as possible) known system metrics at periodic intervals over time. The information given allows you to see fluctuations in areas of the system that may or may not impact operational use. This information also allows for detailed system metrics which can be used for further tuning.
  • 3. Data Warehouse | July 13, 2013 Operational monitoring refers to collecting KEY system metrics at periodic intervals over time. The information given allows you to refine that initial configuration to be more tailored to your requirements. The information also prepares you to address new problems that might appear on their own or following upgrades, increases in volumes, or new deployments.
  • 4. Data Warehouse | July 13, 2013 Apache Server Status OK 0.031554 seconds response time. Idle 29, busy 1, open slots 470 WARNING 0.029917 seconds response time. Idle 27, busy 353, open slots 120 Open Files OK: Open files is 9028 of 819200 System Core Files OK - 0 Core(s) found Java JVM Threads JMX OK - ThreadCount=352 JMX WARNING - ThreadCount=683 Total Number of Processes PROCS CRITICAL: 770 processes
  • 5. Data Warehouse | July 13, 2013 Apache HTTP/HTTPS HTTP OK: HTTP/1.1 200 OK - 245 bytes in 0.032 second response time System CPU 24 CPU, average load 3.2% < 50% : OK System Disk Usage DISK OK - free space: / 6717 MB (92% inode=99%) System Memory OK - 79444M free System Interfaces OK: host 'localhost', interfaces up: 7, down: 0, dormant: 0
  • 6. Data Warehouse | July 13, 2013 Benefits Helps identify key operational metrics Helps with holistic view of a system Performing poorly vs. down Gives additional insight in to system Allows for quicker understanding of a failure based on data Proactive monitoring of services which can forecast impending system failure Allows SME’s to have more visibility Enables vendors access to additional data for troubleshooting Risks or Downside Over use or redundant monitoring Initial implementation can have a high technical cost with SME Overwhelming amount of data to analyze Alert overload from misconfiguration Two systems to maintain (diagnostic and operational)
  • 7. Data Warehouse | July 13, 2013 Trend or Prediction Analysis Identification of Overall Performance Metrics Misconfigurations in Larger System Can Help to Identify and Pinpoint System Abuse Early detection via warning signals that an abnormality is occurring helps avoid the “shock/panic” factor Early detection of abnormalities vs. “System Down” Allow more time for analysis, assisting with scenario /what-if planning Insight into enhancements that would otherwise go un-noticed
  • 8. Data Warehouse | July 13, 2013 Nagios Cacti AWStats Logwatch Up.Time SCOM Tripwire Solar Winds Zabbix Munin Groundworks Big Brother Nfsen MRTG Hyperic HQ Tivoli http://en.wikipedia.org/wiki/Comparison_of_network_monitoring_systems
  • 9. Data Warehouse | July 13, 2013 Diagnostic monitoring is something that SME’s specialize in along with their other skills. Many SME’s prefer to add a monitoring station as an individual component of a larger cluster or platform system. This helps an administrator focus on tuning vs. being impacted by other alerts or misconfigurations in the monitoring station. Smaller systems with less overall metrics may not warrant standing up a unique monitoring station. These systems would benefit most form a collaborative and centralize diagnostic monitoring station.