Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Understanding costly infrastructure problems and how to fix them

94 views

Published on

Learn how to keep tab on business critical processes and services. Also understand how to manage physical and virtual setups proactively.

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Understanding costly infrastructure problems and how to fix them

  1. 1. Understanding costly infrastructure problems and how to fix them
  2. 2. 2. Keep tabs on the business critical process and services 1. Manage compute and storage resources in server and virtual infrastructure 3. Tackling hardware problems contributing towards server downtimes 4. Proactive resource planning to manage physical and virtual setups Agenda
  3. 3. Availability MemoryCPU Network Server infrastructure : Key parameters to focus Processes Hardware Disk
  4. 4. Supported servers
  5. 5. Supported virtual servers
  6. 6. Discovery & mapping 1. Manage compute and storage resources in server & virtual infrastructure Today's example: Windows server & VMware ESXi/ Hyper- V
  7. 7. Total CPU utilization of server Breakup of CPU utilization Windows server: CPU utilization
  8. 8. • User time • System time • I/O wait time • Idle time • CPU Utilization • Interrupts/ sec Windows server CPU core details
  9. 9. • Hour of the day • Day of the week • Statistical report • Heat chart CPU utilization history
  10. 10. Total memory utilization of server Windows server: Memory utilization Break up of memory utilization
  11. 11. Memory utilization statistical analysis Daily, weekly, monthly: • Statistical reports • Average values • Min - Max average values
  12. 12. Total Disk utilization of server Breakup of Disk utilization Windows server: Disk utilization
  13. 13. Disk I/O • Transfers/Sec • Writes/Sec • Reads/Sec • Busy time • Avg queue length
  14. 14. Network details • Input traffic • Output traffic • Network speed • Errors
  15. 15. VMware ESXi server overview
  16. 16. ESXi: CPU cores & Memory
  17. 17. ESXi: Datastore, Network & Disk I/O • ESX datastore capacity and utilization • Network utilization in the Hypervisor • Disk I/O utilization for data read/write
  18. 18. Hyper-V server overview
  19. 19. Hyper-V system services
  20. 20. Hyper-V virtual server details
  21. 21. Discovery & mapping 2. Keep tab on the business critical processes and services
  22. 22. Process details Add/ edit server processes within the console and track details like: • Number of instances with the configured process • CPU usage & • Memory usage of individual processes
  23. 23. Process split up Commands and arguments split up of individual processes along with CPU and memory consumption
  24. 24. Process templates
  25. 25. Service details Start/Restart/Stop/Manage/Unmanage services running on your server
  26. 26. Service templates
  27. 27. Windows event logs Analyze various server events with Event logs that are captured for each individual server.
  28. 28. Windows event log rules • System • File replication service • DNS server • Directory service • Application • Security
  29. 29. Windows event log rules Analyze a specific event log by creating and applying event log rules.
  30. 30. Configuring guest OS for VMs Configure guest OS for Virtual Machines to monitor processes, services and event logs.
  31. 31. Scheduled tasks in Windows • Last run time • Last run result • Next run time • Missed runs
  32. 32. Discovery & mapping 3. Tackling hardware problems contributing towards server downtimes Requirements: • Dell or HP machine • Dell: OpenManage Server Administrator • HP: System Insight Manager (SIM v6.2 or above)
  33. 33. Hardware performance details
  34. 34. Discovery & mapping 4. Resource planning to manage physical and virtual setups • Fault management to resolve red flags • Automated remedial actions for known server and virtual server issues • Save troubleshooting time • Prevent loss of service by automated resource planning • Predict future growth and utilization • Plan capacity and resource allocation
  35. 35. Threshold configuration
  36. 36. Windows service actions
  37. 37. Virtual machine actions
  38. 38. Execute script action
  39. 39. All actions • Email • SMS • Start/Stop/Restart VMs • Start/Stop/Restart Windows Service • Execute corrective programs:  Batch/Shell/Perl Scripts • Java heap/thread dump, garbage collection • Execute MBeans • Start/Stop/Restart Docker container instances • Start/Stop/Restart EC2 Instance • Log a ticket to:  ServiceDesk Plus  ServiceDesk Plus MSP  ServiceNow Notification Actions: Corrective Actions:
  40. 40. Capacity planning reports for servers • Identify undersized, oversized and Idle servers based on memory and disk utilization. • Decide whether it's time to upgrade your infrastructure capacity or a simple load redistribution will suffice.
  41. 41. Similarly, for virtual servers
  42. 42. Forecast reports Text Machine Learning powered predictive analysis based on historical growth and utilization trends
  43. 43. Contact : eval-apm@manageengine.com Online Demo : demo.appmanager.com

×