• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Role of OpManager in event and fault management
 

Role of OpManager in event and fault management

on

  • 1,918 views

Managing Event and Fault are not new to any IT managers. However if not implemented properly, this could be the most daunting of network monitoring and network management tasks....

Managing Event and Fault are not new to any IT managers. However if not implemented properly, this could be the most daunting of network monitoring and network management tasks.

Check out this presentation, to understand

# The basics of Event and Fault Management &
# How ManageEngine OpManager helps in effective Fault Management

Statistics

Views

Total Views
1,918
Views on SlideShare
1,918
Embed Views
0

Actions

Likes
0
Downloads
146
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Is Fault Management all about detecting the events?
  • Detect events, Isolate faults, Inform or notify admins and Resolve or aid faster resolution
  • Other e.g. of Active polling are monitoring through SNMP, WMI, Telnet, SSH, Custom scripts, Remote query & more…
  • Other e.g. of Passive monitoring are SNMP TRAPS, Syslog, NetFlow, Packet forwarding & more …
  • ManageEngine OpManager is comprehensive, easy-to-use network monitoring & management software. For free trial visit - www.opmanager.com / For product demos - mail us at [email_address] / Call at +1 888 720 9500
  • ManageEngine is the only IT Management vendor focused on bringing a complete IT Management portfolio to the mid-sized enterprise. Trusted by over 45,000 customers including 3 out of every 5 fortune 500 companies. More at www.manageengine.com
  • opmanager@ manageengine .com The network monitoring and network management software from ManageEngine www.manageengine.com/network-monitoring/

Role of OpManager in event and fault management Role of OpManager in event and fault management Presentation Transcript

  • The Role of OpManager in Event and Fault Management Team OpManager www.opmanager.com
  • Agenda
    • Brushing up Fault management
      • Reactive Vs. Pro-active
    • The four processes and OpManager’s role
      • Detect
      • Isolate
      • Inform
      • Resolve
  • Reactive Fault Management
    • Firefighting in nature
    • Troubleshooting starts after business is impacted
    • Higher resolution time
    • Least preferred by both IT admins & End users
    User IT Admin It is not working!
  • Proactive Fault Management
    • Alerts on an impending fault
    • Resolution time reduced drastically
    • Reduced operation cost
    NMS has reported a problem & I’m working on it User IT Admin
  • What is Fault and Event Management?
    • Detecting events
    • Make sense of them
    • Present only actionable events
    *An event can be informational, a cleared event, warning, trouble or even a critical problem
  • The four processes
  • The four processes explained
    • Active Monitoring
    • Passive Monitoring
    • De-duplication
    • Correlation
    • Automation
    • Visual representation
    • Ticketing
    • Alerting
    • Automatic correction
    • Troubleshooting tools
  • Detect – Capture events
    • Active Polling/ Probing/ Query monitoring
    Active Monitoring: e.g. SNMP Polling Other e.g. of Active polling are monitoring through SNMP, WMI, Telnet, SSH, Custom scripts, Remote query & more…
  • Detect – Capture events
    • Passive or Event-based Monitoring
    Passive Monitoring e.g. SNMP TRAP Other e.g. of Passive monitoring are SNMP TRAPS, Syslog, NetFlow, Packet forwarding & more …
  • Isolate – Present actionable faults
    • Helps identify the root cause of the problem quickly; reduces Mean-Time-To-Resolve (MTTR)
    • Includes tasks to
      • Understand event source
      • Filters-out redundant or known events
      • Projects only actionable faults
    • *Network Management System’s Fault management engine plays a vital role
    • De-duplication
    • Drops recurrent events from displaying
    • Build them as event history
    Isolate – Present actionable faults
    • De-duplication
    • OpManager Alarms view – Showing unique alerts for every device and type of alarms
    • Detailed alarm history page with list of alarm actions
    Isolate – Present actionable faults
    • Correlation
    • Relates previous events and interdependency
    • Projects only the root cause of the problem
    Isolate – Present actionable faults
    • Correlation
    • OpManager has automated and custom network maps that lets you identify the root cause much quickly.
    • Lets you configure device dependencies to project only the root of the problem
    Isolate – Present actionable faults
    • Automation
    • Ignore incidental events
    • Remove cleared faults
    • Suppress known alarms (Automated/ Manual Suppression)
    Isolate – Present actionable faults
    • Automation
    • Threshold configuration – Consecutive Times and Rearm Value
    • Suppress known alarms – Downtime Scheduler
    Isolate – Present actionable faults
    • Automation
    • Suppress known alarms - Manual suppression for devices and interfaces
    Isolate – Present actionable faults
    • Visual representation of faults to facilitate NOC admins
    • Ticketing and Alert remote admins
    Inform – Notify admins
  • Inform – Notify admins
    • Alarms color coding
    • Web Alarms and Dashboards
    • Dynamic network or custom maps showing the network and device status
  • Inform – Notify admins
    • Trouble ticketing
    • Through Email for other helpdesk products
    • Automatic ticket creation with ManageEngine ServiceDesk plus, through integration
  • Inform – Notify admins
    • Alert remote admins – Email, SMS, RSS feeds, Twitter Alerts, iPhone/ Smartphone GUI
    Email RSS Twitter DM Smart Phone UI SMS
  • Resolve – Aid faster resolution
    • Needs proprietary knowledge of your IT infrastructure, policies & agreed SLAs.
    • NMS should help
      • Execute such automation logics (Communicate execution faults, if any)
      • Back manual troubleshooting with set of IT tools
  • Resolve – Aid faster resolution
    • Automated Fault resolution
    • Run a command or Run a program on a remote machine with options to append error messages
    • Restart Windows service or the server, if the service is found to be down
  • Resolve – Aid faster resolution
    • Server Troubleshooting Tools
    • Remote Process Diagnostics
    • Device Tools: Ping, Trace route, Tools to remotely connect to the server – Web console, Telnet/ SSH, MS terminal server
  • Resolve – Aid faster resolution
    • Network Troubleshooting Tools
    • Switch Port Mapper
    • Network Traffic Analysis
    • Switch port disabling option
  • Resolve – Aid faster resolution
    • Network Troubleshooting Tools
    • WAN link hop-wise latency count graph
    • Network Change and Configuration Management (NCCM)
  • Resolve – Aid faster resolution
    • Other Troubleshooting Tools
    • Real-time performance graphs
    • MIB Browser and Syslog viewer
  • Tons of features that we’ve not talked about
    • Automatic network discovery
    • Device and Interface monitoring templates
    • Network Maps/ Custom Maps
    • WAN RTT and VoIP Monitoring
    • Network Traffic Analysis
    • Network Change and Configuration Mgmt.
    • Server Monitoring (Windows/ Linux/ UNIX flavor OSes)
    • ESX VMware Monitoring
    • MS Exchange, SQL and Active Directory Monitoring
    • Service Monitoring, Website monitoring, Process and File/ Folder monitoring
    • Processing SNMP TRAPs, Syslogs & Event Log
    • Monitors any pingable and SNMP enabled device
    ManageEngine OpManager is comprehensive, easy-to-use network monitoring & management software. For free trial visit - www.opmanager.com For product demos - Mail us at [email_address] Call at +1 888 720 9500
  • About ManageEngine ManageEngine is the only IT Management vendor focused on bringing a complete IT Management portfolio to the mid-sized enterprise. Trusted by over 45,000 customers including 3 out of every 5 fortune 500 companies. More at www.manageengine.com
  • Summary Fault and Event Management Proactive and Reactive approach Four processes of Fault Management : Detect: Active and Passive Monitoring Isolate: De-duplication, Correlation, Automation Inform: Visual fault representation, ticketing and alerting Resolve: Automated Scripts and Tools to aid manual troubleshooting In each process OpManager’s role in Fault and Event management About ManageEngine and its various IT management products
  • Questions ? Thank you opmanager@ manageengine .com