SlideShare a Scribd company logo
1 of 13
Download to read offline
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 1
Table of Contents
1. Executive Summary.............................................................................................................................................................1
2. Why is Network Troubleshooting So Hard? ................................................................................................................1
Causes of Network Outages.............................................................................................................................................1
The Cost of Network Outages..........................................................................................................................................1
Finding a Needle in a Haystack: Troubleshooting with Limited Visibility.........................................................2
3. Divide & Conquer with Network Automation ............................................................................................................3
A Network Map to Define the Scope of the Problem...............................................................................................4
Analyzing Network Performance....................................................................................................................................5
Analyzing Recent Changes...............................................................................................................................................6
Diagnosing Network Segments in Parallel..................................................................................................................7
4. Case Study: Dimension Data Accelerates Troubleshooting on Customer Networks................................. 10
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING 1
1. Executive Summary
When the network goes down, every minute counts. Data from a 2013 CDW
survey suggests that network outages cost enterprises over $1.7B in lost
revenue over the previous year. Much of this loss could have been avoided if
network teams were able to discover the source of problems more quickly.
Many enterprises have already deployed network monitoring systems to help
them react to incidents faster, but it’s not enough. It’s equally important to
improve mean-time-to-repair (MTTR) by accelerating troubleshooting times.
In this paper, we’ll examine why network troubleshooting is so challenging and
look at opportunities to improve incident response times with a divide and
conquer strategy. We’ll address how automation can be applied to a traditional
troubleshooting methodology for isolating the problem, gathering information,
and automating the analysis of critical data.
2. Why is Network Troubleshooting So Hard?
Effective troubleshooting requires a combination of both experience and an
intimate knowledge of the network’s design. Even when a network engineer
possesses both, there’s still the challenge of diagnosing network symptoms,
involving a lot of manual data collection and analysis.
Causes of Network Outages
There’s a lot of hype and media coverage around network hacking and DDoS
attacks, but far more network outages are actually caused by mistakes made by
an organization’s own people. A recent Gartner study estimated that people
and process issues will cause 80% of outages impacting mission-critical services
through 2015. Of that number, more than 50% will be the result of a network
upgrade or configuration change.
The Cost of Network Outages
Early in 2014, both Xbox LIVE and Facebook suffered well-publicized network
outages, both caused by configuration errors during scheduled maintenance.
For Xbox LIVE the untimely outage crippled the launch of one of their biggest
online games. For Facebook, 30 minutes of downtime cost an estimated
$500,000 in lost ad revenue. Of course, the cost to a business’ reputation may
be far higher if customers are impacted.
Top Causes of
Network Outages*
o 23% from router/switch
failure (including DoS
attacks)
o 32% from a link failure
(fiber cuts, network
congestion)
o 36% from a network
change (upgrade, config
change)
*Data from a 2013 Cisco Study
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 2
Finding a Needle in a Haystack: Troubleshooting with Limited
Visibility
Network visibility is increasingly sought-after in the network industry, because
better visualization of the network leads to better decision-making and faster
problem resolution. Despite dozens of tools, which claim to improve visibility,
the most common window a troubleshooter has into the network is the
command-line interface (CLI). Unfortunately, the CLI provides a narrow field of
vision for troubleshooters because the information they can gather is limited to
the rate at which they can issue and interpret commands – one device at a time.
When diagnosing a network problem, it’s estimated that engineers spend 80%
of their time manually gathering data, and only 20% analyzing it. This time
spent ‘data mining’ represents an opportunity for improvement. The figure
below shows how important the task of gathering and analyzing information is
during a typical troubleshooting scenario.
Figure 1: Visibility Challenges during Troubleshooting Diagnosis
Because the CLI provides limited visibility, engineers also need access to
accurate ‘troubleshoot-ready’ network diagrams. These are diagrams that
target the problem area and omit parts of the network that aren’t related to the
problem. These maps should include design parameters including routing
protocols, access-lists, VLANs, etc. Today, very few tools exist which can provide
these types of maps; instead engineers commonly rely on ‘static’ diagrams,
commonly created with MS Visio.
Although both the CLI and network diagrams (if available), help
troubleshooters gather information about topology and configuration, they’re
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 3
both poor tools for understanding what’s happening on the network. During an
incident, engineers need to understand both live performance as well as recent
changes. Even with a performance monitoring solution deployed, engineers
often struggle due to ‘information overload’.
The last factor we’ll address in this paper is the dependence network teams
have on ‘tribal’ knowledge. This refers to the all-too-common scenario where a
network ‘hero’ needs to come in and solve a difficult problem. The reason is
that a very small percentage of team members have sufficient troubleshooting
experience or intimate network knowledge which is required to solve complex
problems. The figure below summarizes the challenges associated with
visibility, and how it impacts an engineer’s ability to find answers to their most
critical questions.
Figure 2: Sources and Limitations of Network Visibility in an Enterprise Environment
3. Divide & Conquer with Network Automation
There’s no shortage of network monitoring tools to help engineers detect
network outages, but the steps to diagnose a detected alarm are almost always
manual. Effective troubleshooting techniques require a tool which can both
increase network visibility as well as help divide and conquer time-consuming
analyses.
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 4
A Network Map to Define the Scope of the Problem
Without visual aids, the ability to understand complex networks begins to break
down. Network diagrams serve as the go-to visual aid for network engineers,
but troubleshooting is dramatically hindered if the diagrams aren’t up-to-date
and reliable.
More than a repository of updated site diagrams, what a troubleshooter needs
is a customized diagram, which omits irrelevant parts of the network that only
serve to distract. For example, if a slow application is traversing across three
data centers, an engineer needs a single diagram of the application flow, not
three diagrams, one for each data center. In other words, a tailored diagram is
the best asset.
A Fresh Approach: Dynamic Network Mapping
NetBrain’s unique network diagrams are dynamic in nature, which means they
are updated automatically, when the network changes. NetBrain diagrams can
be created on-demand as well, so engineers don’t need to sort through dozens
of diagrams during an incident. Instead, they can instantly create a custom map
focused on the event.
Network engineers are frequently asked to troubleshoot poorly performing
applications, with little more to go on than a report of slowness. To tackle this
challenge, the engineer can dynamically create a custom layer-3 or layer-2 map
of the application flow by entering two IP addresses (i.e. the source IP address
and the IP of the application server). NetBrain will perform a comprehensive
analysis of the routing, access-lists, and NAT for every hop in the path. The
resulting map will show which devices are in the path of the application flow.
Figure 3: A Tailored Diagram of an Application Flow (Created On-Demand with NetBrain)
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 5
Analyzing Network Performance
It’s difficult to troubleshoot performance problems without being able to see
what’s happening on the network. Many network teams have 24x7 network
monitoring systems that generate alarms when an incident occurs. Examples of
such monitoring tools include HP OpenView, IBM Tivoli, CA Spectrum, and
Solarwinds NPM.
Figure 4: Example Network Monitoring and Alerting Tools
Network monitoring tools solve only half of the puzzle; after an alarm is
generated network teams still revert to manual methods of troubleshooting. An
effective troubleshooting tool should integrate with network monitoring and
ticketing systems to improve visibility into the problem area.
Diagnostic Monitoring on a Live Map
NetBrain’s monitoring function can be turned on from any map, or even
launched from a 3rd part monitoring tool, to visualize the performance
characteristics of each device and interface. When troubleshooting a slow
application, engineers can quickly spot bandwidth bottlenecks on the
interfaces (highlighted in red) or CPU/Memory over-utilization on each device.
For intermittent application behavior issues, monitoring can be left to run
overnight; it will collect and plot each data point to highlight trends.
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 6
Figure 5: Monitoring Application Performance Factors (Issues Highlighted in Red)
Analyzing Recent Changes
With over one third of network outages resulting from a network change,
visibility into what’s changed is critical. That means understanding not just
what’s changed in configuration, but understanding the impact of those
changes on routing, topology, application traffic, and more.
Automated Change Analysis
NetBrain can be configured to benchmark the network regularly so that network
teams are better equipped to understand recent changes. During every
benchmark, NetBrain collects live data and looks for changes in configuration,
routing, inventory, as well as MAC/ARP/CDP/STP tables. NetBrain also includes
comparative analysis capabilities to automatically highlight the changes side-by-
side.
Figure 6: NetBrain’s System Benchmark Properties
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 7
By way of example, when troubleshooting application slowness, an engineer can
‘rewind the clock’ and see how application traffic was being routed before the
problem arose. Any changes could provide valuable clues into the problem.
Figure 7: Analyzing Application Traffic from Last Week
Diagnosing Network Segments in Parallel
When engineers rely on the command line interface as their primary
troubleshooting tool, they’re forced to diagnose the network in a serialized
manner, one device at a time. That’s because the output to CLI commands is
often uneasy to scan, and important data points are hard to find. Finding the
‘missing pieces’ of information may take dozens of commands.
Figure 8: Serialized Troubleshooting with the CLI
CLI ping and traceroute
used to determine path
Multiple show level
commands in multiple
CLI windows
Repeat until problem is
found
Quick “performance”
test results
Stare and compare to
find deviations and
anomalies
Quick “performance”
test results
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 8
Effective troubleshooting should instead occur in parallel, meaning that
commands are issued on many devices simultaneously and only the relevant
data is parsed from the output. A network map serves as the best
troubleshooting user interface because it provides a canvas for which to
populate the relevant data.
Figure 9: Diagnosing Interface Errors in Parallel (collisions and CRC errors labeled in red)
The image above shows what it may look like to diagnose the interfaces of
multiple devices, in parallel, on a live network map. Troubleshooting
automation can issue the appropriate commands on your behalf, and extract
the relevant data.
Adaptive Network Automation – A Powerful Alternative to Scripting
Writing Perl and Python scripts to automate data collection is powerful, but the
vast majority of network engineers aren’t programmers and they struggle to
realize the benefits. NetBrain eliminates the programming requirement from
network automation with its ‘quick’ programming environment. Engineers can
literally point and click to program their own NetBrain ‘Qapps’.
As an example, the Check Interface Errors Qapp - which was written by a
NetBrain engineer in less than 10 minutes - can be run to detect incrementing
interface errors and speed/duplex mismatches.
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 9
Figure 10: NetBrain’s ‘Quick’ Programming Environment
Each new Qapp becomes a new feature, and it leverages a dynamic map to
display the output. For troubleshooters, every Qapp is an executable diagnosis
which can automatically extract and analyze the CLI data which would
otherwise be collected manually. This helps network teams troubleshoot
virtually any network issue in parallel, rather than one device at a time. It also
helps network teams digitize and share their troubleshooting checklists.
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 10
4. Case Study: Dimension Data Accelerates
Troubleshooting on Customer Networks
.CUSTOMER PROFILE:
Industry: Managed Services
Company: Dimension Data
CHALLENGE:
Dimension Data does not own
the customer networks they
manage so they struggle to
gain and maintain intimate
knowledge on those networks,
which is inherently gained
through day-to-day
operations.
SOLUTION:
Dimension Data utilizes
NetBrain to automate diagram
creation, visualize
performance issues to
expedite diagnosis, and to
easily share information for
collaborative troubleshooting
sessions.
BENEFIT:
NetBrain’s advanced network
visualization and automation
capabilities enable Dimension
Data to shorten typical
diagnosis and repair time by as
much as 50%.
Dimension Data specializes in information technology services, with
operations on every inhabited continent. Dimension Data's focus areas
include network integration, security solutions, data center solutions,
converged communications, and a range of professional, consulting, and
managed services. A major challenge the company consistently faces is the
ability to understand their customers’ networks to the extent necessary to
diagnose and troubleshoot complex issues and resolve network outages
effectively.
Dimension Data deployed NetBrain in their customer environments, in many
cases integrating the tool with the NetCool alarm system, Opsware
configuration management solution, and Vitalnet’s performance trending
solution. With these integrations, an alarm reported by HP OpenView is
instantly translated to a map inside NetBrain Workstation.
NetBrain continues to offer value to Dimension Data in three areas:
 On-demand network mapping effectively removes dependencies on
manual network diagrams which are often inconsistent and error-
prone.
 Network performance diagnosis via Dynamic Diagrams enables lower-
level engineers to troubleshoot advanced problems
 Engineers share information via NetBrain for collaboration
The following are some ‘war stories’ reported by this customer:
Detecting Serious Routing Issues on the Accudyne Network
NetBrain was able to provide real-time network visibility into the
Accudyne network and help identify serious routing issues. The tool
was used to highlight the congestion points on the map and
ultimately tie the problem to equal cost routes and MPLS design
segregation.
Troubleshooting Slowness to a Server
Previously it took Dimension Data almost two and a half hours to
determine the source and destination path of an application server
inside the Accudyne network, followed by another two hours to
diagnose the problem. With NetBrain, the task to find the path took
two minutes, and another five minutes was all that was needed to
diagnose the issue.
Troubleshooting MS Outlook Slowness to Tokyo
The Tokyo office was experiencing slowness sending outlook
attachments. Multiple tickets had been opened for this issue and
several engineers had already looked into it. NetBrain was then
applied and, within three minutes, it was determined that there was a
duplex issue on the edge WAN port.
NetBrain saves time when time is critical. As a Dimension Data Network
Integration Engineer reported, “It has changed the way I approach
troubleshooting.”
WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 11
About NetBrain Technologies, Inc.
Founded in 2004, NetBrain set out to pursue a new vision: automate time-
consuming tasks associated with network documentation, design, and
troubleshooting. NetBrain’s customers are using map-driven automation to
eliminate manual network documentation, automate troubleshooting tasks,
and mitigate security risks. NetBrain is headquartered in Burlington, MA with
offices in Sacramento, CA, New York, and Beijing, China.
To learn more about NetBrain’s dynamic mapping solution, contact us at
781.221.7199 or download free trial of NetBrain’s Enterprise Suite from
www.netbraintech.com/trial.
NetBrain Technologies, Inc.
15 Network Drive
Burlington, MA 01803
+1 800 605 7964
info@netbraintech.com
www.netbraintech.com

More Related Content

What's hot

Fault management presentation
Fault management presentationFault management presentation
Fault management presentationardhita banu adji
 
Comparative analysis of it monitoring tools october2015 final
Comparative analysis of it monitoring tools october2015 finalComparative analysis of it monitoring tools october2015 final
Comparative analysis of it monitoring tools october2015 finalapprize360
 
Network timing synchroniztion antennas_testing
Network timing synchroniztion antennas_testingNetwork timing synchroniztion antennas_testing
Network timing synchroniztion antennas_testingShari Trussell
 
Mohammed Al Mulla - Best practices to secure working environments
Mohammed Al Mulla - Best practices to secure working environmentsMohammed Al Mulla - Best practices to secure working environments
Mohammed Al Mulla - Best practices to secure working environmentsnooralmousa
 
Simplifying SDN Networking Across Private and Public Clouds
Simplifying SDN Networking Across Private and Public CloudsSimplifying SDN Networking Across Private and Public Clouds
Simplifying SDN Networking Across Private and Public Clouds5nine
 
Experiences in Mainframe-to-Splunk Big Data Access
Experiences in Mainframe-to-Splunk Big Data AccessExperiences in Mainframe-to-Splunk Big Data Access
Experiences in Mainframe-to-Splunk Big Data AccessPrecisely
 
Your Applications Are Distributed, How About Your Network Analysis Solution?
Your Applications Are Distributed, How About Your Network Analysis Solution?Your Applications Are Distributed, How About Your Network Analysis Solution?
Your Applications Are Distributed, How About Your Network Analysis Solution?Savvius, Inc
 
Network Management System (Contact: info@exceleron-communications.com)
Network Management System (Contact: info@exceleron-communications.com)Network Management System (Contact: info@exceleron-communications.com)
Network Management System (Contact: info@exceleron-communications.com)Nadeem Khan
 
The Importance of an Integrated Network Management System
The Importance of an Integrated Network Management SystemThe Importance of an Integrated Network Management System
The Importance of an Integrated Network Management SystemIRIS Network Systems
 
Troubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesTroubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesSolarWinds
 
Aruba Rightsizing Your Network
Aruba Rightsizing Your NetworkAruba Rightsizing Your Network
Aruba Rightsizing Your Networkhypknight
 
Managing a Widely Distributed Network
Managing a Widely Distributed NetworkManaging a Widely Distributed Network
Managing a Widely Distributed Network Savvius, Inc
 
Netpod - The Merging of NPM & APM
Netpod - The Merging of NPM & APMNetpod - The Merging of NPM & APM
Netpod - The Merging of NPM & APMBoni Bruno
 
network monitoring system ppt
network monitoring system pptnetwork monitoring system ppt
network monitoring system pptashutosh rai
 
Network Vision Software And Services
Network Vision Software And ServicesNetwork Vision Software And Services
Network Vision Software And ServicesMark Fondl
 
Ge powerpoint presentation group 4!
Ge powerpoint presentation group 4!Ge powerpoint presentation group 4!
Ge powerpoint presentation group 4!Kenneth Regalado
 
The Changing Landscape in Network Performance Monitoring
The Changing Landscape in Network Performance Monitoring The Changing Landscape in Network Performance Monitoring
The Changing Landscape in Network Performance Monitoring Savvius, Inc
 
Network Troubleshooting - Part 2
Network Troubleshooting - Part 2Network Troubleshooting - Part 2
Network Troubleshooting - Part 2SolarWinds
 

What's hot (20)

Fault management presentation
Fault management presentationFault management presentation
Fault management presentation
 
Comparative analysis of it monitoring tools october2015 final
Comparative analysis of it monitoring tools october2015 finalComparative analysis of it monitoring tools october2015 final
Comparative analysis of it monitoring tools october2015 final
 
Network timing synchroniztion antennas_testing
Network timing synchroniztion antennas_testingNetwork timing synchroniztion antennas_testing
Network timing synchroniztion antennas_testing
 
Mohammed Al Mulla - Best practices to secure working environments
Mohammed Al Mulla - Best practices to secure working environmentsMohammed Al Mulla - Best practices to secure working environments
Mohammed Al Mulla - Best practices to secure working environments
 
Simplifying SDN Networking Across Private and Public Clouds
Simplifying SDN Networking Across Private and Public CloudsSimplifying SDN Networking Across Private and Public Clouds
Simplifying SDN Networking Across Private and Public Clouds
 
Experiences in Mainframe-to-Splunk Big Data Access
Experiences in Mainframe-to-Splunk Big Data AccessExperiences in Mainframe-to-Splunk Big Data Access
Experiences in Mainframe-to-Splunk Big Data Access
 
Enterprise network management
Enterprise network managementEnterprise network management
Enterprise network management
 
Your Applications Are Distributed, How About Your Network Analysis Solution?
Your Applications Are Distributed, How About Your Network Analysis Solution?Your Applications Are Distributed, How About Your Network Analysis Solution?
Your Applications Are Distributed, How About Your Network Analysis Solution?
 
Network Management System (Contact: info@exceleron-communications.com)
Network Management System (Contact: info@exceleron-communications.com)Network Management System (Contact: info@exceleron-communications.com)
Network Management System (Contact: info@exceleron-communications.com)
 
The Importance of an Integrated Network Management System
The Importance of an Integrated Network Management SystemThe Importance of an Integrated Network Management System
The Importance of an Integrated Network Management System
 
Troubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesTroubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing Rules
 
Aruba Rightsizing Your Network
Aruba Rightsizing Your NetworkAruba Rightsizing Your Network
Aruba Rightsizing Your Network
 
Managing a Widely Distributed Network
Managing a Widely Distributed NetworkManaging a Widely Distributed Network
Managing a Widely Distributed Network
 
Netpod - The Merging of NPM & APM
Netpod - The Merging of NPM & APMNetpod - The Merging of NPM & APM
Netpod - The Merging of NPM & APM
 
network monitoring system ppt
network monitoring system pptnetwork monitoring system ppt
network monitoring system ppt
 
Network Vision Software And Services
Network Vision Software And ServicesNetwork Vision Software And Services
Network Vision Software And Services
 
Netcool Impact docs
Netcool Impact docsNetcool Impact docs
Netcool Impact docs
 
Ge powerpoint presentation group 4!
Ge powerpoint presentation group 4!Ge powerpoint presentation group 4!
Ge powerpoint presentation group 4!
 
The Changing Landscape in Network Performance Monitoring
The Changing Landscape in Network Performance Monitoring The Changing Landscape in Network Performance Monitoring
The Changing Landscape in Network Performance Monitoring
 
Network Troubleshooting - Part 2
Network Troubleshooting - Part 2Network Troubleshooting - Part 2
Network Troubleshooting - Part 2
 

Similar to White Paper Leveraging Automation for Advanced Network Troubleshooting

[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting
[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting
[White Paper] Leveraging-Automation-For-Advanced-Network-TroubleshootingNetBrain Technologies
 
[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-DiagramsNetBrain Technologies
 
The Case for Automated Network Diagramming a NetBrain WhitePaper
The Case for Automated Network Diagramming a NetBrain WhitePaperThe Case for Automated Network Diagramming a NetBrain WhitePaper
The Case for Automated Network Diagramming a NetBrain WhitePaperKen Flott
 
NetBrain-in-Action
NetBrain-in-ActionNetBrain-in-Action
NetBrain-in-ActionKen Reiff
 
Computer Network Monitoring & Performance
Computer Network Monitoring & PerformanceComputer Network Monitoring & Performance
Computer Network Monitoring & PerformanceDmitry Ponomarenko
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkDellNMS
 
Datasheet_NetBrain-Enterprise-Suite_5-6-14
Datasheet_NetBrain-Enterprise-Suite_5-6-14Datasheet_NetBrain-Enterprise-Suite_5-6-14
Datasheet_NetBrain-Enterprise-Suite_5-6-14Sean Boudreault
 
9Tuts.Com New CCNA 200-120 New CCNA New Questions 2
9Tuts.Com New CCNA 200-120 New CCNA   New Questions 29Tuts.Com New CCNA 200-120 New CCNA   New Questions 2
9Tuts.Com New CCNA 200-120 New CCNA New Questions 2Lori Head
 
Title Networking Essentials Companion GuideAuthor Cisco Networking
Title Networking Essentials Companion GuideAuthor Cisco NetworkingTitle Networking Essentials Companion GuideAuthor Cisco Networking
Title Networking Essentials Companion GuideAuthor Cisco NetworkingTakishaPeck109
 
Identifying and Resolving Network Problems 50 best.pptx
Identifying and Resolving Network Problems 50 best.pptxIdentifying and Resolving Network Problems 50 best.pptx
Identifying and Resolving Network Problems 50 best.pptxReshidJewar
 
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdf
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdfSwisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdf
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdfThomasGraf40
 
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docx
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docxInstrumenting Home NetworksKenneth L. CalvertLab for Adv.docx
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docxnormanibarber20063
 
Tiarrah Computing: The Next Generation of Computing
Tiarrah Computing: The Next Generation of ComputingTiarrah Computing: The Next Generation of Computing
Tiarrah Computing: The Next Generation of ComputingIJECEIAES
 
Introdunction to Network Management Protocols - SNMP & TR-069
Introdunction to Network Management Protocols - SNMP & TR-069Introdunction to Network Management Protocols - SNMP & TR-069
Introdunction to Network Management Protocols - SNMP & TR-069William Lee
 
Chapter 20.pdf
Chapter 20.pdfChapter 20.pdf
Chapter 20.pdfidrissaeed
 
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA
 

Similar to White Paper Leveraging Automation for Advanced Network Troubleshooting (20)

[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting
[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting
[White Paper] Leveraging-Automation-For-Advanced-Network-Troubleshooting
 
[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams
 
The Case for Automated Network Diagramming a NetBrain WhitePaper
The Case for Automated Network Diagramming a NetBrain WhitePaperThe Case for Automated Network Diagramming a NetBrain WhitePaper
The Case for Automated Network Diagramming a NetBrain WhitePaper
 
NetBrain-in-Action
NetBrain-in-ActionNetBrain-in-Action
NetBrain-in-Action
 
NetBrain Technologies
NetBrain Technologies NetBrain Technologies
NetBrain Technologies
 
En35793797
En35793797En35793797
En35793797
 
PacketsNeverLie
PacketsNeverLiePacketsNeverLie
PacketsNeverLie
 
Computer Network Monitoring & Performance
Computer Network Monitoring & PerformanceComputer Network Monitoring & Performance
Computer Network Monitoring & Performance
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your Network
 
Datasheet_NetBrain-Enterprise-Suite_5-6-14
Datasheet_NetBrain-Enterprise-Suite_5-6-14Datasheet_NetBrain-Enterprise-Suite_5-6-14
Datasheet_NetBrain-Enterprise-Suite_5-6-14
 
9Tuts.Com New CCNA 200-120 New CCNA New Questions 2
9Tuts.Com New CCNA 200-120 New CCNA   New Questions 29Tuts.Com New CCNA 200-120 New CCNA   New Questions 2
9Tuts.Com New CCNA 200-120 New CCNA New Questions 2
 
Title Networking Essentials Companion GuideAuthor Cisco Networking
Title Networking Essentials Companion GuideAuthor Cisco NetworkingTitle Networking Essentials Companion GuideAuthor Cisco Networking
Title Networking Essentials Companion GuideAuthor Cisco Networking
 
Identifying and Resolving Network Problems 50 best.pptx
Identifying and Resolving Network Problems 50 best.pptxIdentifying and Resolving Network Problems 50 best.pptx
Identifying and Resolving Network Problems 50 best.pptx
 
Dynamic Mapping with Automation
Dynamic Mapping with AutomationDynamic Mapping with Automation
Dynamic Mapping with Automation
 
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdf
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdfSwisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdf
Swisscom Network Analytics Data Mesh Architecture - ETH Viscon - 10-2022.pdf
 
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docx
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docxInstrumenting Home NetworksKenneth L. CalvertLab for Adv.docx
Instrumenting Home NetworksKenneth L. CalvertLab for Adv.docx
 
Tiarrah Computing: The Next Generation of Computing
Tiarrah Computing: The Next Generation of ComputingTiarrah Computing: The Next Generation of Computing
Tiarrah Computing: The Next Generation of Computing
 
Introdunction to Network Management Protocols - SNMP & TR-069
Introdunction to Network Management Protocols - SNMP & TR-069Introdunction to Network Management Protocols - SNMP & TR-069
Introdunction to Network Management Protocols - SNMP & TR-069
 
Chapter 20.pdf
Chapter 20.pdfChapter 20.pdf
Chapter 20.pdf
 
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
 

More from E.S.G. JR. Consulting, Inc.

Vermont 2012 Emergency broadband action to May 5, 2020
Vermont 2012 Emergency broadband action to May 5, 2020Vermont 2012 Emergency broadband action to May 5, 2020
Vermont 2012 Emergency broadband action to May 5, 2020E.S.G. JR. Consulting, Inc.
 
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting Laws
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting LawsJuly 1, 2020 Vermont's Universal Recycling Law Timeline-Composting Laws
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting LawsE.S.G. JR. Consulting, Inc.
 
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDP
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDPState of Working Vermont 2019 grew only 1.2% Gross State Product = GDP
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDPE.S.G. JR. Consulting, Inc.
 
Vermont Department of Human Resources Payroll Data
Vermont Department of Human Resources Payroll DataVermont Department of Human Resources Payroll Data
Vermont Department of Human Resources Payroll DataE.S.G. JR. Consulting, Inc.
 
Purchasing Power produced by Small Modular Reactors VT
Purchasing Power produced by Small Modular Reactors   VTPurchasing Power produced by Small Modular Reactors   VT
Purchasing Power produced by Small Modular Reactors VTE.S.G. JR. Consulting, Inc.
 
May 14, 2019 Voting Results Minicipal, School Budgets, Select Board
May 14, 2019 Voting Results Minicipal, School Budgets, Select BoardMay 14, 2019 Voting Results Minicipal, School Budgets, Select Board
May 14, 2019 Voting Results Minicipal, School Budgets, Select BoardE.S.G. JR. Consulting, Inc.
 
Rich States Poor States 11th Edition 15 Weighted Categories
Rich States Poor States 11th Edition 15 Weighted CategoriesRich States Poor States 11th Edition 15 Weighted Categories
Rich States Poor States 11th Edition 15 Weighted CategoriesE.S.G. JR. Consulting, Inc.
 
2018-2019 Strategic Plan and Performance Report - Final
2018-2019 Strategic Plan and Performance Report - Final2018-2019 Strategic Plan and Performance Report - Final
2018-2019 Strategic Plan and Performance Report - FinalE.S.G. JR. Consulting, Inc.
 
Building a Wall around the Welfare State, Instead of the Country
Building a Wall around the Welfare State, Instead of the CountryBuilding a Wall around the Welfare State, Instead of the Country
Building a Wall around the Welfare State, Instead of the CountryE.S.G. JR. Consulting, Inc.
 
Decarbonization Methods in Vermont Report 2019 1 to 146 pages
Decarbonization Methods in Vermont Report 2019 1 to 146 pagesDecarbonization Methods in Vermont Report 2019 1 to 146 pages
Decarbonization Methods in Vermont Report 2019 1 to 146 pagesE.S.G. JR. Consulting, Inc.
 
Vermont Opioid Related Fatalities with Suicide Statistics
Vermont Opioid Related Fatalities with Suicide StatisticsVermont Opioid Related Fatalities with Suicide Statistics
Vermont Opioid Related Fatalities with Suicide StatisticsE.S.G. JR. Consulting, Inc.
 

More from E.S.G. JR. Consulting, Inc. (20)

Vermont 2012 Emergency broadband action to May 5, 2020
Vermont 2012 Emergency broadband action to May 5, 2020Vermont 2012 Emergency broadband action to May 5, 2020
Vermont 2012 Emergency broadband action to May 5, 2020
 
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting Laws
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting LawsJuly 1, 2020 Vermont's Universal Recycling Law Timeline-Composting Laws
July 1, 2020 Vermont's Universal Recycling Law Timeline-Composting Laws
 
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDP
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDPState of Working Vermont 2019 grew only 1.2% Gross State Product = GDP
State of Working Vermont 2019 grew only 1.2% Gross State Product = GDP
 
Vermont Department of Human Resources Payroll Data
Vermont Department of Human Resources Payroll DataVermont Department of Human Resources Payroll Data
Vermont Department of Human Resources Payroll Data
 
Vermont 2020 Independent Candidate Signature
Vermont 2020 Independent Candidate Signature Vermont 2020 Independent Candidate Signature
Vermont 2020 Independent Candidate Signature
 
Vermont's Getting On The Ballot
Vermont's Getting On The BallotVermont's Getting On The Ballot
Vermont's Getting On The Ballot
 
Vermont House Calendar Tuesday January 7, 2020
Vermont House Calendar Tuesday January 7, 2020Vermont House Calendar Tuesday January 7, 2020
Vermont House Calendar Tuesday January 7, 2020
 
Vermont Carbon Taxes Schemes Unsustainable
Vermont Carbon Taxes Schemes UnsustainableVermont Carbon Taxes Schemes Unsustainable
Vermont Carbon Taxes Schemes Unsustainable
 
2018 ACO Quality and Financial Results by Payer
2018 ACO Quality and Financial Results by Payer2018 ACO Quality and Financial Results by Payer
2018 ACO Quality and Financial Results by Payer
 
Purchasing Power produced by Small Modular Reactors VT
Purchasing Power produced by Small Modular Reactors   VTPurchasing Power produced by Small Modular Reactors   VT
Purchasing Power produced by Small Modular Reactors VT
 
Vermont's 14 Superfund Clean up Sites
Vermont's 14 Superfund Clean up SitesVermont's 14 Superfund Clean up Sites
Vermont's 14 Superfund Clean up Sites
 
The 2019 State Student Privacy Report Card
The 2019 State Student Privacy Report CardThe 2019 State Student Privacy Report Card
The 2019 State Student Privacy Report Card
 
May 14, 2019 Voting Results Minicipal, School Budgets, Select Board
May 14, 2019 Voting Results Minicipal, School Budgets, Select BoardMay 14, 2019 Voting Results Minicipal, School Budgets, Select Board
May 14, 2019 Voting Results Minicipal, School Budgets, Select Board
 
Rich States Poor States 11th Edition 15 Weighted Categories
Rich States Poor States 11th Edition 15 Weighted CategoriesRich States Poor States 11th Edition 15 Weighted Categories
Rich States Poor States 11th Edition 15 Weighted Categories
 
2018-2019 Strategic Plan and Performance Report - Final
2018-2019 Strategic Plan and Performance Report - Final2018-2019 Strategic Plan and Performance Report - Final
2018-2019 Strategic Plan and Performance Report - Final
 
Building a Wall around the Welfare State, Instead of the Country
Building a Wall around the Welfare State, Instead of the CountryBuilding a Wall around the Welfare State, Instead of the Country
Building a Wall around the Welfare State, Instead of the Country
 
Decarbonization Methods in Vermont Report 2019 1 to 146 pages
Decarbonization Methods in Vermont Report 2019 1 to 146 pagesDecarbonization Methods in Vermont Report 2019 1 to 146 pages
Decarbonization Methods in Vermont Report 2019 1 to 146 pages
 
Vermont Opioid Related Fatalities with Suicide Statistics
Vermont Opioid Related Fatalities with Suicide StatisticsVermont Opioid Related Fatalities with Suicide Statistics
Vermont Opioid Related Fatalities with Suicide Statistics
 
Act 46 Barre City and Barre Town
Act 46 Barre City and Barre TownAct 46 Barre City and Barre Town
Act 46 Barre City and Barre Town
 
Voter'S Guide For People With Disabilities
Voter'S Guide For People With Disabilities Voter'S Guide For People With Disabilities
Voter'S Guide For People With Disabilities
 

Recently uploaded

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 

Recently uploaded (20)

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 

White Paper Leveraging Automation for Advanced Network Troubleshooting

  • 1.
  • 2. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 1 Table of Contents 1. Executive Summary.............................................................................................................................................................1 2. Why is Network Troubleshooting So Hard? ................................................................................................................1 Causes of Network Outages.............................................................................................................................................1 The Cost of Network Outages..........................................................................................................................................1 Finding a Needle in a Haystack: Troubleshooting with Limited Visibility.........................................................2 3. Divide & Conquer with Network Automation ............................................................................................................3 A Network Map to Define the Scope of the Problem...............................................................................................4 Analyzing Network Performance....................................................................................................................................5 Analyzing Recent Changes...............................................................................................................................................6 Diagnosing Network Segments in Parallel..................................................................................................................7 4. Case Study: Dimension Data Accelerates Troubleshooting on Customer Networks................................. 10
  • 3. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING 1 1. Executive Summary When the network goes down, every minute counts. Data from a 2013 CDW survey suggests that network outages cost enterprises over $1.7B in lost revenue over the previous year. Much of this loss could have been avoided if network teams were able to discover the source of problems more quickly. Many enterprises have already deployed network monitoring systems to help them react to incidents faster, but it’s not enough. It’s equally important to improve mean-time-to-repair (MTTR) by accelerating troubleshooting times. In this paper, we’ll examine why network troubleshooting is so challenging and look at opportunities to improve incident response times with a divide and conquer strategy. We’ll address how automation can be applied to a traditional troubleshooting methodology for isolating the problem, gathering information, and automating the analysis of critical data. 2. Why is Network Troubleshooting So Hard? Effective troubleshooting requires a combination of both experience and an intimate knowledge of the network’s design. Even when a network engineer possesses both, there’s still the challenge of diagnosing network symptoms, involving a lot of manual data collection and analysis. Causes of Network Outages There’s a lot of hype and media coverage around network hacking and DDoS attacks, but far more network outages are actually caused by mistakes made by an organization’s own people. A recent Gartner study estimated that people and process issues will cause 80% of outages impacting mission-critical services through 2015. Of that number, more than 50% will be the result of a network upgrade or configuration change. The Cost of Network Outages Early in 2014, both Xbox LIVE and Facebook suffered well-publicized network outages, both caused by configuration errors during scheduled maintenance. For Xbox LIVE the untimely outage crippled the launch of one of their biggest online games. For Facebook, 30 minutes of downtime cost an estimated $500,000 in lost ad revenue. Of course, the cost to a business’ reputation may be far higher if customers are impacted. Top Causes of Network Outages* o 23% from router/switch failure (including DoS attacks) o 32% from a link failure (fiber cuts, network congestion) o 36% from a network change (upgrade, config change) *Data from a 2013 Cisco Study
  • 4. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 2 Finding a Needle in a Haystack: Troubleshooting with Limited Visibility Network visibility is increasingly sought-after in the network industry, because better visualization of the network leads to better decision-making and faster problem resolution. Despite dozens of tools, which claim to improve visibility, the most common window a troubleshooter has into the network is the command-line interface (CLI). Unfortunately, the CLI provides a narrow field of vision for troubleshooters because the information they can gather is limited to the rate at which they can issue and interpret commands – one device at a time. When diagnosing a network problem, it’s estimated that engineers spend 80% of their time manually gathering data, and only 20% analyzing it. This time spent ‘data mining’ represents an opportunity for improvement. The figure below shows how important the task of gathering and analyzing information is during a typical troubleshooting scenario. Figure 1: Visibility Challenges during Troubleshooting Diagnosis Because the CLI provides limited visibility, engineers also need access to accurate ‘troubleshoot-ready’ network diagrams. These are diagrams that target the problem area and omit parts of the network that aren’t related to the problem. These maps should include design parameters including routing protocols, access-lists, VLANs, etc. Today, very few tools exist which can provide these types of maps; instead engineers commonly rely on ‘static’ diagrams, commonly created with MS Visio. Although both the CLI and network diagrams (if available), help troubleshooters gather information about topology and configuration, they’re
  • 5. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 3 both poor tools for understanding what’s happening on the network. During an incident, engineers need to understand both live performance as well as recent changes. Even with a performance monitoring solution deployed, engineers often struggle due to ‘information overload’. The last factor we’ll address in this paper is the dependence network teams have on ‘tribal’ knowledge. This refers to the all-too-common scenario where a network ‘hero’ needs to come in and solve a difficult problem. The reason is that a very small percentage of team members have sufficient troubleshooting experience or intimate network knowledge which is required to solve complex problems. The figure below summarizes the challenges associated with visibility, and how it impacts an engineer’s ability to find answers to their most critical questions. Figure 2: Sources and Limitations of Network Visibility in an Enterprise Environment 3. Divide & Conquer with Network Automation There’s no shortage of network monitoring tools to help engineers detect network outages, but the steps to diagnose a detected alarm are almost always manual. Effective troubleshooting techniques require a tool which can both increase network visibility as well as help divide and conquer time-consuming analyses.
  • 6. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 4 A Network Map to Define the Scope of the Problem Without visual aids, the ability to understand complex networks begins to break down. Network diagrams serve as the go-to visual aid for network engineers, but troubleshooting is dramatically hindered if the diagrams aren’t up-to-date and reliable. More than a repository of updated site diagrams, what a troubleshooter needs is a customized diagram, which omits irrelevant parts of the network that only serve to distract. For example, if a slow application is traversing across three data centers, an engineer needs a single diagram of the application flow, not three diagrams, one for each data center. In other words, a tailored diagram is the best asset. A Fresh Approach: Dynamic Network Mapping NetBrain’s unique network diagrams are dynamic in nature, which means they are updated automatically, when the network changes. NetBrain diagrams can be created on-demand as well, so engineers don’t need to sort through dozens of diagrams during an incident. Instead, they can instantly create a custom map focused on the event. Network engineers are frequently asked to troubleshoot poorly performing applications, with little more to go on than a report of slowness. To tackle this challenge, the engineer can dynamically create a custom layer-3 or layer-2 map of the application flow by entering two IP addresses (i.e. the source IP address and the IP of the application server). NetBrain will perform a comprehensive analysis of the routing, access-lists, and NAT for every hop in the path. The resulting map will show which devices are in the path of the application flow. Figure 3: A Tailored Diagram of an Application Flow (Created On-Demand with NetBrain)
  • 7. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 5 Analyzing Network Performance It’s difficult to troubleshoot performance problems without being able to see what’s happening on the network. Many network teams have 24x7 network monitoring systems that generate alarms when an incident occurs. Examples of such monitoring tools include HP OpenView, IBM Tivoli, CA Spectrum, and Solarwinds NPM. Figure 4: Example Network Monitoring and Alerting Tools Network monitoring tools solve only half of the puzzle; after an alarm is generated network teams still revert to manual methods of troubleshooting. An effective troubleshooting tool should integrate with network monitoring and ticketing systems to improve visibility into the problem area. Diagnostic Monitoring on a Live Map NetBrain’s monitoring function can be turned on from any map, or even launched from a 3rd part monitoring tool, to visualize the performance characteristics of each device and interface. When troubleshooting a slow application, engineers can quickly spot bandwidth bottlenecks on the interfaces (highlighted in red) or CPU/Memory over-utilization on each device. For intermittent application behavior issues, monitoring can be left to run overnight; it will collect and plot each data point to highlight trends.
  • 8. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 6 Figure 5: Monitoring Application Performance Factors (Issues Highlighted in Red) Analyzing Recent Changes With over one third of network outages resulting from a network change, visibility into what’s changed is critical. That means understanding not just what’s changed in configuration, but understanding the impact of those changes on routing, topology, application traffic, and more. Automated Change Analysis NetBrain can be configured to benchmark the network regularly so that network teams are better equipped to understand recent changes. During every benchmark, NetBrain collects live data and looks for changes in configuration, routing, inventory, as well as MAC/ARP/CDP/STP tables. NetBrain also includes comparative analysis capabilities to automatically highlight the changes side-by- side. Figure 6: NetBrain’s System Benchmark Properties
  • 9. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 7 By way of example, when troubleshooting application slowness, an engineer can ‘rewind the clock’ and see how application traffic was being routed before the problem arose. Any changes could provide valuable clues into the problem. Figure 7: Analyzing Application Traffic from Last Week Diagnosing Network Segments in Parallel When engineers rely on the command line interface as their primary troubleshooting tool, they’re forced to diagnose the network in a serialized manner, one device at a time. That’s because the output to CLI commands is often uneasy to scan, and important data points are hard to find. Finding the ‘missing pieces’ of information may take dozens of commands. Figure 8: Serialized Troubleshooting with the CLI CLI ping and traceroute used to determine path Multiple show level commands in multiple CLI windows Repeat until problem is found Quick “performance” test results Stare and compare to find deviations and anomalies Quick “performance” test results
  • 10. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 8 Effective troubleshooting should instead occur in parallel, meaning that commands are issued on many devices simultaneously and only the relevant data is parsed from the output. A network map serves as the best troubleshooting user interface because it provides a canvas for which to populate the relevant data. Figure 9: Diagnosing Interface Errors in Parallel (collisions and CRC errors labeled in red) The image above shows what it may look like to diagnose the interfaces of multiple devices, in parallel, on a live network map. Troubleshooting automation can issue the appropriate commands on your behalf, and extract the relevant data. Adaptive Network Automation – A Powerful Alternative to Scripting Writing Perl and Python scripts to automate data collection is powerful, but the vast majority of network engineers aren’t programmers and they struggle to realize the benefits. NetBrain eliminates the programming requirement from network automation with its ‘quick’ programming environment. Engineers can literally point and click to program their own NetBrain ‘Qapps’. As an example, the Check Interface Errors Qapp - which was written by a NetBrain engineer in less than 10 minutes - can be run to detect incrementing interface errors and speed/duplex mismatches.
  • 11. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 9 Figure 10: NetBrain’s ‘Quick’ Programming Environment Each new Qapp becomes a new feature, and it leverages a dynamic map to display the output. For troubleshooters, every Qapp is an executable diagnosis which can automatically extract and analyze the CLI data which would otherwise be collected manually. This helps network teams troubleshoot virtually any network issue in parallel, rather than one device at a time. It also helps network teams digitize and share their troubleshooting checklists.
  • 12. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 10 4. Case Study: Dimension Data Accelerates Troubleshooting on Customer Networks .CUSTOMER PROFILE: Industry: Managed Services Company: Dimension Data CHALLENGE: Dimension Data does not own the customer networks they manage so they struggle to gain and maintain intimate knowledge on those networks, which is inherently gained through day-to-day operations. SOLUTION: Dimension Data utilizes NetBrain to automate diagram creation, visualize performance issues to expedite diagnosis, and to easily share information for collaborative troubleshooting sessions. BENEFIT: NetBrain’s advanced network visualization and automation capabilities enable Dimension Data to shorten typical diagnosis and repair time by as much as 50%. Dimension Data specializes in information technology services, with operations on every inhabited continent. Dimension Data's focus areas include network integration, security solutions, data center solutions, converged communications, and a range of professional, consulting, and managed services. A major challenge the company consistently faces is the ability to understand their customers’ networks to the extent necessary to diagnose and troubleshoot complex issues and resolve network outages effectively. Dimension Data deployed NetBrain in their customer environments, in many cases integrating the tool with the NetCool alarm system, Opsware configuration management solution, and Vitalnet’s performance trending solution. With these integrations, an alarm reported by HP OpenView is instantly translated to a map inside NetBrain Workstation. NetBrain continues to offer value to Dimension Data in three areas:  On-demand network mapping effectively removes dependencies on manual network diagrams which are often inconsistent and error- prone.  Network performance diagnosis via Dynamic Diagrams enables lower- level engineers to troubleshoot advanced problems  Engineers share information via NetBrain for collaboration The following are some ‘war stories’ reported by this customer: Detecting Serious Routing Issues on the Accudyne Network NetBrain was able to provide real-time network visibility into the Accudyne network and help identify serious routing issues. The tool was used to highlight the congestion points on the map and ultimately tie the problem to equal cost routes and MPLS design segregation. Troubleshooting Slowness to a Server Previously it took Dimension Data almost two and a half hours to determine the source and destination path of an application server inside the Accudyne network, followed by another two hours to diagnose the problem. With NetBrain, the task to find the path took two minutes, and another five minutes was all that was needed to diagnose the issue. Troubleshooting MS Outlook Slowness to Tokyo The Tokyo office was experiencing slowness sending outlook attachments. Multiple tickets had been opened for this issue and several engineers had already looked into it. NetBrain was then applied and, within three minutes, it was determined that there was a duplex issue on the edge WAN port. NetBrain saves time when time is critical. As a Dimension Data Network Integration Engineer reported, “It has changed the way I approach troubleshooting.”
  • 13. WHITEPAPER: LEVERAGING AUTOMATION FOR ADVANCED NETWORK TROUBLESHOOTING | 11 About NetBrain Technologies, Inc. Founded in 2004, NetBrain set out to pursue a new vision: automate time- consuming tasks associated with network documentation, design, and troubleshooting. NetBrain’s customers are using map-driven automation to eliminate manual network documentation, automate troubleshooting tasks, and mitigate security risks. NetBrain is headquartered in Burlington, MA with offices in Sacramento, CA, New York, and Beijing, China. To learn more about NetBrain’s dynamic mapping solution, contact us at 781.221.7199 or download free trial of NetBrain’s Enterprise Suite from www.netbraintech.com/trial. NetBrain Technologies, Inc. 15 Network Drive Burlington, MA 01803 +1 800 605 7964 info@netbraintech.com www.netbraintech.com