Historically, about 60-70% of all network issues are directly tied to change – not issues like fiber cuts, power outages or hardware failure.External threats or malicious attacks receive all of the publicity, however a recent Netcordia study reinforced the analyst’s estimates but also highlighted that internal, best intentioned mistakes are the biggest threat to network availability. The good news is that this issue is easy to control with the right management and visibility.
Request a changeReview changeApprovalSchedule ticketMake change/close ticket
Request a changeReview changeApprovalSchedule ticketMake change/close ticketWas it a good change?Did it impact neighboring devices?Is it within policy?Was it successful?
In dealing with the impact of change, most users are reactive in nature – if everything is green or no one complains, everything must be fine. Once the red light appears or users complain, the troubleshooting process begins.The biggest challenge is finding device configurations or changes that are suboptimal – meaning they don’t cause an immediate impact, the problem occurs hours, days or weeks later and then the time to identify and solve the problem grows exponentially as staff must wade through all of the changes that occurred and try to find the issue.
In dealing with the impact of change, most users are reactive in nature – if everything is green or no one complains, everything must be fine. Once the red light appears or users complain, the troubleshooting process begins.The biggest challenge is finding device configurations or changes that are suboptimal – meaning they don’t cause an immediate impact, the problem occurs hours, days or weeks later and then the time to identify and solve the problem grows exponentially as staff must wade through all of the changes that occurred and try to find the issue.
IT has always been asked to do more with less, but in today’s economic times, it’s harder as staff have either been reduced already or future growth is limited.However, network availability and health must stay as good or improve, so improving productivity with existing staff is becoming more of a focus.Organizations are still investing in solutions to improve performance and efficiency, but more stringent justification is needed.
Organizations understand consistent or standardized networks are inherently more stable and resilient since best practices or standards are cultivated over time.Meeting internal or external compliance requirements should improve network health and reduce the risk of poor performance over time.
NetMRI completes the lifecycle of network change and configurationComplements existing change management process and/or toolsBuilt-in automation and intelligence to manage network changesReduces risk and improves performance by closing the loop in the change processFast to deploy and delivers actionable information within an hourIndustry-leading device auto-discoveryEmbedded expertise and best practicesAutomation for configurations to reduce time to deploy and minimize risk of mistakesRemediation options and job schedulingOnly solution that ties network change, configuration and compliance with network healthSingle platform to correlate impact of change over timeNetwork scorecard highlights trending on overall network infrastructure healthCompliance/consistency view ties to change to identify violations of standards proactivelyCapability to define users and roles based on individual needAccess can include full admin, generate reports, request information, schedule job that needs approval and many moreDefined by entire network, device group or individual deviceAudit trail shows processes and verifies changesAuto-discovery provides rich view of topology – not just ping sweepsTopology view created automatically to understand relationship of device neighborsAbility to add overlays such as change, issues and policy violationsBuilt-in intelligence and analysis evaluates relationship between multiple devices, not just individual devices
IP Expo 2009 - Top 5 Biggest Network Management Mistakes - and How to Avoid Them - Presentation Transcript
Top 5 Biggest Network Management Mistakes – and how to avoid them. Yama Habibzai
In the News….
August 3, 2009 - Paypal goes down – payment services are lost for over 6 hours!
Cause: – a "back-end router" complicated by a change that took place
August 9, 2009 - Cisco’s web site goes down
Cause: A “human error”
September 1, 2009 - Gmail goes down worldwide
Cause: A change by an admin intended to improve traffic flow
Over 50% of all large businesses are unprotected against network outages
Network World Survey Sept 2009
Half of all data center are planning cuts. Nearly 12% expect more disruptions due to less people checking things.
Source: The Association for Computer Operations Management (AFCOM) Survey
What Is the Problem?
Mistake 1 Not Managing Change
Internal Change is Biggest Worry From where do you anticipate the greatest threat to network availability? “With 64% of our respondents’ votes, internal change continues to be network administrators’ greatest worry. While the vast majority of these changes are inadvertent, and probably trying to help the network, in the end they can end up hurting or crippling network performance.” Netcordia survey of more than 450 network engineers
What is the #1 Cause of Network problems? People Making Changes They Think Are Improving Network Performance
Mistake 2 Not Understanding the Impact of Change
Change Management Process Request a change Review change Make change/close ticket Receive approval Schedule ticket But what is missing from this process?
Change Management Process Request a change Successful? Review change Approval Within policy? Schedule ticket Impact neighbors? Change/ close ticket Good change?
Performance and Change Are Separate Performance Team
Track up/down status
Monitor response time
Measure usage and utilization
Verify service level agreements
Change Team
Manage network devices
Implement changes and configurations
Maintain consistency and standards
Manage ticket system
Need a Single View Into Change & Performance
Mistake 3 Manual Network Management / Lack of Automation
Manual Change Processes
Dealing with network change
Manual process
Custom programs, freeware, spreadsheets
Build scripts
Human error is biggest challenge with change
Error touch to device adds new risk
Scripts grow over time, owners move on
Today’s world – cannot waste time
Did you know?
3.6% of annual revenue is lost due to network problems. Source: Infonetics Research
Change management tools deployment save $16M+ annually. Source: EMA
Number #1 cause of network problems? People: 60-80% of the time. Source: Gartner, IDC, EMA, Forrester
Percentage of network problems associated with configuration issues? 60+%. source: EMA
Mistake 4 Access Permissions
Too Much Device Access
Most organizations have one password for devices
Either have full admin rights or none
Limits control to make changes on devices
Many organizations rely on processes or fear to reduce risk
Major exposure for compliance requirements
Mistake 5 Manual Policy Management
Manual Policy Management
Policies are handled manually
Little understanding of impact of change on policies
Creating new policies is hard. Deploying policies and managing policies across various devices even harder
Staff turnover makes policy management a nightmare
Don’t Fall Into the Same Traps as Others
Manage Change – Take control of #1 cause of network outages
Understand impact of change
Use automation to achieve goals
Control access
Automate policy management
About Netcordia In –Depth Data Collection Config | SNMP | Syslog | VoIP Built-in Intelligence and Discovery AutomatedCorrectiveActions Audit NCCM Analysis into Impact of Change Analyze Automate Built-in Compliance Reports Single View into Policy Violations
Award Winning Software Netcordia listed as a Deloitte Fast 500 company Netcordia named 30th fastest growing privately held software company in the US Netcordia named “2009 SmartCEO Future 50 Award Winner Netcordia named “One of the top 100 Privately Held Technology Companies of 2009”
When asked to identify the No. 1 cause of network p more
When asked to identify the No. 1 cause of network performance problems, IT organizations typically point to denial-of-service attacks, computer viruses, fiber cuts, power outages and hardware failures – the events that receive the most publicity and media coverage. However, studies by research firms such as Gartner Group and Enterprise Management Associates indicate that two-thirds of network issues are actually tied to seemingly simple everyday activity: the process of IT staff making network configuration changes. This session will cover the five most common network performance management mistakes and how to avoid them. less
0 comments
Post a comment