Learn how to use open source solutions for your network monitoring to gain the necessary visibility in the status, performance and responsiveness of your enterprise, cloud or hybrid application environments. Get a faster and easier tool to start collecting data from multiple sources and quickly perform root-cause analysis reducing your MTTR.
3. The Move to Autonomy
AutomateLearnObserveInstrument
4. Information Consumers
Network
Engineering
Team Leader
CIO Capacity Planner
3rd-line
Application
Support
"I need those BGP AS
and MPLS Label
metrics right now."
“I don’t care about
tools, I want to see
results!”
"Where are the
bottlenecks now?
Where will they be
next month?"
"Traders demand
microsecond
application response
- I've got no patience
for slow monitoring
systems"
5. Key Indicators
Monitoring is all about turning lots of raw metrics into meaningful
indicators
Availability Performance Capacity Usage
Health
Is the
resource
available?
How well is it
performing?
How much
are we using?
What/who is
driving
consumption?
How are we doing? What is the overall status? Where are we at risk?
6. Pillars for Success and Growth
Cost Centric Profit Centric
Are network resources available? Are network resources fully utilized (provisioned vs.
use)?
How is the network performing (latency, error rates,
bandwidth consumption) ?
How are KPIs impacting revenue generation?
Is root-cause identified promptly? Is the Mean Time to Repair (MTTR) impacting user-
experience and causing churn?
Who/what is driving consumption? Are there network misuse or new revenue
generation use-cases?
Do I have the right tools to acquire actionable
insight and prediction models?
Are mission-critical services KPIs being effectively
tracked and tackled?
9. Rapidly Expanding Open Source Community
Estimated InfluxDB Monthly Growth
InfluxDBAnnounced Nov 2013
15,704
~ 205,698 databases in use today
10. InfluxData
The Modern Engine
for metrics and events
✓ Purpose-built for time series data
✓ Ingest of large Volumes of Points/Second
✓ Real-time queries on large data sets
✓ Rapid Eviction and Transformation
of Data
✓ Down sampling of high precision data
✓ Storage optimization and compression
11. Platform Strategy:
Be The Platform of Choice for All Metrics and Event Workloads
Common Metrics
and Events
Platform
Application,
Custom Logs &
Traces
Business
Metrics
Infrastructure
& Application
Metrics
IoT
Sensor
Events
12. InfluxData Platform Features
INSTRUMENT OBSERVE
AUTOMATE LEARN
ü Quickly ingest data from
everywhere
ü Efficiently store (Compress) the
data at scale
ü Support real-time query,
analysis and visualization of
large data sets
ü Provide time-based functions
for “change over time” analysis
and control
ü Provide automation and control
functions
ü Evict and down-sample data
ü Facilitate machine learning and
anomaly detection algorithms
ü Provide streaming analytics for
data in motion
METRICS
EVENTS
18. PayPal was looking for a ”Host
Monitoring” solution to replace their
antiquated monitoring systems.
They built and use these monitoring
solutions to help them improve
PayPal's operational efficiencies
while mitigating incidents involving
multiple teams.
19. Ring Central built a “DIY” metric
collector solution with InfluxData for
their 1,500 Engineers. This fully
automatic service is integrated with
their deployment systems and
allows their engineers to gather any
metric, view in their own
dashboards, and set their own
alerts.
20. “It’s very important today to deliver
data with high granularity, because
people want to correlate host
monitoring with traffic monitoring.
We believe we made a good
choice jumping to InfluxDB.”
Luca Deri, Founder
21. LibreNMS is an Open source
network monitoring system. Until
recently, they were dependent on
RRDTool. They have since
switched to InfluxDB so their users
can store more data and build
better dashboards
22. Learn More
• https://www.influxdata.com/solutions/networkmonitoring/
• Cisco: Introducing Pipeline: A Model-Driven Telemetry Collection
Service
• Juniper: Guidelines for Aggregating Junos Telemetry Interface
Data
• Meraki: How to Monitor SNMP Devices with Telegraf and
InfluxDB and Grafana | Meraki WAN Data
• Metamako: Introducing Telemetry: Actionable Time Series Data
from Counters