Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gain Network Visibility | InfluxDays NA 2021

InfluxData
InfluxDataInfluxData
How Red Hat Uses gNMI,
Telegraf and InfluxDB to
Gain Network Visibility
Martin Moucka - Principal Network Engineer
Red Hat
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Agenda
• Introduction
• Scope
• Why InfluxDB?
• Architecture
• Visualizations
• Flux
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Red Hat
The world’s leading provider
of open source enterprise IT solutions
MORE THAN
90%
of the
FORTUNE
500
RED HAT
use
PRODUCTS &
SOLUTIONS*
~13,815
EMPLOYEES
105+
OFFICES
40+
COUNTRIES
THE FIRST
$3
OPEN
SOURCE
COMPANY
IN THE WORLD
BILLION
© 2021  InfluxData Inc. All Rights Reserved.
Martin Moucka
Principal Network Engineer, Red Hat
● With company for more than 7 years
● Built a network automation around Ansible, utilizing single source of truth
● Started transition to modern monitoring connected to the network automation
● Tech lead of Network Automation & Tools team
E-mail: mmoucka@redhat.com
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Network Monitoring
Network monitoring provides insight to
the network. It monitors the status of
network devices (switches, routers,
firewalls, etc..), network
status/performance. It provides a
graphical view of metrics (e.g. link
utilization) and/or device status (e.g. up
or down) together with alerting when
something is out of service.
Key Capabilities of Network Monitoring
Performance metric visualizations. Monitoring of the network
for performance issues, display information in a visual format
(Dashboards) - understand your network performance at a
glance.
Network alerts. Alert on any problems that occur. Discovery of
issues from monitored data, augment alert data with relevant
information helping support teams to respond quickly.
Network mapping. Visualization of complex network
landscapes in a map format including device/network health
state.
Bandwidth monitoring. Identify where network bandwidth
usage is not optimal, and drive decisions to improve utilization.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Scope
• Juniper, Cisco (WLC, ASA, IOS, UCS, etc...), OpenGear, F5 and Mist
• Custom probes for synthetic monitoring
• 60+ sites
• ~ 1.6k monitored devices
• ~ 14k monitored interfaces
• 5 collectors
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Why InfluxDB?
• Open Source with Enterprise support
• Efficient data storage
• Flexibility in integrations/languages
• Modular agent Telegraf with support of JTI (Juniper Telemetry Int.)
• Support for SQL-like query language
• Flux as powerful flexible query language
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Solution Architecture
Distributed Monitoring
Services / Storage
Network Devices
Telegraf/Kapacitor/InfluxDB
Troubleshooting
Network
Automation
Adding/Removing
device
Event
Management
Visualization
Probes
Alert
Check / Send data
Manual intervention
Event
Automation
Troubleshooting
Fix
Configure
Configure
New monitored
system/device
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Visualizations - Immediate response
• Device detailed status
• Interface utilization (SNMP / gNMI)
• Interface errors (SNMP / gNMI)
• CPU/Memory utilization (SNMP)
• BGP neighbors status (SNMP / gNMI in progress)
• etc...
• Site View
• Data from probe (Latency, Packet loss, HTTP response time, DNS delay)
• SLI/SLO status (Kapacitor processed + Flux query)
• Internet link utilization (processed by Kapacitor)
• Top talkers (from other tool via RestAPI)
• Wireless status
• Statistics of WLC/APs and connected clients
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
14
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Visualizations - Long-term planning
• Link capacity utilization
• Status page based on SLI/SLO
• Wireless AP (Cisco WLC) anomaly detection - Flux
• Compliance reporting
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Flux
• Provides very flexible programmatic way of query
• Allows changing data type within a query
• Within compliance report, we connect up to 5 different
measurements
• Used for access point, poor SNR anomaly detection across regions
• Focus where it matters most
• Allows custom functions
• Median Absolute Deviation used for anomaly detection
• Well-documented at
https://www.influxdata.com/blog/anomaly-detection-with-median-abs
olute-deviation/
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Median Absolute Deviation - Function
import "math"
import "experimental"
mad = (table=<-, threshold=3.0) => {
data = table |> group(columns: ["_time"], mode:"by")
med = data |> median(column: "_value")
diff = join(tables: {data: data, med: med}, on: ["_time"], method: "inner")
|> map(fn: (r) => ({ r with _value: math.abs(x: r._value_data - r._value_med) }))
|> drop(columns: ["_start", "_stop", "_value_med", "_value_data"])
k = 1.4826
diff_med =
diff
|> median(column: "_value")
|> map(fn: (r) => ({ r with MAD: k * r._value}))
|> filter(fn: (r) => r.MAD > 0.0)
output = join(tables: {diff: diff, diff_med: diff_med}, on: ["_time"], method: "inner")
|> map(fn: (r) => ({ r with _value: r._value_diff/r._value_diff_med}))
|> map(fn: (r) => ({ r with
level:
if r._value >= threshold then "anomaly"
else "normal"
}))
return output
}
© 2021  InfluxData Inc. All Rights Reserved.
© 2021  InfluxData Inc. All Rights Reserved.
Median Absolute Deviation - Usage
pc_duration = from(bucket: "XXXXXX")
|> range(start: v.timeRangeStart, stop: v.timeRangeStop)
|> filter(fn: (r) =>
r._measurement == "bsnAPTable" and
r._field =~ /radio1PoorSNRClients|radio1Users/ and
r.region == "${region}"
)
|> pivot(rowKey:["_time"], columnKey: ["_field"], valueColumn: "_value")
|> filter(fn: (r) =>
r.radio1PoorSNRClients > 0 and
r.radio1Users > 0
)
|> map(fn: (r) => ({ r with CNPR: float(v: r.radio1PoorSNRClients) / float(v: r.radio1Users)}))
|> stateDuration(
fn: (r) => r.CNPR >= 0.1,
column: "duration"
)
|> map(fn: (r) => ({ r with _value: float(v: r.duration) / float(v: r.CNPR)}))
|> filter(fn: (r) => r._value > 0)
|> truncateTimeColumn(unit: 1h)
|> toFloat()
pc_duration |> mad(threshold:10.0)
|> filter(fn: (r) => r.level == "anomaly")
|> group(columns: ["APName"])
|> count()
|> group()
© 2021  InfluxData Inc. All Rights Reserved.
Questions?
© 2021  InfluxData Inc. All Rights Reserved.
Thank You
1 of 22

Recommended

How Cisco Provides World-Class Technology Conference Experiences Using Automa... by
How Cisco Provides World-Class Technology Conference Experiences Using Automa...How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...InfluxData
597 views38 slides
Running Airflow Workflows as ETL Processes on Hadoop by
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoopclairvoyantllc
2K views24 slides
How to Import JSON Using Cypher and APOC by
How to Import JSON Using Cypher and APOCHow to Import JSON Using Cypher and APOC
How to Import JSON Using Cypher and APOCNeo4j
861 views26 slides
Bare Metal Cluster with Kubernetes, Istio and Metallb | Nguyen Phuong An, Ngu... by
Bare Metal Cluster with Kubernetes, Istio and Metallb | Nguyen Phuong An, Ngu...Bare Metal Cluster with Kubernetes, Istio and Metallb | Nguyen Phuong An, Ngu...
Bare Metal Cluster with Kubernetes, Istio and Metallb | Nguyen Phuong An, Ngu...Vietnam Open Infrastructure User Group
749 views30 slides
Ingesting data at scale into elasticsearch with apache pulsar by
Ingesting data at scale into elasticsearch with apache pulsarIngesting data at scale into elasticsearch with apache pulsar
Ingesting data at scale into elasticsearch with apache pulsarTimothy Spann
1.3K views31 slides
Apache Tez - A New Chapter in Hadoop Data Processing by
Apache Tez - A New Chapter in Hadoop Data ProcessingApache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data ProcessingDataWorks Summit
18.3K views39 slides

More Related Content

What's hot

How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and Telegraf by
How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and TelegrafHow to Monitor DOCSIS Devices Using SNMP, InfluxDB, and Telegraf
How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and TelegrafInfluxData
242 views19 slides
Rainbird: Realtime Analytics at Twitter (Strata 2011) by
Rainbird: Realtime Analytics at Twitter (Strata 2011)Rainbird: Realtime Analytics at Twitter (Strata 2011)
Rainbird: Realtime Analytics at Twitter (Strata 2011)Kevin Weil
77K views60 slides
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ... by
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Flink Forward
575 views34 slides
Apache Kylin – Cubes on Hadoop by
Apache Kylin – Cubes on HadoopApache Kylin – Cubes on Hadoop
Apache Kylin – Cubes on HadoopDataWorks Summit
8.5K views42 slides
Sbt baby steps by
Sbt baby stepsSbt baby steps
Sbt baby stepsMarina Sigaeva
3.6K views58 slides
Flink powered stream processing platform at Pinterest by
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink Forward
223 views39 slides

What's hot(20)

How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and Telegraf by InfluxData
How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and TelegrafHow to Monitor DOCSIS Devices Using SNMP, InfluxDB, and Telegraf
How to Monitor DOCSIS Devices Using SNMP, InfluxDB, and Telegraf
InfluxData242 views
Rainbird: Realtime Analytics at Twitter (Strata 2011) by Kevin Weil
Rainbird: Realtime Analytics at Twitter (Strata 2011)Rainbird: Realtime Analytics at Twitter (Strata 2011)
Rainbird: Realtime Analytics at Twitter (Strata 2011)
Kevin Weil77K views
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ... by Flink Forward
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Flink Forward575 views
Flink powered stream processing platform at Pinterest by Flink Forward
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at Pinterest
Flink Forward223 views
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv... by Flink Forward
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Flink Forward1.9K views
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod... by DataWorks Summit
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
DataWorks Summit3K views
Hashicorp Terraform Open Source vs Enterprise by Stenio Ferreira
Hashicorp Terraform Open Source vs EnterpriseHashicorp Terraform Open Source vs Enterprise
Hashicorp Terraform Open Source vs Enterprise
Stenio Ferreira2.2K views
Tuning Apache Kafka Connectors for Flink.pptx by Flink Forward
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptx
Flink Forward428 views
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout by confluent
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron SchildkroutKafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
confluent11.9K views
Kubernetes deployment strategies - CNCF Webinar by Etienne Tremel
Kubernetes deployment strategies - CNCF WebinarKubernetes deployment strategies - CNCF Webinar
Kubernetes deployment strategies - CNCF Webinar
Etienne Tremel3.1K views
The top 3 challenges running multi-tenant Flink at scale by Flink Forward
The top 3 challenges running multi-tenant Flink at scaleThe top 3 challenges running multi-tenant Flink at scale
The top 3 challenges running multi-tenant Flink at scale
Flink Forward328 views
Designing a complete ci cd pipeline using argo events, workflow and cd products by Julian Mazzitelli
Designing a complete ci cd pipeline using argo events, workflow and cd productsDesigning a complete ci cd pipeline using argo events, workflow and cd products
Designing a complete ci cd pipeline using argo events, workflow and cd products
Julian Mazzitelli11.5K views
OpenStack dans la pratique by Osones
OpenStack dans la pratiqueOpenStack dans la pratique
OpenStack dans la pratique
Osones5.6K views
How we can do Multi-Tenancy on Kubernetes by Opsta
How we can do Multi-Tenancy on KubernetesHow we can do Multi-Tenancy on Kubernetes
How we can do Multi-Tenancy on Kubernetes
Opsta336 views
Time Series Data with InfluxDB by Turi, Inc.
Time Series Data with InfluxDBTime Series Data with InfluxDB
Time Series Data with InfluxDB
Turi, Inc.6.3K views

Similar to Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gain Network Visibility | InfluxDays NA 2021

Webofthing_WOT_vs_IOT.pptx by
Webofthing_WOT_vs_IOT.pptxWebofthing_WOT_vs_IOT.pptx
Webofthing_WOT_vs_IOT.pptxjainam bhavsar
409 views63 slides
Rethinking the Database in the IoT Era by
Rethinking the Database in the IoT EraRethinking the Database in the IoT Era
Rethinking the Database in the IoT EraInfluxData
66 views27 slides
Data Center for Cloud Computing - DC3X by
Data Center for Cloud Computing - DC3XData Center for Cloud Computing - DC3X
Data Center for Cloud Computing - DC3XRenaud Blanchette
469 views25 slides
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and Cisco by
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and CiscoWho Moved My Network? Mastering Hybrid WANs with ThousandEyes and Cisco
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and CiscoThousandEyes
505 views25 slides
A New Way of Thinking | NATS 2.0 & Connectivity by
A New Way of Thinking | NATS 2.0 & ConnectivityA New Way of Thinking | NATS 2.0 & Connectivity
A New Way of Thinking | NATS 2.0 & ConnectivityNATS
752 views57 slides
Living objects network performance_management_v2 by
Living objects network performance_management_v2Living objects network performance_management_v2
Living objects network performance_management_v2Yoan SMADJA
1.3K views16 slides

Similar to Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gain Network Visibility | InfluxDays NA 2021(20)

Rethinking the Database in the IoT Era by InfluxData
Rethinking the Database in the IoT EraRethinking the Database in the IoT Era
Rethinking the Database in the IoT Era
InfluxData66 views
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and Cisco by ThousandEyes
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and CiscoWho Moved My Network? Mastering Hybrid WANs with ThousandEyes and Cisco
Who Moved My Network? Mastering Hybrid WANs with ThousandEyes and Cisco
ThousandEyes505 views
A New Way of Thinking | NATS 2.0 & Connectivity by NATS
A New Way of Thinking | NATS 2.0 & ConnectivityA New Way of Thinking | NATS 2.0 & Connectivity
A New Way of Thinking | NATS 2.0 & Connectivity
NATS752 views
Living objects network performance_management_v2 by Yoan SMADJA
Living objects network performance_management_v2Living objects network performance_management_v2
Living objects network performance_management_v2
Yoan SMADJA1.3K views
Building the SD-Branch using uCPE by Michelle Holley
Building the SD-Branch using uCPEBuilding the SD-Branch using uCPE
Building the SD-Branch using uCPE
Michelle Holley2.4K views
Getting Started: Intro to Telegraf - July 2021 by InfluxData
Getting Started: Intro to Telegraf - July 2021Getting Started: Intro to Telegraf - July 2021
Getting Started: Intro to Telegraf - July 2021
InfluxData536 views
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014 by SAMeh Zaghloul
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SAMeh Zaghloul18.1K views
Understanding Cisco Next Generation SD-WAN Solution by Cisco Canada
Understanding Cisco Next Generation SD-WAN SolutionUnderstanding Cisco Next Generation SD-WAN Solution
Understanding Cisco Next Generation SD-WAN Solution
Cisco Canada10K views
Edge Computing Platforms and Protocols - Ph.D. thesis by Nitinder Mohan
Edge Computing Platforms and Protocols - Ph.D. thesisEdge Computing Platforms and Protocols - Ph.D. thesis
Edge Computing Platforms and Protocols - Ph.D. thesis
Nitinder Mohan541 views
Virtual training intro to InfluxDB - June 2021 by InfluxData
Virtual training  intro to InfluxDB  - June 2021Virtual training  intro to InfluxDB  - June 2021
Virtual training intro to InfluxDB - June 2021
InfluxData517 views
Understanding Cisco’s Next Generation SD-WAN Solution with Viptela by Cisco Canada
Understanding Cisco’s Next Generation SD-WAN Solution with ViptelaUnderstanding Cisco’s Next Generation SD-WAN Solution with Viptela
Understanding Cisco’s Next Generation SD-WAN Solution with Viptela
Cisco Canada14.8K views
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre... by HostedbyConfluent
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
Understanding Cisco’ Next Generation SD-WAN Technology by Cisco Canada
Understanding Cisco’ Next Generation SD-WAN TechnologyUnderstanding Cisco’ Next Generation SD-WAN Technology
Understanding Cisco’ Next Generation SD-WAN Technology
Cisco Canada6.6K views
Ashish Resume 2 by Ashish Pal
Ashish Resume 2 Ashish Resume 2
Ashish Resume 2
Ashish Pal143 views
The Data Center Network Evolution by Cisco Canada
The Data Center Network EvolutionThe Data Center Network Evolution
The Data Center Network Evolution
Cisco Canada3K views

More from InfluxData

Announcing InfluxDB Clustered by
Announcing InfluxDB ClusteredAnnouncing InfluxDB Clustered
Announcing InfluxDB ClusteredInfluxData
100 views30 slides
Best Practices for Leveraging the Apache Arrow Ecosystem by
Best Practices for Leveraging the Apache Arrow EcosystemBest Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow EcosystemInfluxData
50 views25 slides
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu... by
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...InfluxData
134 views24 slides
Power Your Predictive Analytics with InfluxDB by
Power Your Predictive Analytics with InfluxDBPower Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDBInfluxData
127 views41 slides
Build an Edge-to-Cloud Solution with the MING Stack by
Build an Edge-to-Cloud Solution with the MING StackBuild an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING StackInfluxData
375 views52 slides
Meet the Founders: An Open Discussion About Rewriting Using Rust by
Meet the Founders: An Open Discussion About Rewriting Using RustMeet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using RustInfluxData
235 views12 slides

More from InfluxData(20)

Announcing InfluxDB Clustered by InfluxData
Announcing InfluxDB ClusteredAnnouncing InfluxDB Clustered
Announcing InfluxDB Clustered
InfluxData100 views
Best Practices for Leveraging the Apache Arrow Ecosystem by InfluxData
Best Practices for Leveraging the Apache Arrow EcosystemBest Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow Ecosystem
InfluxData50 views
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu... by InfluxData
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
InfluxData134 views
Power Your Predictive Analytics with InfluxDB by InfluxData
Power Your Predictive Analytics with InfluxDBPower Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDB
InfluxData127 views
Build an Edge-to-Cloud Solution with the MING Stack by InfluxData
Build an Edge-to-Cloud Solution with the MING StackBuild an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING Stack
InfluxData375 views
Meet the Founders: An Open Discussion About Rewriting Using Rust by InfluxData
Meet the Founders: An Open Discussion About Rewriting Using RustMeet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using Rust
InfluxData235 views
Introducing InfluxDB Cloud Dedicated by InfluxData
Introducing InfluxDB Cloud DedicatedIntroducing InfluxDB Cloud Dedicated
Introducing InfluxDB Cloud Dedicated
InfluxData129 views
Gain Better Observability with OpenTelemetry and InfluxDB by InfluxData
Gain Better Observability with OpenTelemetry and InfluxDB Gain Better Observability with OpenTelemetry and InfluxDB
Gain Better Observability with OpenTelemetry and InfluxDB
InfluxData398 views
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali... by InfluxData
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
InfluxData182 views
How Delft University's Engineering Students Make Their EV Formula-Style Race ... by InfluxData
How Delft University's Engineering Students Make Their EV Formula-Style Race ...How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
InfluxData105 views
Start Automating InfluxDB Deployments at the Edge with balena by InfluxData
Start Automating InfluxDB Deployments at the Edge with balena Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena
InfluxData185 views
Understanding InfluxDB’s New Storage Engine by InfluxData
Understanding InfluxDB’s New Storage EngineUnderstanding InfluxDB’s New Storage Engine
Understanding InfluxDB’s New Storage Engine
InfluxData135 views
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB by InfluxData
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDBStreamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
InfluxData63 views
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa... by InfluxData
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
InfluxData74 views
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022 by InfluxData
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
InfluxData26 views
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022 by InfluxData
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
InfluxData9 views
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ... by InfluxData
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
InfluxData10 views
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022 by InfluxData
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
InfluxData5 views
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022 by InfluxData
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
InfluxData112 views
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I... by InfluxData
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...
InfluxData19 views

Recently uploaded

Piloting & Scaling Successfully With Microsoft Viva by
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft VivaRichard Harbridge
12 views160 slides
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveNetwork Automation Forum
34 views35 slides
Voice Logger - Telephony Integration Solution at Aegis by
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at AegisNirmal Sharma
39 views1 slide
Design Driven Network Assurance by
Design Driven Network AssuranceDesign Driven Network Assurance
Design Driven Network AssuranceNetwork Automation Forum
15 views42 slides
Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
56 views21 slides
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...Bernd Ruecker
40 views69 slides

Recently uploaded(20)

Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Voice Logger - Telephony Integration Solution at Aegis by Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma39 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker40 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc11 views
Business Analyst Series 2023 - Week 3 Session 5 by DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10300 views
Powerful Google developer tools for immediate impact! (2023-24) by wesley chun
Powerful Google developer tools for immediate impact! (2023-24)Powerful Google developer tools for immediate impact! (2023-24)
Powerful Google developer tools for immediate impact! (2023-24)
wesley chun10 views
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 by IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
Future of AR - Facebook Presentation by ssuserb54b561
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
ssuserb54b56115 views
Special_edition_innovator_2023.pdf by WillDavies22
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdf
WillDavies2218 views
Serverless computing with Google Cloud (2023-24) by wesley chun
Serverless computing with Google Cloud (2023-24)Serverless computing with Google Cloud (2023-24)
Serverless computing with Google Cloud (2023-24)
wesley chun11 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays17 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...

Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gain Network Visibility | InfluxDays NA 2021

  • 1. How Red Hat Uses gNMI, Telegraf and InfluxDB to Gain Network Visibility Martin Moucka - Principal Network Engineer Red Hat
  • 2. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Agenda • Introduction • Scope • Why InfluxDB? • Architecture • Visualizations • Flux
  • 3. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Red Hat The world’s leading provider of open source enterprise IT solutions MORE THAN 90% of the FORTUNE 500 RED HAT use PRODUCTS & SOLUTIONS* ~13,815 EMPLOYEES 105+ OFFICES 40+ COUNTRIES THE FIRST $3 OPEN SOURCE COMPANY IN THE WORLD BILLION
  • 4. © 2021  InfluxData Inc. All Rights Reserved. Martin Moucka Principal Network Engineer, Red Hat ● With company for more than 7 years ● Built a network automation around Ansible, utilizing single source of truth ● Started transition to modern monitoring connected to the network automation ● Tech lead of Network Automation & Tools team E-mail: mmoucka@redhat.com
  • 5. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Network Monitoring Network monitoring provides insight to the network. It monitors the status of network devices (switches, routers, firewalls, etc..), network status/performance. It provides a graphical view of metrics (e.g. link utilization) and/or device status (e.g. up or down) together with alerting when something is out of service. Key Capabilities of Network Monitoring Performance metric visualizations. Monitoring of the network for performance issues, display information in a visual format (Dashboards) - understand your network performance at a glance. Network alerts. Alert on any problems that occur. Discovery of issues from monitored data, augment alert data with relevant information helping support teams to respond quickly. Network mapping. Visualization of complex network landscapes in a map format including device/network health state. Bandwidth monitoring. Identify where network bandwidth usage is not optimal, and drive decisions to improve utilization.
  • 6. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Scope • Juniper, Cisco (WLC, ASA, IOS, UCS, etc...), OpenGear, F5 and Mist • Custom probes for synthetic monitoring • 60+ sites • ~ 1.6k monitored devices • ~ 14k monitored interfaces • 5 collectors
  • 7. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Why InfluxDB? • Open Source with Enterprise support • Efficient data storage • Flexibility in integrations/languages • Modular agent Telegraf with support of JTI (Juniper Telemetry Int.) • Support for SQL-like query language • Flux as powerful flexible query language
  • 8. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Solution Architecture Distributed Monitoring Services / Storage Network Devices Telegraf/Kapacitor/InfluxDB Troubleshooting Network Automation Adding/Removing device Event Management Visualization Probes Alert Check / Send data Manual intervention Event Automation Troubleshooting Fix Configure Configure New monitored system/device
  • 9. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 10. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Visualizations - Immediate response • Device detailed status • Interface utilization (SNMP / gNMI) • Interface errors (SNMP / gNMI) • CPU/Memory utilization (SNMP) • BGP neighbors status (SNMP / gNMI in progress) • etc... • Site View • Data from probe (Latency, Packet loss, HTTP response time, DNS delay) • SLI/SLO status (Kapacitor processed + Flux query) • Internet link utilization (processed by Kapacitor) • Top talkers (from other tool via RestAPI) • Wireless status • Statistics of WLC/APs and connected clients
  • 11. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 12. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 13. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 14. © 2021  InfluxData Inc. All Rights Reserved. 14 © 2021  InfluxData Inc. All Rights Reserved.
  • 15. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Visualizations - Long-term planning • Link capacity utilization • Status page based on SLI/SLO • Wireless AP (Cisco WLC) anomaly detection - Flux • Compliance reporting
  • 16. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 17. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved.
  • 18. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Flux • Provides very flexible programmatic way of query • Allows changing data type within a query • Within compliance report, we connect up to 5 different measurements • Used for access point, poor SNR anomaly detection across regions • Focus where it matters most • Allows custom functions • Median Absolute Deviation used for anomaly detection • Well-documented at https://www.influxdata.com/blog/anomaly-detection-with-median-abs olute-deviation/
  • 19. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Median Absolute Deviation - Function import "math" import "experimental" mad = (table=<-, threshold=3.0) => { data = table |> group(columns: ["_time"], mode:"by") med = data |> median(column: "_value") diff = join(tables: {data: data, med: med}, on: ["_time"], method: "inner") |> map(fn: (r) => ({ r with _value: math.abs(x: r._value_data - r._value_med) })) |> drop(columns: ["_start", "_stop", "_value_med", "_value_data"]) k = 1.4826 diff_med = diff |> median(column: "_value") |> map(fn: (r) => ({ r with MAD: k * r._value})) |> filter(fn: (r) => r.MAD > 0.0) output = join(tables: {diff: diff, diff_med: diff_med}, on: ["_time"], method: "inner") |> map(fn: (r) => ({ r with _value: r._value_diff/r._value_diff_med})) |> map(fn: (r) => ({ r with level: if r._value >= threshold then "anomaly" else "normal" })) return output }
  • 20. © 2021  InfluxData Inc. All Rights Reserved. © 2021  InfluxData Inc. All Rights Reserved. Median Absolute Deviation - Usage pc_duration = from(bucket: "XXXXXX") |> range(start: v.timeRangeStart, stop: v.timeRangeStop) |> filter(fn: (r) => r._measurement == "bsnAPTable" and r._field =~ /radio1PoorSNRClients|radio1Users/ and r.region == "${region}" ) |> pivot(rowKey:["_time"], columnKey: ["_field"], valueColumn: "_value") |> filter(fn: (r) => r.radio1PoorSNRClients > 0 and r.radio1Users > 0 ) |> map(fn: (r) => ({ r with CNPR: float(v: r.radio1PoorSNRClients) / float(v: r.radio1Users)})) |> stateDuration( fn: (r) => r.CNPR >= 0.1, column: "duration" ) |> map(fn: (r) => ({ r with _value: float(v: r.duration) / float(v: r.CNPR)})) |> filter(fn: (r) => r._value > 0) |> truncateTimeColumn(unit: 1h) |> toFloat() pc_duration |> mad(threshold:10.0) |> filter(fn: (r) => r.level == "anomaly") |> group(columns: ["APName"]) |> count() |> group()
  • 21. © 2021  InfluxData Inc. All Rights Reserved. Questions?
  • 22. © 2021  InfluxData Inc. All Rights Reserved. Thank You