Monitor everything from physical hardware to application functionality

Nicolas Seyvet
Nicolas SeyvetApplied SW Researcher at Ericsson
Only 4
days
SUBHEADING TEXT
Monitor everything from
physical hardware to
application functionality
Welcome to our lavish
smorgasbord offering
within IT Monitoring.
OP5 is the market leader
of IT monitoring
throughout the Nordic
region and in over 50
countries around the
world.
Passionate software developer at OP5 AB.
Particular interests are coding, cloud, software engineering and architecture,
distributed and scalable systems.
Nicolas Seyvet
The IT Monitoring
Software Solution.
From Sweden. For a Global Market. Based on Open Source.
OP5 is a Swedish company founded in 2004. The vision was to develop an IT
monitoring software solution based on the Open Source project Nagios that
would offer an unprecedented user experience. A solution that would be
easy to implement, intuitive to work with and provide unparalleled scalability
to support clients and their ever changing business needs.
Today, OP5 has grown into an International company with a presence in over
60 countries. Thousands of IT professionals across the world rely daily on
solutions from OP5 to monitor their business-critical IT services.
The OP5 product Monitor is Nagios
Based on:
- Checks
- Plugins
- BUT static infrastructure
Infrastructure:
- Increased number of devices
- Virtual
Applications:
- On-demand deployments (cloud)
- Ephemeral/moving processes
- Distributed
Monitor everything in the data center?
The three Vs of Big Data:
- Volume
- Velocity
- Variety
Dynamic, complex environment
Outpacing humans
Average DC -> ~ 20 000 servers
Monitoring
One simple
dimension:
Dynamicity
Time series
Event
source
Multiple series of timestamp, value pairs
<series name> (t0, v0) (t1, v1) (t2, v2) (t3, v3) …
metric/event
produces
Time
pod.io.read_bytes_sec
Not all sources are created equal
Time
Long lived
Virtual Infrastructure
Application layer
Medium lived
Ephemeral
Physical Infrastructure
An example
Let’s assume 20 000 servers with 4 micro-services per server:
Assume 100 metrics per instance:
Out of which:
Add dynamicity and elasticity → 0.01%/s replacement rate:
Then, add the virtual infrastructure, failures in the DC, new racks, etc.
→ 20 000 + 4 x 20 000 = 100 000 instances
→ 10 000 000 active time series
→ 2 000 000 are long lived
8 000 000 are ephemeral
→ 0.01% * 8 000 000 = 80 new time series/s
~6 900 000 new time series per day
Monitoring Monasca
Monasca (http://monasca.io/) is a open-source multi-tenant, massively scalable,
fault-tolerant monitoring-as-a-service solution.
Main features:
- An event driven architecture.
- A set of REST APIs for high-speed event processing and querying.
- A real-time streaming engine (alarms and transformations)..
- An agent (collector) with plugins.
- A push based system.
Part of the (but not limited to) OpenStack family.
Monasca
OpenStack began in 2010 as a joint project between NASA and Rackspace.
Open source software for creating private and public clouds (Infrastructure as a Service)..
Control large pools of compute, storage, and networking resources throughout a datacenter,
managed through a dashboard or via RESTful APIs.
OpenStack
Key Features
OpenStack Open Source projects
MonascaMONASCA
Monitoring
Stack What is Monasca?
The clients
Monasca API
Horizon
Dashboard
Grafana
Dashboard
Monasca
Agent
Users
GET/POST Push
Auth.
Keystone
Authentication/Authorization → Multi-tenancy
Query,
Create/define alarms and notifications
Monasca API
Data/Event Bus
Publish/
Subscribe
The core
Kafka is an OpenSource massively scalable Pub-Sub message queue:
- horizontally scalable
- fault-tolerant
- high throughput (>100K to millions of events/s)
- at least once guarantee
Monasca API
Data/Event Bus
Configuration
Persister
Streaming
Engine
Notification
Engine
Threshold
Transform
Anomaly
Subscribe SubscribePublish/
Subscribe
TSDB
Logs/Events
The backend
Threshold engine: What to monitor in real-time (alarms)
Transform engine: From raw to smart data.
The Monasca stack
Monasca API
Horizon
Dashboard
Grafana
Dashboard
Monasca
Agent
Users
GET/POST
Push
Data/Event Bus
Configuration
Persister
Streaming
Engine
Notification
Engine
Threshold
Transform
Anomaly
Subscribe SubscribePublish/
Subscribe
TSDB
Logs/Events
Auth.
Keystone
Stack
Two benefits:
Extensibility and
“what?”
Easy to extend
Data/Event Bus
My Function/App
Persister
Streaming
Engine
Notification
Engine
Event driven architecture.
Publish/
Subscribe
...
Highest level:
What to alarm on?
Domain Specific Language (DSL)
Where a sub-expression:
<sub_expression>
::= <function> '(' <metric> [',' period] ')' <operator> threshold_value ['times' periods]
Example:
<expression>
::= <subexpression> [(and | or) <subexpression>]*
avg(disk.space_used_perc{hostname=compute_node_1}) >= 99
and
count(log.error{hostname=compute_node_1,component=kafka},deterministic) >= 1
function
min
max
sum
avg
count
last
Stack In conclusion
To sum up:
- Built for self-healing and elasticity (horizontal scalability)
- Can handle billions of time-series at high throughput
- Multi-tenant
- Extensible
- DSL to monitor what matters
- Can combine different sources (metrics/events/logs)
Built on top of Kubernetes, runs on AWS, OpenStack and VMWare.
$ # Deploy in one line
$ helm install op5_monasca
OP5 Monasca
OP5 HQ
Norgegatan 2
SE-164 32 Kista
Sweden
+46 (0)8 58 83 01 00
www.OP5.com
inkedin.com/company/OP5/
facebook.com/OP5ab
twitter.com/OP5ab
Call us
Follow us
Nicolas Seyvet
Backend Engineer
Email nseyvet@op5.com
Twitter: @NicolasSeyvet
Blog: http://babounehacks.blogspot.se/
Github: https://github.com/nseyvet
https://github.com/baboune
Questions?
1 of 24

Recommended

Why the database is at the heart of DevOps success by
Why the database is at the heart of DevOps successWhy the database is at the heart of DevOps success
Why the database is at the heart of DevOps successRed Gate Software
197 views24 slides
#OSSPARIS19 - How to improve database observability - CHARLES JUDITH, Criteo by
#OSSPARIS19 - How to improve database observability - CHARLES JUDITH, Criteo#OSSPARIS19 - How to improve database observability - CHARLES JUDITH, Criteo
#OSSPARIS19 - How to improve database observability - CHARLES JUDITH, CriteoParis Open Source Summit
30 views43 slides
Monitoring modern applications using Elastic by
Monitoring modern applications using ElasticMonitoring modern applications using Elastic
Monitoring modern applications using ElasticElasticsearch
559 views52 slides
Keynote: Making search better, faster, easier by
Keynote: Making search better, faster, easierKeynote: Making search better, faster, easier
Keynote: Making search better, faster, easierElasticsearch
848 views67 slides
Advanced correlations for threat detection and more by
Advanced correlations for threat detection and moreAdvanced correlations for threat detection and more
Advanced correlations for threat detection and moreElasticsearch
3.2K views54 slides
Elasticsearch: From development to production in 15 minutes by
Elasticsearch: From development to production in 15 minutesElasticsearch: From development to production in 15 minutes
Elasticsearch: From development to production in 15 minutesElasticsearch
529 views37 slides

More Related Content

What's hot

Get involved with the security community at Elastic by
Get involved with the security community at ElasticGet involved with the security community at Elastic
Get involved with the security community at ElasticElasticsearch
171 views42 slides
Platform for the Research and Analysis of Cybernetic Threats by
Platform for the Research and Analysis of Cybernetic ThreatsPlatform for the Research and Analysis of Cybernetic Threats
Platform for the Research and Analysis of Cybernetic ThreatsDataWorks Summit
602 views25 slides
Get full visibility and find hidden security issues by
Get full visibility and find hidden security issuesGet full visibility and find hidden security issues
Get full visibility and find hidden security issuesElasticsearch
77 views153 slides
Modernizing deployment in any environment with Elastic by
Modernizing deployment in any environment with ElasticModernizing deployment in any environment with Elastic
Modernizing deployment in any environment with ElasticElasticsearch
158 views18 slides
How Zebra Technologies delivers business intelligence with Elastic on Google ... by
How Zebra Technologies delivers business intelligence with Elastic on Google ...How Zebra Technologies delivers business intelligence with Elastic on Google ...
How Zebra Technologies delivers business intelligence with Elastic on Google ...Elasticsearch
821 views25 slides
Empowering agencies using Elastic as a Service inside Government by
Empowering agencies using Elastic as a Service inside GovernmentEmpowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside GovernmentElasticsearch
1.2K views8 slides

What's hot(20)

Get involved with the security community at Elastic by Elasticsearch
Get involved with the security community at ElasticGet involved with the security community at Elastic
Get involved with the security community at Elastic
Elasticsearch171 views
Platform for the Research and Analysis of Cybernetic Threats by DataWorks Summit
Platform for the Research and Analysis of Cybernetic ThreatsPlatform for the Research and Analysis of Cybernetic Threats
Platform for the Research and Analysis of Cybernetic Threats
DataWorks Summit602 views
Get full visibility and find hidden security issues by Elasticsearch
Get full visibility and find hidden security issuesGet full visibility and find hidden security issues
Get full visibility and find hidden security issues
Elasticsearch77 views
Modernizing deployment in any environment with Elastic by Elasticsearch
Modernizing deployment in any environment with ElasticModernizing deployment in any environment with Elastic
Modernizing deployment in any environment with Elastic
Elasticsearch158 views
How Zebra Technologies delivers business intelligence with Elastic on Google ... by Elasticsearch
How Zebra Technologies delivers business intelligence with Elastic on Google ...How Zebra Technologies delivers business intelligence with Elastic on Google ...
How Zebra Technologies delivers business intelligence with Elastic on Google ...
Elasticsearch821 views
Empowering agencies using Elastic as a Service inside Government by Elasticsearch
Empowering agencies using Elastic as a Service inside GovernmentEmpowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside Government
Elasticsearch1.2K views
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business by Elasticsearch
How eStruxture Data Centers is Using ECE to Rapidly Scale Their BusinessHow eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
Elasticsearch2.3K views
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization by Kai Wähner
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Kai Wähner872 views
Countering Threats with the Elastic Stack at CERDEC/ARL by Elasticsearch
Countering Threats with the Elastic Stack at CERDEC/ARLCountering Threats with the Elastic Stack at CERDEC/ARL
Countering Threats with the Elastic Stack at CERDEC/ARL
Elasticsearch1.8K views
Streamline search with Elasticsearch Service on Microsoft Azure by Elasticsearch
Streamline search with Elasticsearch Service on Microsoft AzureStreamline search with Elasticsearch Service on Microsoft Azure
Streamline search with Elasticsearch Service on Microsoft Azure
Elasticsearch600 views
Transforming data into actionable insights by Elasticsearch
Transforming data into actionable insightsTransforming data into actionable insights
Transforming data into actionable insights
Elasticsearch201 views
Elasticsearch on Azure by Elasticsearch
Elasticsearch on AzureElasticsearch on Azure
Elasticsearch on Azure
Elasticsearch1.3K views
Integrating advanced analytics with Elasticsearch by Elasticsearch
Integrating advanced analytics with ElasticsearchIntegrating advanced analytics with Elasticsearch
Integrating advanced analytics with Elasticsearch
Elasticsearch917 views
Improving Response Times at Optum with Elastic APM by Elasticsearch
Improving Response Times at Optum with Elastic APMImproving Response Times at Optum with Elastic APM
Improving Response Times at Optum with Elastic APM
Elasticsearch1.5K views
Combining Logs, Metrics, and Traces for Unified Observability by Elasticsearch
Combining Logs, Metrics, and Traces for Unified ObservabilityCombining Logs, Metrics, and Traces for Unified Observability
Combining Logs, Metrics, and Traces for Unified Observability
Elasticsearch1.1K views
Elastic community Abidjan #225 meetup 08 May 2021 by Yassine, LASRI
Elastic community Abidjan #225 meetup 08 May 2021Elastic community Abidjan #225 meetup 08 May 2021
Elastic community Abidjan #225 meetup 08 May 2021
Yassine, LASRI56 views
Owasp appsec container_security_supply_chain by Michele Chubirka
Owasp appsec container_security_supply_chainOwasp appsec container_security_supply_chain
Owasp appsec container_security_supply_chain
Michele Chubirka100 views
Building Identity Graph at Scale for Programmatic Media Buying Using Apache S... by Databricks
Building Identity Graph at Scale for Programmatic Media Buying Using Apache S...Building Identity Graph at Scale for Programmatic Media Buying Using Apache S...
Building Identity Graph at Scale for Programmatic Media Buying Using Apache S...
Databricks192 views
Opening Keynote: Why Elastic? by Elasticsearch
Opening Keynote: Why Elastic?Opening Keynote: Why Elastic?
Opening Keynote: Why Elastic?
Elasticsearch225 views
AWS on Splunk, Splunk on AWS by Splunk
AWS on Splunk, Splunk on AWSAWS on Splunk, Splunk on AWS
AWS on Splunk, Splunk on AWS
Splunk2.4K views

Similar to Monitor everything from physical hardware to application functionality

Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ... by
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...confluent
1.6K views37 slides
StreamAnalytix - Multi-Engine Streaming Analytics Platform by
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformAtul Sharma
167 views2 slides
DS_2016_StreamAnalytix_real_time_streaming_analytics_platform by
DS_2016_StreamAnalytix_real_time_streaming_analytics_platformDS_2016_StreamAnalytix_real_time_streaming_analytics_platform
DS_2016_StreamAnalytix_real_time_streaming_analytics_platformAditya Singh
83 views2 slides
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight by
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsightIngestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsightMicrosoft Tech Community
845 views31 slides
Service Discovery and Registration in a Microservices Architecture by
Service Discovery and Registration in a Microservices ArchitectureService Discovery and Registration in a Microservices Architecture
Service Discovery and Registration in a Microservices ArchitecturePLUMgrid
6.7K views38 slides
Activeeon technology for Big Compute and cloud migration by
Activeeon technology for Big Compute and cloud migrationActiveeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migrationActiveeon
399 views44 slides

Similar to Monitor everything from physical hardware to application functionality(20)

Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ... by confluent
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
confluent1.6K views
StreamAnalytix - Multi-Engine Streaming Analytics Platform by Atul Sharma
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics Platform
Atul Sharma167 views
DS_2016_StreamAnalytix_real_time_streaming_analytics_platform by Aditya Singh
DS_2016_StreamAnalytix_real_time_streaming_analytics_platformDS_2016_StreamAnalytix_real_time_streaming_analytics_platform
DS_2016_StreamAnalytix_real_time_streaming_analytics_platform
Aditya Singh83 views
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight by Microsoft Tech Community
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsightIngestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Service Discovery and Registration in a Microservices Architecture by PLUMgrid
Service Discovery and Registration in a Microservices ArchitectureService Discovery and Registration in a Microservices Architecture
Service Discovery and Registration in a Microservices Architecture
PLUMgrid6.7K views
Activeeon technology for Big Compute and cloud migration by Activeeon
Activeeon technology for Big Compute and cloud migrationActiveeon technology for Big Compute and cloud migration
Activeeon technology for Big Compute and cloud migration
Activeeon399 views
What's new in Elasticsearch v5 by Idan Tohami
What's new in Elasticsearch v5What's new in Elasticsearch v5
What's new in Elasticsearch v5
Idan Tohami4.1K views
Red hat's updates on the cloud & infrastructure strategy by Orgad Kimchi
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
Orgad Kimchi413 views
Built on Pulsar: A Commercial Consent Management System for 80 Million Citizens by StreamNative
Built on Pulsar: A Commercial Consent Management System for 80 Million CitizensBuilt on Pulsar: A Commercial Consent Management System for 80 Million Citizens
Built on Pulsar: A Commercial Consent Management System for 80 Million Citizens
StreamNative265 views
Apache Kafka as Event Streaming Platform for Microservice Architectures by Kai Wähner
Apache Kafka as Event Streaming Platform for Microservice ArchitecturesApache Kafka as Event Streaming Platform for Microservice Architectures
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kai Wähner3.5K views
Data Streaming with Apache Kafka & MongoDB by confluent
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
confluent13.7K views
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita... by Kai Wähner
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner1.1K views
IoT and Event Streaming at Scale with Apache Kafka by confluent
IoT and Event Streaming at Scale with Apache KafkaIoT and Event Streaming at Scale with Apache Kafka
IoT and Event Streaming at Scale with Apache Kafka
confluent243 views
3 reasons to pick a time series platform for monitoring dev ops driven contai... by DevOps.com
3 reasons to pick a time series platform for monitoring dev ops driven contai...3 reasons to pick a time series platform for monitoring dev ops driven contai...
3 reasons to pick a time series platform for monitoring dev ops driven contai...
DevOps.com183 views
Scaling Your SaaS with Analytics-Driven Insights and Wavefront Integrations f... by Amazon Web Services
Scaling Your SaaS with Analytics-Driven Insights and Wavefront Integrations f...Scaling Your SaaS with Analytics-Driven Insights and Wavefront Integrations f...
Scaling Your SaaS with Analytics-Driven Insights and Wavefront Integrations f...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A... by confluent
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
confluent3.4K views
Introduction to OVH Analytics Data Platform by OVHcloud
Introduction to OVH Analytics Data PlatformIntroduction to OVH Analytics Data Platform
Introduction to OVH Analytics Data Platform
OVHcloud774 views
Partner Keynote: Intel - The New Frontier of Cloud Computing by Amazon Web Services
Partner Keynote: Intel - The New Frontier of Cloud ComputingPartner Keynote: Intel - The New Frontier of Cloud Computing
Partner Keynote: Intel - The New Frontier of Cloud Computing

Recently uploaded

SAP FOR CONTRACT MANUFACTURING.pdf by
SAP FOR CONTRACT MANUFACTURING.pdfSAP FOR CONTRACT MANUFACTURING.pdf
SAP FOR CONTRACT MANUFACTURING.pdfVirendra Rai, PMP
11 views2 slides
Cycleops - Automate deployments on top of bare metal.pptx by
Cycleops - Automate deployments on top of bare metal.pptxCycleops - Automate deployments on top of bare metal.pptx
Cycleops - Automate deployments on top of bare metal.pptxThanassis Parathyras
30 views12 slides
Consulting for Data Monetization Maximizing the Profit Potential of Your Data... by
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...Consulting for Data Monetization Maximizing the Profit Potential of Your Data...
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...Flexsin
15 views10 slides
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut... by
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...HCLSoftware
6 views2 slides
ict act 1.pptx by
ict act 1.pptxict act 1.pptx
ict act 1.pptxsanjaniarun08
13 views17 slides
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea... by
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Safe Software
412 views59 slides

Recently uploaded(20)

Cycleops - Automate deployments on top of bare metal.pptx by Thanassis Parathyras
Cycleops - Automate deployments on top of bare metal.pptxCycleops - Automate deployments on top of bare metal.pptx
Cycleops - Automate deployments on top of bare metal.pptx
Consulting for Data Monetization Maximizing the Profit Potential of Your Data... by Flexsin
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...Consulting for Data Monetization Maximizing the Profit Potential of Your Data...
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...
Flexsin 15 views
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut... by HCLSoftware
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...
HCLSoftware6 views
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea... by Safe Software
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Safe Software412 views
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... by Deltares
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
Deltares6 views
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko... by Deltares
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...
Deltares11 views
DSD-INT 2023 Baseline studies for Strategic Coastal protection for Long Islan... by Deltares
DSD-INT 2023 Baseline studies for Strategic Coastal protection for Long Islan...DSD-INT 2023 Baseline studies for Strategic Coastal protection for Long Islan...
DSD-INT 2023 Baseline studies for Strategic Coastal protection for Long Islan...
Deltares11 views
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker by Deltares
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - ParkerDSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
Deltares9 views
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx by animuscrm
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
animuscrm13 views
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t... by Deltares
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
Deltares9 views
Citi TechTalk Session 2: Kafka Deep Dive by confluent
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
confluent17 views
Advanced API Mocking Techniques by Dimpy Adhikary
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking Techniques
Dimpy Adhikary19 views
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge... by Deltares
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...
Deltares16 views
MariaDB stored procedures and why they should be improved by Federico Razzoli
MariaDB stored procedures and why they should be improvedMariaDB stored procedures and why they should be improved
MariaDB stored procedures and why they should be improved

Monitor everything from physical hardware to application functionality

  • 1. Only 4 days SUBHEADING TEXT Monitor everything from physical hardware to application functionality Welcome to our lavish smorgasbord offering within IT Monitoring. OP5 is the market leader of IT monitoring throughout the Nordic region and in over 50 countries around the world.
  • 2. Passionate software developer at OP5 AB. Particular interests are coding, cloud, software engineering and architecture, distributed and scalable systems. Nicolas Seyvet
  • 3. The IT Monitoring Software Solution. From Sweden. For a Global Market. Based on Open Source. OP5 is a Swedish company founded in 2004. The vision was to develop an IT monitoring software solution based on the Open Source project Nagios that would offer an unprecedented user experience. A solution that would be easy to implement, intuitive to work with and provide unparalleled scalability to support clients and their ever changing business needs. Today, OP5 has grown into an International company with a presence in over 60 countries. Thousands of IT professionals across the world rely daily on solutions from OP5 to monitor their business-critical IT services.
  • 4. The OP5 product Monitor is Nagios Based on: - Checks - Plugins - BUT static infrastructure
  • 5. Infrastructure: - Increased number of devices - Virtual Applications: - On-demand deployments (cloud) - Ephemeral/moving processes - Distributed Monitor everything in the data center? The three Vs of Big Data: - Volume - Velocity - Variety Dynamic, complex environment Outpacing humans Average DC -> ~ 20 000 servers
  • 7. Time series Event source Multiple series of timestamp, value pairs <series name> (t0, v0) (t1, v1) (t2, v2) (t3, v3) … metric/event produces Time pod.io.read_bytes_sec
  • 8. Not all sources are created equal Time Long lived Virtual Infrastructure Application layer Medium lived Ephemeral Physical Infrastructure
  • 9. An example Let’s assume 20 000 servers with 4 micro-services per server: Assume 100 metrics per instance: Out of which: Add dynamicity and elasticity → 0.01%/s replacement rate: Then, add the virtual infrastructure, failures in the DC, new racks, etc. → 20 000 + 4 x 20 000 = 100 000 instances → 10 000 000 active time series → 2 000 000 are long lived 8 000 000 are ephemeral → 0.01% * 8 000 000 = 80 new time series/s ~6 900 000 new time series per day
  • 11. Monasca (http://monasca.io/) is a open-source multi-tenant, massively scalable, fault-tolerant monitoring-as-a-service solution. Main features: - An event driven architecture. - A set of REST APIs for high-speed event processing and querying. - A real-time streaming engine (alarms and transformations).. - An agent (collector) with plugins. - A push based system. Part of the (but not limited to) OpenStack family. Monasca
  • 12. OpenStack began in 2010 as a joint project between NASA and Rackspace. Open source software for creating private and public clouds (Infrastructure as a Service).. Control large pools of compute, storage, and networking resources throughout a datacenter, managed through a dashboard or via RESTful APIs. OpenStack Key Features
  • 13. OpenStack Open Source projects MonascaMONASCA Monitoring
  • 14. Stack What is Monasca?
  • 15. The clients Monasca API Horizon Dashboard Grafana Dashboard Monasca Agent Users GET/POST Push Auth. Keystone Authentication/Authorization → Multi-tenancy Query, Create/define alarms and notifications
  • 16. Monasca API Data/Event Bus Publish/ Subscribe The core Kafka is an OpenSource massively scalable Pub-Sub message queue: - horizontally scalable - fault-tolerant - high throughput (>100K to millions of events/s) - at least once guarantee
  • 17. Monasca API Data/Event Bus Configuration Persister Streaming Engine Notification Engine Threshold Transform Anomaly Subscribe SubscribePublish/ Subscribe TSDB Logs/Events The backend Threshold engine: What to monitor in real-time (alarms) Transform engine: From raw to smart data.
  • 18. The Monasca stack Monasca API Horizon Dashboard Grafana Dashboard Monasca Agent Users GET/POST Push Data/Event Bus Configuration Persister Streaming Engine Notification Engine Threshold Transform Anomaly Subscribe SubscribePublish/ Subscribe TSDB Logs/Events Auth. Keystone
  • 20. Easy to extend Data/Event Bus My Function/App Persister Streaming Engine Notification Engine Event driven architecture. Publish/ Subscribe ...
  • 21. Highest level: What to alarm on? Domain Specific Language (DSL) Where a sub-expression: <sub_expression> ::= <function> '(' <metric> [',' period] ')' <operator> threshold_value ['times' periods] Example: <expression> ::= <subexpression> [(and | or) <subexpression>]* avg(disk.space_used_perc{hostname=compute_node_1}) >= 99 and count(log.error{hostname=compute_node_1,component=kafka},deterministic) >= 1 function min max sum avg count last
  • 23. To sum up: - Built for self-healing and elasticity (horizontal scalability) - Can handle billions of time-series at high throughput - Multi-tenant - Extensible - DSL to monitor what matters - Can combine different sources (metrics/events/logs) Built on top of Kubernetes, runs on AWS, OpenStack and VMWare. $ # Deploy in one line $ helm install op5_monasca OP5 Monasca
  • 24. OP5 HQ Norgegatan 2 SE-164 32 Kista Sweden +46 (0)8 58 83 01 00 www.OP5.com inkedin.com/company/OP5/ facebook.com/OP5ab twitter.com/OP5ab Call us Follow us Nicolas Seyvet Backend Engineer Email nseyvet@op5.com Twitter: @NicolasSeyvet Blog: http://babounehacks.blogspot.se/ Github: https://github.com/nseyvet https://github.com/baboune Questions?