SlideShare a Scribd company logo
1 of 72
@clementd & @waxzce
@clementd & @waxzce
Monitoring the Unknown
1000*100 series a day
Quentin Adam / @waxzce
Clément Delafargue / @clementd
@clementd & @waxzce
Who are we?
Quentin 🐧
ADAM
CEO
@ Clever Cloud
@waxzce on twitter
Clément
DELAFARGUE🐇
CTO
@ Clever Cloud
@clementd on Twitter
@clementd & @waxzce
What is Clever Cloud?
@clementd & @waxzce
You need metrics
@clementd & @waxzce
Metrics are not just for ops
@clementd & @waxzce
… but most tools stay ops-oriented
@clementd & @waxzce
Developers also deserve metrics
@clementd & @waxzce
Not just for forensics
@clementd & @waxzce
We need metrics for devs and ops
@clementd & @waxzce
We need metrics for DevOps
@clementd & @waxzce
We still live in a serverful world
@clementd & @waxzce
Cross-compare application-level
and system-level metrics
@clementd & @waxzce
A brave new world
@clementd & @waxzce
An immutable new world
Understand immutable infrastructure
by @waxzce
https://www.youtube.com/watch?v=WrZCbgQsPVU
@clementd & @waxzce
Pet versus Cattle
@clementd & @waxzce
1 service, many servers
@clementd & @waxzce
More is more
@clementd & @waxzce
More volume
@clementd & @waxzce
More series
@clementd & @waxzce
How we do it at
@clementd & @waxzce
We don’t know the apps we have to monitor
@clementd & @waxzce
Metrics collection
@clementd & @waxzce
Collection agent
@clementd & @waxzce
We chose telegraf
@clementd & @waxzce
JVM agent
to grab
JMX data
@clementd & @waxzce
Extensibility with statsd
@clementd & @waxzce
Extensibility with prometheus
@clementd & @waxzce
Push / Pull metrics
@clementd & @waxzce
How do we store all that?
@clementd & @waxzce
Our needs
@clementd & @waxzce
Humongous volume
@clementd & @waxzce
Various access patterns
@clementd & @waxzce
Time Series DataBase
@clementd & @waxzce
What are time series?
@clementd & @waxzce
What you need to pay attention to
@clementd & @waxzce
Real time raw points access
@clementd & @waxzce
Analysis capabilities
@clementd & @waxzce
Cardinality
@clementd & @waxzce
Influx
@clementd & @waxzce
Prometheus
@clementd & @waxzce
ElasticSearch
@clementd & @waxzce
OpenTSDB
@clementd & @waxzce
Home-made solution based on Cassandra
@clementd & @waxzcehttps://github.com/cityzendata/warp10-platform / http://www.warp10.io/
Warp10
Plain old java
Standalone version fat jar using levelDB
GeoTime Series
@clementd & @waxzce
Awesome Distributed system
@clementd & @waxzce
Token-based access
@clementd & @waxzce
Consumption
@clementd & @waxzce
Raw data access
@clementd & @waxzce
WarpScript
@clementd & @waxzce
"<TOKEN>"
'cpu.usage_user'
{ 'host' '<instance>' 'app_id' '=<app_id>'}
NOW -140
FETCH
<open in Quantum>
@clementd & @waxzce
[
$POINTS
bucketizer.mean // bucketizer
0 // lastbucket
0 // bucketspan
5 // bucketcount
]
BUCKETIZE
<open in quantum>
@clementd & @waxzce
[
$POINTS
mapper.rate // mapper
1 // before
0 // after
0 // points
]
MAP
<open in quantum>
@clementd & @waxzce
[
$POINTS
[ ‘app_id’ ]
reducer.mean.exclude-nulls
]
REDUCE
<open in quantum>
@clementd & @waxzce
Full reference
http://www.warp10.io/reference/reference/
@clementd & @waxzce
Triggers / μBatching
@clementd & @waxzce
Build a new series on a series, recompute the
data, become smarter and smarter
@clementd & @waxzce
Poke, good place to get sample
https://poke.digital
https://docs.google.com/presentation/d/1RfpX-KdfAa5ZxsnuRYi34JbidbVVUcz7bYy5x-
k6BBE/edit?usp=sharing
https://www.pscp.tv/waxzce/1OwGWEEvapkxQ?t=4m51s
@clementd & @waxzce
We’ve got a lot of data points.
Good.
Now what?
@clementd & @waxzce
Display
@clementd & @waxzce
Graphs are good.
@clementd & @waxzce
Graphs without units… not so good
@clementd & @waxzce
@clementd & @waxzce
Data good.
Meaning better.
@clementd & @waxzce
Metrics-spec
(collaboration with the OVH metrics team)
https://github.com/CleverCloud/metrics-spec
@clementd & @waxzce
@clementd & @waxzce
What you should look for
when you build
your own metrics pipeline.
@clementd & @waxzce
List what you’ll need,
@clementd & @waxzce
Collect more
@clementd & @waxzce
Mind the ACLs (or lack thereof)
@clementd & @waxzce
Visualize, then analyze
@clementd & @waxzce
Thanks
@waxzce & @clementd
Try 💡☁, here is a voucher:
metrics4devoxx

More Related Content

Similar to Monitorer l'inconnu, 1000 * 100 series par jour - talk avec @clementd à #devoxxFR 2018

Domain Driven Rails
Domain Driven RailsDomain Driven Rails
Domain Driven RailsYan Pritzker
 
Introduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxIntroduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxBrandon Minnick, MBA
 
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017Quentin Adam
 
Introduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxIntroduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxBrandon Minnick, MBA
 
Serverless Development To Production Pipeline
Serverless Development To Production PipelineServerless Development To Production Pipeline
Serverless Development To Production PipelineChase Douglas
 
Resilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelResilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelITCamp
 
qewd-ripple: The Ripple OSI Middle Tier
qewd-ripple: The Ripple OSI Middle Tierqewd-ripple: The Ripple OSI Middle Tier
qewd-ripple: The Ripple OSI Middle TierRob Tweed
 
Scaling Docker with Kubernetes
Scaling Docker with KubernetesScaling Docker with Kubernetes
Scaling Docker with KubernetesCarlos Sanchez
 
An introduction to Serverless
An introduction to ServerlessAn introduction to Serverless
An introduction to ServerlessAdrien Blind
 
[Container world 2017] The Questions You're Afraid to Ask about Containers
[Container world 2017] The Questions You're Afraid to Ask about Containers[Container world 2017] The Questions You're Afraid to Ask about Containers
[Container world 2017] The Questions You're Afraid to Ask about ContainersDustin Kirkland
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraDataStax Academy
 
Cloud Formation
Cloud FormationCloud Formation
Cloud FormationAdron Hall
 
A Hitchhiker's Guide to the Cloud Native Stack
A Hitchhiker's Guide to the Cloud Native StackA Hitchhiker's Guide to the Cloud Native Stack
A Hitchhiker's Guide to the Cloud Native StackQAware GmbH
 
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLA Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLMario-Leander Reimer
 
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...HostedbyConfluent
 
Cassandra and Docker Lessons Learned
Cassandra and Docker Lessons LearnedCassandra and Docker Lessons Learned
Cassandra and Docker Lessons LearnedDataStax Academy
 
Clocker: Managing Container Networking and Placement
Clocker: Managing Container Networking and PlacementClocker: Managing Container Networking and Placement
Clocker: Managing Container Networking and PlacementDocker, Inc.
 
Clocker - The Docker Cloud Maker
Clocker - The Docker Cloud MakerClocker - The Docker Cloud Maker
Clocker - The Docker Cloud MakerAndrew Kennedy
 
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptx
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptxTerraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptx
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptxvanitakirdak
 

Similar to Monitorer l'inconnu, 1000 * 100 series par jour - talk avec @clementd à #devoxxFR 2018 (20)

Domain Driven Rails
Domain Driven RailsDomain Driven Rails
Domain Driven Rails
 
Introduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxIntroduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptx
 
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017
Monitoring the unknown, 1000*100 series a day - Big Data Vilnius 2017
 
Introduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptxIntroduction to Serverless with AWS Lambda in C#.pptx
Introduction to Serverless with AWS Lambda in C#.pptx
 
Docker and CloudStack
Docker and CloudStackDocker and CloudStack
Docker and CloudStack
 
Serverless Development To Production Pipeline
Serverless Development To Production PipelineServerless Development To Production Pipeline
Serverless Development To Production Pipeline
 
Resilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelResilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete Atamel
 
qewd-ripple: The Ripple OSI Middle Tier
qewd-ripple: The Ripple OSI Middle Tierqewd-ripple: The Ripple OSI Middle Tier
qewd-ripple: The Ripple OSI Middle Tier
 
Scaling Docker with Kubernetes
Scaling Docker with KubernetesScaling Docker with Kubernetes
Scaling Docker with Kubernetes
 
An introduction to Serverless
An introduction to ServerlessAn introduction to Serverless
An introduction to Serverless
 
[Container world 2017] The Questions You're Afraid to Ask about Containers
[Container world 2017] The Questions You're Afraid to Ask about Containers[Container world 2017] The Questions You're Afraid to Ask about Containers
[Container world 2017] The Questions You're Afraid to Ask about Containers
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run Cassandra
 
Cloud Formation
Cloud FormationCloud Formation
Cloud Formation
 
A Hitchhiker's Guide to the Cloud Native Stack
A Hitchhiker's Guide to the Cloud Native StackA Hitchhiker's Guide to the Cloud Native Stack
A Hitchhiker's Guide to the Cloud Native Stack
 
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLA Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
 
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...
Delivering Cloud-Native Data Pipelines with Kafka Connect on Kubernetes | Vik...
 
Cassandra and Docker Lessons Learned
Cassandra and Docker Lessons LearnedCassandra and Docker Lessons Learned
Cassandra and Docker Lessons Learned
 
Clocker: Managing Container Networking and Placement
Clocker: Managing Container Networking and PlacementClocker: Managing Container Networking and Placement
Clocker: Managing Container Networking and Placement
 
Clocker - The Docker Cloud Maker
Clocker - The Docker Cloud MakerClocker - The Docker Cloud Maker
Clocker - The Docker Cloud Maker
 
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptx
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptxTerraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptx
Terraform-on-AWS-EKS-v5 adshdhddhowahaaaaaaaaaaa.pptx
 

More from Quentin Adam

Biscuit, the cryptotoken you can share safely with your ap is
Biscuit, the cryptotoken you can share safely with your ap isBiscuit, the cryptotoken you can share safely with your ap is
Biscuit, the cryptotoken you can share safely with your ap isQuentin Adam
 
Keynot Cloud Expo Intel
Keynot Cloud Expo IntelKeynot Cloud Expo Intel
Keynot Cloud Expo IntelQuentin Adam
 
Pulsar 101 at devoxx
Pulsar 101 at devoxxPulsar 101 at devoxx
Pulsar 101 at devoxxQuentin Adam
 
Traversing hyper driven developpement to do great technical choices and make ...
Traversing hyper driven developpement to do great technical choices and make ...Traversing hyper driven developpement to do great technical choices and make ...
Traversing hyper driven developpement to do great technical choices and make ...Quentin Adam
 
How to make people work together? - ending keynote - devfest du bout du monde...
How to make people work together? - ending keynote - devfest du bout du monde...How to make people work together? - ending keynote - devfest du bout du monde...
How to make people work together? - ending keynote - devfest du bout du monde...Quentin Adam
 
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...Remove centralization on Authorization - API Days Paris 2018 (announcement fo...
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...Quentin Adam
 
PostgreSQL is the new NoSQL - at Devoxx 2018
PostgreSQL is the new NoSQL  - at Devoxx 2018PostgreSQL is the new NoSQL  - at Devoxx 2018
PostgreSQL is the new NoSQL - at Devoxx 2018Quentin Adam
 
Hype driven architecture - keynote at devfest Toulouse 2018
Hype driven architecture - keynote at devfest Toulouse 2018Hype driven architecture - keynote at devfest Toulouse 2018
Hype driven architecture - keynote at devfest Toulouse 2018Quentin Adam
 
Real world code, why are you so ashamed? - RivieraDev 2018
Real world code, why are you so ashamed? - RivieraDev 2018Real world code, why are you so ashamed? - RivieraDev 2018
Real world code, why are you so ashamed? - RivieraDev 2018Quentin Adam
 
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017Quentin Adam
 
Problems you’ll face in the Microservices World: Configuration, Authenticatio...
Problems you’ll face in the Microservices World: Configuration, Authenticatio...Problems you’ll face in the Microservices World: Configuration, Authenticatio...
Problems you’ll face in the Microservices World: Configuration, Authenticatio...Quentin Adam
 
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017Quentin Adam
 
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...Quentin Adam
 
What is Clever Cloud?
What is Clever Cloud?What is Clever Cloud?
What is Clever Cloud?Quentin Adam
 
Why all my software projects are late?
Why all my software projects are late?Why all my software projects are late?
Why all my software projects are late?Quentin Adam
 
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT Quentin Adam
 
What is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcampWhat is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcampQuentin Adam
 
Happy dev ... & ops
Happy dev ... & opsHappy dev ... & ops
Happy dev ... & opsQuentin Adam
 
What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017Quentin Adam
 
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...Quentin Adam
 

More from Quentin Adam (20)

Biscuit, the cryptotoken you can share safely with your ap is
Biscuit, the cryptotoken you can share safely with your ap isBiscuit, the cryptotoken you can share safely with your ap is
Biscuit, the cryptotoken you can share safely with your ap is
 
Keynot Cloud Expo Intel
Keynot Cloud Expo IntelKeynot Cloud Expo Intel
Keynot Cloud Expo Intel
 
Pulsar 101 at devoxx
Pulsar 101 at devoxxPulsar 101 at devoxx
Pulsar 101 at devoxx
 
Traversing hyper driven developpement to do great technical choices and make ...
Traversing hyper driven developpement to do great technical choices and make ...Traversing hyper driven developpement to do great technical choices and make ...
Traversing hyper driven developpement to do great technical choices and make ...
 
How to make people work together? - ending keynote - devfest du bout du monde...
How to make people work together? - ending keynote - devfest du bout du monde...How to make people work together? - ending keynote - devfest du bout du monde...
How to make people work together? - ending keynote - devfest du bout du monde...
 
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...Remove centralization on Authorization - API Days Paris 2018 (announcement fo...
Remove centralization on Authorization - API Days Paris 2018 (announcement fo...
 
PostgreSQL is the new NoSQL - at Devoxx 2018
PostgreSQL is the new NoSQL  - at Devoxx 2018PostgreSQL is the new NoSQL  - at Devoxx 2018
PostgreSQL is the new NoSQL - at Devoxx 2018
 
Hype driven architecture - keynote at devfest Toulouse 2018
Hype driven architecture - keynote at devfest Toulouse 2018Hype driven architecture - keynote at devfest Toulouse 2018
Hype driven architecture - keynote at devfest Toulouse 2018
 
Real world code, why are you so ashamed? - RivieraDev 2018
Real world code, why are you so ashamed? - RivieraDev 2018Real world code, why are you so ashamed? - RivieraDev 2018
Real world code, why are you so ashamed? - RivieraDev 2018
 
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017
Comment les contrôleurs de gestion ont fuck up mon IT - Lean Kanban France 2017
 
Problems you’ll face in the Microservices World: Configuration, Authenticatio...
Problems you’ll face in the Microservices World: Configuration, Authenticatio...Problems you’ll face in the Microservices World: Configuration, Authenticatio...
Problems you’ll face in the Microservices World: Configuration, Authenticatio...
 
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017
MONITORING THE UNKNOWN, 1000*100 SERIES A DAY - DEVOXX MOROCCO 2017
 
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...
Build a reverse proxy for modern immutable infrastructure - Sozu - Devops D D...
 
What is Clever Cloud?
What is Clever Cloud?What is Clever Cloud?
What is Clever Cloud?
 
Why all my software projects are late?
Why all my software projects are late?Why all my software projects are late?
Why all my software projects are late?
 
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT
Understand immutable infrastructure, what? Why? How? - Meta-Meetup DEVOPS NIGHT
 
What is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcampWhat is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcamp
 
Happy dev ... & ops
Happy dev ... & opsHappy dev ... & ops
Happy dev ... & ops
 
What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017
 
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...
Quelles changements de vision dans la cyber sécurité en 2017 ? - ADN OUEST, s...
 

Recently uploaded

Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 

Recently uploaded (20)

Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 

Monitorer l'inconnu, 1000 * 100 series par jour - talk avec @clementd à #devoxxFR 2018

Editor's Notes

  1. Clever Cloud: IT automation, on gère la prod, et on l’automatise Public Cloud, Entreprise, On Prem
  2. Interesting things happen when you’re not looking. For errors, you have logs, sure, but for subtler things, you need a finer view (CPU use, network usage, latency, …)
  3. Usually metrics are an ops thing: system-level metrics like CPU, RAM, network. Application-level metrics are important: JVM GC status, number of active sessions, number of validated carts…
  4. Zabbix, Centreon stuff like that are made for ops. Metrics are gathered host by host (ie machine by machine), and access control is complicated. Normally only ops have access to this platform, so it’s complex for devs to have access
  5. Application-level metrics are super important, and help to produce better applications.
  6. We tend to use metrics to gather information about outages or issues, but also to see how people use the applications, or to help with perf optimization efforts
  7. There are metrics system for application-level metrics, but both views are important.
  8. Servers still happen, so application-level and system-level metrics are both equally important, they give interesting context with each other.
  9. You need an unified metrics gathering pipeline, both for system and application-level metrics
  10. Systems like zabbix and centreon are not suited for new architectures
  11. Traditional monitoring / metrics in an immutable infrastructure does not make sense
  12. VMs are short-lived and disposable. In zabbix, metrics are lost when a VM is shut down.
  13. in an immutable infrastructure, one service is handled by many servers along its lifetime. you need to aggregate metrics by service, not see it server by server.
  14. Instead of a few servers, there are now hundreds or thousands of ephemeral virtual machines. Traditional metrics pipeline have a hard time with that.
  15. Clever Cloud: 20GB / hour
  16. ~100 series per VM, ~1000 new VMs everyday. Need to aggregate series with metadata, not feasible to consume data series by series.
  17. Immutable infrastructure, Thousands of applications, lots of tech stacks.
  18. Collection happens on the VMs, both for system & application level metrics
  19. We can’t just send metrics naively to the platform. Buffer: gather metrics and send batches periodically Retry: re-send metrics if needed Jitter: make sure all agents don’t send metrics at the same time
  20. Written in go, easy to deploy, lots of input plugins
  21. Understands the statsd protocol. Does local aggregation
  22. Polls a prometheus endpoint, does gathering
  23. statsd is push-based: metrics are sent directly. Less work on the app side prometheus is pull-based: app is queried by the agent. not so convenient, but with telegraf it’s not so bad.
  24. lots of data (1 point every minute * 100, + 1 point every 10 seconds for a few series, like CPU & RAM)
  25. we have specific needs: immutable infra, lots of applications, lots of tech stacks
  26. several thousand instances running at the same time
  27. We need both real time access for the dashboard, and analysis over longer periods.
  28. DBs made to store Time Series
  29. TimeSeries: a Series of points over time. Eg used RAM over time, or % Idle CPU time. Each point is the same thing, at a different time.
  30. TSDBs have different needs than regular DBs
  31. What is cardinality, why is it an important thing?
  32. Allow access delegation, everybody talks directly to the DB. Secure by default, simpler network config
  33. Different way to use the stored data.
  34. Just list points
  35. Aggregate and filter things
  36. Aggregate and filter things https://quantum.services.clever-cloud.com/#/warpscript/IlFCUGY0WkREYk5DNXE3TEwyVkUzVFRSZEdWZVk2V0Nqbm5JLnUxZ2hBRm9fMmNrSjNkZXMyQXlxcmsyWGd5T3RRZ3RKeFQ5d3dLblFaTi53WEdVVnlsNnZLbUN3Zzc4clIyWHYudE5fWHRXZ3JlUXZiQjhsVnVGQTVjS1NSd05tUkYzZUpReVFNOUYwdmR6NGdGRFJZVTNFNkRDOFZxUjhkMEhZZkwuTldxRnJxMk1OLmdRUGxoSXJBYUp6X29MZTdvZUpydFNXOFJ3IgondG9rZW4nIFNUT1JFCgokdG9rZW4KJ2Zhc3RfY3B1LnVzYWdlX2lkbGUnCnsgJ2hvc3QnICd%2BKDI2NjhiNDQ1LThiMmQtNDk1MS05OGRlLTgyYjcxNGVkOGU3YXw5ZTk0ZDU5NC0xOTU5LTQ2MjUtOTAwNC1iYmMzY2FlOGU2MTApJyAnYXBwX2lkJyAnPWFwcF9iODg2MTdhOC02MDRmLTQ2N2YtYTk2Zi02MDZjYWJhYjNjODYnICdjcHUnICdjcHUtdG90YWwnIH0KTk9XIC0xNDAKRkVUQ0g%3D/eyJ1cmwiOiJodHRwczovL2MxLXdhcnAxMC1jbGV2ZXJjbG91ZC1jdXN0b21lcnMuc2VydmljZXMuY2xldmVyLWNsb3VkLmNvbS9hcGkvdjAiLCJmZXRjaEVuZHBvaW50IjoiL2ZldGNoIiwiaGVhZGVyTmFtZSI6IlgtV2FycDEwIn0%3D
  37. https://quantum.services.clever-cloud.com/#/warpscript/IlFCUGY0WkREYk5DNXE3TEwyVkUzVFRSZEdWZVk2V0Nqbm5JLnUxZ2hBRm9fMmNrSjNkZXMyQXlxcmsyWGd5T3RRZ3RKeFQ5d3dLblFaTi53WEdVVnlsNnZLbUN3Zzc4clIyWHYudE5fWHRXZ3JlUXZiQjhsVnVGQTVjS1NSd05tUkYzZUpReVFNOUYwdmR6NGdGRFJZVTNFNkRDOFZxUjhkMEhZZkwuTldxRnJxMk1OLmdRUGxoSXJBYUp6X29MZTdvZUpydFNXOFJ3IgondG9rZW4nIFNUT1JFCgokdG9rZW4KJ2Zhc3RfY3B1LnVzYWdlX2lkbGUnCnsgJ2hvc3QnICd%2BKDI2NjhiNDQ1LThiMmQtNDk1MS05OGRlLTgyYjcxNGVkOGU3YXw5ZTk0ZDU5NC0xOTU5LTQ2MjUtOTAwNC1iYmMzY2FlOGU2MTApJyAnYXBwX2lkJyAnPWFwcF9iODg2MTdhOC02MDRmLTQ2N2YtYTk2Zi02MDZjYWJhYjNjODYnICdjcHUnICdjcHUtdG90YWwnIH0KTk9XIC0xNDAKRkVUQ0gKJ3BvaW50cycgU1RPUkUKClsKICAkcG9pbnRzCiAgYnVja2V0aXplci5tZWFuCiAgMAogIDAKICA1Cl0gQlVDS0VUSVpF/eyJ1cmwiOiJodHRwczovL2MxLXdhcnAxMC1jbGV2ZXJjbG91ZC1jdXN0b21lcnMuc2VydmljZXMuY2xldmVyLWNsb3VkLmNvbS9hcGkvdjAiLCJmZXRjaEVuZHBvaW50IjoiL2ZldGNoIiwiaGVhZGVyTmFtZSI6IlgtV2FycDEwIn0%3D
  38. https://quantum.services.clever-cloud.com/#/warpscript/IlFCUGY0WkREYk5DNXE3TEwyVkUzVFRSZEdWZVk2V0Nqbm5JLnUxZ2hBRm9fMmNrSjNkZXMyQXlxcmsyWGd5T3RRZ3RKeFQ5d3dLblFaTi53WEdVVnlsNnZLbUN3Zzc4clIyWHYudE5fWHRXZ3JlUXZiQjhsVnVGQTVjS1NSd05tUkYzZUpReVFNOUYwdmR6NGdGRFJZVTNFNkRDOFZxUjhkMEhZZkwuTldxRnJxMk1OLmdRUGxoSXJBYUp6X29MZTdvZUpydFNXOFJ3IgondG9rZW4nIFNUT1JFCgokdG9rZW4KJ2Zhc3RfY3B1LnVzYWdlX2lkbGUnCnsgJ2hvc3QnICd%2BKDI2NjhiNDQ1LThiMmQtNDk1MS05OGRlLTgyYjcxNGVkOGU3YXw5ZTk0ZDU5NC0xOTU5LTQ2MjUtOTAwNC1iYmMzY2FlOGU2MTApJyAnYXBwX2lkJyAnPWFwcF9iODg2MTdhOC02MDRmLTQ2N2YtYTk2Zi02MDZjYWJhYjNjODYnICdjcHUnICdjcHUtdG90YWwnIH0KTk9XIC0xNDAKRkVUQ0gKJ3BvaW50cycgU1RPUkUKClsKICAkcG9pbnRzCiAgbWFwcGVyLnJhdGUKICAxCiAgMAogIDAKXSBNQVA%3D/eyJ1cmwiOiJodHRwczovL2MxLXdhcnAxMC1jbGV2ZXJjbG91ZC1jdXN0b21lcnMuc2VydmljZXMuY2xldmVyLWNsb3VkLmNvbS9hcGkvdjAiLCJmZXRjaEVuZHBvaW50IjoiL2ZldGNoIiwiaGVhZGVyTmFtZSI6IlgtV2FycDEwIn0%3D
  39. https://quantum.services.clever-cloud.com/#/warpscript/IlFCUGY0WkREYk5DNXE3TEwyVkUzVFRSZEdWZVk2V0Nqbm5JLnUxZ2hBRm9fMmNrSjNkZXMyQXlxcmsyWGd5T3RRZ3RKeFQ5d3dLblFaTi53WEdVVnlsNnZLbUN3Zzc4clIyWHYudE5fWHRXZ3JlUXZiQjhsVnVGQTVjS1NSd05tUkYzZUpReVFNOUYwdmR6NGdGRFJZVTNFNkRDOFZxUjhkMEhZZkwuTldxRnJxMk1OLmdRUGxoSXJBYUp6X29MZTdvZUpydFNXOFJ3IgondG9rZW4nIFNUT1JFCgokdG9rZW4KJ2Zhc3RfY3B1LnVzYWdlX2lkbGUnCnsgJ2hvc3QnICd%2BKDI2NjhiNDQ1LThiMmQtNDk1MS05OGRlLTgyYjcxNGVkOGU3YXw5ZTk0ZDU5NC0xOTU5LTQ2MjUtOTAwNC1iYmMzY2FlOGU2MTApJyAnYXBwX2lkJyAnPWFwcF9iODg2MTdhOC02MDRmLTQ2N2YtYTk2Zi02MDZjYWJhYjNjODYnICdjcHUnICdjcHUtdG90YWwnIH0KTk9XIC0xNDAKRkVUQ0gKJ3BvaW50cycgU1RPUkUKClsKICAkcG9pbnRzCiAgWyAnYXBwX2lkJyBdCiAgcmVkdWNlci5tZWFuLmV4Y2x1ZGUtbnVsbHMKXSBSRURVQ0U%3D/eyJ1cmwiOiJodHRwczovL2MxLXdhcnAxMC1jbGV2ZXJjbG91ZC1jdXN0b21lcnMuc2VydmljZXMuY2xldmVyLWNsb3VkLmNvbS9hcGkvdjAiLCJmZXRjaEVuZHBvaW50IjoiL2ZldGNoIiwiaGVhZGVyTmFtZSI6IlgtV2FycDEwIn0%3D
  40. Before automation, visual exploration is super important.
  41. Great way to see what’s going on
  42. Metrics are stored without units. Are those bits, bytes, megabytes, number of packets?
  43. We have data, but that’s not enough. We’re not interested in raw data points, we want meaning.
  44. We have data, but that’s not enough. We’re not interested in raw data points, we want meaning.