SlideShare a Scribd company logo
1 of 30
Download to read offline
1
ClickHouse
at ContentSquare
2
Summary
What are we doing at ContentSquare?
What have we done so far with ClickHouse?
What do we think about ClickHouse?
3
Who are we
Christophe Kalenzaga
Senior Data Engineer
Vianney Foucault
Senior Platform Engineer
4
ContentSquare in a few words
● Web analytic Company
○ 600GB collected per day
○ 13 months retention (~230 TBytes)
○ websites and mobile applications
● 100+ clients Worldwide
○ Automotive | BFSI | Retail | Grocery | Luxury
○ B2B | Energy | Travel | Telco | Gaming
● Collected data doubles every year
5
One of the many challenges of ContentSquare
6
Our workload
● 30k queries per day
● Skewed load
● CPU intensive queries
● 20% of cacheable queries
● Can’t precompute queries
7
current backend infrastructure
● Many ElasticSearch clusters
● Storage cost
● Compute cost
● Can’t handle clients with too much data
Scala Microservices
Data Pipeline Components
(Scala/Spark)
8
to sum up
9
what we’ve done so far
Aug 2018Jun 2018Mar 2018Jan 2018Dec 2017
Selection of a few technologies
to replace ElasticSearch
Start mobile analytics
solution on ClickHouse
One-week POC on ClickHouse on a
specific feature of the web analytics
solution First release of the mobile analytics
solution on clickhouse.
insert data from the web analytics
solution into ClickHouse check if
there are no side effects
Start migrating the web analytic
solution to ClickHouse
Benchmarks to find the
right data model and the
right configuration
the mobile analytics solution is
released
Looking for a new technology
POC of multiple technologies
10
How to benchmark?
11
Benchmark methodology
● Define different types of query
● Get the statistical distribution of each type of query in production
● Create a dataset (10 TBytes)
● Create queries with the same statistical distribution as in production
● Be deterministic
12
Benchmark Constraints for a #Platfomer
Deployment
CPU’s
Memory
Configuration
Management
Deterministic
Automation Cost Optimisation
Monitoring
Data Load
13
Benchmark timeline
{ Time consuming Manual tasks }
Iteration
Setup
infrastructure
Deploy
Middlewares
~ 4 hours ~ 12 hours
Load test Data Benchmark
{ Time consuming }
Cleanup
14
Platformer’s Words
DevOps Motto:
Industrialise As Soon As Possible
15
Fully Automated Benchmark timeline
~ 4 hours ~ 12 hours5 Minutes 1 Minute
Iteration
Setup
infrastructure
Deploy
Middlewares
Load test Data Benchmark Cleanup
DevOps Approved
16
Platformer’s Words
DevOps Leitmotiv:
Monitor As Soon As Possible
17
Benchmark iterations on operating systems
http://openbenchmarking.org/result/1704225-TR-AMAZON38819
18
Benchmark iterations on different operating systems
ResponseTime
Percentiles
+60%
+60%
19
Benchmark iterations on Instances
Instance Type vCPU Mem (GiB) Networking Perf. $ Per Hour Perf Index
i3.large 2 15,25 Up to 10 Gigabit 0.172 1
i3.xlarge 4 30,5 Up to 10 Gigabit 0.344 1
i3.2xlarge 8 61 Up to 10 Gigabit 0.688 1
i3.4xlarge 16 122 Up to 10 Gigabit 1.376 1
i3.8xlarge 32 244 10 Gigabit 2.752 1
i3.16xlarge 64 488 25 Gigabit 5.504 1
i3.metal 72 512 25 Gigabit 5.504 1.3
20
Benchmark iterations on different Amazon servers
All the cluster configurations have the same price!
ResponseTime
Percentiles
Blue To Red +40%
21
Numa Architecture
QPI Bus
ClickHouse Instance 1
MEMORY CPU
I/O Bridge
MEMORYCPU
I/O Bridge
Linux OS
NUMA Node0 NUMA Node1
ClickHouse Instance 2
22
Benchmark iterations on scalability
ResponseTime
Percentiles
+85%
+55%
23
Benchmark iterations on parallelism
ResponseTime
Percentiles
24
New Backend Infrastructure (Zoom in)
1 EC2 (i3.metal) instance hosts 2 clickhouse servers
1
1’
2
2’
3
3’
4
4’
5
5’
6
6’
ClickhouseCluster
AZ-1aAZ-1b
Replication
25
New Backend Infrastructure (Big picture)
ClickHouse Cluster
Data Pipeline
M
assiveInserts
Scala
Microservices
Reads
CHProxy Servers
With local Cache
Monitoring: Prometheus / Grafana
Load Balancers
26
Next Steps
Cluster Extension
Major Disaster
Staging Refresh
Major ClickHouse
upgrade
Full InHouse Automation
27
What do we like with ClickHouse
Native sampling
Stability
Easy to understand
Fast
Active developments
28
What parts of ClickHouse still need to be improved
Difficult to master
Lack of tooling
Random stability of a new version
No query optimizer
29
Do we recommend ClickHouse?
30
Any question?
Thank you
We’re hiring!

More Related Content

What's hot

Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward
 

What's hot (20)

Grafana introduction
Grafana introductionGrafana introduction
Grafana introduction
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
 
ClickHouse Intro
ClickHouse IntroClickHouse Intro
ClickHouse Intro
 
Better than you think: Handling JSON data in ClickHouse
Better than you think: Handling JSON data in ClickHouseBetter than you think: Handling JSON data in ClickHouse
Better than you think: Handling JSON data in ClickHouse
 
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
 
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems -  ACM RecSys 2013 tutorialLearning to Rank for Recommender Systems -  ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesA Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
 
Shallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender SystemShallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender System
 
Explainability for Learning to Rank
Explainability for Learning to RankExplainability for Learning to Rank
Explainability for Learning to Rank
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Apache Spark on K8S and HDFS Security with Ilan Flonenko
Apache Spark on K8S and HDFS Security with Ilan FlonenkoApache Spark on K8S and HDFS Security with Ilan Flonenko
Apache Spark on K8S and HDFS Security with Ilan Flonenko
 
ClickHouse ReplacingMergeTree in Telecom Apps
ClickHouse ReplacingMergeTree in Telecom AppsClickHouse ReplacingMergeTree in Telecom Apps
ClickHouse ReplacingMergeTree in Telecom Apps
 
Altinity Quickstart for ClickHouse
Altinity Quickstart for ClickHouseAltinity Quickstart for ClickHouse
Altinity Quickstart for ClickHouse
 
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixTableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
 
InfluxDB & Grafana
InfluxDB & GrafanaInfluxDB & Grafana
InfluxDB & Grafana
 
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
 
Scaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTOScaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTO
 
ML Infrastracture @ Dropbox
ML Infrastracture @ Dropbox ML Infrastracture @ Dropbox
ML Infrastracture @ Dropbox
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per dayHow Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per day
 

Similar to Clickhouse MeetUp@ContentSquare - ContentSquare's Experience Sharing

Network Automation Journey, A systems engineer NetOps perspective
Network Automation Journey, A systems engineer NetOps perspectiveNetwork Automation Journey, A systems engineer NetOps perspective
Network Automation Journey, A systems engineer NetOps perspective
Walid Shaari
 
Apache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - finalApache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - final
Sub Szabolcs Feczak
 

Similar to Clickhouse MeetUp@ContentSquare - ContentSquare's Experience Sharing (20)

The Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data PlatformThe Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data Platform
 
Network Automation Journey, A systems engineer NetOps perspective
Network Automation Journey, A systems engineer NetOps perspectiveNetwork Automation Journey, A systems engineer NetOps perspective
Network Automation Journey, A systems engineer NetOps perspective
 
Capacity Planning Infrastructure for Web Applications (Drupal)
Capacity Planning Infrastructure for Web Applications (Drupal)Capacity Planning Infrastructure for Web Applications (Drupal)
Capacity Planning Infrastructure for Web Applications (Drupal)
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
 
Streaming in the Wild with Apache Flink
Streaming in the Wild with Apache FlinkStreaming in the Wild with Apache Flink
Streaming in the Wild with Apache Flink
 
Challenges Encountered by Scaling Up Recommendation Services at Gravity R&D
Challenges Encountered by Scaling Up Recommendation Services at Gravity R&DChallenges Encountered by Scaling Up Recommendation Services at Gravity R&D
Challenges Encountered by Scaling Up Recommendation Services at Gravity R&D
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
 
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
 Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
 
PAD-3126 - Evolving the DevOps Organization around IBM PureApplication System...
PAD-3126 - Evolving the DevOps Organization around IBM PureApplication System...PAD-3126 - Evolving the DevOps Organization around IBM PureApplication System...
PAD-3126 - Evolving the DevOps Organization around IBM PureApplication System...
 
Adtech scala-performance-tuning-150323223738-conversion-gate01
Adtech scala-performance-tuning-150323223738-conversion-gate01Adtech scala-performance-tuning-150323223738-conversion-gate01
Adtech scala-performance-tuning-150323223738-conversion-gate01
 
Adtech x Scala x Performance tuning
Adtech x Scala x Performance tuningAdtech x Scala x Performance tuning
Adtech x Scala x Performance tuning
 
RightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to CloudRightScale Roadtrip Boston: Accelerate to Cloud
RightScale Roadtrip Boston: Accelerate to Cloud
 
Streaming in the Wild with Apache Flink
Streaming in the Wild with Apache FlinkStreaming in the Wild with Apache Flink
Streaming in the Wild with Apache Flink
 
Cloud Computing – A CFO Briefing
Cloud Computing – A CFO BriefingCloud Computing – A CFO Briefing
Cloud Computing – A CFO Briefing
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
 
Migration of a high-traffic E-commerce website from Legacy Monolith to Micros...
Migration of a high-traffic E-commerce website from Legacy Monolith to Micros...Migration of a high-traffic E-commerce website from Legacy Monolith to Micros...
Migration of a high-traffic E-commerce website from Legacy Monolith to Micros...
 
Path to continuous delivery
Path to continuous deliveryPath to continuous delivery
Path to continuous delivery
 
Mainframe Application Testing both With and Without Live Data
Mainframe Application Testing both With and Without Live DataMainframe Application Testing both With and Without Live Data
Mainframe Application Testing both With and Without Live Data
 
Apache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - finalApache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - final
 
Performance testing in scope of migration to cloud by Serghei Radov
Performance testing in scope of migration to cloud by Serghei RadovPerformance testing in scope of migration to cloud by Serghei Radov
Performance testing in scope of migration to cloud by Serghei Radov
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Clickhouse MeetUp@ContentSquare - ContentSquare's Experience Sharing

  • 2. 2 Summary What are we doing at ContentSquare? What have we done so far with ClickHouse? What do we think about ClickHouse?
  • 3. 3 Who are we Christophe Kalenzaga Senior Data Engineer Vianney Foucault Senior Platform Engineer
  • 4. 4 ContentSquare in a few words ● Web analytic Company ○ 600GB collected per day ○ 13 months retention (~230 TBytes) ○ websites and mobile applications ● 100+ clients Worldwide ○ Automotive | BFSI | Retail | Grocery | Luxury ○ B2B | Energy | Travel | Telco | Gaming ● Collected data doubles every year
  • 5. 5 One of the many challenges of ContentSquare
  • 6. 6 Our workload ● 30k queries per day ● Skewed load ● CPU intensive queries ● 20% of cacheable queries ● Can’t precompute queries
  • 7. 7 current backend infrastructure ● Many ElasticSearch clusters ● Storage cost ● Compute cost ● Can’t handle clients with too much data Scala Microservices Data Pipeline Components (Scala/Spark)
  • 9. 9 what we’ve done so far Aug 2018Jun 2018Mar 2018Jan 2018Dec 2017 Selection of a few technologies to replace ElasticSearch Start mobile analytics solution on ClickHouse One-week POC on ClickHouse on a specific feature of the web analytics solution First release of the mobile analytics solution on clickhouse. insert data from the web analytics solution into ClickHouse check if there are no side effects Start migrating the web analytic solution to ClickHouse Benchmarks to find the right data model and the right configuration the mobile analytics solution is released Looking for a new technology POC of multiple technologies
  • 11. 11 Benchmark methodology ● Define different types of query ● Get the statistical distribution of each type of query in production ● Create a dataset (10 TBytes) ● Create queries with the same statistical distribution as in production ● Be deterministic
  • 12. 12 Benchmark Constraints for a #Platfomer Deployment CPU’s Memory Configuration Management Deterministic Automation Cost Optimisation Monitoring Data Load
  • 13. 13 Benchmark timeline { Time consuming Manual tasks } Iteration Setup infrastructure Deploy Middlewares ~ 4 hours ~ 12 hours Load test Data Benchmark { Time consuming } Cleanup
  • 15. 15 Fully Automated Benchmark timeline ~ 4 hours ~ 12 hours5 Minutes 1 Minute Iteration Setup infrastructure Deploy Middlewares Load test Data Benchmark Cleanup DevOps Approved
  • 17. 17 Benchmark iterations on operating systems http://openbenchmarking.org/result/1704225-TR-AMAZON38819
  • 18. 18 Benchmark iterations on different operating systems ResponseTime Percentiles +60% +60%
  • 19. 19 Benchmark iterations on Instances Instance Type vCPU Mem (GiB) Networking Perf. $ Per Hour Perf Index i3.large 2 15,25 Up to 10 Gigabit 0.172 1 i3.xlarge 4 30,5 Up to 10 Gigabit 0.344 1 i3.2xlarge 8 61 Up to 10 Gigabit 0.688 1 i3.4xlarge 16 122 Up to 10 Gigabit 1.376 1 i3.8xlarge 32 244 10 Gigabit 2.752 1 i3.16xlarge 64 488 25 Gigabit 5.504 1 i3.metal 72 512 25 Gigabit 5.504 1.3
  • 20. 20 Benchmark iterations on different Amazon servers All the cluster configurations have the same price! ResponseTime Percentiles Blue To Red +40%
  • 21. 21 Numa Architecture QPI Bus ClickHouse Instance 1 MEMORY CPU I/O Bridge MEMORYCPU I/O Bridge Linux OS NUMA Node0 NUMA Node1 ClickHouse Instance 2
  • 22. 22 Benchmark iterations on scalability ResponseTime Percentiles +85% +55%
  • 23. 23 Benchmark iterations on parallelism ResponseTime Percentiles
  • 24. 24 New Backend Infrastructure (Zoom in) 1 EC2 (i3.metal) instance hosts 2 clickhouse servers 1 1’ 2 2’ 3 3’ 4 4’ 5 5’ 6 6’ ClickhouseCluster AZ-1aAZ-1b Replication
  • 25. 25 New Backend Infrastructure (Big picture) ClickHouse Cluster Data Pipeline M assiveInserts Scala Microservices Reads CHProxy Servers With local Cache Monitoring: Prometheus / Grafana Load Balancers
  • 26. 26 Next Steps Cluster Extension Major Disaster Staging Refresh Major ClickHouse upgrade Full InHouse Automation
  • 27. 27 What do we like with ClickHouse Native sampling Stability Easy to understand Fast Active developments
  • 28. 28 What parts of ClickHouse still need to be improved Difficult to master Lack of tooling Random stability of a new version No query optimizer
  • 29. 29 Do we recommend ClickHouse?