SlideShare a Scribd company logo
Submit Search
Upload
Login
Signup
1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
Report
Altinity Ltd
Follow
Altinity Ltd
Apr. 9, 2019
•
0 likes
•
1,030 views
1
of
27
1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
Apr. 9, 2019
•
0 likes
•
1,030 views
Report
Technology
Presented at ClickHouse Meetup in Madrid, April 2, 2019
Altinity Ltd
Follow
Altinity Ltd
Recommended
Altinity Cluster Manager: ClickHouse Management for Kubernetes and Cloud
Altinity Ltd
1.3K views
•
26 slides
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
Altinity Ltd
2.1K views
•
22 slides
Building ClickHouse and Making Your First Contribution: A Tutorial_06.10.2021
Altinity Ltd
605 views
•
64 slides
Mux loves Clickhouse. By Adam Brown, Mux founder
Altinity Ltd
1.4K views
•
49 slides
Big Data and Beautiful Video: How ClickHouse enables Mux to Deliver Content a...
Altinity Ltd
857 views
•
44 slides
ClickHouse Monitoring 101: What to monitor and how
Altinity Ltd
2.3K views
•
38 slides
More Related Content
What's hot
High Performance, High Reliability Data Loading on ClickHouse
Altinity Ltd
2.3K views
•
37 slides
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Altinity Ltd
12.4K views
•
50 slides
ClickHouse Defense Against the Dark Arts - Intro to Security and Privacy
Altinity Ltd
1.3K views
•
43 slides
Data warehouse on Kubernetes - gentle intro to Clickhouse Operator, by Robert...
Altinity Ltd
967 views
•
43 slides
MariaDB and Clickhouse Percona Live 2019 talk
Alexander Rubin
354 views
•
50 slides
ClickHouse on Kubernetes, by Alexander Zaitsev, Altinity CTO
Altinity Ltd
4.1K views
•
34 slides
What's hot
(20)
High Performance, High Reliability Data Loading on ClickHouse
Altinity Ltd
•
2.3K views
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Altinity Ltd
•
12.4K views
ClickHouse Defense Against the Dark Arts - Intro to Security and Privacy
Altinity Ltd
•
1.3K views
Data warehouse on Kubernetes - gentle intro to Clickhouse Operator, by Robert...
Altinity Ltd
•
967 views
MariaDB and Clickhouse Percona Live 2019 talk
Alexander Rubin
•
354 views
ClickHouse on Kubernetes, by Alexander Zaitsev, Altinity CTO
Altinity Ltd
•
4.1K views
Tiered storage intro. By Robert Hodges, Altinity CEO
Altinity Ltd
•
817 views
Big Data in Real-Time: How ClickHouse powers Admiral's visitor relationships ...
Altinity Ltd
•
982 views
New features in ProxySQL 2.0 (updated to 2.0.9) by Rene Cannao (ProxySQL)
Altinity Ltd
•
2.3K views
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
Altinity Ltd
•
3.7K views
Your first ClickHouse data warehouse
Altinity Ltd
•
1.2K views
Webinar slides: Adding Fast Analytics to MySQL Applications with Clickhouse
Altinity Ltd
•
1.3K views
ClickHouse and the Magic of Materialized Views, By Robert Hodges and Altinity...
Altinity Ltd
•
8.7K views
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Altinity Ltd
•
2.1K views
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
Altinity Ltd
•
970 views
ClickHouse new features and development roadmap, by Aleksei Milovidov
Altinity Ltd
•
3.4K views
Bitquery GraphQL for Analytics on ClickHouse
Altinity Ltd
•
696 views
ClickHouse Materialized Views: The Magic Continues
Altinity Ltd
•
2K views
Data Warehouse on Kubernetes: lessons from Clickhouse Operator
Altinity Ltd
•
2.4K views
Five Great Ways to Lose Data on Kubernetes - KubeCon EU 2020
Altinity Ltd
•
912 views
Similar to 1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
NodeJS for Beginner
Apaichon Punopas
14.9K views
•
54 slides
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
800 views
•
52 slides
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
332 views
•
51 slides
Deep Dive on Amazon EC2 instances
Amazon Web Services
2.4K views
•
50 slides
Defcon CTF quals
snyff
1.5K views
•
30 slides
Apache Pinot Meetup Sept02, 2020
Mayank Shrivastava
902 views
•
74 slides
Similar to 1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
(20)
NodeJS for Beginner
Apaichon Punopas
•
14.9K views
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
•
800 views
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
•
332 views
Deep Dive on Amazon EC2 instances
Amazon Web Services
•
2.4K views
Defcon CTF quals
snyff
•
1.5K views
Apache Pinot Meetup Sept02, 2020
Mayank Shrivastava
•
902 views
Virtualization and Cloud Computing with Elastic Server On Demand
Yan Pritzker
•
33.5K views
Web 3, Week 1: Amazon Web Services for Beginners
jkosoy
•
1.2K views
Pen Testing Development
CTruncer
•
804 views
Scaling Docker Containers using Kubernetes and Azure Container Service
Ben Hall
•
803 views
Tales from the Field
MongoDB
•
1.4K views
Using GPUs to handle Big Data with Java by Adam Roberts.
J On The Beach
•
876 views
Become a Java GC Hero - All Day Devops
Tier1app
•
249 views
Nvidia® cuda™ 5 sample evaluationresult_2
Yukio Saito
•
977 views
KDB+ Lite
Sayanosauras
•
994 views
Node azure
Emanuele DelBono
•
1.6K views
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
•
524 views
Let’s talk virtualization
Etienne Tremblay
•
611 views
Advanced Cassandra
DataStax Academy
•
1.6K views
New Jersey Red Hat Users Group Presentation: Provisioning anywhere
Rodrique Heron
•
142 views
More from Altinity Ltd
Building an Analytic Extension to MySQL with ClickHouse and Open Source.pptx
Altinity Ltd
17 views
•
36 slides
Cloud Native ClickHouse at Scale--Using the Altinity Kubernetes Operator-2022...
Altinity Ltd
439 views
•
43 slides
Building an Analytic Extension to MySQL with ClickHouse and Open Source
Altinity Ltd
149 views
•
36 slides
Fun with ClickHouse Window Functions-2021-08-19.pdf
Altinity Ltd
76 views
•
34 slides
Cloud Native Data Warehouses - Intro to ClickHouse on Kubernetes-2021-07.pdf
Altinity Ltd
54 views
•
31 slides
Building High Performance Apps with Altinity Stable Builds for ClickHouse | A...
Altinity Ltd
139 views
•
35 slides
More from Altinity Ltd
(20)
Building an Analytic Extension to MySQL with ClickHouse and Open Source.pptx
Altinity Ltd
•
17 views
Cloud Native ClickHouse at Scale--Using the Altinity Kubernetes Operator-2022...
Altinity Ltd
•
439 views
Building an Analytic Extension to MySQL with ClickHouse and Open Source
Altinity Ltd
•
149 views
Fun with ClickHouse Window Functions-2021-08-19.pdf
Altinity Ltd
•
76 views
Cloud Native Data Warehouses - Intro to ClickHouse on Kubernetes-2021-07.pdf
Altinity Ltd
•
54 views
Building High Performance Apps with Altinity Stable Builds for ClickHouse | A...
Altinity Ltd
•
139 views
Application Monitoring using Open Source - VictoriaMetrics & Altinity ClickHo...
Altinity Ltd
•
387 views
Own your ClickHouse data with Altinity.Cloud Anywhere-2023-01-17.pdf
Altinity Ltd
•
207 views
Adventures with the ClickHouse ReplacingMergeTree Engine
Altinity Ltd
•
307 views
Building a Real-Time Analytics Application with Apache Pulsar and Apache Pinot
Altinity Ltd
•
124 views
Altinity Webinar: Introduction to Altinity.Cloud-Platform for Real-Time Data.pdf
Altinity Ltd
•
150 views
OSA Con 2022 - What Data Engineering Can Learn from Frontend Engineering - Pe...
Altinity Ltd
•
27 views
OSA Con 2022 - Welcome to OSA CON Version 2022 - Robert Hodges - Altinity.pdf
Altinity Ltd
•
19 views
OSA Con 2022 - Using ClickHouse Database to Power Analytics and Customer Enga...
Altinity Ltd
•
65 views
OSA Con 2022 - Tips and Tricks to Keep Your Queries under 100ms with ClickHou...
Altinity Ltd
•
48 views
OSA Con 2022 - The Open Source Analytic Universe, Version 2022 - Robert Hodge...
Altinity Ltd
•
19 views
OSA Con 2022 - Switching Jaeger Distributed Tracing to ClickHouse to Enable A...
Altinity Ltd
•
72 views
OSA Con 2022 - Streaming Data Made Easy - Tim Spann & David Kjerrumgaard - St...
Altinity Ltd
•
12 views
OSA Con 2022 - State of Open Source Databases - Peter Zaitsev - Percona.pdf
Altinity Ltd
•
18 views
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
Altinity Ltd
•
38 views
Recently uploaded
UiPath Tips and Techniques for Debugging - Session 3
DianaGray10
67 views
•
9 slides
roomos_webinar_280923_v2.pptx
ThousandEyes
42 views
•
29 slides
Solving today’s Traffic Problems with Sustainable Ride Hailing Solution
On Demand Clone
51 views
•
9 slides
GDSC INFO SESSION 2023.pdf
Mustabshira
14 views
•
24 slides
grrcon-2023-scheduled-tasks.pdf
Brandon DeVault
16 views
•
29 slides
Lesson 1 - Algorithm and Flowcharting.pdf
ROWELL MARQUINA
68 views
•
32 slides
Recently uploaded
(20)
UiPath Tips and Techniques for Debugging - Session 3
DianaGray10
•
67 views
roomos_webinar_280923_v2.pptx
ThousandEyes
•
42 views
Solving today’s Traffic Problems with Sustainable Ride Hailing Solution
On Demand Clone
•
51 views
GDSC INFO SESSION 2023.pdf
Mustabshira
•
14 views
grrcon-2023-scheduled-tasks.pdf
Brandon DeVault
•
16 views
Lesson 1 - Algorithm and Flowcharting.pdf
ROWELL MARQUINA
•
68 views
Project Euler in Python
Tetsuo Koyama
•
31 views
9C Monthly Newsletter - SEPT 2023
PublishingTeam
•
276 views
Mule Meetup Calgary- API Governance & Conformance.pdf
NithaJoseph4
•
73 views
Regain Supply Chain Control
Converge
•
26 views
Cloud Study Jam ppt.pptx
Poorabpatel
•
44 views
From Project to Product - The Need for Speed
Cprime
•
14 views
Dev Dives: Mastering AI-powered Document Understanding
UiPathCommunity
•
1.1K views
AI Prompt Engineering
Jason J Pulikkottil
•
15 views
Cloud Native Application Development Guide – 2023
Lucy Zeniffer
•
10 views
WaveTech Investor Presentation
Dan Spottsville
•
13 views
Obsoleting Global Supply Chain Management
Converge
•
26 views
Deep Dive Microsoft Viva Insights - Collabdays Bletchley Park 2023
Chirag Patel
•
23 views
Product Research Presentation
DeahJadeArellano
•
42 views
THRIVING IN THE GEN AI ERA: NAVIGATING CHANGE IN TECH
Steve Poole
•
10 views
1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
1.
1027 predictive models in
10 seconds
2.
Who? ● ● ● ● ●
3.
The problem ● ● ● ● ● ● ●
4.
The easy part. ● ● ● ● ●
5.
The easy part.
Weka
6.
250 million records?
You can solve that with a few indexes
7.
When you’ve got
a hammer... copy sales ("id","time","country"...) from 'd:tmpdata.csv' DELIMITER ',' CSV HEADER;
8.
We’ve got RAM,
let’s put it to use ● ● ● ● ● ... ● ¯_(ツ)_/¯
9.
clickwhat? deb http://repo.yandex.ru/clickhouse/deb/stable/main/ sudo apt-key
adv --keyserver keyserver.ubuntu.com --recv E0C56BD4 sudo apt-get update sudo apt-get install clickhouse-client clickhouse-server
10.
Importing CSV data
11.
2:37.82s elapsed
12.
You had my
curiosity Now you have my attention
13.
What?
15.
0.328s Out of the
box One node. No configuration
16.
How many models?
17.
Way too many.
Let’s reduce it a bit
18.
So, 1027 queries:
19.
Good enough. We
can work it out! ● ● ● ● ● It’s alive!
20.
Thank you?
21.
1027*713 = 732.251
rows ●
22.
The full query
23.
One million rows
24.
The results:
25.
The results:
26.
Conclusions ● ● ● ●
27.
Thank you!