Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet

Presented at ClickHouse Meetup in Madrid,
April 2, 2019

  • Login to see the comments

1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet

  1. 1. 1027 predictive models in 10 seconds
  2. 2. Who? ● ● ● ● ●
  3. 3. The problem ● ● ● ● ● ● ●
  4. 4. The easy part. ● ● ● ● ●
  5. 5. The easy part. Weka
  6. 6. 250 million records? You can solve that with a few indexes
  7. 7. When you’ve got a hammer... copy sales ("id","time","country"...) from 'd:tmpdata.csv' DELIMITER ',' CSV HEADER;
  8. 8. We’ve got RAM, let’s put it to use ● ● ● ● ● ... ● ¯_(ツ)_/¯
  9. 9. clickwhat? deb http://repo.yandex.ru/clickhouse/deb/stable/main/ sudo apt-key adv --keyserver keyserver.ubuntu.com --recv E0C56BD4 sudo apt-get update sudo apt-get install clickhouse-client clickhouse-server
  10. 10. Importing CSV data
  11. 11. 2:37.82s elapsed
  12. 12. You had my curiosity Now you have my attention
  13. 13. What?
  14. 14. 0.328s Out of the box One node. No configuration
  15. 15. How many models?
  16. 16. Way too many. Let’s reduce it a bit
  17. 17. So, 1027 queries:
  18. 18. Good enough. We can work it out! ● ● ● ● ● It’s alive!
  19. 19. Thank you?
  20. 20. 1027*713 = 732.251 rows ●
  21. 21. The full query
  22. 22. One million rows
  23. 23. The results:
  24. 24. The results:
  25. 25. Conclusions ● ● ● ●
  26. 26. Thank you!

×