Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

How to go into production your machine learning models? #CWT2017

1,971 views

Published on

機械学習システムのデプロイパターンとそのHadoop/Sparkエコシステムのアーキテクチャ例を説明します。

Published in: Engineering
  • Be the first to comment

How to go into production your machine learning models? #CWT2017

  1. 1. 1© Cloudera, Inc. All rights reserved. 機械学習システムのデプロイパターン Aki Ariga | Field Data Scientist
  2. 2. 2© Cloudera, Inc. All rights reserved. • (Twitter/Github @chezou) • Field Data Scientist @ Cloudera • • NLP/ / • Rails • •
  3. 3. 3© Cloudera, Inc. All rights reserved.
  4. 4. 4© Cloudera, Inc. All rights reserved. 4 Artificial Intelligence ( ) Machine Learning by Splintax CC BY-SA 3.0 https://commons.wikimedia.org/wiki/File:IF- THEN-ELSE-END_flowchart.png Deep Learning Good to read: https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep- learning-ai/
  5. 5. 5© Cloudera, Inc. All rights reserved. ● ○ e.g.) , ● ○ e.g.) ● ○ e.g.) ● ○ e.g.) ● ○ e.g.) Amazon, Netflix ● ○ e.g.) Alpha-Go, ● etc…
  6. 6. 6© Cloudera, Inc. All rights reserved. Non-Spam Spam
  7. 7. 7© Cloudera, Inc. All rights reserved. [0, 1, 0, 2.5, 0, -1, ...] [1, 0.5, 0.1, -2, 3, 2, ...] [1, 0, 1.0, 1.1, 0, 0, ...] Logistic Regression, SVM, Random Forest, NN... w1=1, w2=- 1, w3=0 ... )
  8. 8. 8© Cloudera, Inc. All rights reserved. From data to exploration to action Data Engineering Data Science (Exploratory) Production (Operational) Data Models Business ValuePredictions
  9. 9. 9© Cloudera, Inc. All rights reserved.
  10. 10. 10© Cloudera, Inc. All rights reserved. Reports Dashboards Scoring
  11. 11. 11© Cloudera, Inc. All rights reserved.
  12. 12. 12© Cloudera, Inc. All rights reserved. 1. REST API 2. DB 3. 4.
  13. 13. 13© Cloudera, Inc. All rights reserved. Web DB / API REST API User ID/ Item ID Microservices architecture: Web ML REST (or gRPC) API ML ML
  14. 14. 14© Cloudera, Inc. All rights reserved. DB Web API REST API User ID/ Item ID DB / ML
  15. 15. 15© Cloudera, Inc. All rights reserved. REST API Web DB / API REST API User ID/ Item ID ML
  16. 16. 16© Cloudera, Inc. All rights reserved.
  17. 17. 17© Cloudera, Inc. All rights reserved. & / & PMML export Model building layer Predicting & serving layer CDSW HDFS
  18. 18. 18© Cloudera, Inc. All rights reserved. & / & Model building layerPredicting & serving layer HDFS Docker image CDSW
  19. 19. 19© Cloudera, Inc. All rights reserved. Demo: https://github.com/chezou/cdsw-serve-docker
  20. 20. 20© Cloudera, Inc. All rights reserved. CDSW Amazon ECS Application Load Balancer Amazon S3 Docker HUBDocker image Source code Trained model Prediction request
  21. 21. 21© Cloudera, Inc. All rights reserved. Cloudera Data Science Workbench(CDSW) エンタープライズのためのセルフサービスデータサイエンス基盤 - GPU - fork - Docker - Spark -
  22. 22. 22© Cloudera, Inc. All rights reserved. ● Pros ○ Web ML ■ ■ ■ ○ ○ ○ A/B ○ ● Cons ○ API ○
  23. 23. 23© Cloudera, Inc. All rights reserved. Web DB ML / DB : Web ML DB
  24. 24. 24© Cloudera, Inc. All rights reserved. DB / Web DB ML /
  25. 25. 25© Cloudera, Inc. All rights reserved. DB / DB Web / DB ML
  26. 26. 26© Cloudera, Inc. All rights reserved. DB Web ML / DB
  27. 27. 27© Cloudera, Inc. All rights reserved. Kudu/HBase / & Model building & predicting layerServing layer HDFS CDSW
  28. 28. 28© Cloudera, Inc. All rights reserved. ● Pros ○ Web ML ■ ■ ■ ○ ○ : ○ ○ ● Cons ○ Web ○
  29. 29. 29© Cloudera, Inc. All rights reserved. Web ML ( : Spark Streaming) & - - - Kafka MQ (:Kafka)
  30. 30. 30© Cloudera, Inc. All rights reserved.
  31. 31. 31© Cloudera, Inc. All rights reserved. ● Pros ○ ○ ○ ○ ● Cons ○ ■
  32. 32. 32© Cloudera, Inc. All rights reserved. DB ML / / export/ DB &
  33. 33. 33© Cloudera, Inc. All rights reserved. & / & /export Model building layer Predicting & serving layer HDFS CDSW
  34. 34. 34© Cloudera, Inc. All rights reserved.
  35. 35. 35© Cloudera, Inc. All rights reserved. ● ○ CDSW Job ● ○ NA ( ) ● ○ 1 ■ PMML export OpenScoring API ■ API Docker ■ API ○ 2 ■ CDSW HBase/Kudu/RDB ○ 3 ■ ○ 4 ■ TensorFlow export/CoreML
  36. 36. 36© Cloudera, Inc. All rights reserved. Thank you Aki Ariga @chezou ariga@cloudera.com

×