6. • Support our various services from the view point “Data”
• High level analysis for the services.
• Machine learning model development.
• Hadoop clusters as a Data Lake storing huge data.
What is Data Labs?
11. • Build a model to predict something.
• Learn a new algorithm to improve accuracy.
• Read papers to get more knowledge.
• Understand what happens inside of the machine learning model.
• And so on…
What is necessary for machine learning?
Machine learning lifecycle
12. • Build a model to predict something.
• Learn a new algorithm to improve accuracy.
• Read papers to get more knowledge.
• Understand what happens inside of the machine learning model.
• And so on…
What is necessary for machine learning?
ALL STEPS ARE IMPORTANT
Machine learning lifecycle
13. • Build a model to predict something.
• Learn a new algorithm to improve accuracy.
• Read papers to get more knowledge.
• Understand what happens inside of the machine learning model.
• And so on…
What is necessary for machine learning?
However…
Machine learning lifecycle
15. • Collect data used for model training.
• Prepare a datastore to store the collected data.
• Do pre-processing such as cleaning, normalization, completion, and so on.
• Create a training environment (Distributed? Single?)
• Expose our prediction model as an API.
• Build an infrastructure to serve APIs.
• Monitor accuracy online and offline regularly.
• And others…
Machine learning lifecycle
There are so many other things we have to do.
18. Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Feedback
• Everyone tends to learn these 2 parts.
• But other parts are still important.
• In addition…
Model
Training
Parameter
Tuning
Basic Flow
Prepare a storage
Code review
Test
Prepare a
dashboard
CI
Prepare servers
Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Feedback
Basic Flow
Feedbac
k
Basic Flow
Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Parameter
Tuning
Model
Training
Machine learning lifecycle
19. Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Feedback
• Everyone tends to learn these 2 parts.
• But other parts are still important.
• In addition…
Model
Training
Parameter
Tuning
Basic Flow
• We need to iterate this process as a
lifecycle continuously.
Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Feedback
Model
Training
Parameter
Tuning
Basic Flow
Prepare a storage
Code review
Test
Prepare a
dashboard
CI
Prepare servers
Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Feedback
Basic Flow
Feedbac
k
Basic Flow
Data
Collection
Data
Storing
Pre-
Processing
API
Development
Deployment
Accuracy
Monitoring
Parameter
Tuning
Model
Training
Machine learning lifecycle
21. As you may know.
To achieve rapid continuous development in machine learning,
we need an environment for machine learning.
Platform for machine learning
33. • Of course, we need knowledge about training algorithms,
parameter tuning, and so on.
• In addition, the platform for machine learning is also important.
• It makes our development very rapid, robust, and easy.
What is necessary for machine learning?
Summary