SlideShare a Scribd company logo
1 of 40
Download to read offline
How To Get Started With
Machine Learning
11th October 2017
Alexander Schaefer, Project A
What is ML
Computation
Data
Program
Results
Traditional programming
What is ML
Computation
Data
Program
Results
Traditional programming
What is ML
Computation
Data
Program
{Cat or Dog}
Traditional programming
Data
Program
What is ML
Computation
Data
Program
Traditional programming
{Cat or Dog}
Data
Program
What is ML
Computation
Dog
Program
Computation
Data
Program
Traditional programming
ML Approach
{Cat or Dog}
Data
Program
Data
Result
What is ML
Computation
Data
Cat
Program
Computation
Data
Program
Traditional programming
ML Approach
{Cat or Dog}
Data
Result
Data
Program
What is ML
▪ “Machine learning systems automatically learn
programs from data” Domingos, 2012
Computation
Data
Cat
Program
ML Approach
Data
Result
What is ML
▪ “Machine learning systems automatically learn
programs from data” Domingos, 2012
What is ML
▪ “Machine learning systems automatically learn
programs from data” Domingos, 2012
▪ Coursera
What is ML
▪ “Machine learning systems automatically learn
programs from data” Domingos, 2012
▪ Coursera
▪ Kaggle
Your Problem
▪ Some examples from Kaggle:
▪ Which driver will file insurance in next 12 months?
kaggle.com
Your Problem
▪ Some examples from Kaggle:
▪ Which driver will file insurance in next 12 months?
▪ Which content will be clicked by a user?
kaggle.com
Your Problem
▪ Some examples from Kaggle:
▪ Which driver will file insurance in next 12 months?
▪ Which content will be clicked by a user?
▪ Which article will be bought by a customer in the next 12 months?
kaggle.com
Data
▪ Data is key
Data
Computation
Dog
Program
ML Approach
Data
Result
▪ Data is key
▪ More data is better
Data
▪ Data is key
▪ More data is better
Data
Banko & Brill, 2001
Halevy et al., 2009
▪ Data is key
▪ More data is better
Data
deeplearning.ai
Data
▪ Data is key
▪ More data is better
▪ Better data is better
Data
▪ Data is key
▪ More data is better
▪ Better data is better
2009 2012
Data
▪ Data is key
▪ More data is better
▪ Better data is better
2005
Models and Evaluation
Computation
Dog
Program
ML Approach
Data
Result
Models and Evaluation
1. Come up with a baseline
▪ Naive approach
▪ Something you can do without ML
(mean, median)
Models and Evaluation
1. Come up with a baseline
▪ Naive approach
▪ Something you can do without ML
(mean, median)
2. Start with a simple model
▪ “Draw the line, yes this counts as
machine learning”
▪ Try to be better than the naive
approach
Models and Evaluation
1. Come up with a baseline
▪ Naive approach
▪ Something you can do without ML
(mean, median)
2. Start with a simple model
▪ “Draw the line, yes this counts as
machine learning”
▪ Try to be better than the naive
approach
3. Build a more complex model
▪ Beat the simple model
▪ Iterate
Models and Evaluation
1. Come up with a baseline
▪ Naive approach
▪ Something you can do without ML
(mean, median)
2. Start with a simple model
▪ “Draw the line, yes this counts as
machine learning”
▪ Try to be better than the naive
approach
3. Build a more complex model
▪ Beat the simple model
▪ Iterate
spotify.com
Models and Evaluation
1. Come up with a baseline
▪ Naive approach
▪ Something you can do without ML
(mean, median)
2. Start with a simple model
▪ “Draw the line, yes this counts as
machine learning”
▪ Try to be better than the naive
approach
3. Build a more complex model
▪ Beat the simple model
▪ Iterate
▪ Ideally stop when effort > gain
spotify.com
Models and Evaluation
Computation
Data
Dog
Program
ML Approach
Data
Result
Models and Evaluation
Computation
Data
Dog
Program
Computation
ML Approach
unseen
Data
Results
on unseen data
Data
Result
Models and Evaluation
Program
Computation
Results
on unseen data
▪ Generalization
QA and Validation
QA and Validation
QA and Validation
Program
Computation
Results
on unseen data
QA and Validation
Program
Computation
Results
on unseen data
Summary - How to get started with ML
Summary - How to get started with ML
▪ Get the data
▪ Simple model
▪ Evaluate
Thank you!
Contact
https://alexandschaefer.github.io/
https://twitter.com/alexandschaefer
Appendix

More Related Content

Similar to How To Get Started With Machine Learning

SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHubSOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
DevOpsDays Tel Aviv
 

Similar to How To Get Started With Machine Learning (20)

A few questions about large scale machine learning
A few questions about large scale machine learningA few questions about large scale machine learning
A few questions about large scale machine learning
 
AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be...
SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be...SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be...
SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be...
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
 
Gaps in the algorithm
Gaps in the algorithmGaps in the algorithm
Gaps in the algorithm
 
DutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business PerspectiveDutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business Perspective
 
PyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darknessPyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darkness
 
Free and Open Machine Learning
Free and Open Machine LearningFree and Open Machine Learning
Free and Open Machine Learning
 
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHubSOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
 
Module 7: Unsupervised Learning
Module 7:  Unsupervised LearningModule 7:  Unsupervised Learning
Module 7: Unsupervised Learning
 
BSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, Evaluations
 
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformDutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
 
How to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerHow to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product Manager
 
Machine Learning Interviews_ Lessons from Both Sides - FSDL.pptx
Machine Learning Interviews_ Lessons from Both Sides - FSDL.pptxMachine Learning Interviews_ Lessons from Both Sides - FSDL.pptx
Machine Learning Interviews_ Lessons from Both Sides - FSDL.pptx
 
Machine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXLMachine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXL
 
Day 1 wazz up ai
Day 1  wazz up aiDay 1  wazz up ai
Day 1 wazz up ai
 

Recently uploaded

Recently uploaded (20)

Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million PeopleWSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and ApplicationsWSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAMWSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
 
WSO2CON 2024 - Building a Digital Government in Uganda
WSO2CON 2024 - Building a Digital Government in UgandaWSO2CON 2024 - Building a Digital Government in Uganda
WSO2CON 2024 - Building a Digital Government in Uganda
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
WSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid Environments
 

How To Get Started With Machine Learning