SlideShare a Scribd company logo
@louisdorard
#papisconnect
PredictiveAPIs
Student Researcher Data Scientist Developer Non-technical
Machine Learning
Use cases
Limitations
Predictive APIs
Does it work?
Case study
ML Canvas
–Mike Gualtieri, Principal Analyst at Forrester
“Predictive apps are
the next big thing
in app development.”
–Waqar Hasan, VISA
“Predictive is the ‘killer app’ for big
data.”
1. Machine Learning
2. Data
BUT
–McKinsey & Co. (2011)
“A significant constraint on
realizing value from big data will
be a shortage of talent,
particularly of people with deep
expertise in statistics and machine
learning.”
Demystifying

Machine Learning
“Which type of email is this?
— Spam/Ham”


“Which type of email is this?
— Spam/Ham”


Classification
I
O
“Which type of email is this?
— Spam/Ham”


??
“How much is this house worth?
— X $”


-> Regression
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
ML is a set of AI techniques where
“intelligence” is built by referring to
examples
Use cases
• Real-estate
• Spam
• Priority inbox
• Crowd prediction
property price
email spam indicator
email importance indicator
location & context #people
Zillow
Gmail
Gmail
Tranquilien
I. Get more customers
• Reduce churn
• Score leads
• Optimize campaigns
customer churn indicator
customer revenue
customer & campaign interest indicator
II. Serve customers better
• Cross-sell
• Increase engagement
• Optimize pricing
customer & product purchase indicator
user & item interest indicator
product & price #sales
III. Serve customers more efficiently
• Predict demand
• Automate tasks
• Use predictive enterprise apps
context demand
credit application repayment indicator
Predictive enterprise apps
• Priority filtering
• Message routing
• Auto-configuration
message priority indicator
request employee
user & actions settings
RULES
–Katherine Barr, Partner at VC-firm MDV
"Pairing human workers with
machine learning and automation
will transform knowledge work
and unleash new levels of human
productivity and creativity."
Limitations
Need examples of inputs AND outputs
What if not enough data points?
What if similar inputs have dissimilar outputs?
Bedrooms Bathrooms Price ($)
3 2 500,000
3 2 800,000
1 1 300,000
1 1 800,000
Bedrooms Bathrooms Surface (foot²) Year built Price ($)
3 2 800 1950 500,000
3 2 1000 1950 800,000
1 1 500 1950 300,000
1 1 500 2014 800,000
–@louisdorard
“A model can only be as good as
the data it was given to train on”
Predictive APIs:

ML for all
HTML / CSS / JavaScript
HTML / CSS / JavaScript
squarespace.com
The two phases of machine learning:
• TRAIN a model
• PREDICT with a model
The two methods of predictive APIs:
• TRAIN a model
• PREDICT with a model
The two methods of predictive APIs:
• model = create_model(dataset)
• predicted_output =
create_prediction(model, new_input)
The two methods of predictive APIs:
• model = create_model(‘training.csv’)
• predicted_output =
create_prediction(model, new_input)
“Is this email important?
— Yes/No”
“Is this customer going to leave next month?
— Yes/No”
“What is the sentiment of this tweet?
— Positive/Neutral/Negative”
The two phases of machine learning:
• TRAIN a model
• PREDICT with a model
The two phases of machine learning:
• TRAIN a model
• PREDICT with an already existing model
“Is this email spam?
— Yes/No”
Does it work?
How well
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
1 769 1999 condo 315,000
3 1 1012 1951 house
3 2 1599 1964 house
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
1 769 1999 condo 315,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
4 2 1574 1964 house 835,000
3 2 987 1951 townhouse 790,000
4 2001 house 855,000
1 1 530 2007 condo 122,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
4 2 1574 1964 house 835,000
3 2 987 1951 townhouse 790,000
4 2001 house 855,000
1 1 530 2007 condo 122,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($)
4 2 1574 1964 house 835,000 835,000
3 2 987 1951 townhouse 790,000 790,000
4 2001 house 855,000 855,000
1 1 530 2007 condo 122,000 122,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($)
4 2 1574 1964 house 835,000
3 2 987 1951 townhouse 790,000
4 2001 house 855,000
1 1 530 2007 condo 122,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($)
4 2 1574 1964 house 835,000
3 2 987 1951 townhouse 790,000
4 2001 house 855,000
1 1 530 2007 condo 122,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
2 1.5 968 1976 townhouse 447,000
3 1 860 1950 house 565,000
1 769 1999 condo 315,000
4 1315 1950 house 648,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($)
4 2 1574 1964 house 818,000 835,000
3 2 987 1951 townhouse 800,000 790,000
4 2001 house 915,000 855,000
1 1 530 2007 condo 100,000 122,000
Price ($) Price ($)
818,000 835,000
800,000 790,000
915,000 855,000
100,000 122,000
Need real-time machine learning?
The two phases of machine learning:
• TRAIN a model
• PREDICT with a model
• Training time
• Prediction time
• Accuracy
Case study:

churn analysis
• Who: SaaS company selling monthly subscription
• Question asked:“Is this customer going to leave
within 1 month?”
• Input: customer
• Output: no-churn (negative) or churn (positive)
• Data collection: history up until 1 month ago
• Baseline: if no usage for more than 15 days then
churn
Learning: OK
but
• How to represent customers?
• What to do after predicting churn?
Customer representation:
• basic info (age, income, etc.)
• usage of service (# times used app, avg time spent,
features used, etc.)
• interactions with customer support (how many,
topics of questions, satisfaction ratings)
Taking action to prevent churn:
• contact customers (in which order?)
• switch to different plan
• give special offer
• no action?
Measuring accuracy:
• #TP (we predict customer churns and he does)
• #FP (we predict customer churns but he doesn’t)
• #FN (we predict customer doesn’t churn but he does)
• Compare to baseline
Estimating Return On Investment:
• Taking action for #TP and #FP customers has a cost
• We earn #TP * success rate * revenue /cust. /month
• Compare to baseline
Machine Learning
Canvas
PREDICTIONS OBJECTIVES DATA
Context
Who will use the predictive system / who will be
affected by it? Provide some background.
Value Proposition
What are we trying to do? E.g. spend less time on
X, increase Y...
Data Sources
Where do/can we get data from? (internal
database, 3rd party API, etc.)
Problem
Question to predict answers to (in plain English)
Input (i.e. question "parameter")
Possible outputs (i.e. "answers")
Type of problem (e.g. classification, regression,
recommendation...)
Baseline
What is an alternative way of making predictions
(e.g. manual rules based on feature values)?
Performance evaluation
Domain-specific / bottom-line metrics for
monitoring performance in production
Prediction accuracy metrics (e.g. MSE if
regression; % accuracy, #FP for classification)
Offline performance evaluation method (e.g.
cross-validation or simple training/test split)
Dataset
How do we collect data (inputs and outputs)?
How many data points?
Features
Used to represent inputs and extracted from
data sources above. Group by types and
mention key features if too many to list all.
Using predictions
When do we make predictions and how many?
What is the time constraint for making those predictions?
How do we use predictions and confidence values?
Learning predictive models
When do we create/update models? With which data / how much?
What is the time constraint for creating a model?
Criteria for deploying model (e.g. minimum performance value — absolute,
relative to baseline or to previous model)
IDEASPECSDEPLOYMENT
BACKGROUND
ENGINE SPECS
INTEGRATION
PREDICTIONS OBJECTIVES DATA
BACKGROUND
ENGINE SPECS
INTEGRATION
PREDICTIONS OBJECTIVES DATA
BACKGROUND End-user Value prop Sources
ENGINE SPECS ML problem Perf eval Preparation
INTEGRATION Using pred Learning modelINTEGRATION Using pred Learning model
Why fill in ML canvas?
• Target the right problem for your company
• Choose right algorithm, infrastructure, or ML
solution
• Guide project management
• Improve team communication
machinelearningcanvas.com
Recap
• Need examples of inputs AND outputs
• Need enough examples
• ML to create value from data
• 2 phases: TRAIN and PREDICT
• Predictive APIs make it more accessible
• Good data is essential
• What do we do with predictions?
• Measure performance with accuracy, time and
bottom-line
• Also: deploy, maintain, improve…
louisdorard.com

More Related Content

Viewers also liked

Simple machine learning for the masses - Konstantin Davydov
Simple machine learning for the masses - Konstantin DavydovSimple machine learning for the masses - Konstantin Davydov
Simple machine learning for the masses - Konstantin Davydov
PAPIs.io
 
Leven Court
Leven CourtLeven Court
Leven Courtsolidred
 
Yourprezi
YourpreziYourprezi
Yourprezi
karenfermon
 
ประชาสัมพันธ์ส่งผลงานวิจัย
ประชาสัมพันธ์ส่งผลงานวิจัยประชาสัมพันธ์ส่งผลงานวิจัย
ประชาสัมพันธ์ส่งผลงานวิจัยPatchanida Yadawong
 
The African Skills Mobilization Program_Concept Paper
The African Skills Mobilization Program_Concept PaperThe African Skills Mobilization Program_Concept Paper
The African Skills Mobilization Program_Concept PaperVictor Kyerematen
 
Secep#1 05-11-2 nd
Secep#1 05-11-2 ndSecep#1 05-11-2 nd
MENTOR Quick Overview
MENTOR Quick OverviewMENTOR Quick Overview
MENTOR Quick OverviewCandace1953
 
Self introduction
Self introductionSelf introduction
Self introduction
Yunyun (Roy) Luo
 
The cost of digital newspapers (23-06-08)
The cost of digital newspapers   (23-06-08)The cost of digital newspapers   (23-06-08)
The cost of digital newspapers (23-06-08)
Richard Davies
 
S10200622 w5 p3_lesson plan_22 apr 13
S10200622 w5 p3_lesson plan_22 apr 13S10200622 w5 p3_lesson plan_22 apr 13
S10200622 w5 p3_lesson plan_22 apr 13bchszeto
 
Networking Online using Linkedin
Networking Online using LinkedinNetworking Online using Linkedin
Networking Online using LinkedinAmar Trivedi
 

Viewers also liked (15)

Simple machine learning for the masses - Konstantin Davydov
Simple machine learning for the masses - Konstantin DavydovSimple machine learning for the masses - Konstantin Davydov
Simple machine learning for the masses - Konstantin Davydov
 
Leven Court
Leven CourtLeven Court
Leven Court
 
Yourprezi
YourpreziYourprezi
Yourprezi
 
Archivo012
Archivo012Archivo012
Archivo012
 
ประชาสัมพันธ์ส่งผลงานวิจัย
ประชาสัมพันธ์ส่งผลงานวิจัยประชาสัมพันธ์ส่งผลงานวิจัย
ประชาสัมพันธ์ส่งผลงานวิจัย
 
The African Skills Mobilization Program_Concept Paper
The African Skills Mobilization Program_Concept PaperThe African Skills Mobilization Program_Concept Paper
The African Skills Mobilization Program_Concept Paper
 
Secep#1 05-11-2 nd
Secep#1 05-11-2 ndSecep#1 05-11-2 nd
Secep#1 05-11-2 nd
 
MENTOR Quick Overview
MENTOR Quick OverviewMENTOR Quick Overview
MENTOR Quick Overview
 
E Homes
E HomesE Homes
E Homes
 
Self introduction
Self introductionSelf introduction
Self introduction
 
The cost of digital newspapers (23-06-08)
The cost of digital newspapers   (23-06-08)The cost of digital newspapers   (23-06-08)
The cost of digital newspapers (23-06-08)
 
S10200622 w5 p3_lesson plan_22 apr 13
S10200622 w5 p3_lesson plan_22 apr 13S10200622 w5 p3_lesson plan_22 apr 13
S10200622 w5 p3_lesson plan_22 apr 13
 
Networking Online using Linkedin
Networking Online using LinkedinNetworking Online using Linkedin
Networking Online using Linkedin
 
October 17
October 17October 17
October 17
 
ใบงานที่2..
ใบงานที่2..ใบงานที่2..
ใบงานที่2..
 

Similar to Demystifying Machine Learning

Data Summit Brussels: Introduction
Data Summit Brussels: IntroductionData Summit Brussels: Introduction
Data Summit Brussels: IntroductionLouis Dorard
 
From data to AI with the Machine Learning Canvas by Louis Dorard Slides
From data to AI with the Machine Learning Canvas by Louis  Dorard SlidesFrom data to AI with the Machine Learning Canvas by Louis  Dorard Slides
From data to AI with the Machine Learning Canvas by Louis Dorard Slides
Big Data Spain
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
Maxim Salnikov
 
Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application
ProgrammableWeb
 
From Data to AI with the Machine Learning Canvas
From Data to AI with the Machine Learning CanvasFrom Data to AI with the Machine Learning Canvas
From Data to AI with the Machine Learning Canvas
Louis Dorard
 
Predictive APIs at APIdays Berlin
Predictive APIs at APIdays BerlinPredictive APIs at APIdays Berlin
Predictive APIs at APIdays Berlin
Louis Dorard
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
Maxim Salnikov
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
Maxim Salnikov
 
What Is Religion And Its Role In Society Free Essay Ex
What Is Religion And Its Role In Society Free Essay ExWhat Is Religion And Its Role In Society Free Essay Ex
What Is Religion And Its Role In Society Free Essay Ex
Aparna Harrison
 
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
Ryan Slack
 
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
PAPIs.io
 
Tuskegee Airmen Essay Thesis. Online assignment writing service.
Tuskegee Airmen Essay Thesis. Online assignment writing service.Tuskegee Airmen Essay Thesis. Online assignment writing service.
Tuskegee Airmen Essay Thesis. Online assignment writing service.
April Eide
 
BSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, Evaluations
BigML, Inc
 
Multi Family Design & Development in LA
Multi Family Design & Development in LAMulti Family Design & Development in LA
Multi Family Design & Development in LA
garrjacobs
 
Design-led Approach to Big Data
Design-led Approach to Big Data Design-led Approach to Big Data
Design-led Approach to Big Data
ChrysSullivan
 
Rethinking residential real estate: 2023 and beyond
Rethinking residential real estate: 2023 and beyondRethinking residential real estate: 2023 and beyond
Rethinking residential real estate: 2023 and beyond
AppFolio
 
HouseCanary - PCBC Presentation
HouseCanary - PCBC PresentationHouseCanary - PCBC Presentation
HouseCanary - PCBC Presentation
HouseCanary
 
Corporate finance
Corporate financeCorporate finance
Corporate finance
Ekrem Tufan
 
Kibana_Data_analyst_7.1.0.pdf
Kibana_Data_analyst_7.1.0.pdfKibana_Data_analyst_7.1.0.pdf
Kibana_Data_analyst_7.1.0.pdf
AlexandreGiordanelli1
 
Solution presentation demonstation template
Solution presentation demonstation templateSolution presentation demonstation template
Solution presentation demonstation template
bptowers
 

Similar to Demystifying Machine Learning (20)

Data Summit Brussels: Introduction
Data Summit Brussels: IntroductionData Summit Brussels: Introduction
Data Summit Brussels: Introduction
 
From data to AI with the Machine Learning Canvas by Louis Dorard Slides
From data to AI with the Machine Learning Canvas by Louis  Dorard SlidesFrom data to AI with the Machine Learning Canvas by Louis  Dorard Slides
From data to AI with the Machine Learning Canvas by Louis Dorard Slides
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application
 
From Data to AI with the Machine Learning Canvas
From Data to AI with the Machine Learning CanvasFrom Data to AI with the Machine Learning Canvas
From Data to AI with the Machine Learning Canvas
 
Predictive APIs at APIdays Berlin
Predictive APIs at APIdays BerlinPredictive APIs at APIdays Berlin
Predictive APIs at APIdays Berlin
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
What Is Religion And Its Role In Society Free Essay Ex
What Is Religion And Its Role In Society Free Essay ExWhat Is Religion And Its Role In Society Free Essay Ex
What Is Religion And Its Role In Society Free Essay Ex
 
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
3rd annual Marcus & Millichap / IPA West and Central Florida Multifamily Forum
 
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
 
Tuskegee Airmen Essay Thesis. Online assignment writing service.
Tuskegee Airmen Essay Thesis. Online assignment writing service.Tuskegee Airmen Essay Thesis. Online assignment writing service.
Tuskegee Airmen Essay Thesis. Online assignment writing service.
 
BSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, Evaluations
 
Multi Family Design & Development in LA
Multi Family Design & Development in LAMulti Family Design & Development in LA
Multi Family Design & Development in LA
 
Design-led Approach to Big Data
Design-led Approach to Big Data Design-led Approach to Big Data
Design-led Approach to Big Data
 
Rethinking residential real estate: 2023 and beyond
Rethinking residential real estate: 2023 and beyondRethinking residential real estate: 2023 and beyond
Rethinking residential real estate: 2023 and beyond
 
HouseCanary - PCBC Presentation
HouseCanary - PCBC PresentationHouseCanary - PCBC Presentation
HouseCanary - PCBC Presentation
 
Corporate finance
Corporate financeCorporate finance
Corporate finance
 
Kibana_Data_analyst_7.1.0.pdf
Kibana_Data_analyst_7.1.0.pdfKibana_Data_analyst_7.1.0.pdf
Kibana_Data_analyst_7.1.0.pdf
 
Solution presentation demonstation template
Solution presentation demonstation templateSolution presentation demonstation template
Solution presentation demonstation template
 

Recently uploaded

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 

Demystifying Machine Learning

  • 2.
  • 4.
  • 5. Student Researcher Data Scientist Developer Non-technical
  • 6.
  • 7.
  • 8.
  • 9. Machine Learning Use cases Limitations Predictive APIs Does it work? Case study ML Canvas
  • 10. –Mike Gualtieri, Principal Analyst at Forrester “Predictive apps are the next big thing in app development.”
  • 11. –Waqar Hasan, VISA “Predictive is the ‘killer app’ for big data.”
  • 13. BUT
  • 14. –McKinsey & Co. (2011) “A significant constraint on realizing value from big data will be a shortage of talent, particularly of people with deep expertise in statistics and machine learning.”
  • 16.
  • 17. “Which type of email is this? — Spam/Ham” 

  • 18. “Which type of email is this? — Spam/Ham” 
 Classification
  • 19. I O “Which type of email is this? — Spam/Ham” 

  • 20. ??
  • 21.
  • 22. “How much is this house worth? — X $” 
 -> Regression
  • 23. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 24. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 25.
  • 26. ML is a set of AI techniques where “intelligence” is built by referring to examples
  • 27.
  • 29. • Real-estate • Spam • Priority inbox • Crowd prediction property price email spam indicator email importance indicator location & context #people Zillow Gmail Gmail Tranquilien
  • 30. I. Get more customers • Reduce churn • Score leads • Optimize campaigns customer churn indicator customer revenue customer & campaign interest indicator
  • 31. II. Serve customers better • Cross-sell • Increase engagement • Optimize pricing customer & product purchase indicator user & item interest indicator product & price #sales
  • 32. III. Serve customers more efficiently • Predict demand • Automate tasks • Use predictive enterprise apps context demand credit application repayment indicator
  • 33. Predictive enterprise apps • Priority filtering • Message routing • Auto-configuration message priority indicator request employee user & actions settings RULES
  • 34. –Katherine Barr, Partner at VC-firm MDV "Pairing human workers with machine learning and automation will transform knowledge work and unleash new levels of human productivity and creativity."
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. Need examples of inputs AND outputs
  • 41.
  • 42. What if not enough data points?
  • 43.
  • 44. What if similar inputs have dissimilar outputs?
  • 45.
  • 46. Bedrooms Bathrooms Price ($) 3 2 500,000 3 2 800,000 1 1 300,000 1 1 800,000
  • 47. Bedrooms Bathrooms Surface (foot²) Year built Price ($) 3 2 800 1950 500,000 3 2 1000 1950 800,000 1 1 500 1950 300,000 1 1 500 2014 800,000
  • 48. –@louisdorard “A model can only be as good as the data it was given to train on”
  • 50.
  • 51. HTML / CSS / JavaScript
  • 52. HTML / CSS / JavaScript
  • 54.
  • 55.
  • 56. The two phases of machine learning: • TRAIN a model • PREDICT with a model
  • 57. The two methods of predictive APIs: • TRAIN a model • PREDICT with a model
  • 58. The two methods of predictive APIs: • model = create_model(dataset) • predicted_output = create_prediction(model, new_input)
  • 59. The two methods of predictive APIs: • model = create_model(‘training.csv’) • predicted_output = create_prediction(model, new_input)
  • 60.
  • 61.
  • 62. “Is this email important? — Yes/No”
  • 63. “Is this customer going to leave next month? — Yes/No”
  • 64. “What is the sentiment of this tweet? — Positive/Neutral/Negative”
  • 65. The two phases of machine learning: • TRAIN a model • PREDICT with a model
  • 66. The two phases of machine learning: • TRAIN a model • PREDICT with an already existing model
  • 67. “Is this email spam? — Yes/No”
  • 69. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 70. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 1 769 1999 condo 315,000 3 1 1012 1951 house 3 2 1599 1964 house 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo
  • 71. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 1 769 1999 condo 315,000
  • 72. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000 4 2 1574 1964 house 835,000 3 2 987 1951 townhouse 790,000 4 2001 house 855,000 1 1 530 2007 condo 122,000
  • 73. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 4 2 1574 1964 house 835,000 3 2 987 1951 townhouse 790,000 4 2001 house 855,000 1 1 530 2007 condo 122,000 Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000
  • 74. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000 Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($) 4 2 1574 1964 house 835,000 835,000 3 2 987 1951 townhouse 790,000 790,000 4 2001 house 855,000 855,000 1 1 530 2007 condo 122,000 122,000
  • 75. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000 Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($) 4 2 1574 1964 house 835,000 3 2 987 1951 townhouse 790,000 4 2001 house 855,000 1 1 530 2007 condo 122,000
  • 76. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000 Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($) 4 2 1574 1964 house 835,000 3 2 987 1951 townhouse 790,000 4 2001 house 855,000 1 1 530 2007 condo 122,000
  • 77. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 2 1.5 968 1976 townhouse 447,000 3 1 860 1950 house 565,000 1 769 1999 condo 315,000 4 1315 1950 house 648,000 Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) Price ($) 4 2 1574 1964 house 818,000 835,000 3 2 987 1951 townhouse 800,000 790,000 4 2001 house 915,000 855,000 1 1 530 2007 condo 100,000 122,000
  • 78. Price ($) Price ($) 818,000 835,000 800,000 790,000 915,000 855,000 100,000 122,000
  • 80. The two phases of machine learning: • TRAIN a model • PREDICT with a model
  • 81. • Training time • Prediction time • Accuracy
  • 83. • Who: SaaS company selling monthly subscription • Question asked:“Is this customer going to leave within 1 month?” • Input: customer • Output: no-churn (negative) or churn (positive) • Data collection: history up until 1 month ago • Baseline: if no usage for more than 15 days then churn
  • 84. Learning: OK but • How to represent customers? • What to do after predicting churn?
  • 85. Customer representation: • basic info (age, income, etc.) • usage of service (# times used app, avg time spent, features used, etc.) • interactions with customer support (how many, topics of questions, satisfaction ratings)
  • 86. Taking action to prevent churn: • contact customers (in which order?) • switch to different plan • give special offer • no action?
  • 87. Measuring accuracy: • #TP (we predict customer churns and he does) • #FP (we predict customer churns but he doesn’t) • #FN (we predict customer doesn’t churn but he does) • Compare to baseline
  • 88. Estimating Return On Investment: • Taking action for #TP and #FP customers has a cost • We earn #TP * success rate * revenue /cust. /month • Compare to baseline
  • 90.
  • 91. PREDICTIONS OBJECTIVES DATA Context Who will use the predictive system / who will be affected by it? Provide some background. Value Proposition What are we trying to do? E.g. spend less time on X, increase Y... Data Sources Where do/can we get data from? (internal database, 3rd party API, etc.) Problem Question to predict answers to (in plain English) Input (i.e. question "parameter") Possible outputs (i.e. "answers") Type of problem (e.g. classification, regression, recommendation...) Baseline What is an alternative way of making predictions (e.g. manual rules based on feature values)? Performance evaluation Domain-specific / bottom-line metrics for monitoring performance in production Prediction accuracy metrics (e.g. MSE if regression; % accuracy, #FP for classification) Offline performance evaluation method (e.g. cross-validation or simple training/test split) Dataset How do we collect data (inputs and outputs)? How many data points? Features Used to represent inputs and extracted from data sources above. Group by types and mention key features if too many to list all. Using predictions When do we make predictions and how many? What is the time constraint for making those predictions? How do we use predictions and confidence values? Learning predictive models When do we create/update models? With which data / how much? What is the time constraint for creating a model? Criteria for deploying model (e.g. minimum performance value — absolute, relative to baseline or to previous model) IDEASPECSDEPLOYMENT
  • 94. PREDICTIONS OBJECTIVES DATA BACKGROUND End-user Value prop Sources ENGINE SPECS ML problem Perf eval Preparation INTEGRATION Using pred Learning modelINTEGRATION Using pred Learning model
  • 95. Why fill in ML canvas? • Target the right problem for your company • Choose right algorithm, infrastructure, or ML solution • Guide project management • Improve team communication
  • 97. Recap
  • 98. • Need examples of inputs AND outputs • Need enough examples
  • 99. • ML to create value from data • 2 phases: TRAIN and PREDICT • Predictive APIs make it more accessible • Good data is essential • What do we do with predictions? • Measure performance with accuracy, time and bottom-line • Also: deploy, maintain, improve…
  • 100.