SlideShare a Scribd company logo
@amcasari
Understanding Products Driven
by Machine Learning and AI:
A Data Scientist’s Perspective
A.M. Casari
Principal Product Manager + Data Scientist
Concur Labs @ SAP Concur
@amcasarihere to there via random walk
senior data scientist
@ SAP Concur
control systems
engineering +
robotics + legos
officer in USN
operations research
analyst
wandering dirtbag +
conservation volunteer
EE + applied math
+ complex systems
underwater robotics
consultant
extraordinaire
SAHM
product + data
@ Concur Labs
@amcasari
path
forward
today
§definitions
§data science as process
§data science as people
§data science as product
§data science as service
@amcasari
when we say…
DATA SCIENCE
§ We mean….the interdisciplinary intersection of methods,
processes, algorithms and problem solving techniques to
extract knowledge from data1
MACHINE LEARNING [ML]
§ We mean….the statistical class of algorithms which allow us
to systematically improve a computer’s ability to perform a
given task2
DEEP LEARNING [DL]
§ We mean….the family of ML methods based on
learning data representations3
ARTIFICIAL INTELLIGENCE [AI]
§ We mean….when a machine mimics cognitive functions
usually observed in animals, such as problem solving and
creativity4
hint:AI has not happened yet… +
our community is well represented inWikipedia
@amcasari
DL is ML is AI
h/t @mlhassett
AI
DL
ML
@amcasari
when we say
products are…
DATA DRIVEN
§ We mean….product strategy and engineering decisions
are made by qualitative + quantitative analysis of data
INTELLIGENT
§ We mean….users interact with features that thoughtfully
and seamlessly balance context and useful information
FUELED BY MACHINE LEARNING
§ We mean….somewhere in the backend, someone is
using data with some kind of predictor. More or less.
AUGMENTED
§ We mean….intelligent products which guide users through
a new experience without distracting from their purpose
@amcasari
data science as …
a process
@amcasari
avoid this…
xkcd #1838
@amcasari
more like
this…
@amcasari
more like
this…
idea
research
exploration
hypotheses
model
outcomes
feedback
@amcasari
when to move
on? “Models are not right or wrong; they're
always wrong. They're always approximations.
The question you have to ask is whether a
model tells you more information than you
would have had otherwise. If it does, it's
skillful.”
- Gavin Schmidt’s excellent TED Talk
@amcasari
be
responsible
technologists
§ Algorithmic Accountability Review
§ Responsibility
§ Explainability
§ Accuracy
§ Auditability
§ Fairness
§ Example Guiding Questions
§ How could this go south?
§ What social constructs am I modeling implicitly or
explicitly?
§ What are the impacts of the choices I have made in my
data modeling + feature selection?
§ Could the deployment of this work negatively impact a
subset of my users?
@amcasari
data science as …
people
explained through xkcd art
@amcasari
data science as
a team sport
v0
Cross Functional Team Cross Functional Team
Data Scientist
@amcasari
data science as
a team sport
v1
Cross Functional Team
Data Scientist Team
Cross Functional Team
Cross Functional Team
@amcasari
data science as
a team sport
v2
needs - “define the primary stages of leveraging Big Data with stakeholders representing the
domain. analysts usually drive from discovery toward integration, while the engineers tend to
drive from systems toward integration
NB: effective, hands-on management in Data Science must live in the space of integration, not
delegate it”
roles - “leverage different disciplines, opportunities, and risks... there’s great power in
pairing people with complementary skills, in team environments where they can recognize
each other’s priorities and perspectives
blurring these roles is wonderful... however, when businesses get into trouble, they also
tend to “push down” these roles, blurring boundaries in ways which stresses teams and
limits scalability”
diagram and description courtesy of Paco Nathan
@amcasari
data science as
a team sport
vNOW
Advanced Engineering Team
Data Science Team Cross Functional Team Cross Functional Team
Research Team Applied Research Team
@amcasari
Data science as …
a product
@amcasari
data products
v1
@MROGATI
@amcasari
data products
vNEXT
diagrams and description courtesy of Paco Nathan
The playbook on this is being written now…
personal digital stylists via StitchFix
augmented writing via Textio
artwork generation via Netflix
games to teach computers via Google
@amcasari
be
responsible
companies
§ Design for Fairness
§ Design for Accountability
§ Design for Transparency
§ Design for Privacy
§ Design for Ethics
§ Example Guiding Questions
§ Who is responsible if users are harmed by this product?
§ Who will have the power to decide on necessary changes
to the algorithmic system during design stage, pre-launch,
and post-launch?
§ How much of your system / algorithm can you explain to
your users and stakeholders?
§ What are realistic worst case scenarios in terms of how
errors might impact society, individuals, and stakeholders?
@amcasari
data science as …
a service
@amcasari
comparing
services +
vendors
§ Why are you asking for my data?
§ Cold-start versus warm-start
§ Evaluation comparison
§ How can your models work for me?
§ Feature Transfer
§ Define results by your business value, not their
metrics
§ All your services should uphold your data science
standards: Fairness, Accountability, Transparency,
Privacy, Ethics
§ What questions should I be asking about their
processes?
§ Privacy > GDPR compliant?
§ Where does the data live
§ Where do your services live
§ Who owns the trained models once they are trained
@amcasari
evaluating
during buy or
build
§ Do you have an defensible moat around
this data?
§ How long of a project runway would you
have to build a team?
§ Do you have internal resources who you
could leverage + build out a new team?
§ As this project/product scales, will the
cost of the services keep up with your
ARR?
§ What future-thinking, vertical specific
brainshare are you paying someone else
to gain?
@amcasari
Choose Your Own
Educational Adventure
Data science / ML / AI needs everyone
Approachable Resource Recommendations
Books!
• Python for Data Analysis, William McKinney
• Doing Data Science, Cathy O’Neil + Rachel
Schutt
• Data Science from Scratch, Joel Grus
• Machine Learning with Python Cookbook, Chris
Albon
MOOCs!
• Machine Learning, by Andrew Ng on Coursera
• Machine Learning Specialization, by Emily Fox +
Carlos Guestrin on Coursera
• fast.ai, by Jeremy Howard + Rachel Thomas
@amcasari
thank you

More Related Content

Similar to Understanding Products Driven by Machine Learning and AI: A Data Scientist's Perspective

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Sri Ambati
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
Francesca Lazzeri, PhD
 
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Analytics India Magazine
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Pramit Choudhary
 
Feature Engineering for Machine Learning at QConSP
Feature Engineering for Machine Learning at QConSPFeature Engineering for Machine Learning at QConSP
Feature Engineering for Machine Learning at QConSP
Amanda Casari
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
Sanghamitra Deb
 
Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...
WeLoveSEO
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science Teams
Ganes Kesari
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career path
Rubikal
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
Ahmed Amr Abdul-Fattah
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
Roger Barga
 
Measuring Relevance in the Negative Space
Measuring Relevance in the Negative SpaceMeasuring Relevance in the Negative Space
Measuring Relevance in the Negative Space
Trey Grainger
 
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Amazon Web Services
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a profession
Jose Quesada
 
Machine learning in Banks
Machine learning in BanksMachine learning in Banks
Machine learning in Banks
Abhishek Upadhyay
 
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Dana Gardner
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data Science
Mark West
 
The future of jobs
The future of jobsThe future of jobs
The future of jobs
Padmaja Surendranath
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
Joe Lamantia
 
Online course 6 14 2017
Online course 6 14 2017Online course 6 14 2017
Online course 6 14 2017
vaxelrod
 

Similar to Understanding Products Driven by Machine Learning and AI: A Data Scientist's Perspective (20)

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
 
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
 
Feature Engineering for Machine Learning at QConSP
Feature Engineering for Machine Learning at QConSPFeature Engineering for Machine Learning at QConSP
Feature Engineering for Machine Learning at QConSP
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
 
Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science Teams
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career path
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Measuring Relevance in the Negative Space
Measuring Relevance in the Negative SpaceMeasuring Relevance in the Negative Space
Measuring Relevance in the Negative Space
 
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a profession
 
Machine learning in Banks
Machine learning in BanksMachine learning in Banks
Machine learning in Banks
 
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data Science
 
The future of jobs
The future of jobsThe future of jobs
The future of jobs
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
 
Online course 6 14 2017
Online course 6 14 2017Online course 6 14 2017
Online course 6 14 2017
 

More from Amanda Casari

When Privacy Scales - Intelligent Product Design under GDPR
When Privacy Scales - Intelligent Product Design under GDPRWhen Privacy Scales - Intelligent Product Design under GDPR
When Privacy Scales - Intelligent Product Design under GDPR
Amanda Casari
 
Scaling Data Science Products, Not Data Science Teams
Scaling Data Science Products, Not Data Science TeamsScaling Data Science Products, Not Data Science Teams
Scaling Data Science Products, Not Data Science Teams
Amanda Casari
 
Spark Hearts GraphLab Create
Spark Hearts GraphLab CreateSpark Hearts GraphLab Create
Spark Hearts GraphLab Create
Amanda Casari
 
Apache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code WorkshopApache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code Workshop
Amanda Casari
 
20160512 apache-spark-for-everyone
20160512 apache-spark-for-everyone20160512 apache-spark-for-everyone
20160512 apache-spark-for-everyone
Amanda Casari
 
Design for X: Exploring Product Design with Apache Spark and GraphLab
Design for X: Exploring Product Design with Apache Spark and GraphLabDesign for X: Exploring Product Design with Apache Spark and GraphLab
Design for X: Exploring Product Design with Apache Spark and GraphLab
Amanda Casari
 
PyLadies Seattle - Lessons in Interactive Visualizations
PyLadies Seattle - Lessons in Interactive VisualizationsPyLadies Seattle - Lessons in Interactive Visualizations
PyLadies Seattle - Lessons in Interactive Visualizations
Amanda Casari
 

More from Amanda Casari (7)

When Privacy Scales - Intelligent Product Design under GDPR
When Privacy Scales - Intelligent Product Design under GDPRWhen Privacy Scales - Intelligent Product Design under GDPR
When Privacy Scales - Intelligent Product Design under GDPR
 
Scaling Data Science Products, Not Data Science Teams
Scaling Data Science Products, Not Data Science TeamsScaling Data Science Products, Not Data Science Teams
Scaling Data Science Products, Not Data Science Teams
 
Spark Hearts GraphLab Create
Spark Hearts GraphLab CreateSpark Hearts GraphLab Create
Spark Hearts GraphLab Create
 
Apache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code WorkshopApache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code Workshop
 
20160512 apache-spark-for-everyone
20160512 apache-spark-for-everyone20160512 apache-spark-for-everyone
20160512 apache-spark-for-everyone
 
Design for X: Exploring Product Design with Apache Spark and GraphLab
Design for X: Exploring Product Design with Apache Spark and GraphLabDesign for X: Exploring Product Design with Apache Spark and GraphLab
Design for X: Exploring Product Design with Apache Spark and GraphLab
 
PyLadies Seattle - Lessons in Interactive Visualizations
PyLadies Seattle - Lessons in Interactive VisualizationsPyLadies Seattle - Lessons in Interactive Visualizations
PyLadies Seattle - Lessons in Interactive Visualizations
 

Recently uploaded

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 

Recently uploaded (20)

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 

Understanding Products Driven by Machine Learning and AI: A Data Scientist's Perspective

  • 1. @amcasari Understanding Products Driven by Machine Learning and AI: A Data Scientist’s Perspective A.M. Casari Principal Product Manager + Data Scientist Concur Labs @ SAP Concur
  • 2. @amcasarihere to there via random walk senior data scientist @ SAP Concur control systems engineering + robotics + legos officer in USN operations research analyst wandering dirtbag + conservation volunteer EE + applied math + complex systems underwater robotics consultant extraordinaire SAHM product + data @ Concur Labs
  • 3. @amcasari path forward today §definitions §data science as process §data science as people §data science as product §data science as service
  • 4. @amcasari when we say… DATA SCIENCE § We mean….the interdisciplinary intersection of methods, processes, algorithms and problem solving techniques to extract knowledge from data1 MACHINE LEARNING [ML] § We mean….the statistical class of algorithms which allow us to systematically improve a computer’s ability to perform a given task2 DEEP LEARNING [DL] § We mean….the family of ML methods based on learning data representations3 ARTIFICIAL INTELLIGENCE [AI] § We mean….when a machine mimics cognitive functions usually observed in animals, such as problem solving and creativity4 hint:AI has not happened yet… + our community is well represented inWikipedia
  • 5. @amcasari DL is ML is AI h/t @mlhassett AI DL ML
  • 6. @amcasari when we say products are… DATA DRIVEN § We mean….product strategy and engineering decisions are made by qualitative + quantitative analysis of data INTELLIGENT § We mean….users interact with features that thoughtfully and seamlessly balance context and useful information FUELED BY MACHINE LEARNING § We mean….somewhere in the backend, someone is using data with some kind of predictor. More or less. AUGMENTED § We mean….intelligent products which guide users through a new experience without distracting from their purpose
  • 11. @amcasari when to move on? “Models are not right or wrong; they're always wrong. They're always approximations. The question you have to ask is whether a model tells you more information than you would have had otherwise. If it does, it's skillful.” - Gavin Schmidt’s excellent TED Talk
  • 12. @amcasari be responsible technologists § Algorithmic Accountability Review § Responsibility § Explainability § Accuracy § Auditability § Fairness § Example Guiding Questions § How could this go south? § What social constructs am I modeling implicitly or explicitly? § What are the impacts of the choices I have made in my data modeling + feature selection? § Could the deployment of this work negatively impact a subset of my users?
  • 13. @amcasari data science as … people explained through xkcd art
  • 14. @amcasari data science as a team sport v0 Cross Functional Team Cross Functional Team Data Scientist
  • 15. @amcasari data science as a team sport v1 Cross Functional Team Data Scientist Team Cross Functional Team Cross Functional Team
  • 16. @amcasari data science as a team sport v2 needs - “define the primary stages of leveraging Big Data with stakeholders representing the domain. analysts usually drive from discovery toward integration, while the engineers tend to drive from systems toward integration NB: effective, hands-on management in Data Science must live in the space of integration, not delegate it” roles - “leverage different disciplines, opportunities, and risks... there’s great power in pairing people with complementary skills, in team environments where they can recognize each other’s priorities and perspectives blurring these roles is wonderful... however, when businesses get into trouble, they also tend to “push down” these roles, blurring boundaries in ways which stresses teams and limits scalability” diagram and description courtesy of Paco Nathan
  • 17. @amcasari data science as a team sport vNOW Advanced Engineering Team Data Science Team Cross Functional Team Cross Functional Team Research Team Applied Research Team
  • 18. @amcasari Data science as … a product
  • 20. @amcasari data products vNEXT diagrams and description courtesy of Paco Nathan The playbook on this is being written now… personal digital stylists via StitchFix augmented writing via Textio artwork generation via Netflix games to teach computers via Google
  • 21. @amcasari be responsible companies § Design for Fairness § Design for Accountability § Design for Transparency § Design for Privacy § Design for Ethics § Example Guiding Questions § Who is responsible if users are harmed by this product? § Who will have the power to decide on necessary changes to the algorithmic system during design stage, pre-launch, and post-launch? § How much of your system / algorithm can you explain to your users and stakeholders? § What are realistic worst case scenarios in terms of how errors might impact society, individuals, and stakeholders?
  • 22. @amcasari data science as … a service
  • 23. @amcasari comparing services + vendors § Why are you asking for my data? § Cold-start versus warm-start § Evaluation comparison § How can your models work for me? § Feature Transfer § Define results by your business value, not their metrics § All your services should uphold your data science standards: Fairness, Accountability, Transparency, Privacy, Ethics § What questions should I be asking about their processes? § Privacy > GDPR compliant? § Where does the data live § Where do your services live § Who owns the trained models once they are trained
  • 24. @amcasari evaluating during buy or build § Do you have an defensible moat around this data? § How long of a project runway would you have to build a team? § Do you have internal resources who you could leverage + build out a new team? § As this project/product scales, will the cost of the services keep up with your ARR? § What future-thinking, vertical specific brainshare are you paying someone else to gain?
  • 25. @amcasari Choose Your Own Educational Adventure Data science / ML / AI needs everyone Approachable Resource Recommendations Books! • Python for Data Analysis, William McKinney • Doing Data Science, Cathy O’Neil + Rachel Schutt • Data Science from Scratch, Joel Grus • Machine Learning with Python Cookbook, Chris Albon MOOCs! • Machine Learning, by Andrew Ng on Coursera • Machine Learning Specialization, by Emily Fox + Carlos Guestrin on Coursera • fast.ai, by Jeremy Howard + Rachel Thomas