SlideShare a Scribd company logo
1 of 28
Download to read offline
BUILDINGAGILE&AI
STARTUPS
BASIC TIPS FOR PRODUCT MANAGERS
By John Fagan
R & Pandas & pythons & trees
Notebooks
Tensor flow
Models
AL / ML / Deep learning
Supervised learning
Unsupervised learning
Semi unsupervised
Transferred learning
Reinforcement learning
Back propagation
Forward propagation
Over fitting / under-fitting
Confusion Matrix
WENOWHAVE
DOCTORSINTHEHOUSE
INTIMIDATING NEW WORLD FOR PRODUCT MANAGERS
Inference
Classification, Regression, Correlation
FPs
Outlier / Normalize
Statistical Significance
Variance
Feature Engineering
Neural Networks
LSTMS
Quantile
Random Forest (may forests!)
Confidence interval
F1 Score
Precision / Recall
Mean Squared Err
@johnbfagan
WHATDOTHESEWIZARDSDO?@johnbfagan
https://towardsdatascience.com/demystifying-the-data-science-job-families-c74f2294b1bd
THEREARENOWIZARDS
@johnbfagan
https://hackernoon.com/the-ai-hierarchy-of-needs-18f111fcc007
FIRST.ISYOURDATASTACKREADY?
@johnbfagan
• Understand basic components and work flows
• Get familiar with basic terms, methods and
constructs e.g. supervised, unsupervised,
classification, algorithm and models, model
evaluation, variance, bias, overfitting, precision,
recall
• Listen to Podcasts
• Loads on Youtube
• Do a short course
ACQUIREDATASCIENCELITERACY
PREPAREBEFORETHE
DOCTORSARRIVE
@johnbfagan
Use the Retrospective to make small and continuous
improvements.
Data hygiene & integrity
Data models
Transaction ids
Data flows
Simple data archiving mechanic
Gradually tune up your Definition of Done
Consider hiring a data science consultant on short term
contract
PREPAREBEFORETHE
DOCTORSARRIVE
GET YOUR DATA READY.
@johnbfagan
https://medium.com/@hugh_data_science/the-pyramid-of-data-needs-and-why-it-matters-for-your-career-b0f695c13f11
https://courses.lumenlearning.com/wmintrobusiness/chapter/reading-stages-of-the-product-life-cycle/
PYRAMID OF DATA NEEDS
THINK.EVOLUTIONNOTREVOLUTION
ALIGNED TO PRODUCT DEVELOPMENT & STARTUP LIFECYCLE
@johnbfagan
https://medium.com/jalapeno-app/waterfall-agile-spectrum-e0bff7efa2e1
PLAN.FORCLASHOFCULTURES
🤔CANYOUHAVEA3POINTUSERSTORY ANDTHROWINA“DOTHEDATASCIENCE”SUBTASK?
@johnbfagan
https://www.oreilly.com/radar/a-manifesto-for-agile-data-science/
GREAT MODEL FOR UNLOCKING VALUE IN A AGILE FRAMEWORK, STORIES AT EACH LEVEL
THINK.ABOUTTHEDATAVALUEPYRAMID
@johnbfagan
READY?DEFINEYOURPROBLEM
Allows you to align expectations and outcomes.
You should spend time collaborating with your
team (engineers, data science, testers and
management) on defining the problem,
assumptions & expected outcomes. More so
than you would with a classic problem which is
solved by CRUD.
Luckily Machine Learning Mastery have a great
template, which I have adapted to agile stories.
https://machinelearningmastery.com/how-to-define-your-machine-learning-problem/
MUST.DEFINETHE
PROBLEM
PUT A LOT OF LOVE INTO THIS
@johnbfagan
MACHINELEARNING
MASTERY Tests the boundaries
and re-tests the
problem statement and
assumptions
Breaks the solution
down into layman level
@johnbfagan
IN ORDER TO become a social medial influencer

AS A regular twitter user

I NEED twitter to predict if my draft tweet content will get retweets

GIVEN there is a history of tweets from @illizian

AND some have retweets

AND some do not
WHEN @illizian composes a new tweet

THEN classify the tweet if its going to get retweets or not 

AND ensure the classification model has an accuracy score as a percentage.

AND the accuracy score is the number of tweets predicted correctly out of all tweets

AND The specific words he used in the tweet matter to the model.

AND The specific user that retweets does not matter to the model.

AND The number of retweets may matter to the model.

AND Older tweets are less predictive than more recent tweets.

TRANSLATETOUSERSTORIESWITHBDD
@johnbfagan
F1 score - measure of a test's accuracy. It considers both
the precision p and the recall r
False Positives - a test result which wrongly indicates that a
particular condition or attribute is present.
False Negatives - a test result which wrongly indicates that
a particular condition or attribute is absent.
Tradeoffs - impact mapping milestone a great way to
describe tradeoffs of quality versus, time and cost and
define you Go vs No-Go Metrics 
MUST. CAREABOUT
SUCCESSMETRICS
WHAT DOES SUCCESS LOOK LIKE?
https://www.productschool.com/blog/product-management-2/great-machine-learning-product-management-google/
https://www.impactmapping.org/
@johnbfagan
SOLVINGYOURPREDICTIVE
CHALLENGE
We all have a great solution, the best solution, but machine
learning is just one solution along with many others.
First create a super dumb baseline model (!AI), e.g.
• 100% certainty each tweet will be RT’d!
• Use average % of last 100 tweets that got RT’d
• If any words (excluding stop words) previously got
RT’d, then 100% certain tweet will get RT’d!
You might be surprised that your super simple solution is fit
for purpose
STARTWITHTHE
DUMBESTSOLUTION
@johnbfagan
https://www.youtube.com/watch?v=TK-2189UcKk
MUST. CAREABOUT
TESTDATA.
BUILD YOUR TEST DATA & AGREE KPIS
@johnbfagan
https://www.datarobot.com/wiki/cross-validation/
MUST.UNDERSTANDMODEL
EVALUATION
Validation data, later used for training too
Validation techniques
• Hold out - cross validation
• K fold - cross-validation
Beware of bias, overfitting & underfitting
CROSS VALIDATION
@johnbfagan
https://www.youtube.com/watch?v=EuBBz3bI-aA
VARIANCE AND BIAS
MUST.ENSUREYOUARETRAINING
FORTHERIGHTOUTCOMES.
@johnbfagan
https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
FORGET.ALGORITHMS
YOU HIRE DATA SCIENTISTS TO WORRY ABOUT THIS.
@johnbfagan
https://www.oreilly.com/library/view/evaluating-machine-learning/9781492048756/ch01.html
MAKE SURE YOU LET YOUR TEAM INVEST IN THIS
MUST.WORRYABOUT INFRASTRUCTURE&DEPLOYMENT
@johnbfagan
GOINGFROMSTARTUPTOSCALEUP
https://research.aimultiple.com/data-science-tools/
BUILD.DATASCIENCESTACKS
@johnbfagan
Software is usually static, but data is
always changing.
Monitor algorithms performance for drift
Adapt by understanding, re-fitting,
updating, weighting, learning the
changes.
MONITOR.DRIFT
BEHAVIOURS ALWAYS CHANGE.
https://www.semanticscholar.org/paper/Concept-drift-adaptation-for-learning-with-data-Liu/5b105e357936f989cfb46ddd055ea44a2b0aed04
https://machinelearningmastery.com/gentle-introduction-concept-drift-machine-learning/
@johnbfagan
https://www.grazitti.com/blog/data-lake-vs-data-warehouse-which-one-should-you-go-for/
GROW.DATALAKES
@johnbfagan
HTTPS://WWW.GOOGLE.COM/URL?SA=I&URL=HTTPS%3A%2F%2FWWW.DIALOGUES.ORG%2F08%2F02%2F2018%2FSOFTWARE-ENGINEER-VS-DATA-SCIENTIST-INTERVIEW-FEATURING-JOMA-TECH-
%2F1533228768&PSIG=AOVVAW18WWIM9CIR1QCAHVJYJHVH&UST=1590168327220000&SOURCE=IMAGES&CD=VFE&VED=0CA0QJHXQFWOTCLITHOW8XEKCFQAAAAADAAAAABAI
ENABLE.COLLABORATION
@johnbfagan
THANKS

More Related Content

Similar to Building Agile & AI startups - Basic tips for Product Managers

Model Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and VertaModel Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and Verta
Databricks
 

Similar to Building Agile & AI startups - Basic tips for Product Managers (20)

Product Management & Statistics - ProductTank Helsinki 04/2020
Product Management & Statistics - ProductTank Helsinki 04/2020Product Management & Statistics - ProductTank Helsinki 04/2020
Product Management & Statistics - ProductTank Helsinki 04/2020
 
Model Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and VertaModel Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and Verta
 
Dave Davies @ Pubcon
Dave Davies @ PubconDave Davies @ Pubcon
Dave Davies @ Pubcon
 
Setting Up a Machine Learning Platform
Setting Up a Machine Learning PlatformSetting Up a Machine Learning Platform
Setting Up a Machine Learning Platform
 
DevOps Theory vs. Practice: A Song of Ice and Tire Fire
DevOps Theory vs. Practice: A Song of Ice and Tire FireDevOps Theory vs. Practice: A Song of Ice and Tire Fire
DevOps Theory vs. Practice: A Song of Ice and Tire Fire
 
Intro to Machine Learning by Google Product Manager
Intro to Machine Learning by Google Product ManagerIntro to Machine Learning by Google Product Manager
Intro to Machine Learning by Google Product Manager
 
Building Continuous Learning Systems
Building Continuous Learning SystemsBuilding Continuous Learning Systems
Building Continuous Learning Systems
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9
 
Streams on top of scala - #lambdaCon
Streams on top of scala - #lambdaConStreams on top of scala - #lambdaCon
Streams on top of scala - #lambdaCon
 
Digital Mornings Copenhagen - Mathew Sweezey presentation
Digital Mornings Copenhagen - Mathew Sweezey presentationDigital Mornings Copenhagen - Mathew Sweezey presentation
Digital Mornings Copenhagen - Mathew Sweezey presentation
 
Digital Mornings Copenhagen - Mathew Sweezey presentation
Digital Mornings Copenhagen - Mathew Sweezey presentationDigital Mornings Copenhagen - Mathew Sweezey presentation
Digital Mornings Copenhagen - Mathew Sweezey presentation
 
Leveraging AI & ML to Automoate Repetitive Tasks
Leveraging AI & ML to Automoate Repetitive TasksLeveraging AI & ML to Automoate Repetitive Tasks
Leveraging AI & ML to Automoate Repetitive Tasks
 
How to Use Machine Learning as a Product Manager by Wework PM
 How to Use Machine Learning as a Product Manager by Wework PM How to Use Machine Learning as a Product Manager by Wework PM
How to Use Machine Learning as a Product Manager by Wework PM
 
Generalist.pdf
Generalist.pdfGeneralist.pdf
Generalist.pdf
 
SentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdfSentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdf
 
Natural Language Classifier - Handbook (IBM)
Natural Language Classifier - Handbook (IBM)Natural Language Classifier - Handbook (IBM)
Natural Language Classifier - Handbook (IBM)
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learning
 
ODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systemsODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systems
 
Digital morning, Stockholm, 23 of March
Digital morning, Stockholm, 23 of MarchDigital morning, Stockholm, 23 of March
Digital morning, Stockholm, 23 of March
 
10 TRENDS IN STAFFING TECH — DON'T GET BLINDSIDED
10 TRENDS IN STAFFING TECH — DON'T GET BLINDSIDED10 TRENDS IN STAFFING TECH — DON'T GET BLINDSIDED
10 TRENDS IN STAFFING TECH — DON'T GET BLINDSIDED
 

More from John Fagan

Working From Home - Distributed Teams
Working From Home - Distributed TeamsWorking From Home - Distributed Teams
Working From Home - Distributed Teams
John Fagan
 
GeoWeb Services #WIN or #FAIL
GeoWeb Services #WIN or #FAILGeoWeb Services #WIN or #FAIL
GeoWeb Services #WIN or #FAIL
John Fagan
 

More from John Fagan (8)

Beyond the Farebox - Mobility-as-a-Platform
Beyond the Farebox - Mobility-as-a-PlatformBeyond the Farebox - Mobility-as-a-Platform
Beyond the Farebox - Mobility-as-a-Platform
 
Orchestrated Mobility - Changing the way we move (Barclays Ai Frenzy)
Orchestrated Mobility - Changing the way we move (Barclays Ai Frenzy)Orchestrated Mobility - Changing the way we move (Barclays Ai Frenzy)
Orchestrated Mobility - Changing the way we move (Barclays Ai Frenzy)
 
SyncNorwich 5 years
SyncNorwich 5 yearsSyncNorwich 5 years
SyncNorwich 5 years
 
#mapbots can win, where #chatbots fail
#mapbots can win, where #chatbots fail#mapbots can win, where #chatbots fail
#mapbots can win, where #chatbots fail
 
Working From Home - Distributed Teams
Working From Home - Distributed TeamsWorking From Home - Distributed Teams
Working From Home - Distributed Teams
 
GeoWeb Services #WIN or #FAIL
GeoWeb Services #WIN or #FAILGeoWeb Services #WIN or #FAIL
GeoWeb Services #WIN or #FAIL
 
Where2.0Now - Finding the heat in Thematic Maps
Where2.0Now - Finding the heat in Thematic MapsWhere2.0Now - Finding the heat in Thematic Maps
Where2.0Now - Finding the heat in Thematic Maps
 
John Fagan - The Black Art of Geocoding
John Fagan - The Black Art of GeocodingJohn Fagan - The Black Art of Geocoding
John Fagan - The Black Art of Geocoding
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Building Agile & AI startups - Basic tips for Product Managers