SlideShare a Scribd company logo
Thinking with Data
Max Shron
@mshron
Thinking with Data
Max Shron
@mshron
Big picture
• Data is too much fun, too easy to rabbit-hole
• Specialized knowledge is hard to communicate
• Not all stats is well-adapted to the real world
• We need techniques to handle that
Big picture
• Design — UX, consulting, etc.
• Humanities — philosophy, law, etc.
• Social science — sociology, psychology, etc.
Scoping
• First set of techniques: scoping.
• The world gives us vague requests.
• We should have things clear before we start, or
we end up with uninteresting questions.
• Write things down or say out loud.
Scoping
• Imagine we are working with a company with a
subscription business. The CEO asks us for a
churn model.
• Bad scope: “We will use R to create a logistic
regression to predict who will quit using the
product.”
• Not actionable, irrelevant detail.
Scoping
• CoNVO:
• Context
• Need
• Vision
• Outcome
• Iterative process — start simple, refine, refine, refine.
Scoping
• Context
• Who are we working with? What are the big
picture, long term goals?
• “The company has subscription model. CEO’s
goal is to improve profitability.”
Scoping
• Need
• What is the particular knowledge we are
missing?
• “We want to understand who drops off early
enough so that we can intervene.”
Scoping
• Vision
• What would it look like to solve the problem?
• “We will build a predictive model using
behavioral data to predict who will drop off —
early enough to be useful.”
• Sources of data: important. Kinds of offers:
important. Kind of experimentation: important.
Kind of model: unimportant.
Scoping
• Outcome
• Who will be responsible for next steps? How will we
know if we are correct?
• “The tech team will implement the model in a batch
process to run daily, automatically sending out email
offers. We will calculate success metrics (precision
and recall) on held out users, and send a weekly
email of stats to stay on top.”
• We need a control group!
Scoping
• How do we develop a CoNVO?
• interviews
• kitchen sink interrogation
• roleplaying
• story-telling
• mockups
• Clearer vision with mockups
John Smith is 36 years old,
he has seen 40 different
pages over the past two
weeks, his and he has a 20%
chance to convert.
Scoping
Scoping
• Context We are hired to work in a hospital system with
250K patients over 20 years. Report to CEO, who is
interested in building a tool for reducing medical issues.
• Need After talking to some doctors, some belief that
there is overuse of antibiotics, but hard to detect.
• Vision A pilot investigation. If we find signal, repeatable
flagging tool.
• Outcome CMO will decide if pilot is valuable based on
report. Automated tool would be run by CMO on demand.
Arguments
• Data is not a ray gun!
• People need to be convinced, including you.
• The world is not deductive logic, we need a
theory that includes that people have minds.
• Trusting a tool, making a point with a graph,
coming to terms on a definition, convincing
someone to act differently, etc etc.
Arguments
• General model is semi-deductive. We move from
what is known / agreed-upon and move towards
what is not yet known.
• Patterns of reasoning help us make stronger cases
in less time and effort. Take advantage of two
thousand years of research.
Arguments
• Example: Predicting Δ poverty from satellite data
• It takes 5-10 years to get small scale poverty
estimates in poor countries.
• The vision: predict whether the poverty estimates
will go up or down ahead of time, using cheap
satellite data.
• The outcome: use to informally guide policy
decisions, keeping track of interventions.
Arguments
• Claim - Your audience does not believe it yet but
you think you can make a case for it.
• “Poverty can be modeled effectively with satellite
data.”
• Prior knowledge - Things your audience already
believes before the case is started.
Arguments
• Evidence - Where data enters an argument. We
transform data into evidence. Counts, models, graphs,
etc. make up the evidence.
• Justification - The reasoning why the evidence should
cause us to believe the claim.
• “These graphs indicate that the residuals for our
model are as we had anticipated.”
• Rebuttal - Any of the reasons why the justification might
not hold in this particular case. Usually smart to know.
Arguments
• Patterns!
• Causal analysis
• Convincing takes more than math
• Categories of dispute
Arguments
• Disputes of fact — getting the details straight
• “The F1 for this model is .7”
• Two stock issues:
• What is a reasonable truth condition?
• Is it satisfied?
Arguments
• Disputes of definition — relating words to math
• “Poverty is defined as FGT, α = 2”
• Three stock issues:
• Does this definition make a useful distinction?
• How consistent is this definition with prior ideas?
• What, if any, are the reasonable alternatives?
Arguments
• Disputes of value — making the right trade-offs
• “Our model is simple enough.”
• Two stock issues:
• How do our goals determine which values are
most important?
• Have the values been properly applied here?
Arguments
• Disputes of policy — the right course of action
• “We should use this model to informally guide our decisions
between official estimates.”
• Four stock issues:
• Is there a problem? ill
• Where is credit or blame due? blame
• Will the proposal solve it? cure
• Will it be better on balance? cost
Summary
• Take half the math and tools and twice the listening
to what people actually need.
• This is the tip of the iceberg. In general, we have a
lot to learn from others
• Let’s talk! @mshron

More Related Content

What's hot

What Social Media Networks Should Your Brand Be On?
What Social Media Networks Should Your Brand Be On?What Social Media Networks Should Your Brand Be On?
What Social Media Networks Should Your Brand Be On?
Falcon.io
 
OTT & Video Advertising Playbook
OTT & Video Advertising PlaybookOTT & Video Advertising Playbook
OTT & Video Advertising Playbook
Tinuiti
 
Sunsilk Activation Case Study
Sunsilk Activation Case StudySunsilk Activation Case Study
Sunsilk Activation Case Study
Vũ Văn Hiển
 
Airbnb project
Airbnb projectAirbnb project
Airbnb project
Morgan Milza
 
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
MarketingTrips
 
DM_MICA_TELTEK_ANJALI_BISOI.pdf
DM_MICA_TELTEK_ANJALI_BISOI.pdfDM_MICA_TELTEK_ANJALI_BISOI.pdf
DM_MICA_TELTEK_ANJALI_BISOI.pdf
anjalibisoi1
 
WeTipp Pitch Deck
WeTipp Pitch DeckWeTipp Pitch Deck
WeTipp Pitch Deck
startuphome
 
Case Studies of Social Media Marketing
Case Studies of Social Media MarketingCase Studies of Social Media Marketing
Case Studies of Social Media Marketing
S Harlalka
 
Pet grooming case study
Pet grooming case studyPet grooming case study
Pet grooming case study
janettewynn
 
Davai Pitchdeck
Davai Pitchdeck Davai Pitchdeck
Davai Pitchdeck
davaiapp
 
Tinder Pitch Deck
Tinder Pitch DeckTinder Pitch Deck
Tinder Pitch Deck
Ryan Gum
 
Moderator guide for Focus Group Discussion
Moderator guide for Focus Group DiscussionModerator guide for Focus Group Discussion
Moderator guide for Focus Group Discussion
Raja Babu
 
The product folks : leap scholar Product teardown
The product folks : leap scholar Product teardownThe product folks : leap scholar Product teardown
The product folks : leap scholar Product teardown
AbhinavThakur51
 
Airbnb Social Media Strategy
Airbnb Social Media StrategyAirbnb Social Media Strategy
Airbnb Social Media Strategy
Maria Vargas
 
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBEEARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
Octoly
 
Young Marketers Elite 3_Assignment 11_Brand Communication 2
Young Marketers Elite 3_Assignment 11_Brand Communication 2Young Marketers Elite 3_Assignment 11_Brand Communication 2
Young Marketers Elite 3_Assignment 11_Brand Communication 2
honvongphu
 
Spotify Pitch Deck
Spotify Pitch DeckSpotify Pitch Deck
Spotify Pitch Deck
EwanDoyle
 
Growth Lessons from Early Snapchat
Growth Lessons from Early SnapchatGrowth Lessons from Early Snapchat
Growth Lessons from Early Snapchat
Mozza
 
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
Simplilearn
 
How Do Social Media Algorithms Work?
How Do Social Media Algorithms Work?How Do Social Media Algorithms Work?
How Do Social Media Algorithms Work?
JanszenMedia
 

What's hot (20)

What Social Media Networks Should Your Brand Be On?
What Social Media Networks Should Your Brand Be On?What Social Media Networks Should Your Brand Be On?
What Social Media Networks Should Your Brand Be On?
 
OTT & Video Advertising Playbook
OTT & Video Advertising PlaybookOTT & Video Advertising Playbook
OTT & Video Advertising Playbook
 
Sunsilk Activation Case Study
Sunsilk Activation Case StudySunsilk Activation Case Study
Sunsilk Activation Case Study
 
Airbnb project
Airbnb projectAirbnb project
Airbnb project
 
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
Social Trends 2022: Báo cáo xu hướng mạng xã hội năm 2022
 
DM_MICA_TELTEK_ANJALI_BISOI.pdf
DM_MICA_TELTEK_ANJALI_BISOI.pdfDM_MICA_TELTEK_ANJALI_BISOI.pdf
DM_MICA_TELTEK_ANJALI_BISOI.pdf
 
WeTipp Pitch Deck
WeTipp Pitch DeckWeTipp Pitch Deck
WeTipp Pitch Deck
 
Case Studies of Social Media Marketing
Case Studies of Social Media MarketingCase Studies of Social Media Marketing
Case Studies of Social Media Marketing
 
Pet grooming case study
Pet grooming case studyPet grooming case study
Pet grooming case study
 
Davai Pitchdeck
Davai Pitchdeck Davai Pitchdeck
Davai Pitchdeck
 
Tinder Pitch Deck
Tinder Pitch DeckTinder Pitch Deck
Tinder Pitch Deck
 
Moderator guide for Focus Group Discussion
Moderator guide for Focus Group DiscussionModerator guide for Focus Group Discussion
Moderator guide for Focus Group Discussion
 
The product folks : leap scholar Product teardown
The product folks : leap scholar Product teardownThe product folks : leap scholar Product teardown
The product folks : leap scholar Product teardown
 
Airbnb Social Media Strategy
Airbnb Social Media StrategyAirbnb Social Media Strategy
Airbnb Social Media Strategy
 
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBEEARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
EARNED MEDIA SUCCESS FOR BEAUTY BRANDS ON YOUTUBE
 
Young Marketers Elite 3_Assignment 11_Brand Communication 2
Young Marketers Elite 3_Assignment 11_Brand Communication 2Young Marketers Elite 3_Assignment 11_Brand Communication 2
Young Marketers Elite 3_Assignment 11_Brand Communication 2
 
Spotify Pitch Deck
Spotify Pitch DeckSpotify Pitch Deck
Spotify Pitch Deck
 
Growth Lessons from Early Snapchat
Growth Lessons from Early SnapchatGrowth Lessons from Early Snapchat
Growth Lessons from Early Snapchat
 
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
How To Start Social Media Marketing | Social Media Marketing Tutorial For Beg...
 
How Do Social Media Algorithms Work?
How Do Social Media Algorithms Work?How Do Social Media Algorithms Work?
How Do Social Media Algorithms Work?
 

Viewers also liked

Drew Conway: A Social Scientist's Perspective on Data Science
Drew Conway: A Social Scientist's Perspective on Data ScienceDrew Conway: A Social Scientist's Perspective on Data Science
Drew Conway: A Social Scientist's Perspective on Data Science
mortardata
 
Can Big Data Save the World? By Jake Porway
Can Big Data Save the World? By Jake PorwayCan Big Data Save the World? By Jake Porway
Can Big Data Save the World? By Jake Porway
mortardata
 
The Back of The Napkin (Dan Roam)
The Back of The Napkin (Dan Roam)The Back of The Napkin (Dan Roam)
The Back of The Napkin (Dan Roam)
Santo Thie
 
The Fry and the Cranberry
The Fry and the CranberryThe Fry and the Cranberry
The Fry and the Cranberry
Dan Roam
 
1 simple way to better presentations: don't outline, PUMA!
1 simple way to better presentations: don't outline, PUMA!1 simple way to better presentations: don't outline, PUMA!
1 simple way to better presentations: don't outline, PUMA!
Dan Roam
 
Navigating the AWS Compliance Framework | AWS Security Roadshow Dublin
Navigating the AWS Compliance Framework | AWS Security Roadshow DublinNavigating the AWS Compliance Framework | AWS Security Roadshow Dublin
Navigating the AWS Compliance Framework | AWS Security Roadshow Dublin
Amazon Web Services
 
Health Care Napkin 2
Health Care Napkin 2Health Care Napkin 2
Health Care Napkin 2
Dan Roam
 
Health care napkin 4
Health care napkin 4Health care napkin 4
Health care napkin 4
Dan Roam
 
My Speech to Congress (Health Care)
My Speech to Congress (Health Care)My Speech to Congress (Health Care)
My Speech to Congress (Health Care)
Dan Roam
 
Auditoría Odontológica
Auditoría OdontológicaAuditoría Odontológica
Auditoría Odontológica
Jorge Enrique Manrique-Chávez
 
Health Care Napkin 1
Health Care Napkin 1Health Care Napkin 1
Health Care Napkin 1
Dan Roam
 
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
Global Risk Forum GRFDavos
 
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
Global Risk Forum GRFDavos
 
Methods and Compliance Programs for Anti-Corruption, Pascal HANS
Methods and Compliance Programs for Anti-Corruption, Pascal HANSMethods and Compliance Programs for Anti-Corruption, Pascal HANS
Methods and Compliance Programs for Anti-Corruption, Pascal HANS
Global Risk Forum GRFDavos
 
Expected Skills, Required Program Content and Assessment System to Address th...
Expected Skills, Required Program Content and Assessment System to Address th...Expected Skills, Required Program Content and Assessment System to Address th...
Expected Skills, Required Program Content and Assessment System to Address th...
Global Risk Forum GRFDavos
 
Qualitative Risk Assessment for Business Continuity Management in University,...
Qualitative Risk Assessment for Business Continuity Management in University,...Qualitative Risk Assessment for Business Continuity Management in University,...
Qualitative Risk Assessment for Business Continuity Management in University,...
Global Risk Forum GRFDavos
 
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
Global Risk Forum GRFDavos
 
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!) Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
Dan Roam
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with Cassandra
Matthias Broecheler
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
Daniel Tunkelang
 

Viewers also liked (20)

Drew Conway: A Social Scientist's Perspective on Data Science
Drew Conway: A Social Scientist's Perspective on Data ScienceDrew Conway: A Social Scientist's Perspective on Data Science
Drew Conway: A Social Scientist's Perspective on Data Science
 
Can Big Data Save the World? By Jake Porway
Can Big Data Save the World? By Jake PorwayCan Big Data Save the World? By Jake Porway
Can Big Data Save the World? By Jake Porway
 
The Back of The Napkin (Dan Roam)
The Back of The Napkin (Dan Roam)The Back of The Napkin (Dan Roam)
The Back of The Napkin (Dan Roam)
 
The Fry and the Cranberry
The Fry and the CranberryThe Fry and the Cranberry
The Fry and the Cranberry
 
1 simple way to better presentations: don't outline, PUMA!
1 simple way to better presentations: don't outline, PUMA!1 simple way to better presentations: don't outline, PUMA!
1 simple way to better presentations: don't outline, PUMA!
 
Navigating the AWS Compliance Framework | AWS Security Roadshow Dublin
Navigating the AWS Compliance Framework | AWS Security Roadshow DublinNavigating the AWS Compliance Framework | AWS Security Roadshow Dublin
Navigating the AWS Compliance Framework | AWS Security Roadshow Dublin
 
Health Care Napkin 2
Health Care Napkin 2Health Care Napkin 2
Health Care Napkin 2
 
Health care napkin 4
Health care napkin 4Health care napkin 4
Health care napkin 4
 
My Speech to Congress (Health Care)
My Speech to Congress (Health Care)My Speech to Congress (Health Care)
My Speech to Congress (Health Care)
 
Auditoría Odontológica
Auditoría OdontológicaAuditoría Odontológica
Auditoría Odontológica
 
Health Care Napkin 1
Health Care Napkin 1Health Care Napkin 1
Health Care Napkin 1
 
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
Consequences of the Armed Conflict as a Stressor of Climate Change in Colombi...
 
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
Impact of a Collective Action in a Disaster-affected Community to Site a Temp...
 
Methods and Compliance Programs for Anti-Corruption, Pascal HANS
Methods and Compliance Programs for Anti-Corruption, Pascal HANSMethods and Compliance Programs for Anti-Corruption, Pascal HANS
Methods and Compliance Programs for Anti-Corruption, Pascal HANS
 
Expected Skills, Required Program Content and Assessment System to Address th...
Expected Skills, Required Program Content and Assessment System to Address th...Expected Skills, Required Program Content and Assessment System to Address th...
Expected Skills, Required Program Content and Assessment System to Address th...
 
Qualitative Risk Assessment for Business Continuity Management in University,...
Qualitative Risk Assessment for Business Continuity Management in University,...Qualitative Risk Assessment for Business Continuity Management in University,...
Qualitative Risk Assessment for Business Continuity Management in University,...
 
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
Vulnerability Assessment Using Spatial Information in terms of Chemical Relea...
 
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!) Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
Rock and Roll's Drawing Fable (A quick lesson on how to draw anything!)
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with Cassandra
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 

Similar to Max Shron, Thinking with Data at the NYC Data Science Meetup

Baworld adapting to whats happening
Baworld adapting to whats happeningBaworld adapting to whats happening
Baworld adapting to whats happening
Dave Davis PMP, PgMP, PBA
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
sasi
 
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
OrateTeam
 
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Matt Stubbs
 
The Human Side of Data By Colin Strong
The Human Side of Data By Colin StrongThe Human Side of Data By Colin Strong
The Human Side of Data By Colin Strong
MarTech Conference
 
Data science
Data scienceData science
Data science
Shourya Simha
 
6.6 Family and Youth Program Measurement Simplified
6.6 Family and Youth Program Measurement Simplified6.6 Family and Youth Program Measurement Simplified
6.6 Family and Youth Program Measurement Simplified
National Alliance to End Homelessness
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Vivian S. Zhang
 
CSI: Clinical Site Intelligence
CSI: Clinical Site IntelligenceCSI: Clinical Site Intelligence
CSI: Clinical Site Intelligence
goBalto
 
Pp 3
Pp 3Pp 3
Acceptance, accessible, actionable and auditable
Acceptance, accessible, actionable and auditableAcceptance, accessible, actionable and auditable
Acceptance, accessible, actionable and auditable
Alban Gérôme
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspective
oralonso
 
The Behavioural Science of Predictions, Forecasting and Decision Making
The Behavioural Science of Predictions, Forecasting and Decision MakingThe Behavioural Science of Predictions, Forecasting and Decision Making
The Behavioural Science of Predictions, Forecasting and Decision Making
Needle Partners
 
Market Sounding Brief: ACT Government Data Management
Market Sounding Brief: ACT Government Data ManagementMarket Sounding Brief: ACT Government Data Management
Market Sounding Brief: ACT Government Data Management
Christopher Norman
 
3 beliefs you need to let go to start you agile journey – Agile EE 2017
3 beliefs you need to let go to start you agile journey – Agile EE 20173 beliefs you need to let go to start you agile journey – Agile EE 2017
3 beliefs you need to let go to start you agile journey – Agile EE 2017
Antti Kirjavainen
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?
Srinath Perera
 
Management by data
Management by dataManagement by data
Management by data
Luca Foresti
 
DOES 2016 Sciencing the Crap Out of DevOps
DOES 2016 Sciencing the Crap Out of DevOpsDOES 2016 Sciencing the Crap Out of DevOps
DOES 2016 Sciencing the Crap Out of DevOps
Nicole Forsgren
 

Similar to Max Shron, Thinking with Data at the NYC Data Science Meetup (20)

Baworld adapting to whats happening
Baworld adapting to whats happeningBaworld adapting to whats happening
Baworld adapting to whats happening
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
 
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
 
Data Science-final7
Data Science-final7Data Science-final7
Data Science-final7
 
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
 
The Human Side of Data By Colin Strong
The Human Side of Data By Colin StrongThe Human Side of Data By Colin Strong
The Human Side of Data By Colin Strong
 
Data science
Data scienceData science
Data science
 
6.6 Family and Youth Program Measurement Simplified
6.6 Family and Youth Program Measurement Simplified6.6 Family and Youth Program Measurement Simplified
6.6 Family and Youth Program Measurement Simplified
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
CSI: Clinical Site Intelligence
CSI: Clinical Site IntelligenceCSI: Clinical Site Intelligence
CSI: Clinical Site Intelligence
 
Pp 3
Pp 3Pp 3
Pp 3
 
Acceptance, accessible, actionable and auditable
Acceptance, accessible, actionable and auditableAcceptance, accessible, actionable and auditable
Acceptance, accessible, actionable and auditable
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspective
 
The Behavioural Science of Predictions, Forecasting and Decision Making
The Behavioural Science of Predictions, Forecasting and Decision MakingThe Behavioural Science of Predictions, Forecasting and Decision Making
The Behavioural Science of Predictions, Forecasting and Decision Making
 
1.11 Data and Performance Simplified
1.11 Data and Performance Simplified1.11 Data and Performance Simplified
1.11 Data and Performance Simplified
 
Market Sounding Brief: ACT Government Data Management
Market Sounding Brief: ACT Government Data ManagementMarket Sounding Brief: ACT Government Data Management
Market Sounding Brief: ACT Government Data Management
 
3 beliefs you need to let go to start you agile journey – Agile EE 2017
3 beliefs you need to let go to start you agile journey – Agile EE 20173 beliefs you need to let go to start you agile journey – Agile EE 2017
3 beliefs you need to let go to start you agile journey – Agile EE 2017
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?
 
Management by data
Management by dataManagement by data
Management by data
 
DOES 2016 Sciencing the Crap Out of DevOps
DOES 2016 Sciencing the Crap Out of DevOpsDOES 2016 Sciencing the Crap Out of DevOps
DOES 2016 Sciencing the Crap Out of DevOps
 

More from mortardata

Daeil Kim: Machine Learning at the New York Times
Daeil Kim: Machine Learning at the New York TimesDaeil Kim: Machine Learning at the New York Times
Daeil Kim: Machine Learning at the New York Times
mortardata
 
Jonathan Coveney: Why Pig?
Jonathan Coveney: Why Pig?Jonathan Coveney: Why Pig?
Jonathan Coveney: Why Pig?
mortardata
 
Pig on Spark
Pig on SparkPig on Spark
Pig on Spark
mortardata
 
Data Science at Tumblr
Data Science at TumblrData Science at Tumblr
Data Science at Tumblrmortardata
 
Hadoop, Pig, and Python (PyData NYC 2012)
Hadoop, Pig, and Python (PyData NYC 2012)Hadoop, Pig, and Python (PyData NYC 2012)
Hadoop, Pig, and Python (PyData NYC 2012)
mortardata
 
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
mortardata
 

More from mortardata (6)

Daeil Kim: Machine Learning at the New York Times
Daeil Kim: Machine Learning at the New York TimesDaeil Kim: Machine Learning at the New York Times
Daeil Kim: Machine Learning at the New York Times
 
Jonathan Coveney: Why Pig?
Jonathan Coveney: Why Pig?Jonathan Coveney: Why Pig?
Jonathan Coveney: Why Pig?
 
Pig on Spark
Pig on SparkPig on Spark
Pig on Spark
 
Data Science at Tumblr
Data Science at TumblrData Science at Tumblr
Data Science at Tumblr
 
Hadoop, Pig, and Python (PyData NYC 2012)
Hadoop, Pig, and Python (PyData NYC 2012)Hadoop, Pig, and Python (PyData NYC 2012)
Hadoop, Pig, and Python (PyData NYC 2012)
 
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
Mortar: Hadoop-as-a-Service + Open Source Framework | AWS re: Invent public …
 

Recently uploaded

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 

Recently uploaded (20)

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 

Max Shron, Thinking with Data at the NYC Data Science Meetup

  • 1. Thinking with Data Max Shron @mshron
  • 2.
  • 3. Thinking with Data Max Shron @mshron
  • 4.
  • 5. Big picture • Data is too much fun, too easy to rabbit-hole • Specialized knowledge is hard to communicate • Not all stats is well-adapted to the real world • We need techniques to handle that
  • 6. Big picture • Design — UX, consulting, etc. • Humanities — philosophy, law, etc. • Social science — sociology, psychology, etc.
  • 7.
  • 8. Scoping • First set of techniques: scoping. • The world gives us vague requests. • We should have things clear before we start, or we end up with uninteresting questions. • Write things down or say out loud.
  • 9. Scoping • Imagine we are working with a company with a subscription business. The CEO asks us for a churn model. • Bad scope: “We will use R to create a logistic regression to predict who will quit using the product.” • Not actionable, irrelevant detail.
  • 10. Scoping • CoNVO: • Context • Need • Vision • Outcome • Iterative process — start simple, refine, refine, refine.
  • 11. Scoping • Context • Who are we working with? What are the big picture, long term goals? • “The company has subscription model. CEO’s goal is to improve profitability.”
  • 12. Scoping • Need • What is the particular knowledge we are missing? • “We want to understand who drops off early enough so that we can intervene.”
  • 13. Scoping • Vision • What would it look like to solve the problem? • “We will build a predictive model using behavioral data to predict who will drop off — early enough to be useful.” • Sources of data: important. Kinds of offers: important. Kind of experimentation: important. Kind of model: unimportant.
  • 14. Scoping • Outcome • Who will be responsible for next steps? How will we know if we are correct? • “The tech team will implement the model in a batch process to run daily, automatically sending out email offers. We will calculate success metrics (precision and recall) on held out users, and send a weekly email of stats to stay on top.” • We need a control group!
  • 15. Scoping • How do we develop a CoNVO? • interviews • kitchen sink interrogation • roleplaying • story-telling • mockups
  • 16. • Clearer vision with mockups John Smith is 36 years old, he has seen 40 different pages over the past two weeks, his and he has a 20% chance to convert. Scoping
  • 17. Scoping • Context We are hired to work in a hospital system with 250K patients over 20 years. Report to CEO, who is interested in building a tool for reducing medical issues. • Need After talking to some doctors, some belief that there is overuse of antibiotics, but hard to detect. • Vision A pilot investigation. If we find signal, repeatable flagging tool. • Outcome CMO will decide if pilot is valuable based on report. Automated tool would be run by CMO on demand.
  • 18.
  • 19. Arguments • Data is not a ray gun! • People need to be convinced, including you. • The world is not deductive logic, we need a theory that includes that people have minds. • Trusting a tool, making a point with a graph, coming to terms on a definition, convincing someone to act differently, etc etc.
  • 20. Arguments • General model is semi-deductive. We move from what is known / agreed-upon and move towards what is not yet known. • Patterns of reasoning help us make stronger cases in less time and effort. Take advantage of two thousand years of research.
  • 21.
  • 22. Arguments • Example: Predicting Δ poverty from satellite data • It takes 5-10 years to get small scale poverty estimates in poor countries. • The vision: predict whether the poverty estimates will go up or down ahead of time, using cheap satellite data. • The outcome: use to informally guide policy decisions, keeping track of interventions.
  • 23. Arguments • Claim - Your audience does not believe it yet but you think you can make a case for it. • “Poverty can be modeled effectively with satellite data.” • Prior knowledge - Things your audience already believes before the case is started.
  • 24. Arguments • Evidence - Where data enters an argument. We transform data into evidence. Counts, models, graphs, etc. make up the evidence. • Justification - The reasoning why the evidence should cause us to believe the claim. • “These graphs indicate that the residuals for our model are as we had anticipated.” • Rebuttal - Any of the reasons why the justification might not hold in this particular case. Usually smart to know.
  • 25.
  • 26. Arguments • Patterns! • Causal analysis • Convincing takes more than math • Categories of dispute
  • 27. Arguments • Disputes of fact — getting the details straight • “The F1 for this model is .7” • Two stock issues: • What is a reasonable truth condition? • Is it satisfied?
  • 28. Arguments • Disputes of definition — relating words to math • “Poverty is defined as FGT, α = 2” • Three stock issues: • Does this definition make a useful distinction? • How consistent is this definition with prior ideas? • What, if any, are the reasonable alternatives?
  • 29. Arguments • Disputes of value — making the right trade-offs • “Our model is simple enough.” • Two stock issues: • How do our goals determine which values are most important? • Have the values been properly applied here?
  • 30. Arguments • Disputes of policy — the right course of action • “We should use this model to informally guide our decisions between official estimates.” • Four stock issues: • Is there a problem? ill • Where is credit or blame due? blame • Will the proposal solve it? cure • Will it be better on balance? cost
  • 31. Summary • Take half the math and tools and twice the listening to what people actually need. • This is the tip of the iceberg. In general, we have a lot to learn from others • Let’s talk! @mshron