SlideShare a Scribd company logo
1 of 238
Irmak Sirer
@frrmack
irmaksirer.com
Irmak Sirer
@frrmack
irmaksirer.com
The Anatomy
of a Data Science
Project
AGE 7
Oh cool.
Pretty good. Space and stuff.
AGE 14
Omigod Omigod Omigod.
Epic masterpiece is epic!!!!1!
I'm in love with Leia.
AGE 17
WTF?
AGE 30
When you think about it, it's not that good.
AGE 30
When you think about it, it's not that good.
Ah, who am I kidding? It's amazing.
I'm still in love with Leia.
I mean... look at her.
What determines
how much I like a movie?
What determines
how much I like a movie?
A personal question
on something
I am passionate about
How do I boost my sales?
A business question
Can I identify experts in each division
of my company and bring them
together to collaborate?
Another business question
What do customers out there
think and say
about my products?
Another business question
The Anatomy
of a
Data Science Project
Finding the right questions
Right metrics
Knowing what’s been done
Obsessed
with
Movies
Irmak
Sirer
Start with question, not data
Iterative design process
Moving targets
Start with question, not data
Iterative design process
Moving targets
Find Data
Clean Data
Manage Data
Find Data
Clean Data
Manage Data
BIG DATA
Machine Learning
Statistics
Applied Math
Open source tools
Python
Pandas, Scikit.learn
SQL, Mongo
Javascript, d3, Flask
Hadoop, Spark, Hive, Mahoot
Google
Interactive Dashboards
Easy to read graphs
Explaining well, adapting to audience
Interactive Dashboards
Easy to read graphs
Explaining well, adapting to audience
Ultimate Goal & Product: Insights
What determines
how much I like a movie?
What determines
how much I like a movie?
Is my reaction to a
movie / book / song
predictable?
How much will I like
The Book of Eli?
2006
Cinematch
1 billion
user ratings
55,000
movies
Cinematch
I have a soulmate in taste
Irmak
Cinematch
I have a soulmate in taste
Irmak Frrmack
Cinematch
I have a soulmate in taste
Watched the same movies
Irmak Frrmack
Cinematch
I have a soulmate in taste
Watched the same movies
Gave the exact same ratings
Irmak Frrmack
Cinematch
I have a soulmate in taste
Watched the same movies
Gave the exact same ratings
Except The Book of Eli
Irmak Frrmack
Cinematch
I have a soulmate in taste
Frrmack watched The Book of Eli
Irmak Frrmack
Cinematch
I have a soulmate in taste
Irmak Frrmack
Oh man, it was…
Cinematch
I have a soulmate in taste
Irmak Frrmack
Oh man, it was…
FANTASTIC!
Cinematch
I have a soulmate in taste
Irmak Frrmack
Oh man, it was…
FANTASTIC!
Predict
No perfect soulmates in real life
Irmak
Irmak
Almost soulmate 1
No perfect soulmates in real life
Irmak
Almost soulmate 1 Almost soulmate 2
No perfect soulmates in real life
Irmak
Almost soulmate 1 Almost soulmate 2
Almost soulmate 3
No perfect soulmates in real life
Irmak
Almost soulmate 1 Almost soulmate 2
Almost soulmate 4Almost soulmate 3
No perfect soulmates in real life
Irmak
87% soulmate 74% soulmate
95% soulmate82% soulmate
No perfect soulmates in real life
Irmak
No perfect soulmates in real life
Irmak
No perfect soulmates in real life
Cinematch
Works well for movies that everybody rates
Cinematch
Quite bad with movies that only few people rate
Cinematch
Some movies are especially difficult to predict
Biggest error source: popular but weird
15% of all errors from ONE movie
Trivial: Mean score of everyone
Trivial: Mean score of everyone
Error: (RMSE) 1.0540 stars
Trivial: Mean score of everyone
Error: (RMSE) 1.0540 stars
Cinematch
Error: (RMSE) 0.9525 stars
Trivial: Mean score of everyone
Error: (RMSE) 1.0540 stars
Cinematch
Error: (RMSE) 0.9525 stars
9.6%
Trivial: Mean score of everyone
Error: (RMSE) 1.0540 stars
Cinematch
Error: (RMSE) 0.9525 stars
Better rankings  Better recommendations
9.6%
Trivial: Mean score of everyone
Error: (RMSE) 1.0540 stars
Cinematch
Error: (RMSE) 0.9525 stars
Better rankings  Better recommendations
+ 8.6%  + 1200% people watch top recommendation
9.6%
BigChaos Netflix Prize Repo
Cinematch
Error: 0.9525 stars
Cinematch
Error: 0.9525 stars
$1,000,000
for a 10% improvement
2006
Cinematch
Error: 0.9525 stars
Bring it down to:
Error: 0.8563 stars
$1,000,000
for a 10% improvement
2006
BellKor’s Pragmatic Chaos
How did they do it?
How did they do it?
How did they do it?
Before:
Solid assumptions
You have a certain taste.
Your taste dictates a hidden rating for Book of Eli.
When you watch it, this rating is revealed to you.
How did they do it?
Before:
Solid assumptions
You have a certain taste.
Your taste dictates a hidden rating for Book of Eli.
When you watch it, this rating is revealed to you.
WRONG
How did they do it?
After:
Your rating changes with time.
How did they do it?
After:
Your rating changes with time.
It depends on...
How did they do it?
After:
Your rating changes with time.
It depends on...
how many you rated that day
your average rating for the day
which movies you rated on this day
shown Netflix prediction
Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009
Trivial: Mean score of everyone
Error: 1.0540 stars
Cinematch
Error: 0.9525 stars
Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009
Trivial: Mean score of everyone
Error: 1.0540 stars
Cinematch
Error: 0.9525 stars
Your time dependent rating tendencies
Trivial: Mean score of everyone
Error: 1.0540 stars
Cinematch
Error: 0.9525 stars
Your time dependent rating tendencies
Error: 0.9278 stars
Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009
Trivial: Mean score of everyone
Error: 1.0540 stars
Cinematch
Error: 0.9525 stars
Your time dependent rating tendencies
Error: 0.9278 stars
Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009
12.0%
Trivial: Mean score of everyone
Error: 1.0540 stars
Cinematch
Error: 0.9525 stars
Your time dependent rating tendencies
Error: 0.9278 stars
without looking at which movies you like/hate!
Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009
12.0%
What does this suggest?
What does this suggest?
We cannot compare a movie with all others we've seen.
What does this suggest?
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
What does this suggest?
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
Liking (real time & remembered) depends on time and
mood.
What does this suggest?
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
Liking (real time & remembered) depends on time and
mood.
Other people's opinions affect our own (followers / hipsters)
What does this suggest?
We cannot compare Book of Eli with all movies we've seen.
We compare it to a limited set.
Liking (real time & remembered) depends on time and
mood.
Other people's opinions affect our own (followers / hipsters)
An experiment
Music Lab: A website for downloading music
An experiment
Same website: Music download and rating
M.J. Salganik, P.S. Dodds, D.J. Watts. Science, 311:854-856, 2006
An experiment
Music Lab: A website for downloading music
Alternative A:
Other people's ratings invisible
An experiment
Music Lab: A website for downloading music
Alternative A:
Other people's ratings invisible
More or less equal ratings
An experiment
Music Lab: A website for downloading music
Alternative A:
Other people's ratings invisible
Alternative B:
All ratings visible
More or less equal ratings
An experiment
Music Lab: A website for downloading music
Alternative A:
Other people's ratings invisible
Alternative B:
All ratings visible
More or less equal ratings
Several songs snowball in popularity
An experiment
Music Lab: A website for downloading music
Alternative A:
Other people's ratings invisible
Alternative B:
All ratings visible
More or less equal ratings
Several songs snowball in popularity
It's different songs for each trial
Social influence plays a big part in determining hits and misses
Problems with rating movies
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
Liking (real time & remembered) depends on time and
mood.
Other people's opinions affect our own.
Degree of liking is
sensitive and vague
Amazing! Total
garbage
Tuesday 3am Sunday 12pm
Liking (real time & remembered) depends on time and
mood.
Other people's opinions affect our own.
Degree of liking is
sensitive and vague
Degree of liking is
sensitive and vague
Dependent on many other
environmental factors
besides our taste
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
Degree of liking is
sensitive and vague
Degree of liking is
sensitive and vague
Difficult to describe
accurately and consistently
with a number
Predicting aside,
can I even reliably rate & rank movies
I’ve seen in terms of enjoyment?
Irmak Frrmack
What are your
top twenty
movies?
Irmak Frrmack
Well…
Ummm…
What are your
top twenty
movies?
Irmak Frrmack
Well…
Ummm…
I like Star Wars.
What are your
top twenty
movies?
Degree of liking is
sensitive and vague
Can’t we do
something
about this?
Degree of liking is
sensitive and vague
“Enjoyment” from a movie is very
high dimensional information
“Enjoyment” from a movie is very
high dimensional information
Rating means projecting this onto a single
dimension
?
But sometimes you just want to do the
best projection you can
What is my top twenty?
We cannot compare a movie with all others we've seen.
We compare it to a limited set.
Degree of liking is
sensitive and vague
Trying to rate Star Wars
Trying to rate Star Wars
Trying to rate Star Wars
Map enjoyment
to a specific scale
1
Trying to rate Star Wars
Map enjoyment
to a specific scale
1
Trying to rate Star Wars
Map enjoyment
to a specific scale
1
Trying to rate Star Wars
choose corresponding rating
for this degree of liking
2
Trying to rate Star Wars
But we cannot keep
this entire history of
enjoyment in mind
Trying to rate Star Wars
But we cannot keep
this entire history of
enjoyment in mind
We fuzzily remember
a small subset
Trying to rate Star Wars
But we cannot keep
this entire history of
enjoyment in mind
We fuzzily remember
a small subset
We map based on this subset
Trying to rate Star Wars
But we cannot keep
this entire history of
enjoyment in mind
We fuzzily remember
a small subset
We map based on this subset
 
SAMPLING
BIASED
SAMPLING
Tuesday
Tuesday
Friday
Friday
Degree of liking is
sensitive and vague
Can’t we do
something
about this?
We can certainly handle
single comparisons
?
We can certainly handle
single comparisons
We can certainly handle
single comparisons
less vague
We can certainly handle
single comparisons
little information
I can manually compare it with all others
And find exactly where it belongs
right after Indiana Jones
right before The Princess
Bride
Full ranking: Compare all pairs
That’s a bit
too much effort
for me
1,000,000 comparisons?
We don’t need all of them
We don’t need all of them
If
We don’t need all of them
If
,
We don’t need all of them
If
,
I have some information about
Compare a random sample of pairs
Use a ranking algorithm that utilizes
all the information
Good idea!
Elo rating system
Elo rating system
Elo rating system
Elo rating system
7.00
“hotness”
Elo rating system
7.00
“hotness” range
+1.50-1.50
Elo rating system
7.00 8.00
+1.50-1.50 +1.50-1.50
Elo rating system
7.00 8.00
+1.50-1.50 +1.50-1.50
7.12 7.68
Elo rating system
7.00 8.00
7.12 7.68
+1.50-1.50 +1.50-1.50
Elo rating system
7.00 8.00
7.12 7.68
+1.50-1.50 +1.50-1.50
Elo rating system
7.00 8.00
+150-150 +150-150
36%
to win
64%
to win
Elo rating system
How do we find out what these ranges are?
Elo rating system
Start with the same guess for every contender
5.00 5.00 5.00 5.00 5.00 5.00
Elo rating system
5.00 5.00
?
Elo rating system
5.00 5.00
Elo rating system
5.12 4.88
Update the best guesses accordingly
Elo rating system
5.12 5.00
?
Elo rating system
5.24 4.88
Elo rating system
5.24 5.00
?
Elo rating system
5.14 5.10
We don’t need all comparisons
If
,
I have some information about
Elo rating system
7.61 4.02
?
Elo rating system
7.61 4.02
?
89%
to win
11%
to win
Elo rating system
7.61
+.02
4.02
-.02
89%
to win
11%
to win
Elo rating system
7.61
-.53
4.02
+.53
89%
to win
11%
to win
Elo rating system
We now have scores on a single scale
9.07 8.42 6.40 4.88 4.20 3.03
Elo rating system
We now have scores on a single scale
(estimates of people’s appreciation levels)
9.07 8.42 6.40 4.88 4.20 3.03
Elo rating system
and a ranking
1 2 3 4 5 6
9.07 8.42 6.40 4.88 4.20 3.03
Degree of liking is
sensitive and vague
Can we somehow apply
this to movies, then?
We can do better
We can do better
Bayesian ranking algorithms
We can do better
Bayesian ranking algorithms
Glicko
(The Elo Killer)
1999
We can do better
Bayesian ranking algorithms
Glicko
(The Elo Killer)
1999
TrueSkill™
2007
Bayesian ranking
4.46 4.01
+- +-
Liking (real time & remembered) depends on time and
mood.
Other people's opinions affect our own.
Degree of liking is
sensitive and vague
Bayesian ranking
4.46 4.01
+- +-
Bayesian ranking
4.46 4.01
+- +-
82%
to win
15%
to win
3%
to draw
Bayesian ranking
?
Bayesian ranking
? 4.3
Elo:
Best guess
for the center
Bayesian ranking
? 4.3
Bayesian:
It could be
centered around
Bayesian:
It could also be
centered around
Bayesian ranking
? 4.2
Bayesian:
or
centered around
Bayesian ranking
? 4.4
Bayesian:
Less likely
but even around
Bayesian ranking
? 4.5
Bayesian ranking
? 4.3
3.5 4 4.5 5
Probability
Bayesian ranking
? 4.3
3.5 4 4.5 5
Probability
uncertainty
Few comparisons: Lots of uncertainty
(anything from 2.3 to 4.5 is quite possible)
2.0 2.5 3.0 3.5 4 4.5
5
Probability
After many comparisons: Quite sure
(pretty much between 4.11 to 4.18)
Probability
2.0 2.5 3.0 3.5 4 4.5
5
Bayesian ranking
?
Bayesian ranking
Star
Wars
Lord of
the Rings
2.0 3.0 4.0 5.0
Bayesian ranking
Star
Wars
Lord of
the Rings
2.0 3.0 4.0 5.0
How did they do it?
After:
Your rating changes with time.
A small, constant increase
in uncertainty before each
comparison
3.5 4 4.5 5
Probability
uncertainty
Degree of liking is
sensitive and vague
Great! We have a system!
I don’t want to
spend too much
time on this
How many is too many?
Minimum Effort
Maximum Information
Minimum Effort
Maximum Information
1 3 1 3 1 3 1 3 1 3
Minimum Effort
Maximum Information
Minimum Effort
Maximum Information
Minimum Effort
Maximum Information
Not reliable by itself
Still carries a lot of information
Minimum Effort
Maximum Information
1 3 5
Minimum Effort
Maximum Information
1 3 5 1 3 5
I don’t want to
spend too much
time on this
What else can we do?
Minimum Effort
Maximum Information
?
Minimum Effort
Maximum Information
?
I can calculate the expected amount
of information from a comparison!
Minimum Effort
Maximum Information
Minimum Effort
Maximum Information
Certain about both movies
Won’t learn a lot
Minimum Effort
Maximum Information
Certain about both movies
Won’t learn a lot
Minimum Effort
Maximum Information
Certain about both movies
Won’t learn a lot
Don’t know much about either
Will learn a lot
regardless of outcome
Python
Trueskill
Django
Javascript
MySQL
Python
Trueskill
Django
Javascript
MySQL
movievsmovie.datasco.pe
Irmak Frrmack
What are your
top twenty
movies?
Quantifying human reactions are hard
books
songs
food
politicans
products
celebrities
tv shows
importance of issues
what to spend ‘fun’ budget on
teams in different sports
Degree of liking is
sensitive and vague
Amazing! Total
garbage
Tuesday 3am Sunday 12pm
Quantifying human reactions are hard
Start with a rating,
pose the correct comparisons
Quantifying human reactions are hard
Start with a rating,
pose the correct comparisons
Every decision gets us closer
Degree of liking is
sensitive and vague
Amazing! Total
garbage
Tuesday 3am Sunday 12pm
Many comparisons for a movie
over different days
averages out mood and other factors
Degree of liking is
sensitive and vague
Amazing! Total
garbage
Tuesday 3am Sunday 12pm
movievsmovie.datasco.pe
The Anatomy
of a
Data Science Project
The Anatomy
of a
Data Science Project
Thanks

More Related Content

What's hot

Audience Research - 4018
Audience Research - 4018Audience Research - 4018
Audience Research - 4018haverstockmedia
 
Viewers advisory slideshow (december 1, 2015)
Viewers advisory slideshow (december 1, 2015)Viewers advisory slideshow (december 1, 2015)
Viewers advisory slideshow (december 1, 2015)SAS2012
 
Dolly Questionnaire
Dolly QuestionnaireDolly Questionnaire
Dolly Questionnairejessicak95
 
Viewers advisory slideshow (fall 2016) slideshare
Viewers advisory slideshow (fall 2016) slideshareViewers advisory slideshow (fall 2016) slideshare
Viewers advisory slideshow (fall 2016) slideshareSAS2012
 
Film research task
Film research taskFilm research task
Film research taskBrunoSinger1
 
Viewers' Advisory slideshow (2012)
Viewers' Advisory slideshow (2012)Viewers' Advisory slideshow (2012)
Viewers' Advisory slideshow (2012)SAS2012
 
Elements questionnaire
Elements questionnaireElements questionnaire
Elements questionnaireashhough2207
 
Audience Research - 4018
Audience Research - 4018Audience Research - 4018
Audience Research - 4018haverstockmedia
 
Evalutaion question 1
Evalutaion question 1Evalutaion question 1
Evalutaion question 1Caxie
 
Transformation robot car action figure
Transformation robot car action figureTransformation robot car action figure
Transformation robot car action figureShop For Gamers
 
Criticisms and reviews research
Criticisms and reviews researchCriticisms and reviews research
Criticisms and reviews researchJamescooperabel1
 
Criticisms and reviews research
Criticisms and reviews researchCriticisms and reviews research
Criticisms and reviews researchJamescooperabel1
 

What's hot (20)

Survey results
Survey resultsSurvey results
Survey results
 
Audience profile2
Audience profile2Audience profile2
Audience profile2
 
Presentation1
Presentation1Presentation1
Presentation1
 
Audience Research - 4018
Audience Research - 4018Audience Research - 4018
Audience Research - 4018
 
Viewers advisory slideshow (december 1, 2015)
Viewers advisory slideshow (december 1, 2015)Viewers advisory slideshow (december 1, 2015)
Viewers advisory slideshow (december 1, 2015)
 
Audience profile2
Audience profile2Audience profile2
Audience profile2
 
Audience profile2
Audience profile2Audience profile2
Audience profile2
 
Dolly Questionnaire
Dolly QuestionnaireDolly Questionnaire
Dolly Questionnaire
 
Viewers advisory slideshow (fall 2016) slideshare
Viewers advisory slideshow (fall 2016) slideshareViewers advisory slideshow (fall 2016) slideshare
Viewers advisory slideshow (fall 2016) slideshare
 
Film research task
Film research taskFilm research task
Film research task
 
Viewers' Advisory slideshow (2012)
Viewers' Advisory slideshow (2012)Viewers' Advisory slideshow (2012)
Viewers' Advisory slideshow (2012)
 
Circ le rouge
Circ le rougeCirc le rouge
Circ le rouge
 
Feedback forms
Feedback formsFeedback forms
Feedback forms
 
Elements questionnaire
Elements questionnaireElements questionnaire
Elements questionnaire
 
Audience Research - 4018
Audience Research - 4018Audience Research - 4018
Audience Research - 4018
 
Bbfc presentation
Bbfc presentationBbfc presentation
Bbfc presentation
 
Evalutaion question 1
Evalutaion question 1Evalutaion question 1
Evalutaion question 1
 
Transformation robot car action figure
Transformation robot car action figureTransformation robot car action figure
Transformation robot car action figure
 
Criticisms and reviews research
Criticisms and reviews researchCriticisms and reviews research
Criticisms and reviews research
 
Criticisms and reviews research
Criticisms and reviews researchCriticisms and reviews research
Criticisms and reviews research
 

Viewers also liked

Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t)  trends and challe...National seminar on emergence of internet of things (io t)  trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...Ajay Ohri
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project LifecycleJason Geng
 
Information processing cycle
Information processing cycleInformation processing cycle
Information processing cycleCriselda Liscano
 
01 Information Processing Cycle
01 Information Processing Cycle01 Information Processing Cycle
01 Information Processing Cycleryan.hackl
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data ScienceJason Geng
 
Tools and techniques for data science
Tools and techniques for data scienceTools and techniques for data science
Tools and techniques for data scienceAjay Ohri
 
Putting the Magic in Data Science
Putting the Magic in Data SciencePutting the Magic in Data Science
Putting the Magic in Data ScienceSean Taylor
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo LeeData Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo LeeSpark Summit
 
Data Processing-Presentation
Data Processing-PresentationData Processing-Presentation
Data Processing-Presentationnibraspk
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientistryanorban
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and ProcessingMehul Gondaliya
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 

Viewers also liked (17)

Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t)  trends and challe...National seminar on emergence of internet of things (io t)  trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project Lifecycle
 
Information processing cycle
Information processing cycleInformation processing cycle
Information processing cycle
 
01 Information Processing Cycle
01 Information Processing Cycle01 Information Processing Cycle
01 Information Processing Cycle
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Tools and techniques for data science
Tools and techniques for data scienceTools and techniques for data science
Tools and techniques for data science
 
Data processing cycle
Data processing cycleData processing cycle
Data processing cycle
 
Putting the Magic in Data Science
Putting the Magic in Data SciencePutting the Magic in Data Science
Putting the Magic in Data Science
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
Data Processing
Data ProcessingData Processing
Data Processing
 
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo LeeData Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
 
Data Processing-Presentation
Data Processing-PresentationData Processing-Presentation
Data Processing-Presentation
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Types of Data Processing
Types of Data ProcessingTypes of Data Processing
Types of Data Processing
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 

Similar to The Anatomy of a Data Science Project

Questionnaire analysis
Questionnaire analysisQuestionnaire analysis
Questionnaire analysisalicemedia1
 
Primary Research
Primary ResearchPrimary Research
Primary Researchkatiebabey
 
Target audience questionnaire
Target audience questionnaireTarget audience questionnaire
Target audience questionnairesohaya123
 
Questionnaire results-media-a2
Questionnaire results-media-a2Questionnaire results-media-a2
Questionnaire results-media-a2gethpricee
 
Target Audience Questionnaire Results
Target Audience Questionnaire ResultsTarget Audience Questionnaire Results
Target Audience Questionnaire ResultsDora44
 
10691532 10202048433250846 1279343409_n
10691532 10202048433250846 1279343409_n10691532 10202048433250846 1279343409_n
10691532 10202048433250846 1279343409_nDora44
 
Questionnaire Results
Questionnaire Results Questionnaire Results
Questionnaire Results AlysLeMoignan
 
Horror genre questionnaire
Horror genre questionnaireHorror genre questionnaire
Horror genre questionnaireLillymay12
 
Questionnaire analysis
Questionnaire analysisQuestionnaire analysis
Questionnaire analysismisscarter123
 
Audience research slide
Audience research slideAudience research slide
Audience research slideBeccihammond
 
Evaluation question 3
Evaluation question 3Evaluation question 3
Evaluation question 3Matt leesley
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4Zayd Hillawi
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4Zayd Hillawi
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4Zayd Hillawi
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4Zayd Hillawi
 

Similar to The Anatomy of a Data Science Project (20)

Questionnaire analysis
Questionnaire analysisQuestionnaire analysis
Questionnaire analysis
 
Primary Research
Primary ResearchPrimary Research
Primary Research
 
Primary Research
Primary ResearchPrimary Research
Primary Research
 
Target audience questionnaire
Target audience questionnaireTarget audience questionnaire
Target audience questionnaire
 
Questionnaire results-media-a2
Questionnaire results-media-a2Questionnaire results-media-a2
Questionnaire results-media-a2
 
Target Audience Questionnaire Results
Target Audience Questionnaire ResultsTarget Audience Questionnaire Results
Target Audience Questionnaire Results
 
10691532 10202048433250846 1279343409_n
10691532 10202048433250846 1279343409_n10691532 10202048433250846 1279343409_n
10691532 10202048433250846 1279343409_n
 
Questionnaire Results
Questionnaire Results Questionnaire Results
Questionnaire Results
 
Primary research
Primary researchPrimary research
Primary research
 
Horror genre questionnaire
Horror genre questionnaireHorror genre questionnaire
Horror genre questionnaire
 
Horror genre questionnaire
Horror genre questionnaireHorror genre questionnaire
Horror genre questionnaire
 
Questionnaire analysis
Questionnaire analysisQuestionnaire analysis
Questionnaire analysis
 
Stats
StatsStats
Stats
 
Audience research slide
Audience research slideAudience research slide
Audience research slide
 
Question 4
Question 4Question 4
Question 4
 
Evaluation question 3
Evaluation question 3Evaluation question 3
Evaluation question 3
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4
 
Audience feedback 4
Audience feedback 4Audience feedback 4
Audience feedback 4
 

Recently uploaded

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 

Recently uploaded (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 

The Anatomy of a Data Science Project

Editor's Notes

  1. Social influence plays a big part in determining hits and misses