SlideShare a Scribd company logo
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
• freedom to think about problems new ways
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
computing has progressed
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
1950
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
hoursv
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
same person thinking about the problem
can conduct experiments to answer it
hoursv
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
knowing
what is possible
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW WHY
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
tools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking why
tools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
WHY
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
start there
berlin pydata | @gabegaster | 2015 february
an anecdote
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
an example
from the real world
berlin pydata | @gabegaster | 2015 february
• e
an example
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
turn over to plaintiff
don’t
turn over to plaintiff
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype — design for lawyers
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype — design for lawyers
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
Sexier. Less nerdy. Tailored.
design for transparency
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
http://www.daegis.com/judicial-acceptance-of-tar/
berlin pydata | @gabegaster | 2015 february
another example
contests
berlin pydata | @gabegaster | 2015 february
another example
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
task:
berlin pydata | @gabegaster | 2015 february
classify schizophrenia w MRItask:
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how?
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how? … outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
WHY
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
berlin pydata | @gabegaster | 2015 february
timeline of contest
AUC
Accuracy of Classification
berlin pydata | @gabegaster | 2015 february
what is AUC?
AUC
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve?
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
upshot:
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
choice of metric matters a LOT
upshot:
in practice
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
timeline of contest
goal?
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
timeline of contest
goal: depends on why
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
random guess
basic SVM
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
very common problem
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
worth it?
AUC
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
an example
just for fun
berlin pydata | @gabegaster | 2015 february
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Think about how bike share has changed geography
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
a typical trip for me
berlin pydata | @gabegaster | 2015 february
Bus transit
times
=
a LIE
berlin pydata | @gabegaster | 2015 february
Chicago is a grid city
berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
2+ buses = FAIL
berlin pydata | @gabegaster | 2015 february
Adding bikes to
public transit
=
win
berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
viz Goal:
berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
show where people
actually go
viz Goal:
berlin pydata | @gabegaster | 2015 february
demo
berlin pydata | @gabegaster | 2015 february
in conclusion
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
thanks!
@gabegaster

More Related Content

Viewers also liked

Korea
Korea Korea
Korea
tommyhulsen
 
What goes into making a Data Visualization?
What goes into making a Data Visualization?What goes into making a Data Visualization?
What goes into making a Data Visualization?
Gabriel Gaster
 
слайды к первому модулю
слайды к первому модулюслайды к первому модулю
слайды к первому модулю
Tatyana Gavrinyuk
 
Trabajo individual con quandary yoo
Trabajo individual con quandary   yooTrabajo individual con quandary   yoo
Trabajo individual con quandary yoo
chuyonyoo
 
Describing people (1)
Describing people (1)Describing people (1)
Describing people (1)
Gabi Camacho
 
Aerospace Engineering -Hannah Petty
Aerospace Engineering  -Hannah PettyAerospace Engineering  -Hannah Petty
Aerospace Engineering -Hannah Petty
Jackie Settles
 
Music Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less WadingMusic Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less Wading
rachelmsmith
 
Hul 101128100726-phpapp01
Hul 101128100726-phpapp01Hul 101128100726-phpapp01
Hul 101128100726-phpapp01
Jitender Kumar
 
Ruby data types and objects
Ruby   data types and objectsRuby   data types and objects
Ruby data types and objects
Harkamal Singh
 
13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks
Corey Anderson
 
Data analysis
Data analysisData analysis
Data analysis
metalkid132
 
Viva201393(1).pptxbaru
Viva201393(1).pptxbaruViva201393(1).pptxbaru
Viva201393(1).pptxbaru
Nor'Aida Khairuddin
 
Máy công cụ tnut
Máy công cụ tnutMáy công cụ tnut
Máy công cụ tnut
chetaomaytnut
 
美容企業經營管理企劃書
美容企業經營管理企劃書美容企業經營管理企劃書
美容企業經營管理企劃書Sha Sha Liu
 
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 ColeEMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
Robert Cole
 
Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]
University of Miami
 
Cadena Alimenticia
Cadena AlimenticiaCadena Alimenticia
Cadena Alimenticia
Gabi Camacho
 
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Kenisha S Russell Jonsson
 
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
PRITAM JADHAV
 

Viewers also liked (20)

Korea
Korea Korea
Korea
 
What goes into making a Data Visualization?
What goes into making a Data Visualization?What goes into making a Data Visualization?
What goes into making a Data Visualization?
 
слайды к первому модулю
слайды к первому модулюслайды к первому модулю
слайды к первому модулю
 
Trabajo individual con quandary yoo
Trabajo individual con quandary   yooTrabajo individual con quandary   yoo
Trabajo individual con quandary yoo
 
Describing people (1)
Describing people (1)Describing people (1)
Describing people (1)
 
Cap 4, novena edc
Cap 4, novena edcCap 4, novena edc
Cap 4, novena edc
 
Aerospace Engineering -Hannah Petty
Aerospace Engineering  -Hannah PettyAerospace Engineering  -Hannah Petty
Aerospace Engineering -Hannah Petty
 
Music Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less WadingMusic Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less Wading
 
Hul 101128100726-phpapp01
Hul 101128100726-phpapp01Hul 101128100726-phpapp01
Hul 101128100726-phpapp01
 
Ruby data types and objects
Ruby   data types and objectsRuby   data types and objects
Ruby data types and objects
 
13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks
 
Data analysis
Data analysisData analysis
Data analysis
 
Viva201393(1).pptxbaru
Viva201393(1).pptxbaruViva201393(1).pptxbaru
Viva201393(1).pptxbaru
 
Máy công cụ tnut
Máy công cụ tnutMáy công cụ tnut
Máy công cụ tnut
 
美容企業經營管理企劃書
美容企業經營管理企劃書美容企業經營管理企劃書
美容企業經營管理企劃書
 
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 ColeEMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
 
Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]
 
Cadena Alimenticia
Cadena AlimenticiaCadena Alimenticia
Cadena Alimenticia
 
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12
 
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
 

Similar to Why Why Why

Social media trends and audiences: March 2105
Social media trends  and audiences: March 2105Social media trends  and audiences: March 2105
Social media trends and audiences: March 2105
Bob Crawshaw
 
Embrace the future gaelle bertrand
Embrace the future gaelle bertrandEmbrace the future gaelle bertrand
Embrace the future gaelle bertrand
Ray Poynter
 
Listening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the NoiseListening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the Noise
EDIT. - Disruptive Digital Education
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Python Predictions
 
Top 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to ConsiderTop 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to Consider
circle S studio
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
Tech in Asia ID
 
Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015
Lisa Flowers
 
Content Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, GrowthContent Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, Growth
Vinish Garg
 
nVest
nVestnVest
nVest
incubes_to
 
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Andy Young
 
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Turing Fest
 
Is your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshawIs your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshaw
Jon Earnshaw
 
(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers
Frank Vullers
 
Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!
BigDataExpo
 
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Martijn Burgman
 
Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017
Stewart Bennett
 
Creating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That WorksCreating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That Works
G3 Communications
 
SimilarWeb - NOAH15 Berlin
SimilarWeb -  NOAH15 BerlinSimilarWeb -  NOAH15 Berlin
SimilarWeb - NOAH15 Berlin
NOAH Advisors
 
pixelcamp
pixelcamppixelcamp
pixelcamp
Isabel Portugal
 
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content WranglerWhat Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
Information Development World
 

Similar to Why Why Why (20)

Social media trends and audiences: March 2105
Social media trends  and audiences: March 2105Social media trends  and audiences: March 2105
Social media trends and audiences: March 2105
 
Embrace the future gaelle bertrand
Embrace the future gaelle bertrandEmbrace the future gaelle bertrand
Embrace the future gaelle bertrand
 
Listening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the NoiseListening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the Noise
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
 
Top 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to ConsiderTop 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to Consider
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
 
Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015
 
Content Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, GrowthContent Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, Growth
 
nVest
nVestnVest
nVest
 
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
 
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
 
Is your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshawIs your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshaw
 
(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers
 
Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!
 
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
 
Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017
 
Creating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That WorksCreating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That Works
 
SimilarWeb - NOAH15 Berlin
SimilarWeb -  NOAH15 BerlinSimilarWeb -  NOAH15 Berlin
SimilarWeb - NOAH15 Berlin
 
pixelcamp
pixelcamppixelcamp
pixelcamp
 
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content WranglerWhat Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
 

Recently uploaded

一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
1tyxnjpia
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
slg6lamcq
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
asyed10
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
exukyp
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
vasanthatpuram
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
nyvan3
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
Vineet
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
ywqeos
 
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
lzdvtmy8
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 

Recently uploaded (20)

一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
 
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 

Why Why Why

  • 1. berlin pydata | @gabegaster | 2015 february what is data science?
  • 2. berlin pydata | @gabegaster | 2015 february what is data science?
  • 3. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist?
  • 4. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? review of literature
  • 5. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? review of literature
  • 6. berlin pydata | @gabegaster | 2015 february what is data science? review of literature
  • 7. berlin pydata | @gabegaster | 2015 february what is data science? review of literature
  • 8. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist?
  • 9. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code”
  • 10. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems
  • 11. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems • repeatable analysis
  • 12. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems • repeatable analysis • freedom to think about problems new ways
  • 13. berlin pydata | @gabegaster | 2015 february what is data science?
  • 14. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically
  • 15. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before
  • 16. berlin pydata | @gabegaster | 2015 february which were difficult to answer before
  • 17. berlin pydata | @gabegaster | 2015 february computing has progressed which were difficult to answer before
  • 18. berlin pydata | @gabegaster | 2015 february 1950 computing has progressed
  • 19. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis computing has progressed
  • 20. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years computing has progressed
  • 21. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today computing has progressed
  • 22. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today v computing has progressed
  • 23. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today hoursv v computing has progressed
  • 24. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today same person thinking about the problem can conduct experiments to answer it hoursv v computing has progressed
  • 25. berlin pydata | @gabegaster | 2015 february computing has progressed
  • 26. berlin pydata | @gabegaster | 2015 february open-source code computing has progressed
  • 27. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 28. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 29. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 30. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants reinventing the wheel computing has progressed
  • 31. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants reinventing the wheel computing has progressed
  • 32. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before
  • 33. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically knowing what is possible which were difficult to answer before
  • 34. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful
  • 35. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful HOW
  • 36. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful HOW WHY
  • 37. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful
  • 38. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right tools
  • 39. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools
  • 40. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking why tools
  • 41. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools
  • 42. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools WHY
  • 43. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools WHY WHY
  • 44. berlin pydata | @gabegaster | 2015 february why why why what is data science?
  • 45. berlin pydata | @gabegaster | 2015 february why why why what is data science? science is about asking why
  • 46. berlin pydata | @gabegaster | 2015 february why why why what is data science? science is about asking why start there
  • 47. berlin pydata | @gabegaster | 2015 february an anecdote
  • 48. berlin pydata | @gabegaster | 2015 february
  • 49. berlin pydata | @gabegaster | 2015 february
  • 50. berlin pydata | @gabegaster | 2015 february
  • 51. berlin pydata | @gabegaster | 2015 february an example from the real world
  • 52. berlin pydata | @gabegaster | 2015 february • e an example
  • 53. berlin pydata | @gabegaster | 2015 february
  • 54. berlin pydata | @gabegaster | 2015 february
  • 55. berlin pydata | @gabegaster | 2015 february
  • 56. berlin pydata | @gabegaster | 2015 february goal: save money
  • 57. berlin pydata | @gabegaster | 2015 february goal: save money
  • 58. berlin pydata | @gabegaster | 2015 february goal: save money
  • 59. berlin pydata | @gabegaster | 2015 february goal: save money
  • 60. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 61. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 62. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 63. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent goal: save money task: find needle in the haystack (without poking yourself)
  • 64. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference goal: save money task: find needle in the haystack (without poking yourself)
  • 65. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets goal: save money task: find needle in the haystack (without poking yourself)
  • 66. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets goal: save money task: find needle in the haystack (without poking yourself)
  • 67. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february turn over to plaintiff don’t turn over to plaintiff goal: save money task: find needle in the haystack (without poking yourself)
  • 68. berlin pydata | @gabegaster | 2015 february
  • 69. berlin pydata | @gabegaster | 2015 february
  • 70. berlin pydata | @gabegaster | 2015 february
  • 71. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype
  • 72. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype
  • 73. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype — design for lawyers
  • 74. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype — design for lawyers
  • 75. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february Sexier. Less nerdy. Tailored. design for transparency
  • 76. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february http://www.daegis.com/judicial-acceptance-of-tar/
  • 77. berlin pydata | @gabegaster | 2015 february another example contests
  • 78. berlin pydata | @gabegaster | 2015 february another example
  • 79. berlin pydata | @gabegaster | 2015 february
  • 80. berlin pydata | @gabegaster | 2015 february task:
  • 81. berlin pydata | @gabegaster | 2015 february classify schizophrenia w MRItask:
  • 82. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask:
  • 83. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease
  • 84. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease how?
  • 85. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease how? … outside contest purview
  • 86. berlin pydata | @gabegaster | 2015 february why? outside contest purview
  • 87. berlin pydata | @gabegaster | 2015 february why? outside contest purview
  • 88. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle
  • 89. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle getting data & making usable
  • 90. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle getting data & making usable WHY
  • 91. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification
  • 92. berlin pydata | @gabegaster | 2015 february timeline of contest AUC Accuracy of Classification
  • 93. berlin pydata | @gabegaster | 2015 february what is AUC? AUC
  • 94. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve
  • 95. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve?
  • 96. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 97. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 98. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 99. berlin pydata | @gabegaster | 2015 february balances: AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 100. berlin pydata | @gabegaster | 2015 february balances: True Positive Rate False Positive Rate AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 101. berlin pydata | @gabegaster | 2015 february balances: True Positive Rate False Positive Rate AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 102. berlin pydata | @gabegaster | 2015 february AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 103. berlin pydata | @gabegaster | 2015 february why? AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 104. berlin pydata | @gabegaster | 2015 february why? … AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 105. berlin pydata | @gabegaster | 2015 february why? … upshot: AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 106. berlin pydata | @gabegaster | 2015 february why? … choice of metric matters a LOT upshot: in practice AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 107. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC
  • 108. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC random guess
  • 109. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC random guess basic SVM
  • 110. berlin pydata | @gabegaster | 2015 february timeline of contest goal? Accuracy of Classification AUC random guess basic SVM
  • 111. berlin pydata | @gabegaster | 2015 february timeline of contest goal: depends on why Accuracy of Classification AUC random guess basic SVM
  • 112. berlin pydata | @gabegaster | 2015 february random guess basic SVM timeline of contest Accuracy of Classification AUC
  • 113. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC
  • 114. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC turned out to place 9th — because overfitting
  • 115. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC turned out to place 9th — because overfitting very common problem
  • 116. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification worth it? AUC
  • 117. berlin pydata | @gabegaster | 2015 february
  • 118. berlin pydata | @gabegaster | 2015 february
  • 119. berlin pydata | @gabegaster | 2015 february
  • 120. berlin pydata | @gabegaster | 2015 february
  • 121. berlin pydata | @gabegaster | 2015 february
  • 122. berlin pydata | @gabegaster | 2015 february
  • 123. berlin pydata | @gabegaster | 2015 february
  • 124. berlin pydata | @gabegaster | 2015 february
  • 125. berlin pydata | @gabegaster | 2015 february
  • 126. berlin pydata | @gabegaster | 2015 february an example just for fun
  • 127. berlin pydata | @gabegaster | 2015 february Chicago Bike Share System ! ! kind of like call-a-bike
  • 128. berlin pydata | @gabegaster | 2015 february Show what I like about Bike share ! Chicago Bike Share System ! ! kind of like call-a-bike
  • 129. berlin pydata | @gabegaster | 2015 february Show what I like about Bike share ! Think about how bike share has changed geography Chicago Bike Share System ! ! kind of like call-a-bike
  • 130. berlin pydata | @gabegaster | 2015 february a typical trip for me
  • 131. berlin pydata | @gabegaster | 2015 february Bus transit times = a LIE
  • 132. berlin pydata | @gabegaster | 2015 february Chicago is a grid city
  • 133. berlin pydata | @gabegaster | 2015 february Difficult Public Transit on the grid =+ Diagonals
  • 134. berlin pydata | @gabegaster | 2015 february Difficult Public Transit on the grid =+ Diagonals 2+ buses = FAIL
  • 135. berlin pydata | @gabegaster | 2015 february Adding bikes to public transit = win
  • 136. berlin pydata | @gabegaster | 2015 february show how has divvy changed where people can go viz Goal:
  • 137. berlin pydata | @gabegaster | 2015 february show how has divvy changed where people can go show where people actually go viz Goal:
  • 138. berlin pydata | @gabegaster | 2015 february demo
  • 139. berlin pydata | @gabegaster | 2015 february in conclusion
  • 140. berlin pydata | @gabegaster | 2015 february
  • 141. berlin pydata | @gabegaster | 2015 february
  • 142. berlin pydata | @gabegaster | 2015 february
  • 143. berlin pydata | @gabegaster | 2015 february
  • 144. berlin pydata | @gabegaster | 2015 february
  • 145. berlin pydata | @gabegaster | 2015 february thanks! @gabegaster