SlideShare a Scribd company logo
1 of 145
Download to read offline
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
• freedom to think about problems new ways
berlin pydata | @gabegaster | 2015 february
what is data science?
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
computing has progressed
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
1950
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
hoursv
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
same person thinking about the problem
can conduct experiments to answer it
hoursv
v
computing has progressed
berlin pydata | @gabegaster | 2015 february
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
knowing
what is possible
which were difficult to answer before
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW WHY
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
tools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking why
tools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
WHY
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
start there
berlin pydata | @gabegaster | 2015 february
an anecdote
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
an example
from the real world
berlin pydata | @gabegaster | 2015 february
• e
an example
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 february
goal: save money
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
turn over to plaintiff
don’t
turn over to plaintiff
goal: save money
task: find needle in the haystack (without poking yourself)
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype — design for lawyers
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
prototype — design for lawyers
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
Sexier. Less nerdy. Tailored.
design for transparency
berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
http://www.daegis.com/judicial-acceptance-of-tar/
berlin pydata | @gabegaster | 2015 february
another example
contests
berlin pydata | @gabegaster | 2015 february
another example
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
task:
berlin pydata | @gabegaster | 2015 february
classify schizophrenia w MRItask:
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how?
berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how? … outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
WHY
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
berlin pydata | @gabegaster | 2015 february
timeline of contest
AUC
Accuracy of Classification
berlin pydata | @gabegaster | 2015 february
what is AUC?
AUC
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve?
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
upshot:
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
why?
…
choice of metric matters a LOT
upshot:
in practice
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
timeline of contest
goal?
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
timeline of contest
goal: depends on why
Accuracy of Classification
AUC
random guess
basic SVM
berlin pydata | @gabegaster | 2015 february
random guess
basic SVM
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
very common problem
berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
worth it?
AUC
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
an example
just for fun
berlin pydata | @gabegaster | 2015 february
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Think about how bike share has changed geography
Chicago Bike Share System
!
!
kind of like call-a-bike
berlin pydata | @gabegaster | 2015 february
a typical trip for me
berlin pydata | @gabegaster | 2015 february
Bus transit
times
=
a LIE
berlin pydata | @gabegaster | 2015 february
Chicago is a grid city
berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
2+ buses = FAIL
berlin pydata | @gabegaster | 2015 february
Adding bikes to
public transit
=
win
berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
viz Goal:
berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
show where people
actually go
viz Goal:
berlin pydata | @gabegaster | 2015 february
demo
berlin pydata | @gabegaster | 2015 february
in conclusion
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
berlin pydata | @gabegaster | 2015 february
thanks!
@gabegaster

More Related Content

Viewers also liked

What goes into making a Data Visualization?
What goes into making a Data Visualization?What goes into making a Data Visualization?
What goes into making a Data Visualization?Gabriel Gaster
 
слайды к первому модулю
слайды к первому модулюслайды к первому модулю
слайды к первому модулюTatyana Gavrinyuk
 
Trabajo individual con quandary yoo
Trabajo individual con quandary   yooTrabajo individual con quandary   yoo
Trabajo individual con quandary yoochuyonyoo
 
Describing people (1)
Describing people (1)Describing people (1)
Describing people (1)Gabi Camacho
 
Aerospace Engineering -Hannah Petty
Aerospace Engineering  -Hannah PettyAerospace Engineering  -Hannah Petty
Aerospace Engineering -Hannah PettyJackie Settles
 
Music Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less WadingMusic Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less Wadingrachelmsmith
 
Hul 101128100726-phpapp01
Hul 101128100726-phpapp01Hul 101128100726-phpapp01
Hul 101128100726-phpapp01Jitender Kumar
 
Ruby data types and objects
Ruby   data types and objectsRuby   data types and objects
Ruby data types and objectsHarkamal Singh
 
13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless NetworksCorey Anderson
 
Máy công cụ tnut
Máy công cụ tnutMáy công cụ tnut
Máy công cụ tnutchetaomaytnut
 
美容企業經營管理企劃書
美容企業經營管理企劃書美容企業經營管理企劃書
美容企業經營管理企劃書Sha Sha Liu
 
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 ColeEMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 ColeRobert Cole
 
Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]University of Miami
 
Cadena Alimenticia
Cadena AlimenticiaCadena Alimenticia
Cadena AlimenticiaGabi Camacho
 
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12Kenisha S Russell Jonsson
 
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...PRITAM JADHAV
 

Viewers also liked (20)

Korea
Korea Korea
Korea
 
What goes into making a Data Visualization?
What goes into making a Data Visualization?What goes into making a Data Visualization?
What goes into making a Data Visualization?
 
слайды к первому модулю
слайды к первому модулюслайды к первому модулю
слайды к первому модулю
 
Trabajo individual con quandary yoo
Trabajo individual con quandary   yooTrabajo individual con quandary   yoo
Trabajo individual con quandary yoo
 
Describing people (1)
Describing people (1)Describing people (1)
Describing people (1)
 
Cap 4, novena edc
Cap 4, novena edcCap 4, novena edc
Cap 4, novena edc
 
Aerospace Engineering -Hannah Petty
Aerospace Engineering  -Hannah PettyAerospace Engineering  -Hannah Petty
Aerospace Engineering -Hannah Petty
 
Music Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less WadingMusic Therapy Notes / More Reflecting, Less Wading
Music Therapy Notes / More Reflecting, Less Wading
 
Hul 101128100726-phpapp01
Hul 101128100726-phpapp01Hul 101128100726-phpapp01
Hul 101128100726-phpapp01
 
Ruby data types and objects
Ruby   data types and objectsRuby   data types and objects
Ruby data types and objects
 
13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks13 Trends and Challenges with School Wireless Networks
13 Trends and Challenges with School Wireless Networks
 
Data analysis
Data analysisData analysis
Data analysis
 
Viva201393(1).pptxbaru
Viva201393(1).pptxbaruViva201393(1).pptxbaru
Viva201393(1).pptxbaru
 
Máy công cụ tnut
Máy công cụ tnutMáy công cụ tnut
Máy công cụ tnut
 
美容企業經營管理企劃書
美容企業經營管理企劃書美容企業經營管理企劃書
美容企業經營管理企劃書
 
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 ColeEMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
EMS Carbon Monoxide and CO-Ox monitoring Co Rad 57 Cole
 
Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]Advanced practice preparation pharmacodynamics[1]
Advanced practice preparation pharmacodynamics[1]
 
Cadena Alimenticia
Cadena AlimenticiaCadena Alimenticia
Cadena Alimenticia
 
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12Receiver Operating Characteristic (ROC) curve analysis. 19.12
Receiver Operating Characteristic (ROC) curve analysis. 19.12
 
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...Central gas equipment for industrial gases (2011 edition in english) uk619 10...
Central gas equipment for industrial gases (2011 edition in english) uk619 10...
 

Similar to Why Why Why

Social media trends and audiences: March 2105
Social media trends  and audiences: March 2105Social media trends  and audiences: March 2105
Social media trends and audiences: March 2105Bob Crawshaw
 
Embrace the future gaelle bertrand
Embrace the future gaelle bertrandEmbrace the future gaelle bertrand
Embrace the future gaelle bertrandRay Poynter
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumPython Predictions
 
Top 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to ConsiderTop 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to Considercircle S studio
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)Tech in Asia ID
 
Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015Lisa Flowers
 
Content Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, GrowthContent Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, GrowthVinish Garg
 
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Andy Young
 
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Turing Fest
 
Is your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshawIs your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshawJon Earnshaw
 
(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_VullersFrank Vullers
 
Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!BigDataExpo
 
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Martijn Burgman
 
Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017Stewart Bennett
 
Creating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That WorksCreating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That WorksG3 Communications
 
SimilarWeb - NOAH15 Berlin
SimilarWeb -  NOAH15 BerlinSimilarWeb -  NOAH15 Berlin
SimilarWeb - NOAH15 BerlinNOAH Advisors
 
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content WranglerWhat Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content WranglerInformation Development World
 

Similar to Why Why Why (20)

Social media trends and audiences: March 2105
Social media trends  and audiences: March 2105Social media trends  and audiences: March 2105
Social media trends and audiences: March 2105
 
Embrace the future gaelle bertrand
Embrace the future gaelle bertrandEmbrace the future gaelle bertrand
Embrace the future gaelle bertrand
 
Listening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the NoiseListening on the Social Web: How To Make Sense of all the Noise
Listening on the Social Web: How To Make Sense of all the Noise
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
 
Top 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to ConsiderTop 2015 B2B Marketing Trends to Consider
Top 2015 B2B Marketing Trends to Consider
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
 
Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015Social Media for Non Profits DTCC Dover Oct 2015
Social Media for Non Profits DTCC Dover Oct 2015
 
Content Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, GrowthContent Strategy Meetup: Experience, Traction, Growth
Content Strategy Meetup: Experience, Traction, Growth
 
nVest
nVestnVest
nVest
 
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
 
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
 
Is your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshawIs your content working better for someone else? @jonearnshaw
Is your content working better for someone else? @jonearnshaw
 
(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers(Big)Data_just use it 30 sep2015_Frank_Vullers
(Big)Data_just use it 30 sep2015_Frank_Vullers
 
Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!Big Data Expo 2015 - Teradata Big Data : Just use it!
Big Data Expo 2015 - Teradata Big Data : Just use it!
 
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...
 
Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017Liverpool HubSpot User Group - June 2017
Liverpool HubSpot User Group - June 2017
 
Creating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That WorksCreating A 2016 Content Strategy That Works
Creating A 2016 Content Strategy That Works
 
SimilarWeb - NOAH15 Berlin
SimilarWeb -  NOAH15 BerlinSimilarWeb -  NOAH15 Berlin
SimilarWeb - NOAH15 Berlin
 
pixelcamp
pixelcamppixelcamp
pixelcamp
 
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content WranglerWhat Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
What Does it Mean to Be Helpful? with Scott Abel, The Content Wrangler
 

Recently uploaded

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Recently uploaded (20)

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

Why Why Why

  • 1. berlin pydata | @gabegaster | 2015 february what is data science?
  • 2. berlin pydata | @gabegaster | 2015 february what is data science?
  • 3. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist?
  • 4. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? review of literature
  • 5. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? review of literature
  • 6. berlin pydata | @gabegaster | 2015 february what is data science? review of literature
  • 7. berlin pydata | @gabegaster | 2015 february what is data science? review of literature
  • 8. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist?
  • 9. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code”
  • 10. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems
  • 11. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems • repeatable analysis
  • 12. berlin pydata | @gabegaster | 2015 february what is data science? who is a data scientist? “a scientist who can code” • lower barrier to attack new problems • repeatable analysis • freedom to think about problems new ways
  • 13. berlin pydata | @gabegaster | 2015 february what is data science?
  • 14. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically
  • 15. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before
  • 16. berlin pydata | @gabegaster | 2015 february which were difficult to answer before
  • 17. berlin pydata | @gabegaster | 2015 february computing has progressed which were difficult to answer before
  • 18. berlin pydata | @gabegaster | 2015 february 1950 computing has progressed
  • 19. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis computing has progressed
  • 20. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years computing has progressed
  • 21. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today computing has progressed
  • 22. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today v computing has progressed
  • 23. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today hoursv v computing has progressed
  • 24. berlin pydata | @gabegaster | 2015 february 1950 cost of new analysis years today same person thinking about the problem can conduct experiments to answer it hoursv v computing has progressed
  • 25. berlin pydata | @gabegaster | 2015 february computing has progressed
  • 26. berlin pydata | @gabegaster | 2015 february open-source code computing has progressed
  • 27. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 28. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 29. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants computing has progressed
  • 30. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants reinventing the wheel computing has progressed
  • 31. berlin pydata | @gabegaster | 2015 february open-source code standing on shoulders of giants reinventing the wheel computing has progressed
  • 32. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before
  • 33. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically knowing what is possible which were difficult to answer before
  • 34. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful
  • 35. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful HOW
  • 36. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful HOW WHY
  • 37. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful
  • 38. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right tools
  • 39. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools
  • 40. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking why tools
  • 41. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools
  • 42. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools WHY
  • 43. berlin pydata | @gabegaster | 2015 february what is data science? using emerging technologies to approach problems scientifically which were difficult to answer before knowing what is possible doing something useful using new good the right asking whytools WHY WHY
  • 44. berlin pydata | @gabegaster | 2015 february why why why what is data science?
  • 45. berlin pydata | @gabegaster | 2015 february why why why what is data science? science is about asking why
  • 46. berlin pydata | @gabegaster | 2015 february why why why what is data science? science is about asking why start there
  • 47. berlin pydata | @gabegaster | 2015 february an anecdote
  • 48. berlin pydata | @gabegaster | 2015 february
  • 49. berlin pydata | @gabegaster | 2015 february
  • 50. berlin pydata | @gabegaster | 2015 february
  • 51. berlin pydata | @gabegaster | 2015 february an example from the real world
  • 52. berlin pydata | @gabegaster | 2015 february • e an example
  • 53. berlin pydata | @gabegaster | 2015 february
  • 54. berlin pydata | @gabegaster | 2015 february
  • 55. berlin pydata | @gabegaster | 2015 february
  • 56. berlin pydata | @gabegaster | 2015 february goal: save money
  • 57. berlin pydata | @gabegaster | 2015 february goal: save money
  • 58. berlin pydata | @gabegaster | 2015 february goal: save money
  • 59. berlin pydata | @gabegaster | 2015 february goal: save money
  • 60. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 61. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 62. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money task: find needle in the haystack (without poking yourself)
  • 63. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent goal: save money task: find needle in the haystack (without poking yourself)
  • 64. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference goal: save money task: find needle in the haystack (without poking yourself)
  • 65. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets goal: save money task: find needle in the haystack (without poking yourself)
  • 66. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february aboutpatent not aboutpatent turn over to plaintiff don’t turn over to plaintiff adverse inference give away trade secrets goal: save money task: find needle in the haystack (without poking yourself)
  • 67. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february turn over to plaintiff don’t turn over to plaintiff goal: save money task: find needle in the haystack (without poking yourself)
  • 68. berlin pydata | @gabegaster | 2015 february
  • 69. berlin pydata | @gabegaster | 2015 february
  • 70. berlin pydata | @gabegaster | 2015 february
  • 71. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype
  • 72. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype
  • 73. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype — design for lawyers
  • 74. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february goal: save money prototype — design for lawyers
  • 75. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february Sexier. Less nerdy. Tailored. design for transparency
  • 76. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february http://www.daegis.com/judicial-acceptance-of-tar/
  • 77. berlin pydata | @gabegaster | 2015 february another example contests
  • 78. berlin pydata | @gabegaster | 2015 february another example
  • 79. berlin pydata | @gabegaster | 2015 february
  • 80. berlin pydata | @gabegaster | 2015 february task:
  • 81. berlin pydata | @gabegaster | 2015 february classify schizophrenia w MRItask:
  • 82. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask:
  • 83. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease
  • 84. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease how?
  • 85. berlin pydata | @gabegaster | 2015 february why? classify schizophrenia w MRItask: improve understanding of disease how? … outside contest purview
  • 86. berlin pydata | @gabegaster | 2015 february why? outside contest purview
  • 87. berlin pydata | @gabegaster | 2015 february why? outside contest purview
  • 88. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle
  • 89. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle getting data & making usable
  • 90. berlin pydata | @gabegaster | 2015 february why? outside contest purview kaggle getting data & making usable WHY
  • 91. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification
  • 92. berlin pydata | @gabegaster | 2015 february timeline of contest AUC Accuracy of Classification
  • 93. berlin pydata | @gabegaster | 2015 february what is AUC? AUC
  • 94. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve
  • 95. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve?
  • 96. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 97. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 98. berlin pydata | @gabegaster | 2015 february AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 99. berlin pydata | @gabegaster | 2015 february balances: AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 100. berlin pydata | @gabegaster | 2015 february balances: True Positive Rate False Positive Rate AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 101. berlin pydata | @gabegaster | 2015 february balances: True Positive Rate False Positive Rate AUC what is AUC? Area Under Curve what curve? Receiver Operating Characteristic
  • 102. berlin pydata | @gabegaster | 2015 february AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 103. berlin pydata | @gabegaster | 2015 february why? AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 104. berlin pydata | @gabegaster | 2015 february why? … AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 105. berlin pydata | @gabegaster | 2015 february why? … upshot: AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 106. berlin pydata | @gabegaster | 2015 february why? … choice of metric matters a LOT upshot: in practice AUC what is AUC? balances: True Positive Rate False Positive Rate Area Under Curve what curve? Receiver Operating Characteristic
  • 107. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC
  • 108. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC random guess
  • 109. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification AUC random guess basic SVM
  • 110. berlin pydata | @gabegaster | 2015 february timeline of contest goal? Accuracy of Classification AUC random guess basic SVM
  • 111. berlin pydata | @gabegaster | 2015 february timeline of contest goal: depends on why Accuracy of Classification AUC random guess basic SVM
  • 112. berlin pydata | @gabegaster | 2015 february random guess basic SVM timeline of contest Accuracy of Classification AUC
  • 113. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC
  • 114. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC turned out to place 9th — because overfitting
  • 115. berlin pydata | @gabegaster | 2015 february me timeline of contest Accuracy of Classification AUC turned out to place 9th — because overfitting very common problem
  • 116. berlin pydata | @gabegaster | 2015 february timeline of contest Accuracy of Classification worth it? AUC
  • 117. berlin pydata | @gabegaster | 2015 february
  • 118. berlin pydata | @gabegaster | 2015 february
  • 119. berlin pydata | @gabegaster | 2015 february
  • 120. berlin pydata | @gabegaster | 2015 february
  • 121. berlin pydata | @gabegaster | 2015 february
  • 122. berlin pydata | @gabegaster | 2015 february
  • 123. berlin pydata | @gabegaster | 2015 february
  • 124. berlin pydata | @gabegaster | 2015 february
  • 125. berlin pydata | @gabegaster | 2015 february
  • 126. berlin pydata | @gabegaster | 2015 february an example just for fun
  • 127. berlin pydata | @gabegaster | 2015 february Chicago Bike Share System ! ! kind of like call-a-bike
  • 128. berlin pydata | @gabegaster | 2015 february Show what I like about Bike share ! Chicago Bike Share System ! ! kind of like call-a-bike
  • 129. berlin pydata | @gabegaster | 2015 february Show what I like about Bike share ! Think about how bike share has changed geography Chicago Bike Share System ! ! kind of like call-a-bike
  • 130. berlin pydata | @gabegaster | 2015 february a typical trip for me
  • 131. berlin pydata | @gabegaster | 2015 february Bus transit times = a LIE
  • 132. berlin pydata | @gabegaster | 2015 february Chicago is a grid city
  • 133. berlin pydata | @gabegaster | 2015 february Difficult Public Transit on the grid =+ Diagonals
  • 134. berlin pydata | @gabegaster | 2015 february Difficult Public Transit on the grid =+ Diagonals 2+ buses = FAIL
  • 135. berlin pydata | @gabegaster | 2015 february Adding bikes to public transit = win
  • 136. berlin pydata | @gabegaster | 2015 february show how has divvy changed where people can go viz Goal:
  • 137. berlin pydata | @gabegaster | 2015 february show how has divvy changed where people can go show where people actually go viz Goal:
  • 138. berlin pydata | @gabegaster | 2015 february demo
  • 139. berlin pydata | @gabegaster | 2015 february in conclusion
  • 140. berlin pydata | @gabegaster | 2015 february
  • 141. berlin pydata | @gabegaster | 2015 february
  • 142. berlin pydata | @gabegaster | 2015 february
  • 143. berlin pydata | @gabegaster | 2015 february
  • 144. berlin pydata | @gabegaster | 2015 february
  • 145. berlin pydata | @gabegaster | 2015 february thanks! @gabegaster