SlideShare a Scribd company logo
1 of 33
Download to read offline
© Manish Kurse, 2016
Data Science - a Perspective
Manish Kurse, Ph.D.
Data Scientist, Google
28 April 2016
1
This is my perspective and is not necessarily intended to represent that of my employer
© Manish Kurse, 2016
Agenda
An Introduction
2
Insights on being a Data Scientist in the Industry
Thoughts about this evolving field
Lessons learnt on transitioning to data science
© Manish Kurse, 2016
An Introduction
Insights on being a Data Scientist in the Industry
Thoughts about this evolving field
Lessons learnt on transitioning to data science
3
© Manish Kurse, 2016
Extracting insights from structured
and unstructured data
Creating actionable solutions and
products based on these insights
What is Data Science?
Courtesy : Drew Conway 4
(Programming)
(Domain expertise)
© Manish Kurse, 2016
Popular Examples of Data Science
5
Recommendation Systems Inventory planning Dynamic
pricing
© Manish Kurse, 2016
Interest in data science has grown rapidly!
6
© Manish Kurse, 2016
Why this rise in interest?
7
Digital Connected World
Data storage is cheap
Computational power is cheap
Need to make sense of data
© Manish Kurse, 2016
Blind Men and an Elephant
Taken from the internet. Original artist: Not sure
8
© Manish Kurse, 2016
99
What do data scientists do in the industry?
© Manish Kurse, 2016
Developing
models and
building products
using data
Data Science today is a spectrum
Business
analysts
generating
insights
Researchers
developing new
mathematical
techniques and
algorithms
Insight 1:
10
© Manish Kurse, 2016
Data Scientists wear several hats
Dashboards
Continuous
Business Insights
Insight 2:
Slide-decks
Actionable insights
Software
Products
Prototyping
Tools and
infrastructure
Data science
platforms
11
© Manish Kurse, 2016
Data Science Interfaces with Several Teams
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Determine need with stakeholders.
Experimentation
Data cleaning
Insight 3:
12
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Work with engineers, set-up new data
logging
Experimentation
Data cleaning
13
Insight 3:
Data Science Interfaces with Several Teams
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Data engineering
Experimentation
Data cleaning
14
Insight 3:
Data Science Interfaces with Several Teams
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Experimentation
Data cleaning
Clean raw data, exploratory
analysis
Insight 3:
15
Data Science Interfaces with Several Teams
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Experimentation
Machine learning/
computational models
Data cleaning
Insight 3:
16
Data Science Interfaces with Several Teams
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
U/X
Experimentation
Data cleaning
Data Science Interfaces with Several Teams
Insight 3:
17
© Manish Kurse, 2016
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Get user feedback
Experimentation
Data cleaning
Data Science Interfaces with Several Teams
Insight 3:
18
© Manish Kurse, 2016
Data Science Interfaces with Several Teams
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
ProductionizeWork with Software Engineers
Experimentation
Data cleaning
Insight 3:
19
© Manish Kurse, 2016
Data Science Interfaces with Several Teams
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
Launch to customers/stakeholders
Experimentation
Data cleaning
Insight 3:
20
© Manish Kurse, 2016
Data Science Interfaces with Several Teams
Define project
Define data sources
Build pipelines
Build models
Visualization
Evaluate with users
Launch
Productionize
ExperimentationA/B Experiments
Data cleaning
Insight 3:
21
© Manish Kurse, 2016
Every Stage in Business is a Data Science Opportunity
Product
Sales
Customer SupportCustomer engagement
Marketing
Understanding need
Insight 4:
22
© Manish Kurse, 2016
Getting the right data could take time, effort
Change is constant and not everything can be modeled
Data cannot solve everything
Gaining stakeholder trust and showing value
Data Science is Challenging
Insight 5:
23
© Manish Kurse, 2016
24
Thoughts on Data Science Evolution
© Manish Kurse, 2016
Need for data scientists will continue to exist
Growing data science tools
Data scientists are needed to ask the
right questions
Define the data, the solution
Role of a data scientist will evolve
Google Cloud Machine
Learning
Thought 1:
25
© Manish Kurse, 2016
Data science will be an integral part of business strategy
Thought 2:
Data Infrastructure
Understanding
Business Need
Understanding
Customers
Data Logging
26
© Manish Kurse, 2016
Machine learning will influence non-data scientist roles
Thought 3:
Machine learning becomes
mainstream
Business analysts apply more
complex predictive models
Software engineers are trained in
building machine learning software
27
© Manish Kurse, 2016
Security and Privacy should/will be a focus
“With Great Power
Comes Great
Responsibility”
Data Science
Thought 4:
28Source: Marvel
© Manish Kurse, 2016
Journey towards Data Science
Source: rei.com
29
© Manish Kurse, 2016
Spend time to understand the field
Books
Data Science for
Business
Doing Data
Science
Big Data: A
Revolution...
Longer List
Podcasts
Linear Digressions
Data Skeptic
Partially Derivative
...
Longer list
Follow
Subscriptions on
online magazines
like Flipboard
Data scientists in
your field of
interest
Longer list
Blogs
KDNuggets
DataTau
Analytics Vidya
Longer List
Lesson 1:
30
© Manish Kurse, 2016
Online tutorials
Algorithms and data structures
Python: Tutorials, Python for Data
Analysis
R: Tutorials
SQL: Tutorials
Knowledge of tools is important, but understanding
of fundamentals is key
Lesson 2
Classes
MOOCs: Udacity, Coursera
Bootcamps: Logit, Insight, General
Assembly, Data Incubator
Mentored Courses: Thinkful, Springboard
Machine Learning, Statistics, Programming
31
© Manish Kurse, 2016
Free Datasets
Interesting data-sets for
statistics
Datasets curated by data
scientists
Data sources for cool data
science projects
Side-Projects are invaluable
Lesson 3
Side projects
Mini projects
Online contests like kaggle.
com
Article about choosing
projects
Create a web
portfolio
Host code on github
Creating a website
hosted on github
32
© Manish Kurse, 2016
Exciting Time to be in Data Science!
33
An Introduction
Insights on being a Data Scientist in the Industry
Thoughts about this evolving field
Lessons learnt on transitioning to data science

More Related Content

What's hot

Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applicationsPadma Metta
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceLivePerson
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger databodaceacat
 
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)Denny Lee
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...raihansikdar
 
Hadoop infrastructure for education
Hadoop infrastructure for educationHadoop infrastructure for education
Hadoop infrastructure for educationDarko Marjanovic
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Jisc learning analytics scotland HEIDS
Jisc learning analytics scotland HEIDSJisc learning analytics scotland HEIDS
Jisc learning analytics scotland HEIDSPaul Bailey
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education SectorKaran Sachdeva
 
DataCamp investor pitch deck April 2017
DataCamp investor pitch deck April 2017DataCamp investor pitch deck April 2017
DataCamp investor pitch deck April 2017Jonathan Cornelissen
 
Jisc learning analytics overview Oct2017
Jisc learning analytics overview Oct2017Jisc learning analytics overview Oct2017
Jisc learning analytics overview Oct2017Paul Bailey
 
Jisc learning analytics update Sept 2017
Jisc learning analytics update Sept 2017Jisc learning analytics update Sept 2017
Jisc learning analytics update Sept 2017Paul Bailey
 
Big Data in Education
Big Data in EducationBig Data in Education
Big Data in EducationAlfred Essa
 
Jisc Learning Analytics intro for digital leaders
Jisc Learning Analytics intro for digital leadersJisc Learning Analytics intro for digital leaders
Jisc Learning Analytics intro for digital leadersPaul Bailey
 

What's hot (20)

Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applications
 
Unit 3 part 2
Unit  3 part 2Unit  3 part 2
Unit 3 part 2
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
Gwi dm intro_20140605
Gwi dm intro_20140605Gwi dm intro_20140605
Gwi dm intro_20140605
 
Data science 101
Data science 101Data science 101
Data science 101
 
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)
Differential Privacy Case Studies (CMU-MSR Mindswap on Privacy 2007)
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
 
Hadoop infrastructure for education
Hadoop infrastructure for educationHadoop infrastructure for education
Hadoop infrastructure for education
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Conrad - Separating the Wheat from the Chaff
Conrad - Separating the Wheat from the ChaffConrad - Separating the Wheat from the Chaff
Conrad - Separating the Wheat from the Chaff
 
DataMind Pitch August 2013
DataMind Pitch August 2013DataMind Pitch August 2013
DataMind Pitch August 2013
 
Jisc learning analytics scotland HEIDS
Jisc learning analytics scotland HEIDSJisc learning analytics scotland HEIDS
Jisc learning analytics scotland HEIDS
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education Sector
 
DataCamp investor pitch deck April 2017
DataCamp investor pitch deck April 2017DataCamp investor pitch deck April 2017
DataCamp investor pitch deck April 2017
 
Jisc learning analytics overview Oct2017
Jisc learning analytics overview Oct2017Jisc learning analytics overview Oct2017
Jisc learning analytics overview Oct2017
 
Jisc learning analytics update Sept 2017
Jisc learning analytics update Sept 2017Jisc learning analytics update Sept 2017
Jisc learning analytics update Sept 2017
 
TOP Statistical Analysis Software
TOP Statistical Analysis SoftwareTOP Statistical Analysis Software
TOP Statistical Analysis Software
 
Big Data in Education
Big Data in EducationBig Data in Education
Big Data in Education
 
Jisc Learning Analytics intro for digital leaders
Jisc Learning Analytics intro for digital leadersJisc Learning Analytics intro for digital leaders
Jisc Learning Analytics intro for digital leaders
 

Viewers also liked

Digital Transformation 'Before and After' seminar 10th February, Edinburgh
Digital Transformation 'Before and After' seminar 10th February, EdinburghDigital Transformation 'Before and After' seminar 10th February, Edinburgh
Digital Transformation 'Before and After' seminar 10th February, EdinburghPrecedent
 
Robar Corporate Chair Massage Services
Robar Corporate Chair Massage ServicesRobar Corporate Chair Massage Services
Robar Corporate Chair Massage ServicesRuss Robar
 
Сулейменова Инкар + Выпечка на заказ + Идея
Сулейменова Инкар + Выпечка на заказ + ИдеяСулейменова Инкар + Выпечка на заказ + Идея
Сулейменова Инкар + Выпечка на заказ + ИдеяИнкар Сулейменова
 
Escrita 1º
Escrita 1ºEscrita 1º
Escrita 1ºAna Luis
 
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and RecoveryGetting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and RecoveryAmazon Web Services
 
Oficio enviado por el fiscal Ramiro González
Oficio enviado por el fiscal Ramiro GonzálezOficio enviado por el fiscal Ramiro González
Oficio enviado por el fiscal Ramiro GonzálezLuis Ernesto Zegarra
 
Músculos de los miembros inferiores
Músculos de los miembros inferioresMúsculos de los miembros inferiores
Músculos de los miembros inferioresJhuyClau
 
The Why and How to Go All In on AWS
The Why and How to Go All In on AWSThe Why and How to Go All In on AWS
The Why and How to Go All In on AWSAmazon Web Services
 
Catalogo GYM MOSTO
Catalogo GYM MOSTOCatalogo GYM MOSTO
Catalogo GYM MOSTOgymmosto
 
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_Mariona Gené
 

Viewers also liked (12)

STYLE4CURE
STYLE4CURESTYLE4CURE
STYLE4CURE
 
Digital Transformation 'Before and After' seminar 10th February, Edinburgh
Digital Transformation 'Before and After' seminar 10th February, EdinburghDigital Transformation 'Before and After' seminar 10th February, Edinburgh
Digital Transformation 'Before and After' seminar 10th February, Edinburgh
 
Robar Corporate Chair Massage Services
Robar Corporate Chair Massage ServicesRobar Corporate Chair Massage Services
Robar Corporate Chair Massage Services
 
Hello FLARToolKit
Hello FLARToolKitHello FLARToolKit
Hello FLARToolKit
 
Сулейменова Инкар + Выпечка на заказ + Идея
Сулейменова Инкар + Выпечка на заказ + ИдеяСулейменова Инкар + Выпечка на заказ + Идея
Сулейменова Инкар + Выпечка на заказ + Идея
 
Escrita 1º
Escrita 1ºEscrita 1º
Escrita 1º
 
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and RecoveryGetting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
 
Oficio enviado por el fiscal Ramiro González
Oficio enviado por el fiscal Ramiro GonzálezOficio enviado por el fiscal Ramiro González
Oficio enviado por el fiscal Ramiro González
 
Músculos de los miembros inferiores
Músculos de los miembros inferioresMúsculos de los miembros inferiores
Músculos de los miembros inferiores
 
The Why and How to Go All In on AWS
The Why and How to Go All In on AWSThe Why and How to Go All In on AWS
The Why and How to Go All In on AWS
 
Catalogo GYM MOSTO
Catalogo GYM MOSTOCatalogo GYM MOSTO
Catalogo GYM MOSTO
 
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_
ESTUDI_DEPENDENCIA_GENT_GRAN_EL_MASNOU__2008_
 

Similar to Data Science Perspective, Manish Kurse, 2016

The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineageLeigh Hill
 
Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...Cloudera, Inc.
 
GFW Partner Meeting 2017 - Parallel Discussions 2: Private Sector
GFW Partner Meeting 2017 - Parallel Discussions 2: Private SectorGFW Partner Meeting 2017 - Parallel Discussions 2: Private Sector
GFW Partner Meeting 2017 - Parallel Discussions 2: Private SectorWorld Resources Institute (WRI)
 
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraAlgorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraNozha Boujemaa
 
Why Your Data and Analytics Should Live in the Cloud
Why Your Data and Analytics Should Live in the CloudWhy Your Data and Analytics Should Live in the Cloud
Why Your Data and Analytics Should Live in the CloudDavid Menninger
 
Modeling Big Data with the ArchiMate 3.0 Language
Modeling Big Data with the ArchiMate 3.0 LanguageModeling Big Data with the ArchiMate 3.0 Language
Modeling Big Data with the ArchiMate 3.0 LanguageIver Band
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
MSOR 2016 Seminar 3rd presentation
MSOR 2016 Seminar 3rd presentationMSOR 2016 Seminar 3rd presentation
MSOR 2016 Seminar 3rd presentationAnwar Ali Mohamed
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeArushi Prakash, Ph.D.
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBala Iyer
 
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA DATASCIENCE
 
Big Data Analytics: A New Business Opportunity
Big Data Analytics: A New Business OpportunityBig Data Analytics: A New Business Opportunity
Big Data Analytics: A New Business OpportunityEdward Curry
 
Deploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareDeploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareHealth Catalyst
 
Help Me, Help You: Supporting Your Data
Help Me, Help You: Supporting Your DataHelp Me, Help You: Supporting Your Data
Help Me, Help You: Supporting Your DataData Con LA
 
Maturing User Research in a Unicorn - UXSEA Summit 2019
Maturing User Research in a Unicorn - UXSEA Summit 2019Maturing User Research in a Unicorn - UXSEA Summit 2019
Maturing User Research in a Unicorn - UXSEA Summit 2019Kuldeep Kulshreshtha
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sectorAnil Rana
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Governing and Preparing Data for Analytics and Business
Governing and Preparing Data for Analytics and BusinessGoverning and Preparing Data for Analytics and Business
Governing and Preparing Data for Analytics and BusinessMark Smith
 

Similar to Data Science Perspective, Manish Kurse, 2016 (20)

The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineage
 
Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...
 
GFW Partner Meeting 2017 - Parallel Discussions 2: Private Sector
GFW Partner Meeting 2017 - Parallel Discussions 2: Private SectorGFW Partner Meeting 2017 - Parallel Discussions 2: Private Sector
GFW Partner Meeting 2017 - Parallel Discussions 2: Private Sector
 
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraAlgorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
 
Why Your Data and Analytics Should Live in the Cloud
Why Your Data and Analytics Should Live in the CloudWhy Your Data and Analytics Should Live in the Cloud
Why Your Data and Analytics Should Live in the Cloud
 
Modeling Big Data with the ArchiMate 3.0 Language
Modeling Big Data with the ArchiMate 3.0 LanguageModeling Big Data with the ArchiMate 3.0 Language
Modeling Big Data with the ArchiMate 3.0 Language
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
MSOR 2016 Seminar 3rd presentation
MSOR 2016 Seminar 3rd presentationMSOR 2016 Seminar 3rd presentation
MSOR 2016 Seminar 3rd presentation
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science Resume
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
 
Become a citizen data scientist
Become a citizen data scientistBecome a citizen data scientist
Become a citizen data scientist
 
Big Data Analytics: A New Business Opportunity
Big Data Analytics: A New Business OpportunityBig Data Analytics: A New Business Opportunity
Big Data Analytics: A New Business Opportunity
 
Deploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareDeploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in Healthcare
 
Help Me, Help You: Supporting Your Data
Help Me, Help You: Supporting Your DataHelp Me, Help You: Supporting Your Data
Help Me, Help You: Supporting Your Data
 
Maturing User Research in a Unicorn - UXSEA Summit 2019
Maturing User Research in a Unicorn - UXSEA Summit 2019Maturing User Research in a Unicorn - UXSEA Summit 2019
Maturing User Research in a Unicorn - UXSEA Summit 2019
 
Data analytics
Data analyticsData analytics
Data analytics
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sector
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
Governing and Preparing Data for Analytics and Business
Governing and Preparing Data for Analytics and BusinessGoverning and Preparing Data for Analytics and Business
Governing and Preparing Data for Analytics and Business
 

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 

Data Science Perspective, Manish Kurse, 2016

  • 1. © Manish Kurse, 2016 Data Science - a Perspective Manish Kurse, Ph.D. Data Scientist, Google 28 April 2016 1 This is my perspective and is not necessarily intended to represent that of my employer
  • 2. © Manish Kurse, 2016 Agenda An Introduction 2 Insights on being a Data Scientist in the Industry Thoughts about this evolving field Lessons learnt on transitioning to data science
  • 3. © Manish Kurse, 2016 An Introduction Insights on being a Data Scientist in the Industry Thoughts about this evolving field Lessons learnt on transitioning to data science 3
  • 4. © Manish Kurse, 2016 Extracting insights from structured and unstructured data Creating actionable solutions and products based on these insights What is Data Science? Courtesy : Drew Conway 4 (Programming) (Domain expertise)
  • 5. © Manish Kurse, 2016 Popular Examples of Data Science 5 Recommendation Systems Inventory planning Dynamic pricing
  • 6. © Manish Kurse, 2016 Interest in data science has grown rapidly! 6
  • 7. © Manish Kurse, 2016 Why this rise in interest? 7 Digital Connected World Data storage is cheap Computational power is cheap Need to make sense of data
  • 8. © Manish Kurse, 2016 Blind Men and an Elephant Taken from the internet. Original artist: Not sure 8
  • 9. © Manish Kurse, 2016 99 What do data scientists do in the industry?
  • 10. © Manish Kurse, 2016 Developing models and building products using data Data Science today is a spectrum Business analysts generating insights Researchers developing new mathematical techniques and algorithms Insight 1: 10
  • 11. © Manish Kurse, 2016 Data Scientists wear several hats Dashboards Continuous Business Insights Insight 2: Slide-decks Actionable insights Software Products Prototyping Tools and infrastructure Data science platforms 11
  • 12. © Manish Kurse, 2016 Data Science Interfaces with Several Teams Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Determine need with stakeholders. Experimentation Data cleaning Insight 3: 12
  • 13. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Work with engineers, set-up new data logging Experimentation Data cleaning 13 Insight 3: Data Science Interfaces with Several Teams
  • 14. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Data engineering Experimentation Data cleaning 14 Insight 3: Data Science Interfaces with Several Teams
  • 15. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Experimentation Data cleaning Clean raw data, exploratory analysis Insight 3: 15 Data Science Interfaces with Several Teams
  • 16. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Experimentation Machine learning/ computational models Data cleaning Insight 3: 16 Data Science Interfaces with Several Teams
  • 17. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize U/X Experimentation Data cleaning Data Science Interfaces with Several Teams Insight 3: 17
  • 18. © Manish Kurse, 2016 Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Get user feedback Experimentation Data cleaning Data Science Interfaces with Several Teams Insight 3: 18
  • 19. © Manish Kurse, 2016 Data Science Interfaces with Several Teams Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch ProductionizeWork with Software Engineers Experimentation Data cleaning Insight 3: 19
  • 20. © Manish Kurse, 2016 Data Science Interfaces with Several Teams Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize Launch to customers/stakeholders Experimentation Data cleaning Insight 3: 20
  • 21. © Manish Kurse, 2016 Data Science Interfaces with Several Teams Define project Define data sources Build pipelines Build models Visualization Evaluate with users Launch Productionize ExperimentationA/B Experiments Data cleaning Insight 3: 21
  • 22. © Manish Kurse, 2016 Every Stage in Business is a Data Science Opportunity Product Sales Customer SupportCustomer engagement Marketing Understanding need Insight 4: 22
  • 23. © Manish Kurse, 2016 Getting the right data could take time, effort Change is constant and not everything can be modeled Data cannot solve everything Gaining stakeholder trust and showing value Data Science is Challenging Insight 5: 23
  • 24. © Manish Kurse, 2016 24 Thoughts on Data Science Evolution
  • 25. © Manish Kurse, 2016 Need for data scientists will continue to exist Growing data science tools Data scientists are needed to ask the right questions Define the data, the solution Role of a data scientist will evolve Google Cloud Machine Learning Thought 1: 25
  • 26. © Manish Kurse, 2016 Data science will be an integral part of business strategy Thought 2: Data Infrastructure Understanding Business Need Understanding Customers Data Logging 26
  • 27. © Manish Kurse, 2016 Machine learning will influence non-data scientist roles Thought 3: Machine learning becomes mainstream Business analysts apply more complex predictive models Software engineers are trained in building machine learning software 27
  • 28. © Manish Kurse, 2016 Security and Privacy should/will be a focus “With Great Power Comes Great Responsibility” Data Science Thought 4: 28Source: Marvel
  • 29. © Manish Kurse, 2016 Journey towards Data Science Source: rei.com 29
  • 30. © Manish Kurse, 2016 Spend time to understand the field Books Data Science for Business Doing Data Science Big Data: A Revolution... Longer List Podcasts Linear Digressions Data Skeptic Partially Derivative ... Longer list Follow Subscriptions on online magazines like Flipboard Data scientists in your field of interest Longer list Blogs KDNuggets DataTau Analytics Vidya Longer List Lesson 1: 30
  • 31. © Manish Kurse, 2016 Online tutorials Algorithms and data structures Python: Tutorials, Python for Data Analysis R: Tutorials SQL: Tutorials Knowledge of tools is important, but understanding of fundamentals is key Lesson 2 Classes MOOCs: Udacity, Coursera Bootcamps: Logit, Insight, General Assembly, Data Incubator Mentored Courses: Thinkful, Springboard Machine Learning, Statistics, Programming 31
  • 32. © Manish Kurse, 2016 Free Datasets Interesting data-sets for statistics Datasets curated by data scientists Data sources for cool data science projects Side-Projects are invaluable Lesson 3 Side projects Mini projects Online contests like kaggle. com Article about choosing projects Create a web portfolio Host code on github Creating a website hosted on github 32
  • 33. © Manish Kurse, 2016 Exciting Time to be in Data Science! 33 An Introduction Insights on being a Data Scientist in the Industry Thoughts about this evolving field Lessons learnt on transitioning to data science