SlideShare a Scribd company logo
DATA SCIENCE
FOR PRODUCT DESIGNERS AND NON-SCIENTISTS
ESSENTIAL
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
WHAT IS DATA SCIENCE?
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DEFINITIONS:
▸ The art and science of utilizing data to produce actionable insights.
▸ The utilization of science and mathematics to extract knowledge and insights
▸ A branch of computer science that applies statistics on a dataset to make
predictions and find patterns.
▸ KDD - Knowledge Discovery in Databases
WORKING WITH DATA TO OBTAIN INSIGHTS, MAKE PREDICTIONS AND PROVIDE ADVICE
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF BRANCHES & DISCIPLINES
▸ Data Analytics
▸ Big Data
▸ Data Engineering
▸ Database Technology
▸ Statistics
▸ Data Visualization
▸ Machine Learning
▸ Neural Networks
▸ Artificial Intelligence
DATA SCIENCE
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LET’S FOCUS ON THESE TO
ORIENT OURSELVES TO A
NARROWER FOCUS IN DATA
SCIENCE
▸ Data Analytics
▸ Machine Learning
▸ Data Engineering
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
BUSINESS INTELLIGENCE VS DATA SCIENCE
▸ Retrospective - Looking at historical data to predict
the future
▸ Pre-canned or pre-defined questions to submit to
system
▸ Data is largely siloed or warehoused
▸ Ask specific questions related to strategic business
operations
BUSINESS INTELLIGENCE DATA SCIENCE
▸ Prospective - forecasting the future
▸ Discovery of questions to ask or development of
questions in the form of hypothesis
▸ Data is distributed in data lakes or in warehouses; can
be real-time streams or near-real time streams
▸ Can ask questions about strategy but can be used for
any domains
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
WHAT PRODUCT DESIGNERS
NEED TO KNOW?
KNOW THE CONCEPTS AND CHARACTERISTICS OF DATA IS A GOOD START
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
GOALS FOR
PRODUCT
DESIGNERS
▸ Provide ability for users to make better decisions with new
knowledge and insights
▸ To optimize and unlock the value of service and operational
efficiencies
▸ Supply insights to make new, better products and services
▸ Converting data into stories that engage and empower users
▸ Employ summary reports or visualization to help decision makers
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
UNDERSTAND THE PROBLEM
WHAT IS THE PURPOSE? WHAT ARE THE QUESTIONS?
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
REVIEW THE DATA
CHECK THE ORIGIN, TYPE, PROPERTIES & CLASSES OF YOUR DATA
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
USING THE RESULTS
HOW ARE THE RESULTS REFLECTED IN YOUR PRODUCT?
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
DATA IS THE FUELTHE INPUTS FOR ANALYSIS
INSIGHTS AND PREDICTION
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DATA POINTS
BIG DATATO EXTRACT SAMPLES
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA SOURCES • IT Log Files
• Sensor Data (IOT)
• Website Clickstreams
• Social Media Feeds
• Machine Data
• Location Tracking Data
• Financial Transactions
• Commercial Feeds (3rd Party Vendor)
• Public Databases
• Academic Sources
WHERE TO OBTAIN DATA FROM?
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
PROPERTIES OF DATA : 4V
▸ Volume: How much data do you have?
▸ Velocity: How fast is the data coming in? How often?
▸ Variety: How heterogeneous or homogenous is the data?
▸ Veracity: What is the quality of the data?
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VOLUME
Volume is usually data is measured in gigabytes (GB)
and terabytes (TB), sometimes hundreds of megabytes
(MB) can considered a healthy yield
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VELOCITY The rate at which data is delivered into the system. This
can be streamed in real-time, near real-time or
delivered in batches.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VARIETY The types and kinds of data you expect in your system.
Is it homogenous or heterogenous? We cover this in
the next few slides.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VERACITY
Does the data accurately reflect what you are trying to
accomplish with the data? Important to rate the source
of the data. Do a cursory review to see if the data is
what you would expect.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LEVELS OF MEASURES (TYPES OF DATA TOO)
▸ Nominal - Distinct Categories (e.g. Gender)
▸ Ordinal - Ranking or Order (e.g. Service Ratings)
▸ Interval - difference between two values is
meaningful (e.g. Temperature)
▸ Ratio - like interval, but has a clear definition of 0
(e.g. Height)
QUALITATIVE OR QUANTITIVE
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA TYPES
▸ Text Data
▸ Image Data
▸ Timestamps
▸ Video Data
▸ Audio
▸ Binary Data
▸ Counters
KINDS OF DATA YOU CAN EXPECT
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA STRUCTURE FORMS
• Structured - Typically in a RDBMS, very organized and labeled
• Unstructured - unfiltrered, unlabeled data like images, video, raw
data from IOT
• Semi-Structured - mixed between labeled and unlabeled data
STRUCTURED, UNSTRUCTURED & SEMI-STRUCTURED
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
RECORDS AND FIELDS
SEE DATA AS A MATRIX
Variables Attributes Features
Observations
Samples
Tuples
DATA COLUMNS
ROWS
When encountering these words in
data analysis and machine
learning, think in terms of a
spreadsheet as simply columns
(fields) and records (rows) like in a
relational databases.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA QUALITY
๏ Complete
๏ Accurate
๏ Relevant
๏ Fresh or Outdate
๏ Distinct
๏ Accessible
WHAT IS THE LEVEL OF DATA QUALITY? IS IT…
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OFF THE MARK
๏ Accuracy is an issue pertaining to the quality of data and the number of errors
contained in a dataset Precision
๏ Precision refers to the level of measurement and exactness of description in a
dataset; It is important to realize, however, that precise data--no matter how
carefully measured--may be inaccurate.
SEEK BOTH ACCURACY AND PRECISION
This also is related to variance and bias in data…
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ERRORS: VARIANCE
๏ The variance is error from sensitivity to small fluctuations in the training set. High
variance can cause overfitting: modeling the random noise in the training data,
rather than the intended outputs
๏ Add more data to your dataset to mitigate high variance
MEASUREMENT SENSITIVITY
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ERRORS: BIAS
๏ Bias measures how far off in general these models' predictions are from the correct
value.
๏ Review your data; may require you to revise your dataset
ERRONEOUS ASSUMPTIONS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA PIPELINE
1. HARVESTING / STORING
2. MUNGING / CLEANING
3. PREPARATION / PROCESSING
4. ANALYSIS
5. MODELING
6. VISUALIZATION / REPORTING / ACTING
6 STAGES
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA PIPELINE
Harvesting > Wrangling > Processing > Analysis > Modeling > Visualization & Action
6 STAGES
DATA ENGINEERING DATA SCIENCE AND ANALYSIS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
1. HARVESTING
Data Yield
Data Sources (Data Lake, Data Warehouse
Data Volume (GB -> TB)
Data Delivery
DATA COLLECTION & INGESTION
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
RAW DATA
Data can be sourced from a data lake or data warehouse; value-to-data ratio: low
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
2. DATA WRANGLING
Null Values
Duplicates
Incomplete Values
DATA PROCESSING AND PREP
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
SCRUB YOUR DATA FOR ANALYSIS
CLEAN DATA
Option 1: Identity and discard records with
null or wrong values
Option 2: Fill-in a placeholder common
value in the dataset
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
3. PROCESSING
‣ Ranking
‣ Scoring
‣ Sorting
‣ Grouping
‣ Manipulating
ARRANGING & MANIPULATING DATA
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
4. ANALYSISARRANGING THE DATA
Descriptive Analytics
Inferential Analytics
Advanced Statistical Analytics
STATISTICAL
5. MODELING
Density
Linear Regression
Nearest Neighbor
…many more modeling techniques
+
Utilizing stochastic models to validate hypothesis and make predictions
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
STATISTICAL
MODELING
Applying statistical models to validate hypothesis, simulate scenarios and make predictions
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
6. VISUALIZATION
REPORTING & INTERPRETATION
What does it show?
What does it say?
Exploratory or Explanation
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OBJECTIVES OF DATA ANALYTICS
๏ Find patterns to test hypothesis
๏ Refine an existing hypothesis
๏ Provide actionable insights
๏ Make predictions
CLEARLY DEFINED
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
TYPES OF ANALYTICS
• Descriptive - What happened?
• Diagnostic - What went wrong?
• Prescriptive - What to do?
• Predictive - Can this happen?
EACH TYPE ANSWERS DIFFERENT QUESTIONS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: WHAT HAPPENED?
Descripitive Analytics is based on historical data and current data
DESCRIPTIVE ANALYTICS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: WHY DID THIS HAPPEN? WHAT WENT WRONG?
Deduce and infer the success and failure of a particular activity, initiative, campaign or program
DIAGNOSTIC ANALYTICS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTIONS: WHAT DO I DO? WHAT ACTIONS SHOULD I TAKE?
Based on generated predictions, the analysis provides informed actions a decision
maker can or should take.
PRESCRIPTIVE ANALYTICS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: COULD THIS HAPPEN?
Applying stochastic and mathematical models to predict outcomes
PREDICTIVE ANALYTICS
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
LEARNING FOR THE DATA
MACHINE LEARNINGSUPERVISED AND UNSUPERVISED LEARNING
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
IDENTIFYING PATTERNS IN THE DATASET
Input data is unlabeled; process is non-deterministic; use of inferential methods to find relationships, patterns and
correlations;
UNSUPERVISED
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
MAKING INFERENCES ON DATASETS
Data is labeled; deterministic process with an input and desired output; algorithms learn from labeled data
SUPERVISED
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
COMMON MACHINE LEARNING MODELS
ALGORITHMS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LINEAR REGRESSION
REGRESSION
LOGISTIC REGRESSION
Logistic regression is used to describe
data and to explain the relationship
between one dependent binary
variable and one or more nominal,
ordinal, interval or ratio-level
independent variables.
Regressions are used to quantify the
relationship between one variable
and the other variables that are
thought to explain it; regressions can
also identify how close and well
determined the relationship is.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
CLASSIFICATION
DECISION TREES
Decision tree builds classification in the
form of a tree structure. It breaks down a
dataset into smaller and smaller subsets
while at the same time an associated
decision tree is incrementally developed
K NEAREST NEIGHBOR
K nearest neighbors is a simple algorithm
that stores all available cases and classifies
new cases based on a similarity measure
(e.g., distance functions)
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
CLUSTERING
K-MEANS
K-Means clustering intends to partition n
objects into k clusters in which each object
belongs to the cluster with the nearest
mean.
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DIFFERENT
MACHINE LEARNING
ALGORITHMS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• What datasets do you have now? What Sources? What
parameters?
• Address 4Vs (Velocity, Variety, Volume, Veracity)
• Rate the data quality
• Identify the attributes, type and groups of data
• Do you need enrich your current dataset?
REMEMBER: APPRAISE YOUR DATA
LOOK AT YOUR PROPERTIES OF YOUR DATA
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
EXPECTED OUTCOMES AND YOUR OBJECTIVE
• How is your product going to use the data?
• How does your objective align with results?
• Does the current dataset allow you to make inferences or predictions?
• Get help from domain experts to determine if the attributes and data is sufficient
DOES YOUR DATASET & EXPECTED RESULTS MATCH YOUR OBJECTIVE?
ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
APPLYING DATA SCIENCE TRADECRAFT TO BUILD
DATA PRODUCTS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
USING RESULTS TO IMPROVE PRODUCT EXPERIENCES
Use the findings and insights and apply it to enhancing your product or service
DATA PRODUCT DEV
EXAMPLES
Sales Forecasts
Operational Predictions
Video Recommendations
Featured Content
Other stuff
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
USE THE FINDINGS AND INSIGHTS AND APPLY IT TO ENHANCING
YOUR PRODUCT OR SERVICE
DATA ENRICHING PRODUCTS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
PROVIDING ANSWERS TO THE INQUIRIES
Utilize predictive and prescriptive analytics to forecast and give recommendations to decision-makers. Make it clear what actions to
take and why.
USE IN PRODUCTS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OUTPUTS FORMED FOR HUMAN CONSUMPTION
Key characteristic of data products is data visualization elements like charts, graphs, scoreboards and tables to
communicate the story. Utilize graphics and interactivity to tell your story.
DATA VISUALIZATION
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• Show Relationships
• Make Comparisons
• Show Distribution
• Present Composition
• Make Predictions / ForecastsUSING DATA VISUALIZATIONS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• Dashboards
• Data Filters
• Data Exploration Features
• Custom Inputs
• Graph Selection
DATA PRODUCT CUSTOMIZATIONS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ECONOMICS OF WORKING WITH DATA
The cost of labor and resources to
prepare, process and maintain data can
be high. Consider using simpler
models.
COST CONSIDERATIONS
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
JCHRISA
THANK YOU!
https://www.linkedin.com/in/jchrisa
Creative Technologist. Product Designer
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
REFERENCES
http://insidebigdata.com/2014/06/05/data-munging-exploratory-data-analysis-feature-engineering/
http://www.colorado.edu/geography/gcraft/notes/error/error_f.html
Rhodes, Trey, and Kenneth Foote. "Error, Accuracy, and Precision." Error, Accuracy, and Precision. University of Colorado, n.d.
Web. 05 Apr. 2017.
Fortmann, Scott. "Bias and Variance." Understanding the Bias-Variance Tradeoff. N.p., n.d. Web. 05 June 2012.
http://scott.fortmann-roe.com/docs/BiasVariance.html
Gutierrez , Daniel. "Data Munging, Exploratory Data Analysis, and Feature Engineering." InsideBIGDATA. N.p., 20 June 2014.
Web. 05 Apr. 2017.
Causey, Trey. "Trey Causey – Getting started in data science." Trey Causey – Getting started in data science. N.p., 7
June 2014. Web. 05 Apr. 2017.
http://treycausey.com/getting_started.html
ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
REFERENCES CONTINUED
Sayed, Saed. "Data Mining Map." Data Mining Map. Saed Sayed, 2010. Web. 10 Apr. 2017.
http://www.saedsayad.com/modeling.htm
http://www.imf.org/external/pubs/ft/fandd/2006/03/basics.htm
Ramcharan, Rodney. "Finance and Development." Finance and Development | F&D. Finance & Development, Mar. 2006.
Web. 10 Apr. 2017.

More Related Content

Similar to Essential Data Science for Product Designers and Non-Scientists

How to Start People Analytics Practice?
How to Start People Analytics Practice?How to Start People Analytics Practice?
How to Start People Analytics Practice?
Martin Boudikianov
 
Big Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
Big Data Analytics: How to Get Started? | OPTIMUS 2015 AtlantaBig Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
Big Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
ORTEC US
 
Introduction to Data Analytics - PPM.pptx
Introduction to Data Analytics - PPM.pptxIntroduction to Data Analytics - PPM.pptx
Introduction to Data Analytics - PPM.pptx
ssuser5cdaa93
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with Standards
Matt Turner
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
Sreenatha Reddy K R
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data scienceJordan Engbers
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
IvanHo572682
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
Analytics8
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
XanGwaps
 
Big Data Analytics information And Tools
Big Data Analytics information And ToolsBig Data Analytics information And Tools
Big Data Analytics information And Tools
Sarvesh Bhagwat
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
SaravanaD2
 
Data Analyics
Data AnalyicsData Analyics
Data Analyics
SysDiva Consultants
 
Business Analytics.pptx
Business Analytics.pptxBusiness Analytics.pptx
Business Analytics.pptx
Charlou Bautista
 
Introduction to business analytics.pptx
Introduction to business analytics.pptxIntroduction to business analytics.pptx
Introduction to business analytics.pptx
Charlou Bautista
 
Data science; why, what, how?
Data science; why, what, how?Data science; why, what, how?
Data science; why, what, how?
Muhammad Shahid
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
Domino Data Lab
 
Data Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptxData Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptx
PratikshaSurve4
 
Why You Need to STOP Using Spreadsheets for Audit Analysis
Why You Need to STOP Using Spreadsheets for Audit AnalysisWhy You Need to STOP Using Spreadsheets for Audit Analysis
Why You Need to STOP Using Spreadsheets for Audit Analysis
CaseWare IDEA
 
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BICC Thomas More
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
Inside Analysis
 

Similar to Essential Data Science for Product Designers and Non-Scientists (20)

How to Start People Analytics Practice?
How to Start People Analytics Practice?How to Start People Analytics Practice?
How to Start People Analytics Practice?
 
Big Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
Big Data Analytics: How to Get Started? | OPTIMUS 2015 AtlantaBig Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
Big Data Analytics: How to Get Started? | OPTIMUS 2015 Atlanta
 
Introduction to Data Analytics - PPM.pptx
Introduction to Data Analytics - PPM.pptxIntroduction to Data Analytics - PPM.pptx
Introduction to Data Analytics - PPM.pptx
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with Standards
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
Big Data Analytics information And Tools
Big Data Analytics information And ToolsBig Data Analytics information And Tools
Big Data Analytics information And Tools
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
 
Data Analyics
Data AnalyicsData Analyics
Data Analyics
 
Business Analytics.pptx
Business Analytics.pptxBusiness Analytics.pptx
Business Analytics.pptx
 
Introduction to business analytics.pptx
Introduction to business analytics.pptxIntroduction to business analytics.pptx
Introduction to business analytics.pptx
 
Data science; why, what, how?
Data science; why, what, how?Data science; why, what, how?
Data science; why, what, how?
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Data Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptxData Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptx
 
Why You Need to STOP Using Spreadsheets for Audit Analysis
Why You Need to STOP Using Spreadsheets for Audit AnalysisWhy You Need to STOP Using Spreadsheets for Audit Analysis
Why You Need to STOP Using Spreadsheets for Audit Analysis
 
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 

Recently uploaded

FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 

Essential Data Science for Product Designers and Non-Scientists

  • 1. DATA SCIENCE FOR PRODUCT DESIGNERS AND NON-SCIENTISTS ESSENTIAL
  • 2. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS WHAT IS DATA SCIENCE?
  • 3. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LOTS OF DEFINITIONS: ▸ The art and science of utilizing data to produce actionable insights. ▸ The utilization of science and mathematics to extract knowledge and insights ▸ A branch of computer science that applies statistics on a dataset to make predictions and find patterns. ▸ KDD - Knowledge Discovery in Databases WORKING WITH DATA TO OBTAIN INSIGHTS, MAKE PREDICTIONS AND PROVIDE ADVICE
  • 4. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LOTS OF BRANCHES & DISCIPLINES ▸ Data Analytics ▸ Big Data ▸ Data Engineering ▸ Database Technology ▸ Statistics ▸ Data Visualization ▸ Machine Learning ▸ Neural Networks ▸ Artificial Intelligence DATA SCIENCE
  • 5. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LET’S FOCUS ON THESE TO ORIENT OURSELVES TO A NARROWER FOCUS IN DATA SCIENCE ▸ Data Analytics ▸ Machine Learning ▸ Data Engineering
  • 6. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS BUSINESS INTELLIGENCE VS DATA SCIENCE ▸ Retrospective - Looking at historical data to predict the future ▸ Pre-canned or pre-defined questions to submit to system ▸ Data is largely siloed or warehoused ▸ Ask specific questions related to strategic business operations BUSINESS INTELLIGENCE DATA SCIENCE ▸ Prospective - forecasting the future ▸ Discovery of questions to ask or development of questions in the form of hypothesis ▸ Data is distributed in data lakes or in warehouses; can be real-time streams or near-real time streams ▸ Can ask questions about strategy but can be used for any domains
  • 7. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS WHAT PRODUCT DESIGNERS NEED TO KNOW? KNOW THE CONCEPTS AND CHARACTERISTICS OF DATA IS A GOOD START
  • 8. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS GOALS FOR PRODUCT DESIGNERS ▸ Provide ability for users to make better decisions with new knowledge and insights ▸ To optimize and unlock the value of service and operational efficiencies ▸ Supply insights to make new, better products and services ▸ Converting data into stories that engage and empower users ▸ Employ summary reports or visualization to help decision makers
  • 9. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS UNDERSTAND THE PROBLEM WHAT IS THE PURPOSE? WHAT ARE THE QUESTIONS?
  • 10. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS REVIEW THE DATA CHECK THE ORIGIN, TYPE, PROPERTIES & CLASSES OF YOUR DATA
  • 11. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS USING THE RESULTS HOW ARE THE RESULTS REFLECTED IN YOUR PRODUCT?
  • 12. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS DATA IS THE FUELTHE INPUTS FOR ANALYSIS INSIGHTS AND PREDICTION
  • 13. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS LOTS OF DATA POINTS BIG DATATO EXTRACT SAMPLES
  • 14. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA SOURCES • IT Log Files • Sensor Data (IOT) • Website Clickstreams • Social Media Feeds • Machine Data • Location Tracking Data • Financial Transactions • Commercial Feeds (3rd Party Vendor) • Public Databases • Academic Sources WHERE TO OBTAIN DATA FROM?
  • 15. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS PROPERTIES OF DATA : 4V ▸ Volume: How much data do you have? ▸ Velocity: How fast is the data coming in? How often? ▸ Variety: How heterogeneous or homogenous is the data? ▸ Veracity: What is the quality of the data?
  • 16. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS VOLUME Volume is usually data is measured in gigabytes (GB) and terabytes (TB), sometimes hundreds of megabytes (MB) can considered a healthy yield
  • 17. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS VELOCITY The rate at which data is delivered into the system. This can be streamed in real-time, near real-time or delivered in batches.
  • 18. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS VARIETY The types and kinds of data you expect in your system. Is it homogenous or heterogenous? We cover this in the next few slides.
  • 19. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS VERACITY Does the data accurately reflect what you are trying to accomplish with the data? Important to rate the source of the data. Do a cursory review to see if the data is what you would expect.
  • 20. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LEVELS OF MEASURES (TYPES OF DATA TOO) ▸ Nominal - Distinct Categories (e.g. Gender) ▸ Ordinal - Ranking or Order (e.g. Service Ratings) ▸ Interval - difference between two values is meaningful (e.g. Temperature) ▸ Ratio - like interval, but has a clear definition of 0 (e.g. Height) QUALITATIVE OR QUANTITIVE
  • 21. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA TYPES ▸ Text Data ▸ Image Data ▸ Timestamps ▸ Video Data ▸ Audio ▸ Binary Data ▸ Counters KINDS OF DATA YOU CAN EXPECT
  • 22. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA STRUCTURE FORMS • Structured - Typically in a RDBMS, very organized and labeled • Unstructured - unfiltrered, unlabeled data like images, video, raw data from IOT • Semi-Structured - mixed between labeled and unlabeled data STRUCTURED, UNSTRUCTURED & SEMI-STRUCTURED
  • 23. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS RECORDS AND FIELDS SEE DATA AS A MATRIX Variables Attributes Features Observations Samples Tuples DATA COLUMNS ROWS When encountering these words in data analysis and machine learning, think in terms of a spreadsheet as simply columns (fields) and records (rows) like in a relational databases.
  • 24. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA QUALITY ๏ Complete ๏ Accurate ๏ Relevant ๏ Fresh or Outdate ๏ Distinct ๏ Accessible WHAT IS THE LEVEL OF DATA QUALITY? IS IT…
  • 25. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS OFF THE MARK ๏ Accuracy is an issue pertaining to the quality of data and the number of errors contained in a dataset Precision ๏ Precision refers to the level of measurement and exactness of description in a dataset; It is important to realize, however, that precise data--no matter how carefully measured--may be inaccurate. SEEK BOTH ACCURACY AND PRECISION This also is related to variance and bias in data…
  • 26. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ERRORS: VARIANCE ๏ The variance is error from sensitivity to small fluctuations in the training set. High variance can cause overfitting: modeling the random noise in the training data, rather than the intended outputs ๏ Add more data to your dataset to mitigate high variance MEASUREMENT SENSITIVITY
  • 27. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ERRORS: BIAS ๏ Bias measures how far off in general these models' predictions are from the correct value. ๏ Review your data; may require you to revise your dataset ERRONEOUS ASSUMPTIONS
  • 28. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA PIPELINE 1. HARVESTING / STORING 2. MUNGING / CLEANING 3. PREPARATION / PROCESSING 4. ANALYSIS 5. MODELING 6. VISUALIZATION / REPORTING / ACTING 6 STAGES
  • 29. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS DATA PIPELINE Harvesting > Wrangling > Processing > Analysis > Modeling > Visualization & Action 6 STAGES DATA ENGINEERING DATA SCIENCE AND ANALYSIS
  • 30. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS 1. HARVESTING Data Yield Data Sources (Data Lake, Data Warehouse Data Volume (GB -> TB) Data Delivery DATA COLLECTION & INGESTION
  • 31. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS RAW DATA Data can be sourced from a data lake or data warehouse; value-to-data ratio: low
  • 32. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS 2. DATA WRANGLING Null Values Duplicates Incomplete Values DATA PROCESSING AND PREP
  • 33. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS SCRUB YOUR DATA FOR ANALYSIS CLEAN DATA Option 1: Identity and discard records with null or wrong values Option 2: Fill-in a placeholder common value in the dataset
  • 34. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS 3. PROCESSING ‣ Ranking ‣ Scoring ‣ Sorting ‣ Grouping ‣ Manipulating ARRANGING & MANIPULATING DATA
  • 35. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS 4. ANALYSISARRANGING THE DATA Descriptive Analytics Inferential Analytics Advanced Statistical Analytics STATISTICAL 5. MODELING Density Linear Regression Nearest Neighbor …many more modeling techniques + Utilizing stochastic models to validate hypothesis and make predictions
  • 36. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS STATISTICAL MODELING Applying statistical models to validate hypothesis, simulate scenarios and make predictions
  • 37. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS 6. VISUALIZATION REPORTING & INTERPRETATION What does it show? What does it say? Exploratory or Explanation
  • 38. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS OBJECTIVES OF DATA ANALYTICS ๏ Find patterns to test hypothesis ๏ Refine an existing hypothesis ๏ Provide actionable insights ๏ Make predictions CLEARLY DEFINED
  • 39. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS TYPES OF ANALYTICS • Descriptive - What happened? • Diagnostic - What went wrong? • Prescriptive - What to do? • Predictive - Can this happen? EACH TYPE ANSWERS DIFFERENT QUESTIONS
  • 40. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ANSWERS THE QUESTION: WHAT HAPPENED? Descripitive Analytics is based on historical data and current data DESCRIPTIVE ANALYTICS
  • 41. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ANSWERS THE QUESTION: WHY DID THIS HAPPEN? WHAT WENT WRONG? Deduce and infer the success and failure of a particular activity, initiative, campaign or program DIAGNOSTIC ANALYTICS
  • 42. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ANSWERS THE QUESTIONS: WHAT DO I DO? WHAT ACTIONS SHOULD I TAKE? Based on generated predictions, the analysis provides informed actions a decision maker can or should take. PRESCRIPTIVE ANALYTICS
  • 43. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ANSWERS THE QUESTION: COULD THIS HAPPEN? Applying stochastic and mathematical models to predict outcomes PREDICTIVE ANALYTICS
  • 44. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS LEARNING FOR THE DATA MACHINE LEARNINGSUPERVISED AND UNSUPERVISED LEARNING
  • 45. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS IDENTIFYING PATTERNS IN THE DATASET Input data is unlabeled; process is non-deterministic; use of inferential methods to find relationships, patterns and correlations; UNSUPERVISED
  • 46. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS MAKING INFERENCES ON DATASETS Data is labeled; deterministic process with an input and desired output; algorithms learn from labeled data SUPERVISED
  • 47. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS COMMON MACHINE LEARNING MODELS ALGORITHMS
  • 48. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LINEAR REGRESSION REGRESSION LOGISTIC REGRESSION Logistic regression is used to describe data and to explain the relationship between one dependent binary variable and one or more nominal, ordinal, interval or ratio-level independent variables. Regressions are used to quantify the relationship between one variable and the other variables that are thought to explain it; regressions can also identify how close and well determined the relationship is.
  • 49. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS CLASSIFICATION DECISION TREES Decision tree builds classification in the form of a tree structure. It breaks down a dataset into smaller and smaller subsets while at the same time an associated decision tree is incrementally developed K NEAREST NEIGHBOR K nearest neighbors is a simple algorithm that stores all available cases and classifies new cases based on a similarity measure (e.g., distance functions)
  • 50. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS CLUSTERING K-MEANS K-Means clustering intends to partition n objects into k clusters in which each object belongs to the cluster with the nearest mean.
  • 51. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS LOTS OF DIFFERENT MACHINE LEARNING ALGORITHMS
  • 52. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS • What datasets do you have now? What Sources? What parameters? • Address 4Vs (Velocity, Variety, Volume, Veracity) • Rate the data quality • Identify the attributes, type and groups of data • Do you need enrich your current dataset? REMEMBER: APPRAISE YOUR DATA LOOK AT YOUR PROPERTIES OF YOUR DATA
  • 53. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS EXPECTED OUTCOMES AND YOUR OBJECTIVE • How is your product going to use the data? • How does your objective align with results? • Does the current dataset allow you to make inferences or predictions? • Get help from domain experts to determine if the attributes and data is sufficient DOES YOUR DATASET & EXPECTED RESULTS MATCH YOUR OBJECTIVE?
  • 54. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS APPLYING DATA SCIENCE TRADECRAFT TO BUILD DATA PRODUCTS
  • 55. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS USING RESULTS TO IMPROVE PRODUCT EXPERIENCES Use the findings and insights and apply it to enhancing your product or service DATA PRODUCT DEV EXAMPLES Sales Forecasts Operational Predictions Video Recommendations Featured Content Other stuff
  • 56. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS USE THE FINDINGS AND INSIGHTS AND APPLY IT TO ENHANCING YOUR PRODUCT OR SERVICE DATA ENRICHING PRODUCTS
  • 57. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS PROVIDING ANSWERS TO THE INQUIRIES Utilize predictive and prescriptive analytics to forecast and give recommendations to decision-makers. Make it clear what actions to take and why. USE IN PRODUCTS
  • 58. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS OUTPUTS FORMED FOR HUMAN CONSUMPTION Key characteristic of data products is data visualization elements like charts, graphs, scoreboards and tables to communicate the story. Utilize graphics and interactivity to tell your story. DATA VISUALIZATION
  • 59. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS • Show Relationships • Make Comparisons • Show Distribution • Present Composition • Make Predictions / ForecastsUSING DATA VISUALIZATIONS
  • 60. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS • Dashboards • Data Filters • Data Exploration Features • Custom Inputs • Graph Selection DATA PRODUCT CUSTOMIZATIONS
  • 61. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS ECONOMICS OF WORKING WITH DATA The cost of labor and resources to prepare, process and maintain data can be high. Consider using simpler models. COST CONSIDERATIONS
  • 62. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS JCHRISA THANK YOU! https://www.linkedin.com/in/jchrisa Creative Technologist. Product Designer
  • 63. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS REFERENCES http://insidebigdata.com/2014/06/05/data-munging-exploratory-data-analysis-feature-engineering/ http://www.colorado.edu/geography/gcraft/notes/error/error_f.html Rhodes, Trey, and Kenneth Foote. "Error, Accuracy, and Precision." Error, Accuracy, and Precision. University of Colorado, n.d. Web. 05 Apr. 2017. Fortmann, Scott. "Bias and Variance." Understanding the Bias-Variance Tradeoff. N.p., n.d. Web. 05 June 2012. http://scott.fortmann-roe.com/docs/BiasVariance.html Gutierrez , Daniel. "Data Munging, Exploratory Data Analysis, and Feature Engineering." InsideBIGDATA. N.p., 20 June 2014. Web. 05 Apr. 2017. Causey, Trey. "Trey Causey – Getting started in data science." Trey Causey – Getting started in data science. N.p., 7 June 2014. Web. 05 Apr. 2017. http://treycausey.com/getting_started.html
  • 64. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS REFERENCES CONTINUED Sayed, Saed. "Data Mining Map." Data Mining Map. Saed Sayed, 2010. Web. 10 Apr. 2017. http://www.saedsayad.com/modeling.htm http://www.imf.org/external/pubs/ft/fandd/2006/03/basics.htm Ramcharan, Rodney. "Finance and Development." Finance and Development | F&D. Finance & Development, Mar. 2006. Web. 10 Apr. 2017.