Data Science: An Emerging Field for Future JobsJian Qin
Data deluge has become a reality in today's scientific research. What does it mean to future science workforce? How can you prepare yourself to embrace the data challenges and opportunities? This presentation will provide you with an overview of data science and what it means to you as future researchers and career scientists.
Data Science is all about Data or information ,research and decision-making.Data Science refers to how well data match reality.
content: HISTORY,WHAT IS DATA SCIENCE, NEED OF DATA SCIENCE, APPLICATION OF DATA SCIENCE, CHALLENGES OF DATA SCIENCE, DATA SCIENCE CAREER, SKILL REQUIRED FOR DATA SCIENCE, COURSE, JOBS, SALARY,CONCLUSION
What are the the main areas of analytics and how can they benefit your business? Learn the value of SAS analytics and how you can get better insight into your data to make more profitable decisions.
By getting a better understanding of your data you will know which part of the data can be reliably forecast using time series methods and which cannot. You will also gain an understanding of any hierarchical structure in the data that can be used.
Look no further than our comprehensive Data Science Training program in Chandigarh. Designed to equip individuals with the skills and knowledge required to thrive in today's data-centric world, our course offers a unique blend of theoretical foundations and hands-on practical experience.
Data Science: An Emerging Field for Future JobsJian Qin
Data deluge has become a reality in today's scientific research. What does it mean to future science workforce? How can you prepare yourself to embrace the data challenges and opportunities? This presentation will provide you with an overview of data science and what it means to you as future researchers and career scientists.
Data Science is all about Data or information ,research and decision-making.Data Science refers to how well data match reality.
content: HISTORY,WHAT IS DATA SCIENCE, NEED OF DATA SCIENCE, APPLICATION OF DATA SCIENCE, CHALLENGES OF DATA SCIENCE, DATA SCIENCE CAREER, SKILL REQUIRED FOR DATA SCIENCE, COURSE, JOBS, SALARY,CONCLUSION
What are the the main areas of analytics and how can they benefit your business? Learn the value of SAS analytics and how you can get better insight into your data to make more profitable decisions.
By getting a better understanding of your data you will know which part of the data can be reliably forecast using time series methods and which cannot. You will also gain an understanding of any hierarchical structure in the data that can be used.
Look no further than our comprehensive Data Science Training program in Chandigarh. Designed to equip individuals with the skills and knowledge required to thrive in today's data-centric world, our course offers a unique blend of theoretical foundations and hands-on practical experience.
The majority of organizations (54%) use people analytics to improve HR effectiveness today. Organizations more frequently rely on people analytics to improve business outcomes, organizational performance and achieve labor cost savings.
People Analytics allows HR to gain a more strategic role in the organization and clearly show its impact.
Advanced organizations use data to analyze the workforce proactively, make predictions, and create and monitor comprehensive workforce plans to achieve financial success.
HR data has become an strategic priority, but it takes efforts in order to enable the usage of it.
Three Cool Things You Can Do with StandardsMatt Turner
Standards organizations deliver some of the world's most critical information to ensure interoperability and safety across every industry.
I gave this talk at the Standards Technology and Business Forum and covered what people are doing today and how standards organizations can
1) Better Deliver What Customers Want
2) Connect Standards to Their Customer's Data
3) Deliver Standards as Data
In this presentation, let's have a look at What is Data Science and it's applications. We discussed most common use cases of Data Science.
I presented this at LSPE-IN meetup happened on 10th March 2018 at Walmart Global Technology Services.
The Path to Data and Analytics ModernizationAnalytics8
Learn about the business demands driving modernization, the benefits of doing so, and how to get started.
Can your data and analytics solutions handle today’s challenges?
To stay competitive in today’s market, companies must be able to use their data to make better decisions. However, we are living in a world flooded by data, new technologies, and demands from the business for better and more advanced analytics. Most companies do not have the modern technologies and processes in place to keep up with these growing demands. They need to modernize how they collect, analyze, use, and share their data.
In this webinar, we discuss how you can build modern data and analytics solutions that are future ready, scalable, real-time, high speed, and agile and that can enable better use of data throughout your company.
We cover:
-The business demands and industry shifts that are impacting the need to modernize
-The benefits of data and analytics modernization
-How to approach data and analytics modernization- steps you need to take and how to get it right
-The pillars of modern data management
-Tips for migrating from legacy analytics tools to modern, next-gen platforms
-Lessons learned from companies that have gone through the modernization process
Data analytics is the need of any organization using any branded erp software, home grown erp or using MS Excel. To grow business to new verticals Data Analytics show the insights of business!
What you till learn:
GOALS - What is the bar for data science teams
PITFALLS - What are common data science struggles
DIAGNOSES - Why so many of our efforts fail to deliver value
RECOMMENDATIONS - How to address these struggles with best practices
Presented by Mac Steele
Director of Product at Domino Data Lab
Why You Need to STOP Using Spreadsheets for Audit AnalysisCaseWare IDEA
Still using spreadsheets for audit analysis? This presentation reviews why auditors should STOP the practice.
SLIDESHARE: www.slideshare.net/CaseWare_Analytics
WEBSITE: www.casewareanalytics.com
BLOG: www.casewareanalytics.com/blog
TWITTER: www.twitter.com/CW_Analytic
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BICC Thomas More
9de BI congres van het BICC-Thomas More: 24 maart 2016
Waar traditionele BI voornamelijk beschrijft van WAT er gebeurd is, kunnen we met Self-Service BI een stapje verder gaan en een eerste verklaring geven WAAROM iets zich voordoet. Als we echter tot de wortel willen geraken, moeten we gebruik maken van Analytics.
TechWise with Eric Kavanagh, Dr. Robin Bloor and Dr. Kirk Borne
Live Webcast on July 23, 2014
Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=59d50a520542ee7ed00a0c38e8319b54
Analytical applications are everywhere these days, and for good reason. Organizations large and small are using analytics to better understand any aspect of their business: customers, processes, behaviors, even competitors. There are several critical success factors for using analytics effectively: 1) know which kind of apps make sense for your company; 2) figure out which data sets you can use, both internal and external; 3) determine optimal roles and responsibilities for your team; 4) identify where you need help, either by hiring new employees or using consultants 5) manage your program effectively over time.
Register for this episode of TechWise to learn from two of the most experienced analysts in the business: Dr. Robin Bloor, Chief Analyst of The Bloor Group, and Dr. Kirk Borne, Data Scientist, George Mason University. Each will provide their perspective on how companies can address each of the key success factors in building, refining and using analytics to improve their business. There will then be an extensive Q&A session in which attendees can ask detailed questions of our experts and get answers in real time. Registrants will also receive a consolidated deck of slides, not just from the main presenters, but also from a variety of software vendors who provide targeted solutions.
Visit InsideAnlaysis.com for more information.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
More Related Content
Similar to Essential Data Science for Product Designers and Non-Scientists
The majority of organizations (54%) use people analytics to improve HR effectiveness today. Organizations more frequently rely on people analytics to improve business outcomes, organizational performance and achieve labor cost savings.
People Analytics allows HR to gain a more strategic role in the organization and clearly show its impact.
Advanced organizations use data to analyze the workforce proactively, make predictions, and create and monitor comprehensive workforce plans to achieve financial success.
HR data has become an strategic priority, but it takes efforts in order to enable the usage of it.
Three Cool Things You Can Do with StandardsMatt Turner
Standards organizations deliver some of the world's most critical information to ensure interoperability and safety across every industry.
I gave this talk at the Standards Technology and Business Forum and covered what people are doing today and how standards organizations can
1) Better Deliver What Customers Want
2) Connect Standards to Their Customer's Data
3) Deliver Standards as Data
In this presentation, let's have a look at What is Data Science and it's applications. We discussed most common use cases of Data Science.
I presented this at LSPE-IN meetup happened on 10th March 2018 at Walmart Global Technology Services.
The Path to Data and Analytics ModernizationAnalytics8
Learn about the business demands driving modernization, the benefits of doing so, and how to get started.
Can your data and analytics solutions handle today’s challenges?
To stay competitive in today’s market, companies must be able to use their data to make better decisions. However, we are living in a world flooded by data, new technologies, and demands from the business for better and more advanced analytics. Most companies do not have the modern technologies and processes in place to keep up with these growing demands. They need to modernize how they collect, analyze, use, and share their data.
In this webinar, we discuss how you can build modern data and analytics solutions that are future ready, scalable, real-time, high speed, and agile and that can enable better use of data throughout your company.
We cover:
-The business demands and industry shifts that are impacting the need to modernize
-The benefits of data and analytics modernization
-How to approach data and analytics modernization- steps you need to take and how to get it right
-The pillars of modern data management
-Tips for migrating from legacy analytics tools to modern, next-gen platforms
-Lessons learned from companies that have gone through the modernization process
Data analytics is the need of any organization using any branded erp software, home grown erp or using MS Excel. To grow business to new verticals Data Analytics show the insights of business!
What you till learn:
GOALS - What is the bar for data science teams
PITFALLS - What are common data science struggles
DIAGNOSES - Why so many of our efforts fail to deliver value
RECOMMENDATIONS - How to address these struggles with best practices
Presented by Mac Steele
Director of Product at Domino Data Lab
Why You Need to STOP Using Spreadsheets for Audit AnalysisCaseWare IDEA
Still using spreadsheets for audit analysis? This presentation reviews why auditors should STOP the practice.
SLIDESHARE: www.slideshare.net/CaseWare_Analytics
WEBSITE: www.casewareanalytics.com
BLOG: www.casewareanalytics.com/blog
TWITTER: www.twitter.com/CW_Analytic
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BICC Thomas More
9de BI congres van het BICC-Thomas More: 24 maart 2016
Waar traditionele BI voornamelijk beschrijft van WAT er gebeurd is, kunnen we met Self-Service BI een stapje verder gaan en een eerste verklaring geven WAAROM iets zich voordoet. Als we echter tot de wortel willen geraken, moeten we gebruik maken van Analytics.
TechWise with Eric Kavanagh, Dr. Robin Bloor and Dr. Kirk Borne
Live Webcast on July 23, 2014
Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=59d50a520542ee7ed00a0c38e8319b54
Analytical applications are everywhere these days, and for good reason. Organizations large and small are using analytics to better understand any aspect of their business: customers, processes, behaviors, even competitors. There are several critical success factors for using analytics effectively: 1) know which kind of apps make sense for your company; 2) figure out which data sets you can use, both internal and external; 3) determine optimal roles and responsibilities for your team; 4) identify where you need help, either by hiring new employees or using consultants 5) manage your program effectively over time.
Register for this episode of TechWise to learn from two of the most experienced analysts in the business: Dr. Robin Bloor, Chief Analyst of The Bloor Group, and Dr. Kirk Borne, Data Scientist, George Mason University. Each will provide their perspective on how companies can address each of the key success factors in building, refining and using analytics to improve their business. There will then be an extensive Q&A session in which attendees can ask detailed questions of our experts and get answers in real time. Registrants will also receive a consolidated deck of slides, not just from the main presenters, but also from a variety of software vendors who provide targeted solutions.
Visit InsideAnlaysis.com for more information.
Similar to Essential Data Science for Product Designers and Non-Scientists (20)
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
3. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DEFINITIONS:
▸ The art and science of utilizing data to produce actionable insights.
▸ The utilization of science and mathematics to extract knowledge and insights
▸ A branch of computer science that applies statistics on a dataset to make
predictions and find patterns.
▸ KDD - Knowledge Discovery in Databases
WORKING WITH DATA TO OBTAIN INSIGHTS, MAKE PREDICTIONS AND PROVIDE ADVICE
4. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF BRANCHES & DISCIPLINES
▸ Data Analytics
▸ Big Data
▸ Data Engineering
▸ Database Technology
▸ Statistics
▸ Data Visualization
▸ Machine Learning
▸ Neural Networks
▸ Artificial Intelligence
DATA SCIENCE
5. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LET’S FOCUS ON THESE TO
ORIENT OURSELVES TO A
NARROWER FOCUS IN DATA
SCIENCE
▸ Data Analytics
▸ Machine Learning
▸ Data Engineering
6. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
BUSINESS INTELLIGENCE VS DATA SCIENCE
▸ Retrospective - Looking at historical data to predict
the future
▸ Pre-canned or pre-defined questions to submit to
system
▸ Data is largely siloed or warehoused
▸ Ask specific questions related to strategic business
operations
BUSINESS INTELLIGENCE DATA SCIENCE
▸ Prospective - forecasting the future
▸ Discovery of questions to ask or development of
questions in the form of hypothesis
▸ Data is distributed in data lakes or in warehouses; can
be real-time streams or near-real time streams
▸ Can ask questions about strategy but can be used for
any domains
7. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
WHAT PRODUCT DESIGNERS
NEED TO KNOW?
KNOW THE CONCEPTS AND CHARACTERISTICS OF DATA IS A GOOD START
8. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
GOALS FOR
PRODUCT
DESIGNERS
▸ Provide ability for users to make better decisions with new
knowledge and insights
▸ To optimize and unlock the value of service and operational
efficiencies
▸ Supply insights to make new, better products and services
▸ Converting data into stories that engage and empower users
▸ Employ summary reports or visualization to help decision makers
9. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
UNDERSTAND THE PROBLEM
WHAT IS THE PURPOSE? WHAT ARE THE QUESTIONS?
10. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
REVIEW THE DATA
CHECK THE ORIGIN, TYPE, PROPERTIES & CLASSES OF YOUR DATA
11. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
USING THE RESULTS
HOW ARE THE RESULTS REFLECTED IN YOUR PRODUCT?
12. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
DATA IS THE FUELTHE INPUTS FOR ANALYSIS
INSIGHTS AND PREDICTION
13. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DATA POINTS
BIG DATATO EXTRACT SAMPLES
14. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA SOURCES • IT Log Files
• Sensor Data (IOT)
• Website Clickstreams
• Social Media Feeds
• Machine Data
• Location Tracking Data
• Financial Transactions
• Commercial Feeds (3rd Party Vendor)
• Public Databases
• Academic Sources
WHERE TO OBTAIN DATA FROM?
15. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
PROPERTIES OF DATA : 4V
▸ Volume: How much data do you have?
▸ Velocity: How fast is the data coming in? How often?
▸ Variety: How heterogeneous or homogenous is the data?
▸ Veracity: What is the quality of the data?
16. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VOLUME
Volume is usually data is measured in gigabytes (GB)
and terabytes (TB), sometimes hundreds of megabytes
(MB) can considered a healthy yield
17. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VELOCITY The rate at which data is delivered into the system. This
can be streamed in real-time, near real-time or
delivered in batches.
18. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VARIETY The types and kinds of data you expect in your system.
Is it homogenous or heterogenous? We cover this in
the next few slides.
19. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
VERACITY
Does the data accurately reflect what you are trying to
accomplish with the data? Important to rate the source
of the data. Do a cursory review to see if the data is
what you would expect.
20. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LEVELS OF MEASURES (TYPES OF DATA TOO)
▸ Nominal - Distinct Categories (e.g. Gender)
▸ Ordinal - Ranking or Order (e.g. Service Ratings)
▸ Interval - difference between two values is
meaningful (e.g. Temperature)
▸ Ratio - like interval, but has a clear definition of 0
(e.g. Height)
QUALITATIVE OR QUANTITIVE
21. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA TYPES
▸ Text Data
▸ Image Data
▸ Timestamps
▸ Video Data
▸ Audio
▸ Binary Data
▸ Counters
KINDS OF DATA YOU CAN EXPECT
22. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA STRUCTURE FORMS
• Structured - Typically in a RDBMS, very organized and labeled
• Unstructured - unfiltrered, unlabeled data like images, video, raw
data from IOT
• Semi-Structured - mixed between labeled and unlabeled data
STRUCTURED, UNSTRUCTURED & SEMI-STRUCTURED
23. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
RECORDS AND FIELDS
SEE DATA AS A MATRIX
Variables Attributes Features
Observations
Samples
Tuples
DATA COLUMNS
ROWS
When encountering these words in
data analysis and machine
learning, think in terms of a
spreadsheet as simply columns
(fields) and records (rows) like in a
relational databases.
24. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA QUALITY
๏ Complete
๏ Accurate
๏ Relevant
๏ Fresh or Outdate
๏ Distinct
๏ Accessible
WHAT IS THE LEVEL OF DATA QUALITY? IS IT…
25. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OFF THE MARK
๏ Accuracy is an issue pertaining to the quality of data and the number of errors
contained in a dataset Precision
๏ Precision refers to the level of measurement and exactness of description in a
dataset; It is important to realize, however, that precise data--no matter how
carefully measured--may be inaccurate.
SEEK BOTH ACCURACY AND PRECISION
This also is related to variance and bias in data…
26. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ERRORS: VARIANCE
๏ The variance is error from sensitivity to small fluctuations in the training set. High
variance can cause overfitting: modeling the random noise in the training data,
rather than the intended outputs
๏ Add more data to your dataset to mitigate high variance
MEASUREMENT SENSITIVITY
27. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ERRORS: BIAS
๏ Bias measures how far off in general these models' predictions are from the correct
value.
๏ Review your data; may require you to revise your dataset
ERRONEOUS ASSUMPTIONS
28. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA PIPELINE
1. HARVESTING / STORING
2. MUNGING / CLEANING
3. PREPARATION / PROCESSING
4. ANALYSIS
5. MODELING
6. VISUALIZATION / REPORTING / ACTING
6 STAGES
29. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
DATA PIPELINE
Harvesting > Wrangling > Processing > Analysis > Modeling > Visualization & Action
6 STAGES
DATA ENGINEERING DATA SCIENCE AND ANALYSIS
30. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
1. HARVESTING
Data Yield
Data Sources (Data Lake, Data Warehouse
Data Volume (GB -> TB)
Data Delivery
DATA COLLECTION & INGESTION
31. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
RAW DATA
Data can be sourced from a data lake or data warehouse; value-to-data ratio: low
32. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
2. DATA WRANGLING
Null Values
Duplicates
Incomplete Values
DATA PROCESSING AND PREP
33. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
SCRUB YOUR DATA FOR ANALYSIS
CLEAN DATA
Option 1: Identity and discard records with
null or wrong values
Option 2: Fill-in a placeholder common
value in the dataset
34. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
3. PROCESSING
‣ Ranking
‣ Scoring
‣ Sorting
‣ Grouping
‣ Manipulating
ARRANGING & MANIPULATING DATA
35. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
4. ANALYSISARRANGING THE DATA
Descriptive Analytics
Inferential Analytics
Advanced Statistical Analytics
STATISTICAL
5. MODELING
Density
Linear Regression
Nearest Neighbor
…many more modeling techniques
+
Utilizing stochastic models to validate hypothesis and make predictions
36. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
STATISTICAL
MODELING
Applying statistical models to validate hypothesis, simulate scenarios and make predictions
37. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
6. VISUALIZATION
REPORTING & INTERPRETATION
What does it show?
What does it say?
Exploratory or Explanation
38. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OBJECTIVES OF DATA ANALYTICS
๏ Find patterns to test hypothesis
๏ Refine an existing hypothesis
๏ Provide actionable insights
๏ Make predictions
CLEARLY DEFINED
39. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
TYPES OF ANALYTICS
• Descriptive - What happened?
• Diagnostic - What went wrong?
• Prescriptive - What to do?
• Predictive - Can this happen?
EACH TYPE ANSWERS DIFFERENT QUESTIONS
40. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: WHAT HAPPENED?
Descripitive Analytics is based on historical data and current data
DESCRIPTIVE ANALYTICS
41. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: WHY DID THIS HAPPEN? WHAT WENT WRONG?
Deduce and infer the success and failure of a particular activity, initiative, campaign or program
DIAGNOSTIC ANALYTICS
42. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTIONS: WHAT DO I DO? WHAT ACTIONS SHOULD I TAKE?
Based on generated predictions, the analysis provides informed actions a decision
maker can or should take.
PRESCRIPTIVE ANALYTICS
43. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ANSWERS THE QUESTION: COULD THIS HAPPEN?
Applying stochastic and mathematical models to predict outcomes
PREDICTIVE ANALYTICS
44. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
LEARNING FOR THE DATA
MACHINE LEARNINGSUPERVISED AND UNSUPERVISED LEARNING
45. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
IDENTIFYING PATTERNS IN THE DATASET
Input data is unlabeled; process is non-deterministic; use of inferential methods to find relationships, patterns and
correlations;
UNSUPERVISED
46. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
MAKING INFERENCES ON DATASETS
Data is labeled; deterministic process with an input and desired output; algorithms learn from labeled data
SUPERVISED
47. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
COMMON MACHINE LEARNING MODELS
ALGORITHMS
48. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LINEAR REGRESSION
REGRESSION
LOGISTIC REGRESSION
Logistic regression is used to describe
data and to explain the relationship
between one dependent binary
variable and one or more nominal,
ordinal, interval or ratio-level
independent variables.
Regressions are used to quantify the
relationship between one variable
and the other variables that are
thought to explain it; regressions can
also identify how close and well
determined the relationship is.
49. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
CLASSIFICATION
DECISION TREES
Decision tree builds classification in the
form of a tree structure. It breaks down a
dataset into smaller and smaller subsets
while at the same time an associated
decision tree is incrementally developed
K NEAREST NEIGHBOR
K nearest neighbors is a simple algorithm
that stores all available cases and classifies
new cases based on a similarity measure
(e.g., distance functions)
50. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
CLUSTERING
K-MEANS
K-Means clustering intends to partition n
objects into k clusters in which each object
belongs to the cluster with the nearest
mean.
51. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
LOTS OF DIFFERENT
MACHINE LEARNING
ALGORITHMS
52. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• What datasets do you have now? What Sources? What
parameters?
• Address 4Vs (Velocity, Variety, Volume, Veracity)
• Rate the data quality
• Identify the attributes, type and groups of data
• Do you need enrich your current dataset?
REMEMBER: APPRAISE YOUR DATA
LOOK AT YOUR PROPERTIES OF YOUR DATA
53. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
EXPECTED OUTCOMES AND YOUR OBJECTIVE
• How is your product going to use the data?
• How does your objective align with results?
• Does the current dataset allow you to make inferences or predictions?
• Get help from domain experts to determine if the attributes and data is sufficient
DOES YOUR DATASET & EXPECTED RESULTS MATCH YOUR OBJECTIVE?
54. ESSENTIAL DATA SCIENCE FOR PRODUCT DESIGNERS
APPLYING DATA SCIENCE TRADECRAFT TO BUILD
DATA PRODUCTS
55. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
USING RESULTS TO IMPROVE PRODUCT EXPERIENCES
Use the findings and insights and apply it to enhancing your product or service
DATA PRODUCT DEV
EXAMPLES
Sales Forecasts
Operational Predictions
Video Recommendations
Featured Content
Other stuff
56. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
USE THE FINDINGS AND INSIGHTS AND APPLY IT TO ENHANCING
YOUR PRODUCT OR SERVICE
DATA ENRICHING PRODUCTS
57. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
PROVIDING ANSWERS TO THE INQUIRIES
Utilize predictive and prescriptive analytics to forecast and give recommendations to decision-makers. Make it clear what actions to
take and why.
USE IN PRODUCTS
58. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
OUTPUTS FORMED FOR HUMAN CONSUMPTION
Key characteristic of data products is data visualization elements like charts, graphs, scoreboards and tables to
communicate the story. Utilize graphics and interactivity to tell your story.
DATA VISUALIZATION
59. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• Show Relationships
• Make Comparisons
• Show Distribution
• Present Composition
• Make Predictions / ForecastsUSING DATA VISUALIZATIONS
60. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
• Dashboards
• Data Filters
• Data Exploration Features
• Custom Inputs
• Graph Selection
DATA PRODUCT CUSTOMIZATIONS
61. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
ECONOMICS OF WORKING WITH DATA
The cost of labor and resources to
prepare, process and maintain data can
be high. Consider using simpler
models.
COST CONSIDERATIONS
62. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
JCHRISA
THANK YOU!
https://www.linkedin.com/in/jchrisa
Creative Technologist. Product Designer
63. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
REFERENCES
http://insidebigdata.com/2014/06/05/data-munging-exploratory-data-analysis-feature-engineering/
http://www.colorado.edu/geography/gcraft/notes/error/error_f.html
Rhodes, Trey, and Kenneth Foote. "Error, Accuracy, and Precision." Error, Accuracy, and Precision. University of Colorado, n.d.
Web. 05 Apr. 2017.
Fortmann, Scott. "Bias and Variance." Understanding the Bias-Variance Tradeoff. N.p., n.d. Web. 05 June 2012.
http://scott.fortmann-roe.com/docs/BiasVariance.html
Gutierrez , Daniel. "Data Munging, Exploratory Data Analysis, and Feature Engineering." InsideBIGDATA. N.p., 20 June 2014.
Web. 05 Apr. 2017.
Causey, Trey. "Trey Causey – Getting started in data science." Trey Causey – Getting started in data science. N.p., 7
June 2014. Web. 05 Apr. 2017.
http://treycausey.com/getting_started.html
64. ESSENTIALS OF DATA SCIENCE FOR PRODUCT DESIGNERS
REFERENCES CONTINUED
Sayed, Saed. "Data Mining Map." Data Mining Map. Saed Sayed, 2010. Web. 10 Apr. 2017.
http://www.saedsayad.com/modeling.htm
http://www.imf.org/external/pubs/ft/fandd/2006/03/basics.htm
Ramcharan, Rodney. "Finance and Development." Finance and Development | F&D. Finance & Development, Mar. 2006.
Web. 10 Apr. 2017.