Developer Series
Join our webinar series to expand your developer skills!
—
Security Tuesdays
Data Science and AI Wednesdays
Cloud Native and Red Hat OpenShift Thursdays
meetup.com/IBM-Cloud-MEA
Gain hidden insights from your data
using IBM Watson Studio
—
Anam Mahmood
Developer Advocate, UAE
Hashim Noor
Client Technical Specialist, UAE
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Get started at: https://ibm.biz/BdfpKw
Let’s get started
• Sign up/Log in to your IBM Cloud
Account: https://ibm.biz/BdfpKw
• Follow along for the hands-on:
https://github.com/IBMDeveloperMEA/
Gain-hidden-insights-from-your-data-
using-IBM-Watson-Studio
3
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
4
chat with everyone!
Q&A here!
Follow us to get notified
about upcoming events
View more info about this event
Workshop Resources
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Agenda
5
Data Science and Subsets 8
Data Science Methodology 9
Data Types 16
Problems with Data 18
Data Preprocessing 22
Hands-on 27
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Introduction to Artificial
Intelligence
“The science and engineering of
making intelligent machines”
- John McCarthy
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
7
The subsets of AI
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
What is Data Science?
Data science is an interdisciplinary
field leveraging insights from
many fields to extract knowledge
from data.
8
https://blog.finxter.com/artificial-intelligence-machine-learning-deep-learning-and-data-science-whats-the-difference/
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Science Methodology
9
Get started at: https://ibm.biz/BdfpKw
Requirements to
Collection
Understanding
to Preparation
Modelling to
Evaluation
Deployment to
Feedback
Problem to
Approach
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
From Problem to Approach
10
What is the problem we are trying to solve?
How can we use data to answer the question?
Analytic Approach
2. Descriptive Model
3. Statistical Analysis
4. Classification Model
1. Analytic Model
http://www.clipartpanda.com/clipart_images/the-nominal-group-technique-63420908
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
From Requirements to Collection
11
What data do we need to answer the question?
Where is the data coming from and how to get it?
Data
Collection
Data
Requirements
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
From Understanding to Preparation
12
Does the data represent the problem we are trying to
solve?
What additional work is required?
Run descriptive statistics
Dataframe.describe()
Data Prepation
1. Missing Data
2. Invalid Values
3. Remove Duplicates
4. Formatting
5. Feature Engineering
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
From Modeling to Evaluation
13
How can the data be visualized to get to the answer that is
required?
Does the model used really answer the initial question or
does it need to be adjusted?
DESCRIPTIVE
ANALYSIS
PREDICTIVE
ANALYSIS
The Diagnostic
Measures phase
The Statistical
Significance phase
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
From Deployment to Feedback
14
Can you put the model into practice?
Can you get constructive feedback into answering the
question?
https://www.newbreedmarketing.com/blog/how-to-translate-customer-feedback-into-action
https://giphy.com/gifs/producthunt-push-to-deploy-3og0IAQG2BtR13joe4
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Science Methodology
https://www.geeksforgeeks.org/data-science-methodology-and-approach/
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Types of Data
https://www.pinterest.com/pin/404127766560273784/
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
.
Problems with Data
18
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Getting Data
19
1. Getting the right data.
2. Many often a problem needs data that
does not exist, or exists with another
entity.
3. Another challenge is that data may
come from a variety of different
sources.
4. Data also comes in different formats:
• Database
• CSV
• Unstructured
Get started at: https://ibm.biz/BdfpKw
19
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Completeness and Quality
20
• Once the data is obtained, it is important
to ensure that the data is usable.
1. Are there missing values?
2. Is the data reflective of the reality?
3. Are there outliers in the data?
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Feature Engineering
21
• The process of using domain knowledge to
extract features from raw data via data
mining techniques.
• We use feature engineering when the data
or features we want don’t exist but are
related to other features.
• Example: Loan application example
Total Income
EMI
New feature added: Balance Income
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Preprocessing
Converting or mapping data
from raw form into another form
to prepare data for further
analysis
Get started at: https://ibm.biz/BdfpKw
22
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Preprocessing
Deal with missing data
Check the Data Collection Source
Drop the missing
Value
Drop the variable
Drop the data entry
Replace the missing
values
Use average
Use frequency
Leave it as missing value
Get started at: https://ibm.biz/BdfpKw
23
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Preprocessing
Data Formatting Data Normalization
• Ensure data is consistent and
easily understandable to make
meaningful comparison.
• Bring data into similar range for
more useful comparison.
Get started at: https://ibm.biz/BdfpKw
24
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data Preprocessing
Data Binning
• Data Binning gives a better
understanding of the data
distribution.
Turning categorical values into
numerical values
• For example: turn the values
of female/male into 0/1
Get started at: https://ibm.biz/BdfpKw
25
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Data
Machine Learning
Visualization
aiFairness360
IBM OpenScale
LIME
SHAP
Code
Ethics
Data ScienceTools
26
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Let’s get started
• Sign up/Log in to your IBM Cloud
Account: https://ibm.biz/BdfpKw
• Follow along for the hands-on:
https://github.com/IBMDeveloperMEA
/Gain-hidden-insights-from-your-data-
using-IBM-Watson-Studio
27
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Survey
https://ibm.biz/BdfpKk
Get started at: https://ibm.biz/BdfpKw
28
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
.
.
Summary
29
• Introduction to AI
• Introduction to Data Science
• Data Science Methodology
o From problem to approach
o From requirements to collection
o From understanding to preparation
o From modelling to evaluation
o From deployment to feedback
• Problems with Data
o Getting data
o Completeness and quality
o Feature engineering
o Data Preprocessing
• Hands On: Use Watson Studio gather insights from Data
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
Resources
30
IBM Developer: https://developer.ibm.com/
Meetup: https://www.meetup.com/IBM-Cloud-MEA/
Learning:
– https://cognitiveclass.ai/
– https://learn.ibm.com/
Spark Fundamentals: https://cognitiveclass.ai/learn/spark
Data Science Methodology: https://cognitiveclass.ai/courses/data-science-methodology-2
Python for Data Science: https://cognitiveclass.ai/courses/python-for-data-science
Data visualization, preparation, and transformation using IBM Watson Studio: https://developer.ibm.com/tutorials/watson-studio-
data-visualization-preparation-transformation/?mhsrc=ibmsearch_a&mhq=data%20visualization
Take control of your data with Watson Studio: https://developer.ibm.com/learningpaths/get-started-watson-studio/
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
.
.
Thank you
31
.
Anam Mahmood
Developer Advocate, UAE
anam.mahmood@ibm.com
Hashim Noor
Client Technical Specialist, UAE
hashim.noor1@ibm.com
Get started at: https://ibm.biz/BdfpKw
Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
32

Gain hidden insights from your data using IBM Watson Studio

  • 1.
    Developer Series Join ourwebinar series to expand your developer skills! — Security Tuesdays Data Science and AI Wednesdays Cloud Native and Red Hat OpenShift Thursdays meetup.com/IBM-Cloud-MEA
  • 2.
    Gain hidden insightsfrom your data using IBM Watson Studio — Anam Mahmood Developer Advocate, UAE Hashim Noor Client Technical Specialist, UAE Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation Get started at: https://ibm.biz/BdfpKw
  • 3.
    Let’s get started •Sign up/Log in to your IBM Cloud Account: https://ibm.biz/BdfpKw • Follow along for the hands-on: https://github.com/IBMDeveloperMEA/ Gain-hidden-insights-from-your-data- using-IBM-Watson-Studio 3 Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 4.
    4 chat with everyone! Q&Ahere! Follow us to get notified about upcoming events View more info about this event Workshop Resources Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 5.
    Agenda 5 Data Science andSubsets 8 Data Science Methodology 9 Data Types 16 Problems with Data 18 Data Preprocessing 22 Hands-on 27 Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 6.
    Introduction to Artificial Intelligence “Thescience and engineering of making intelligent machines” - John McCarthy Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 7.
    7 The subsets ofAI Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 8.
    What is DataScience? Data science is an interdisciplinary field leveraging insights from many fields to extract knowledge from data. 8 https://blog.finxter.com/artificial-intelligence-machine-learning-deep-learning-and-data-science-whats-the-difference/ Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 9.
    Data Science Methodology 9 Getstarted at: https://ibm.biz/BdfpKw Requirements to Collection Understanding to Preparation Modelling to Evaluation Deployment to Feedback Problem to Approach Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 10.
    From Problem toApproach 10 What is the problem we are trying to solve? How can we use data to answer the question? Analytic Approach 2. Descriptive Model 3. Statistical Analysis 4. Classification Model 1. Analytic Model http://www.clipartpanda.com/clipart_images/the-nominal-group-technique-63420908 Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 11.
    From Requirements toCollection 11 What data do we need to answer the question? Where is the data coming from and how to get it? Data Collection Data Requirements Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 12.
    From Understanding toPreparation 12 Does the data represent the problem we are trying to solve? What additional work is required? Run descriptive statistics Dataframe.describe() Data Prepation 1. Missing Data 2. Invalid Values 3. Remove Duplicates 4. Formatting 5. Feature Engineering Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 13.
    From Modeling toEvaluation 13 How can the data be visualized to get to the answer that is required? Does the model used really answer the initial question or does it need to be adjusted? DESCRIPTIVE ANALYSIS PREDICTIVE ANALYSIS The Diagnostic Measures phase The Statistical Significance phase Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 14.
    From Deployment toFeedback 14 Can you put the model into practice? Can you get constructive feedback into answering the question? https://www.newbreedmarketing.com/blog/how-to-translate-customer-feedback-into-action https://giphy.com/gifs/producthunt-push-to-deploy-3og0IAQG2BtR13joe4 Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 15.
    Data Science Methodology https://www.geeksforgeeks.org/data-science-methodology-and-approach/ Getstarted at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 16.
    Types of Data https://www.pinterest.com/pin/404127766560273784/ Getstarted at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 17.
    Get started at:https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 18.
    . Problems with Data 18 Gainhidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 19.
    Getting Data 19 1. Gettingthe right data. 2. Many often a problem needs data that does not exist, or exists with another entity. 3. Another challenge is that data may come from a variety of different sources. 4. Data also comes in different formats: • Database • CSV • Unstructured Get started at: https://ibm.biz/BdfpKw 19 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 20.
    Completeness and Quality 20 •Once the data is obtained, it is important to ensure that the data is usable. 1. Are there missing values? 2. Is the data reflective of the reality? 3. Are there outliers in the data? Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 21.
    Feature Engineering 21 • Theprocess of using domain knowledge to extract features from raw data via data mining techniques. • We use feature engineering when the data or features we want don’t exist but are related to other features. • Example: Loan application example Total Income EMI New feature added: Balance Income Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 22.
    Data Preprocessing Converting ormapping data from raw form into another form to prepare data for further analysis Get started at: https://ibm.biz/BdfpKw 22 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 23.
    Data Preprocessing Deal withmissing data Check the Data Collection Source Drop the missing Value Drop the variable Drop the data entry Replace the missing values Use average Use frequency Leave it as missing value Get started at: https://ibm.biz/BdfpKw 23 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 24.
    Data Preprocessing Data FormattingData Normalization • Ensure data is consistent and easily understandable to make meaningful comparison. • Bring data into similar range for more useful comparison. Get started at: https://ibm.biz/BdfpKw 24 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 25.
    Data Preprocessing Data Binning •Data Binning gives a better understanding of the data distribution. Turning categorical values into numerical values • For example: turn the values of female/male into 0/1 Get started at: https://ibm.biz/BdfpKw 25 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 26.
    Data Machine Learning Visualization aiFairness360 IBM OpenScale LIME SHAP Code Ethics DataScienceTools 26 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 27.
    Let’s get started •Sign up/Log in to your IBM Cloud Account: https://ibm.biz/BdfpKw • Follow along for the hands-on: https://github.com/IBMDeveloperMEA /Gain-hidden-insights-from-your-data- using-IBM-Watson-Studio 27 Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 28.
    Survey https://ibm.biz/BdfpKk Get started at:https://ibm.biz/BdfpKw 28 Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 29.
    . . Summary 29 • Introduction toAI • Introduction to Data Science • Data Science Methodology o From problem to approach o From requirements to collection o From understanding to preparation o From modelling to evaluation o From deployment to feedback • Problems with Data o Getting data o Completeness and quality o Feature engineering o Data Preprocessing • Hands On: Use Watson Studio gather insights from Data Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 30.
    Resources 30 IBM Developer: https://developer.ibm.com/ Meetup:https://www.meetup.com/IBM-Cloud-MEA/ Learning: – https://cognitiveclass.ai/ – https://learn.ibm.com/ Spark Fundamentals: https://cognitiveclass.ai/learn/spark Data Science Methodology: https://cognitiveclass.ai/courses/data-science-methodology-2 Python for Data Science: https://cognitiveclass.ai/courses/python-for-data-science Data visualization, preparation, and transformation using IBM Watson Studio: https://developer.ibm.com/tutorials/watson-studio- data-visualization-preparation-transformation/?mhsrc=ibmsearch_a&mhq=data%20visualization Take control of your data with Watson Studio: https://developer.ibm.com/learningpaths/get-started-watson-studio/ Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 31.
    . . Thank you 31 . Anam Mahmood DeveloperAdvocate, UAE anam.mahmood@ibm.com Hashim Noor Client Technical Specialist, UAE hashim.noor1@ibm.com Get started at: https://ibm.biz/BdfpKw Gain hidden insights from your data using IBM Watson Studio, Sept 20 2021/ © 2021 IBM Corporation
  • 32.

Editor's Notes

  • #3 Hello everyone, and a very good afternoon to all of you. Thank you for joining our webinar, in today’s wehinar we be talking about AI, Data Science, and will also be presenting a use case on how you can use IBM Watson Studio to gain insights from your data. Before we proceed, I would like to Introduce myself. My name is Hashim Noor and I am a Client Technical Specialist at IBM and my areas of focus as AI, Blockchain, and Web Application Development. Along with me today I have my colleague Anam, Anam would like like to introduce youself. With that lets get started.
  • #4 For today’s webinar we will be leveraging the IBM Cloud platform. If you want to do the hand-on exercise you will need to have an IBM Cloud account. You can register or Log in to your IBM Cloud account using this link which Anam will put on the chat. Additionally, we also also a Github repository where all the steps that we will be doing today are document so if there is anything that you would like to go back to you can always refer to the GitHub account. Anam will also put the link to this GitHub account on the chat.
  • #5 Before we get started, I quickly want to go over a few important thing to note about Crowd Cast. On the bottom you will be able to see two panel. The chat panel and the QnA panel, if at any point during the webinar you have any questions you can post your questions in either of the two panels. To view the resources, bring used in today’s webinar you can click on the “Get started Here” button. And if you would like to stay up to date with our upcoming events are webinars you can click on the “Follow” button that you can see on the top right-hand side of the window.
  • #6 In today’s session we will talk about how you can collect, cleanse and enhance your data. We will also talk about Data science and its subsets. We will then move on to what problems you can encounter with data and how you can fix them. We will then talk about the different techniques to data pre-processing. After that Anam will take you through the code lab.
  • #17 Textural data -unstructured data – json -tabular form- excel, csv -
  • #23 https://medium.com/analytics-vidhya/data-cleaning-and-preprocessing-a4b751f4066f
  • #28 For today’s webinar we will be leveraging the IBM Cloud platform. If you want to do the hand-on exercise you will need to have an IBM Cloud account. You can register or Log in to your IBM Cloud account using this link which Anam will put on the chat. Additionally, we also also a Github repository where all the steps that we will be doing today are document so if there is anything that you would like to go back to you can always refer to the GitHub account. Anam will also put the link to this GitHub account on the chat.