SlideShare a Scribd company logo
Machine Learning Workflow
Learning Objectives
✦Describe the seven steps of machine learning workflow
The Machine Learning Workflow
The job of the data scientist
The processes a data scientist follows to provide
feedback to decision makers
The machine learning process in business
environment
Goals of Machine Learning
Workflow
Goals
• Derive answers to business
challenges
• Derive meaningful conclusions for
complicated issues
• Identify actionable steps with a
given set of variables
Yes
No
Yes
Yes
7 steps of Machine Learning
1.Get
more data
2. Ask a
sharp
question
3. Add
the data
to table
4. Check
for quality
5.
Transfor
m
features
6. Answer
the
question
7. Use
the
answer
Step 1 : Get More Data
Data can be collected in different formats
Investigate a business challenge
Quality of the model depends upon quality
and quantity of the data gathered
Step 1 : Get More Data - data formats
Step 2 : Ask a sharp question
Need for a
sharp
question
It is direct and specific
It focuses on a single topic
It helps you to get clear
answers to the questions.
It focuses on the exact need
and requirement
Step 2 : Ask a sharp question (Vague vs
Sharp)
Vague questions
1. What should you do?
2. How should you live your life?
3. Which career path should you take?
4. Which data can tell you about your business?
Sharp questions
1. Which route will get you to work faster?
2. When are you planning to join the company?
3. How many times will a user use the new product
features?
4. Where did you go to college?
Step 2 : Ask a sharp question (Example)
✦Study different tables of data in the database and analyse your company’s
monthly sales performance
✦Understand how the company is doing in terms of market share
✦Analyse the historical data and predict the stock price for a future date
Step 3 : Add data to the table
Data analyst arranges data in database tables in a systematic
manner.01
Systematic arrangement of data helps in detailed analysis02
Data is stored in the table in the form of columns and rows.03
Table columns represent data of a single type and rows
represent records pertaining to one entity.04
Aggregate, distribute, compute of measure to derive data
analysis.05
Data Analysis in Machine Learning
The process of deriving new
findings from historical data
Focuses on aggregating
table data to find answers to
business problems
Performed by data analysts
to build machine learning
algorithms
Example : Add Data to the Table
• The stock price column shows the stock value across different dates
• Each table row represents observations across given attributes
Example : Data Analysis
Aggregate and distribute the data as shown here:
Example : Aggregate
• You can aggregate the data in the table to derive answers
• This process is called data analysis and involves counting total
observations in a table or combining data from multiple tables
Example : Distribute, Compute, Measure
• An example of performing aggregate, distribute, compute and measure operations on data
in tables
• Each feature and their observations are distributed across the table and then combined
Example : Estimate
• The market share column shows the estimated stock price values of the company that are
derived from the previous steps
Step 4 : Check for Quality
Determine if the data is acceptable for
further investigation
Ensure the data in a column is in a
consistent format
Check for Quality : Example
Check for Quality : Example
• There is inconsistencies in the data format in the Birth year column of the table
• Dates in the column need to be converted to a consistent format to make it readable for the
ML algorithm
Check for Quality : Example
• Denote the Birth year column numbers as numbers, without any special characters
• Checking data quality is a critical step
Step 5 : Transform Features
•Enables you to make sense out of the data, especially when
there are multiple features
•Help overcome challenges where some features may not
give useful information for the model, where as some
features may be combined to derive meaningful information
Feature Engineering
Tricks of Feature Engineering
•Scale Invariant Feature Transform (SIFT): Images
•Term Frequency-Inverse Document Frequency (TF-IDF):
Text
Data Specific
•Econometric, technological, agricultural and sociological
data engineering
Domain Specific
•Images, text and audio data engineering
Deep Learning
Transform Features : Example
• There are 3 columns and 65670 rows
• Features 0 and 1 have similar values
• The numbers are meaningless and scattered
Transform Features : Example
• Values of feature column 0 is multiplied with every observation in feature
column 1
• These values are plotted in image 2
Transform Features : Example
• By plotting the values obtained by subtracting feature 0 from feature 1, a
curve is formed
• This curve is a normal or gaussian distribution or bell-shaped curve
Step 6 : Answer the question
Helps to analyse if the obtained answers are clear
Question
s
How much or how many?
Which category?
Which group?
Does this look strange?
Which action?
Answer the Question: Type 1
What will be the temperature this Friday?01
How much or how many?
How many people will like your post?02
What will be your product sales next month?03
Answer the Question: Type 2
Is this an image of a dog?01
Which category?
What is the topic of this news article?02
Which hotel in your area offers free Wi-Fi?03
Answer the Question: Type 3
Which group of shoppers purchase similar
products?01
Which group?
Which group of viewers like horror movies?02
How best can you divide this book into ten
topics?03
Answer the Question: Type 4
Is this internet message typical?01
Does this look strange?
Is this heart beat reading abnormal?02
Do these transactions look unusual as opposed
to customers usual transactions?03
Answer the Question: Type 5
Should you vacuum again or not?01
Which action?
Should you beat the red light?02
Should you raise or lower the temperature?03
Step 7 : Use the Answer
Making up a decision01
There are plenty of ways to use the answer derived
form the previous step.
Proposing the price of an item02
Publishing the results obtained as part of a
research paper
03
Constructing a dashboard on a visualisation tool04
Making changes to product features05
Key Takeaway
✦Machine learning workflow involves seven steps.
✦The first step of machine learning workflow is used to collect data to answer different business
questions.
✦To get the desired response, always ask sharp questions and avoid vague ones.
✦Arrange raw data in tables for better data analysis.
✦To ensure data consistency data scientists must check for the quality of data.
✦The transform feature is used to increase the efficiency of the machine learning model.
✦The answer received by the model helps in solving business challenges.
✦Learn from the answer received by the model and implement it as a solution to the problem.

More Related Content

What's hot

CIS 111 STUDY Inspiring Innovation--cis111study.com
CIS 111 STUDY Inspiring Innovation--cis111study.comCIS 111 STUDY Inspiring Innovation--cis111study.com
CIS 111 STUDY Inspiring Innovation--cis111study.comKeatonJennings90
 
CIS 111 STUDY Education Your Life--cis111study
CIS 111 STUDY Education Your Life--cis111studyCIS 111 STUDY Education Your Life--cis111study
CIS 111 STUDY Education Your Life--cis111studythomashard44
 
Chap02 presenting data in chart & tables
Chap02 presenting data in chart & tablesChap02 presenting data in chart & tables
Chap02 presenting data in chart & tablesUni Azza Aunillah
 
Intro To Spreadsheets Y34
Intro To Spreadsheets Y34Intro To Spreadsheets Y34
Intro To Spreadsheets Y34birchfields
 
De vry math 399 all ilabs latest 2016 november
De vry math 399 all ilabs latest 2016 novemberDe vry math 399 all ilabs latest 2016 november
De vry math 399 all ilabs latest 2016 novemberlenasour
 
Chap14 multiple regression model building
Chap14 multiple regression model buildingChap14 multiple regression model building
Chap14 multiple regression model buildingUni Azza Aunillah
 
Supervised machine learning algorithms(strengths and weaknesses)
Supervised machine learning algorithms(strengths and weaknesses)Supervised machine learning algorithms(strengths and weaknesses)
Supervised machine learning algorithms(strengths and weaknesses)MonarchSaha
 
Complete Introduction to Business Data Analysis
Complete Introduction to Business Data AnalysisComplete Introduction to Business Data Analysis
Complete Introduction to Business Data AnalysisSam Dias
 
Importance of Numerical Methods in CSE.pptx
Importance of Numerical Methods in CSE.pptxImportance of Numerical Methods in CSE.pptx
Importance of Numerical Methods in CSE.pptxSanad Bhowmik
 
[PDF] Optimization Modeling with Spreadsheets
[PDF] Optimization Modeling with Spreadsheets[PDF] Optimization Modeling with Spreadsheets
[PDF] Optimization Modeling with SpreadsheetsAdrianaPeraltay67
 

What's hot (12)

CIS 111 STUDY Inspiring Innovation--cis111study.com
CIS 111 STUDY Inspiring Innovation--cis111study.comCIS 111 STUDY Inspiring Innovation--cis111study.com
CIS 111 STUDY Inspiring Innovation--cis111study.com
 
CIS 111 STUDY Education Your Life--cis111study
CIS 111 STUDY Education Your Life--cis111studyCIS 111 STUDY Education Your Life--cis111study
CIS 111 STUDY Education Your Life--cis111study
 
Chap02 presenting data in chart & tables
Chap02 presenting data in chart & tablesChap02 presenting data in chart & tables
Chap02 presenting data in chart & tables
 
Intro To Spreadsheets Y34
Intro To Spreadsheets Y34Intro To Spreadsheets Y34
Intro To Spreadsheets Y34
 
De vry math 399 all ilabs latest 2016 november
De vry math 399 all ilabs latest 2016 novemberDe vry math 399 all ilabs latest 2016 november
De vry math 399 all ilabs latest 2016 november
 
Chap14 multiple regression model building
Chap14 multiple regression model buildingChap14 multiple regression model building
Chap14 multiple regression model building
 
Conceptual modeling
Conceptual modelingConceptual modeling
Conceptual modeling
 
Microsoft Mathematics
Microsoft MathematicsMicrosoft Mathematics
Microsoft Mathematics
 
Supervised machine learning algorithms(strengths and weaknesses)
Supervised machine learning algorithms(strengths and weaknesses)Supervised machine learning algorithms(strengths and weaknesses)
Supervised machine learning algorithms(strengths and weaknesses)
 
Complete Introduction to Business Data Analysis
Complete Introduction to Business Data AnalysisComplete Introduction to Business Data Analysis
Complete Introduction to Business Data Analysis
 
Importance of Numerical Methods in CSE.pptx
Importance of Numerical Methods in CSE.pptxImportance of Numerical Methods in CSE.pptx
Importance of Numerical Methods in CSE.pptx
 
[PDF] Optimization Modeling with Spreadsheets
[PDF] Optimization Modeling with Spreadsheets[PDF] Optimization Modeling with Spreadsheets
[PDF] Optimization Modeling with Spreadsheets
 

Similar to 03 machine learning workflow v2

DWaltherGraniteStateUG 2023.pptx
DWaltherGraniteStateUG 2023.pptxDWaltherGraniteStateUG 2023.pptx
DWaltherGraniteStateUG 2023.pptxDeb Walther
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveHyderabad Scalability Meetup
 
Enterprise Architecture Surveys with LeanIX
Enterprise Architecture Surveys with LeanIXEnterprise Architecture Surveys with LeanIX
Enterprise Architecture Surveys with LeanIXLeanIX GmbH
 
AI-900 - Fundamental Principles of ML.pptx
AI-900 - Fundamental Principles of ML.pptxAI-900 - Fundamental Principles of ML.pptx
AI-900 - Fundamental Principles of ML.pptxkprasad8
 
19 11-13 MS Ignite The Tour Paris - Surfing Microsoft 365 waves
19 11-13 MS Ignite The Tour Paris -  Surfing Microsoft 365 waves19 11-13 MS Ignite The Tour Paris -  Surfing Microsoft 365 waves
19 11-13 MS Ignite The Tour Paris - Surfing Microsoft 365 wavesPatrick Guimonet
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Arun K
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
ML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queriesML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queriesVarun Nathan
 
Group 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxGroup 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxellamangapis2003
 
Accelerate business transformation with AI Builder
Accelerate business transformation with AI BuilderAccelerate business transformation with AI Builder
Accelerate business transformation with AI BuilderHamish Sheild
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxrafaelaj1
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsVivastream
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining ProcessMarc Berman
 
The Life Cycle Of Data Science PPT.pdf
The Life Cycle Of Data Science PPT.pdfThe Life Cycle Of Data Science PPT.pdf
The Life Cycle Of Data Science PPT.pdfPhurba Sherpa
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1Joni Salminen
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huwekineheshete
 
1. Text mining – Text mining or text data mining is a process to e.docx
1. Text mining – Text mining or text data mining is a process to e.docx1. Text mining – Text mining or text data mining is a process to e.docx
1. Text mining – Text mining or text data mining is a process to e.docxstilliegeorgiana
 
Machine learning workshop @DYP Pune
Machine learning workshop @DYP PuneMachine learning workshop @DYP Pune
Machine learning workshop @DYP PuneGanesh Raskar
 
Practical Software Development Metrics
Practical Software Development MetricsPractical Software Development Metrics
Practical Software Development MetricsJari Kuusisto
 

Similar to 03 machine learning workflow v2 (20)

DWaltherGraniteStateUG 2023.pptx
DWaltherGraniteStateUG 2023.pptxDWaltherGraniteStateUG 2023.pptx
DWaltherGraniteStateUG 2023.pptx
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
 
Enterprise Architecture Surveys with LeanIX
Enterprise Architecture Surveys with LeanIXEnterprise Architecture Surveys with LeanIX
Enterprise Architecture Surveys with LeanIX
 
AI-900 - Fundamental Principles of ML.pptx
AI-900 - Fundamental Principles of ML.pptxAI-900 - Fundamental Principles of ML.pptx
AI-900 - Fundamental Principles of ML.pptx
 
19 11-13 MS Ignite The Tour Paris - Surfing Microsoft 365 waves
19 11-13 MS Ignite The Tour Paris -  Surfing Microsoft 365 waves19 11-13 MS Ignite The Tour Paris -  Surfing Microsoft 365 waves
19 11-13 MS Ignite The Tour Paris - Surfing Microsoft 365 waves
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
ML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queriesML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queries
 
Group 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxGroup 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptx
 
Accelerate business transformation with AI Builder
Accelerate business transformation with AI BuilderAccelerate business transformation with AI Builder
Accelerate business transformation with AI Builder
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
 
The Life Cycle Of Data Science PPT.pdf
The Life Cycle Of Data Science PPT.pdfThe Life Cycle Of Data Science PPT.pdf
The Life Cycle Of Data Science PPT.pdf
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by hu
 
1. Text mining – Text mining or text data mining is a process to e.docx
1. Text mining – Text mining or text data mining is a process to e.docx1. Text mining – Text mining or text data mining is a process to e.docx
1. Text mining – Text mining or text data mining is a process to e.docx
 
Machine learning workshop @DYP Pune
Machine learning workshop @DYP PuneMachine learning workshop @DYP Pune
Machine learning workshop @DYP Pune
 
Practical Software Development Metrics
Practical Software Development MetricsPractical Software Development Metrics
Practical Software Development Metrics
 

More from Anne Starr

I01letor20so201leutor2020
I01letor20so201leutor2020I01letor20so201leutor2020
I01letor20so201leutor2020Anne Starr
 
Iso27001leadauditor2020
Iso27001leadauditor2020Iso27001leadauditor2020
Iso27001leadauditor2020Anne Starr
 
Dncybersecurity
DncybersecurityDncybersecurity
DncybersecurityAnne Starr
 
Dancyrityshy 1foundatioieh
Dancyrityshy 1foundatioiehDancyrityshy 1foundatioieh
Dancyrityshy 1foundatioiehAnne Starr
 
2 slides(2ndvariadaystion)
2 slides(2ndvariadaystion)2 slides(2ndvariadaystion)
2 slides(2ndvariadaystion)Anne Starr
 
Awtitioneressentialsdeckscloudprac401-577
Awtitioneressentialsdeckscloudprac401-577Awtitioneressentialsdeckscloudprac401-577
Awtitioneressentialsdeckscloudprac401-577Anne Starr
 
01wslouAsentialsdeck2dpractitioneres-400
01wslouAsentialsdeck2dpractitioneres-40001wslouAsentialsdeck2dpractitioneres-400
01wslouAsentialsdeck2dpractitioneres-400Anne Starr
 
uderessAwscloentialsdeck1-2ion00
uderessAwscloentialsdeck1-2ion00uderessAwscloentialsdeck1-2ion00
uderessAwscloentialsdeck1-2ion00Anne Starr
 
Cloudhnologysstecociat
CloudhnologysstecociatCloudhnologysstecociat
CloudhnologysstecociatAnne Starr
 
Cmbysantocsddsh
CmbysantocsddshCmbysantocsddsh
CmbysantocsddshAnne Starr
 
Cddmbysantcsosh
CddmbysantcsoshCddmbysantcsosh
CddmbysantcsoshAnne Starr
 
Ccbysantsddosh
Ccbysantsddosh  Ccbysantsddosh
Ccbysantsddosh Anne Starr
 

More from Anne Starr (20)

I01letor20so201leutor2020
I01letor20so201leutor2020I01letor20so201leutor2020
I01letor20so201leutor2020
 
Iso27001leadauditor2020
Iso27001leadauditor2020Iso27001leadauditor2020
Iso27001leadauditor2020
 
Ccsddm5days
Ccsddm5daysCcsddm5days
Ccsddm5days
 
Dayblic
DayblicDayblic
Dayblic
 
Day1cspbeblic
Day1cspbeblicDay1cspbeblic
Day1cspbeblic
 
Dncybersecurity
DncybersecurityDncybersecurity
Dncybersecurity
 
Dancyrityshy 1foundatioieh
Dancyrityshy 1foundatioiehDancyrityshy 1foundatioieh
Dancyrityshy 1foundatioieh
 
2 slides(2ndvariadaystion)
2 slides(2ndvariadaystion)2 slides(2ndvariadaystion)
2 slides(2ndvariadaystion)
 
Sec4
Sec4Sec4
Sec4
 
Secuntialesse
SecuntialesseSecuntialesse
Secuntialesse
 
Securityic2
Securityic2Securityic2
Securityic2
 
)k
)k)k
)k
 
inte
inteinte
inte
 
Awtitioneressentialsdeckscloudprac401-577
Awtitioneressentialsdeckscloudprac401-577Awtitioneressentialsdeckscloudprac401-577
Awtitioneressentialsdeckscloudprac401-577
 
01wslouAsentialsdeck2dpractitioneres-400
01wslouAsentialsdeck2dpractitioneres-40001wslouAsentialsdeck2dpractitioneres-400
01wslouAsentialsdeck2dpractitioneres-400
 
uderessAwscloentialsdeck1-2ion00
uderessAwscloentialsdeck1-2ion00uderessAwscloentialsdeck1-2ion00
uderessAwscloentialsdeck1-2ion00
 
Cloudhnologysstecociat
CloudhnologysstecociatCloudhnologysstecociat
Cloudhnologysstecociat
 
Cmbysantocsddsh
CmbysantocsddshCmbysantocsddsh
Cmbysantocsddsh
 
Cddmbysantcsosh
CddmbysantcsoshCddmbysantcsosh
Cddmbysantcsosh
 
Ccbysantsddosh
Ccbysantsddosh  Ccbysantsddosh
Ccbysantsddosh
 

Recently uploaded

2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativePeter Windle
 
Basic Civil Engineering Notes of Chapter-6, Topic- Ecosystem, Biodiversity G...
Basic Civil Engineering Notes of Chapter-6,  Topic- Ecosystem, Biodiversity G...Basic Civil Engineering Notes of Chapter-6,  Topic- Ecosystem, Biodiversity G...
Basic Civil Engineering Notes of Chapter-6, Topic- Ecosystem, Biodiversity G...Denish Jangid
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxssuserbdd3e8
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxRaedMohamed3
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345beazzy04
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...AzmatAli747758
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfVivekanand Anglo Vedic Academy
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasGeoBlogs
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfkaushalkr1407
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasiemaillard
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsparmarsneha2
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...Sayali Powar
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxricssacare
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxJisc
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...Nguyen Thanh Tu Collection
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleCeline George
 

Recently uploaded (20)

2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Basic Civil Engineering Notes of Chapter-6, Topic- Ecosystem, Biodiversity G...
Basic Civil Engineering Notes of Chapter-6,  Topic- Ecosystem, Biodiversity G...Basic Civil Engineering Notes of Chapter-6,  Topic- Ecosystem, Biodiversity G...
Basic Civil Engineering Notes of Chapter-6, Topic- Ecosystem, Biodiversity G...
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 

03 machine learning workflow v2

  • 2. Learning Objectives ✦Describe the seven steps of machine learning workflow
  • 3. The Machine Learning Workflow The job of the data scientist The processes a data scientist follows to provide feedback to decision makers The machine learning process in business environment
  • 4. Goals of Machine Learning Workflow Goals • Derive answers to business challenges • Derive meaningful conclusions for complicated issues • Identify actionable steps with a given set of variables
  • 5. Yes No Yes Yes 7 steps of Machine Learning 1.Get more data 2. Ask a sharp question 3. Add the data to table 4. Check for quality 5. Transfor m features 6. Answer the question 7. Use the answer
  • 6. Step 1 : Get More Data Data can be collected in different formats Investigate a business challenge Quality of the model depends upon quality and quantity of the data gathered
  • 7. Step 1 : Get More Data - data formats
  • 8. Step 2 : Ask a sharp question Need for a sharp question It is direct and specific It focuses on a single topic It helps you to get clear answers to the questions. It focuses on the exact need and requirement
  • 9. Step 2 : Ask a sharp question (Vague vs Sharp) Vague questions 1. What should you do? 2. How should you live your life? 3. Which career path should you take? 4. Which data can tell you about your business? Sharp questions 1. Which route will get you to work faster? 2. When are you planning to join the company? 3. How many times will a user use the new product features? 4. Where did you go to college?
  • 10. Step 2 : Ask a sharp question (Example) ✦Study different tables of data in the database and analyse your company’s monthly sales performance ✦Understand how the company is doing in terms of market share ✦Analyse the historical data and predict the stock price for a future date
  • 11. Step 3 : Add data to the table Data analyst arranges data in database tables in a systematic manner.01 Systematic arrangement of data helps in detailed analysis02 Data is stored in the table in the form of columns and rows.03 Table columns represent data of a single type and rows represent records pertaining to one entity.04 Aggregate, distribute, compute of measure to derive data analysis.05
  • 12. Data Analysis in Machine Learning The process of deriving new findings from historical data Focuses on aggregating table data to find answers to business problems Performed by data analysts to build machine learning algorithms
  • 13. Example : Add Data to the Table • The stock price column shows the stock value across different dates • Each table row represents observations across given attributes
  • 14. Example : Data Analysis Aggregate and distribute the data as shown here:
  • 15. Example : Aggregate • You can aggregate the data in the table to derive answers • This process is called data analysis and involves counting total observations in a table or combining data from multiple tables
  • 16. Example : Distribute, Compute, Measure • An example of performing aggregate, distribute, compute and measure operations on data in tables • Each feature and their observations are distributed across the table and then combined
  • 17. Example : Estimate • The market share column shows the estimated stock price values of the company that are derived from the previous steps
  • 18. Step 4 : Check for Quality Determine if the data is acceptable for further investigation Ensure the data in a column is in a consistent format
  • 19. Check for Quality : Example
  • 20. Check for Quality : Example • There is inconsistencies in the data format in the Birth year column of the table • Dates in the column need to be converted to a consistent format to make it readable for the ML algorithm
  • 21. Check for Quality : Example • Denote the Birth year column numbers as numbers, without any special characters • Checking data quality is a critical step
  • 22. Step 5 : Transform Features •Enables you to make sense out of the data, especially when there are multiple features •Help overcome challenges where some features may not give useful information for the model, where as some features may be combined to derive meaningful information Feature Engineering
  • 23. Tricks of Feature Engineering •Scale Invariant Feature Transform (SIFT): Images •Term Frequency-Inverse Document Frequency (TF-IDF): Text Data Specific •Econometric, technological, agricultural and sociological data engineering Domain Specific •Images, text and audio data engineering Deep Learning
  • 24. Transform Features : Example • There are 3 columns and 65670 rows • Features 0 and 1 have similar values • The numbers are meaningless and scattered
  • 25. Transform Features : Example • Values of feature column 0 is multiplied with every observation in feature column 1 • These values are plotted in image 2
  • 26. Transform Features : Example • By plotting the values obtained by subtracting feature 0 from feature 1, a curve is formed • This curve is a normal or gaussian distribution or bell-shaped curve
  • 27. Step 6 : Answer the question Helps to analyse if the obtained answers are clear Question s How much or how many? Which category? Which group? Does this look strange? Which action?
  • 28. Answer the Question: Type 1 What will be the temperature this Friday?01 How much or how many? How many people will like your post?02 What will be your product sales next month?03
  • 29. Answer the Question: Type 2 Is this an image of a dog?01 Which category? What is the topic of this news article?02 Which hotel in your area offers free Wi-Fi?03
  • 30. Answer the Question: Type 3 Which group of shoppers purchase similar products?01 Which group? Which group of viewers like horror movies?02 How best can you divide this book into ten topics?03
  • 31. Answer the Question: Type 4 Is this internet message typical?01 Does this look strange? Is this heart beat reading abnormal?02 Do these transactions look unusual as opposed to customers usual transactions?03
  • 32. Answer the Question: Type 5 Should you vacuum again or not?01 Which action? Should you beat the red light?02 Should you raise or lower the temperature?03
  • 33. Step 7 : Use the Answer Making up a decision01 There are plenty of ways to use the answer derived form the previous step. Proposing the price of an item02 Publishing the results obtained as part of a research paper 03 Constructing a dashboard on a visualisation tool04 Making changes to product features05
  • 34. Key Takeaway ✦Machine learning workflow involves seven steps. ✦The first step of machine learning workflow is used to collect data to answer different business questions. ✦To get the desired response, always ask sharp questions and avoid vague ones. ✦Arrange raw data in tables for better data analysis. ✦To ensure data consistency data scientists must check for the quality of data. ✦The transform feature is used to increase the efficiency of the machine learning model. ✦The answer received by the model helps in solving business challenges. ✦Learn from the answer received by the model and implement it as a solution to the problem.

Editor's Notes

  1. The business must understand the various areas involved in machine learning
  2. Dont ask VAGUE questions