SlideShare a Scribd company logo
Module - 4
Prof. Vijay K S Bapuji B-Schools, Davangere
Typical data sources
• Operations
• Compensation
• Customer service
• Human resources information systems (HRIS)
• Learning management systems (LMS)
• Social media and non-traditional learning systems
• Engagement
• Surveys
• Performance management systems
• Interviews and estimation by experts
• Public data from outside the organizationProf. Vijay K S Bapuji B-Schools, Davangere
HR professionals gather data points across the
organization from sources like:
• Employee surveys
• Telemetric Data
• Attendance records
• Multi-rater reviews
• Salary and promotion history
• Employee work history
• Demographic data
• Personality/temperament data
• Recruitment process
• Employee databases
Prof. Vijay K S Bapuji B-Schools, Davangere
For each source of data, it is important to
know
• where the data are housed,
• who is the owner ,
• how you will collaborate with the owner,
• How you will obtain and integrate the information with other data,
and
• how valid and reliable it is.
Prof. Vijay K S Bapuji B-Schools, Davangere
Typical questions faced (survey)
• Consciousness towards the responses
• Raise the issue of confidentiality
• Objective behind the survey
• Detailed information gathering during the survey
• Revealing their personal information
• Jargons, Abbreviations and inner meanings associated with the survey
• Wording, Phrases and Options
Prof. Vijay K S Bapuji B-Schools, Davangere
Typical data issues
• Biggest Challenges of HR Analytics?
• Finding people with the right skillset to gather, manage,
and report on the data
• Data cleansing
• Data quality
• Too much data to parse or not knowing what data is most
important
• Data privacy and compliance
• Proving its worth to executive leadership
• Tying actions and insight to ROI
• Identifying the best HR technologies to keep track of the
data
Red Marked – Data Issues
Prof. Vijay K S Bapuji B-Schools, Davangere
Typical data issues
1) Poor Organization. If you're not able to easily search through
your data
2) Too Much Data. ...
3) Inconsistent Data. ...
4) Poor Data Security. ...
5) Poorly Defined Data. ...
6) Incorrect Data. ...
7) Poor Data Recovery.
Prof. Vijay K S Bapuji B-Schools, Davangere
Five Characteristics of Good data
1) Accuracy
2) Completeness
3) Consistency
4) Uniqueness
5) Timeliness
Prof. Vijay K S Bapuji B-Schools, Davangere
Techniques for establishing questions
Data Is Only As Good As The Questions You Ask
Few Questions need to be asked ….
1.What exactly do you want to find out?
2.What Data will you use that can help?
3.Where will your data come from?
4.How can you ensure data quality?
5.Which statistical analysis techniques do you want to apply?
6.What HR Analytics method need to be developed, if any?
7.Who are the final users of your analysis results?
8.What else do I need to know?
9.What data visualizations should you choose?
10.How can you create a data-driven culture?
Prof. Vijay K S Bapuji B-Schools, Davangere
Techniques for establishing questions
Few techniques
- Closed questions
- Open Questions
- Probing questions
- Leading questions – Leading to some desired
- Loaded Questions – Straight forward questions
- Funnel Questions – Started as broad and then narrow down
- Recall and process questions
- Rhetorical questions – Phrases
- A word on tone – Emojis and Images
Prof. Vijay K S Bapuji B-Schools, Davangere
Obtaining data, Cleaning data (exercise),
Supplementing data
Prof. Vijay K S Bapuji B-Schools, Davangere
Obtaining data
- This talks about the collected data from various sources
Prof. Vijay K S Bapuji B-Schools, Davangere
Cleaning data (exercise
- Remove Unwanted observations - This includes duplicate or irrelevant observations.
Duplicate observations
Duplicate observations most frequently arise during data collection, such as when you:
• Combine datasets from multiple places
• Scrape data
• Receive data from clients/other departments
Irrelevant observations
Irrelevant observations are those that don’t actually fit the specific problem that you’re
trying to solve.
• For example, if you were building a model for Single-Family homes only, you
wouldn't want observations for Apartments in there.
• This is also a great time to review your charts from Exploratory Analysis. You can look
at the distribution charts for categorical features to see if there are any classes that
shouldn’t be there.
• Checking for irrelevant observations before engineering features can save you many
headaches down the road.
Prof. Vijay K S Bapuji B-Schools, Davangere
Cleaning data (exercise
- Filter Unwanted Outliers-
• Outliers can cause problems with certain types of models. For example, linear
regression models are less robust to outliers than decision tree models.
• In general, if you have a legitimate reason to remove an outlier, it will help your
model’s performance.
• However, outliers are innocent until proven guilty. You should never remove an outlier
just because it’s a "big number." That big number could be very informative for your
model.
• We can’t stress this enough: you must have a good reason for removing an outlier,
such as suspicious measurements that are unlikely to be real data.
Prof. Vijay K S Bapuji B-Schools, Davangere
Cleaning data (exercise)
- Handle Missing Data - you cannot simply ignore missing values in your dataset.
Two ways to handle this
• Dropping observations that have missing values
• Imputing the missing values based on other observations
Prof. Vijay K S Bapuji B-Schools, Davangere
Cleaning data - Techniques
• Get Rid of Extra Spaces
• Select and Treat All Blank Cells
• Convert Numbers Stored as Text into Numbers
• Remove Duplicates
• Highlight Errors
• Change Text to Lower/Upper/Proper Case
• Spell Check
• Delete all Formatting
Prof. Vijay K S Bapuji B-Schools, Davangere
Supplementing data
• This basically sourcing data from various sources
• Something that completes or makes an addition
• Add something to something to make it larger or better
Here add from where? From various other sources viz. Social media,
Some secondary research data, research survey findings ……
Prof. Vijay K S Bapuji B-Schools, Davangere
Thank You
Prof. Vijay K S Bapuji B-Schools, Davangere

More Related Content

What's hot

Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource management
Vishal Kachhdiya
 
Human resource accounting and audit
Human resource accounting and auditHuman resource accounting and audit
Human resource accounting and audit
Shobitash Jamwal
 
Hr analytics
Hr analyticsHr analytics
Hr analytics
E P John
 
Role of hr in strategic formulation
Role of hr in strategic formulationRole of hr in strategic formulation
Role of hr in strategic formulation
Renu Sebastian
 

What's hot (20)

HRP(Human Resources Planning
HRP(Human Resources PlanningHRP(Human Resources Planning
HRP(Human Resources Planning
 
Macro level scenario of HRP
Macro level scenario of HRPMacro level scenario of HRP
Macro level scenario of HRP
 
JOB ANALYSIS - PROCESS
JOB ANALYSIS - PROCESSJOB ANALYSIS - PROCESS
JOB ANALYSIS - PROCESS
 
Labour market analysis
Labour market analysisLabour market analysis
Labour market analysis
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource management
 
Introduction to Human Resource Management
Introduction to Human Resource ManagementIntroduction to Human Resource Management
Introduction to Human Resource Management
 
Human resource accounting and audit
Human resource accounting and auditHuman resource accounting and audit
Human resource accounting and audit
 
Job analysis
Job analysisJob analysis
Job analysis
 
Human Capital Management (HCM)
Human Capital Management (HCM)Human Capital Management (HCM)
Human Capital Management (HCM)
 
Techniques for Forecasting Human Resources
 Techniques  for Forecasting   Human Resources Techniques  for Forecasting   Human Resources
Techniques for Forecasting Human Resources
 
Hris
HrisHris
Hris
 
Hr analytics
Hr analyticsHr analytics
Hr analytics
 
HRP Demand and supply
HRP Demand and supplyHRP Demand and supply
HRP Demand and supply
 
Forecasting HR demand and supply
Forecasting HR demand and supplyForecasting HR demand and supply
Forecasting HR demand and supply
 
Role of hr in strategic formulation
Role of hr in strategic formulationRole of hr in strategic formulation
Role of hr in strategic formulation
 
Human Resource Information System, Accounting and Audit.
Human Resource Information System, Accounting and Audit.Human Resource Information System, Accounting and Audit.
Human Resource Information System, Accounting and Audit.
 
HR
HRHR
HR
 
HR Audit
HR AuditHR Audit
HR Audit
 
Hrm nature and scope
Hrm nature and scopeHrm nature and scope
Hrm nature and scope
 
EMPLOYEE ENGAGEMENT PROJECT
EMPLOYEE ENGAGEMENT PROJECTEMPLOYEE ENGAGEMENT PROJECT
EMPLOYEE ENGAGEMENT PROJECT
 

Similar to Unit 4 HR Analytics

School customer service presentation
School customer service presentationSchool customer service presentation
School customer service presentation
steve muzzy
 
Executive Summary First Break All The Rules
Executive Summary First Break All The RulesExecutive Summary First Break All The Rules
Executive Summary First Break All The Rules
Greg Crouch
 

Similar to Unit 4 HR Analytics (20)

Women in Tech Summit 2013 presentation
Women in Tech Summit 2013 presentationWomen in Tech Summit 2013 presentation
Women in Tech Summit 2013 presentation
 
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
Learning Analytics Primer: Getting Started with Learning and Performance Anal...Learning Analytics Primer: Getting Started with Learning and Performance Anal...
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
 
School customer service presentation
School customer service presentationSchool customer service presentation
School customer service presentation
 
Data That Sparks Joy by Heap Analytics Sr Product Manager
Data That Sparks Joy by Heap Analytics Sr Product ManagerData That Sparks Joy by Heap Analytics Sr Product Manager
Data That Sparks Joy by Heap Analytics Sr Product Manager
 
Great Survey Design
Great Survey DesignGreat Survey Design
Great Survey Design
 
#Firmday 22 oct London Broadbean Technology - Applying data to build an effec...
#Firmday 22 oct London Broadbean Technology - Applying data to build an effec...#Firmday 22 oct London Broadbean Technology - Applying data to build an effec...
#Firmday 22 oct London Broadbean Technology - Applying data to build an effec...
 
Cn 5 Day Presentation
Cn 5 Day PresentationCn 5 Day Presentation
Cn 5 Day Presentation
 
11 Habits of Highly Effective Tech Recruiters
11 Habits of Highly Effective Tech Recruiters11 Habits of Highly Effective Tech Recruiters
11 Habits of Highly Effective Tech Recruiters
 
Training Program Evaluation
Training Program EvaluationTraining Program Evaluation
Training Program Evaluation
 
Exaudio, llc Background + MR capabilities
Exaudio, llc Background + MR capabilitiesExaudio, llc Background + MR capabilities
Exaudio, llc Background + MR capabilities
 
Is Training the Right Solution?
Is Training the Right Solution?Is Training the Right Solution?
Is Training the Right Solution?
 
Recruitment and Selection
Recruitment and SelectionRecruitment and Selection
Recruitment and Selection
 
Choose to use your Career Center Resources
Choose to use your Career Center ResourcesChoose to use your Career Center Resources
Choose to use your Career Center Resources
 
Data Cleaning
Data CleaningData Cleaning
Data Cleaning
 
Competency-Based-Interviews.pdf
Competency-Based-Interviews.pdfCompetency-Based-Interviews.pdf
Competency-Based-Interviews.pdf
 
Talent Analytics: Maximizing the TA Value
Talent Analytics: Maximizing the TA ValueTalent Analytics: Maximizing the TA Value
Talent Analytics: Maximizing the TA Value
 
Planning your analytics journey - webinar slides
Planning your analytics journey  - webinar slidesPlanning your analytics journey  - webinar slides
Planning your analytics journey - webinar slides
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
 
Analytical skills
Analytical skillsAnalytical skills
Analytical skills
 
Executive Summary First Break All The Rules
Executive Summary First Break All The RulesExecutive Summary First Break All The Rules
Executive Summary First Break All The Rules
 

More from Vijay K S

More from Vijay K S (19)

Business statistics and analytics
Business statistics and analyticsBusiness statistics and analytics
Business statistics and analytics
 
Unit - 6_Strategic Management (18MBA25)_Strategy Implementation
Unit - 6_Strategic Management (18MBA25)_Strategy ImplementationUnit - 6_Strategic Management (18MBA25)_Strategy Implementation
Unit - 6_Strategic Management (18MBA25)_Strategy Implementation
 
Unit - 5_Part B_Strategic Management (18MBA25)_Corporate Level Strategies
Unit - 5_Part B_Strategic Management (18MBA25)_Corporate Level Strategies Unit - 5_Part B_Strategic Management (18MBA25)_Corporate Level Strategies
Unit - 5_Part B_Strategic Management (18MBA25)_Corporate Level Strategies
 
Unit - 5_Part A_Strategic Management (18MBA25)_Entrepreneurship
Unit - 5_Part A_Strategic Management (18MBA25)_EntrepreneurshipUnit - 5_Part A_Strategic Management (18MBA25)_Entrepreneurship
Unit - 5_Part A_Strategic Management (18MBA25)_Entrepreneurship
 
Unit - 4_Part C_Strategic Management (18MBA25)_Cooperative Strategies
Unit - 4_Part C_Strategic Management (18MBA25)_Cooperative StrategiesUnit - 4_Part C_Strategic Management (18MBA25)_Cooperative Strategies
Unit - 4_Part C_Strategic Management (18MBA25)_Cooperative Strategies
 
Unit - 4_Part B_Strategic Management (18MBA25)_Business Level Strategies
Unit - 4_Part B_Strategic Management (18MBA25)_Business Level StrategiesUnit - 4_Part B_Strategic Management (18MBA25)_Business Level Strategies
Unit - 4_Part B_Strategic Management (18MBA25)_Business Level Strategies
 
Unit - 4_Part A_Strategic Management (18MBA25)_Internal Analysis
Unit - 4_Part A_Strategic Management (18MBA25)_Internal AnalysisUnit - 4_Part A_Strategic Management (18MBA25)_Internal Analysis
Unit - 4_Part A_Strategic Management (18MBA25)_Internal Analysis
 
Unit - 3_Strategic Management (18MBA25)_ External Analysis
Unit - 3_Strategic Management (18MBA25)_ External AnalysisUnit - 3_Strategic Management (18MBA25)_ External Analysis
Unit - 3_Strategic Management (18MBA25)_ External Analysis
 
Unit - 2_Strategic Management (18MBA25)_Formulation & Intent
Unit - 2_Strategic Management (18MBA25)_Formulation & IntentUnit - 2_Strategic Management (18MBA25)_Formulation & Intent
Unit - 2_Strategic Management (18MBA25)_Formulation & Intent
 
Unit - 1_Strategic Management (18MBA25)_Introduction
Unit - 1_Strategic Management (18MBA25)_IntroductionUnit - 1_Strategic Management (18MBA25)_Introduction
Unit - 1_Strategic Management (18MBA25)_Introduction
 
Strategic formulation, intent & balance score card
Strategic formulation, intent & balance score cardStrategic formulation, intent & balance score card
Strategic formulation, intent & balance score card
 
An introduction to strategic management unit - 1
An introduction to strategic management   unit - 1An introduction to strategic management   unit - 1
An introduction to strategic management unit - 1
 
Corporate level strategies
Corporate level strategiesCorporate level strategies
Corporate level strategies
 
Strategic Implementation
Strategic ImplementationStrategic Implementation
Strategic Implementation
 
Business level strategies
Business level strategiesBusiness level strategies
Business level strategies
 
Strategic formulation
Strategic formulationStrategic formulation
Strategic formulation
 
Internal analysis strategic management
Internal analysis   strategic managementInternal analysis   strategic management
Internal analysis strategic management
 
Environmental scanning - External Analysis
Environmental scanning - External AnalysisEnvironmental scanning - External Analysis
Environmental scanning - External Analysis
 
Introduction to strategic management
Introduction to strategic managementIntroduction to strategic management
Introduction to strategic management
 

Recently uploaded

Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
creerey
 
anas about venice for grade 6f about venice
anas about venice for grade 6f about veniceanas about venice for grade 6f about venice
anas about venice for grade 6f about venice
anasabutalha2013
 
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
Khaled Al Awadi
 

Recently uploaded (20)

Equinox Gold Corporate Deck May 24th 2024
Equinox Gold Corporate Deck May 24th 2024Equinox Gold Corporate Deck May 24th 2024
Equinox Gold Corporate Deck May 24th 2024
 
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
 
Event Report - IBM Think 2024 - It is all about AI and hybrid
Event Report - IBM Think 2024 - It is all about AI and hybridEvent Report - IBM Think 2024 - It is all about AI and hybrid
Event Report - IBM Think 2024 - It is all about AI and hybrid
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
 
Using Generative AI for Content Marketing
Using Generative AI for Content MarketingUsing Generative AI for Content Marketing
Using Generative AI for Content Marketing
 
HR and Employment law update: May 2024.
HR and Employment law update:  May 2024.HR and Employment law update:  May 2024.
HR and Employment law update: May 2024.
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
 
Special Purpose Vehicle (Purpose, Formation & examples)
Special Purpose Vehicle (Purpose, Formation & examples)Special Purpose Vehicle (Purpose, Formation & examples)
Special Purpose Vehicle (Purpose, Formation & examples)
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
 
anas about venice for grade 6f about venice
anas about venice for grade 6f about veniceanas about venice for grade 6f about venice
anas about venice for grade 6f about venice
 
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
 
Improving profitability for small business
Improving profitability for small businessImproving profitability for small business
Improving profitability for small business
 
IPTV Subscription UK: Your Guide to Choosing the Best Service
IPTV Subscription UK: Your Guide to Choosing the Best ServiceIPTV Subscription UK: Your Guide to Choosing the Best Service
IPTV Subscription UK: Your Guide to Choosing the Best Service
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
 
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdfMatt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
Matt Conway - Attorney - A Knowledgeable Professional - Kentucky.pdf
 
Transforming Max Life Insurance with PMaps Job-Fit Assessments- Case Study
Transforming Max Life Insurance with PMaps Job-Fit Assessments- Case StudyTransforming Max Life Insurance with PMaps Job-Fit Assessments- Case Study
Transforming Max Life Insurance with PMaps Job-Fit Assessments- Case Study
 
Unlock Your TikTok Potential: Free TikTok Likes with InstBlast
Unlock Your TikTok Potential: Free TikTok Likes with InstBlastUnlock Your TikTok Potential: Free TikTok Likes with InstBlast
Unlock Your TikTok Potential: Free TikTok Likes with InstBlast
 
LinkedIn Masterclass Techweek 2024 v4.1.pptx
LinkedIn Masterclass Techweek 2024 v4.1.pptxLinkedIn Masterclass Techweek 2024 v4.1.pptx
LinkedIn Masterclass Techweek 2024 v4.1.pptx
 
Easy Way to Download and Set Up Gen TDS Software on Your Computer
Easy Way to Download and Set Up Gen TDS Software on Your ComputerEasy Way to Download and Set Up Gen TDS Software on Your Computer
Easy Way to Download and Set Up Gen TDS Software on Your Computer
 

Unit 4 HR Analytics

  • 1. Module - 4 Prof. Vijay K S Bapuji B-Schools, Davangere
  • 2. Typical data sources • Operations • Compensation • Customer service • Human resources information systems (HRIS) • Learning management systems (LMS) • Social media and non-traditional learning systems • Engagement • Surveys • Performance management systems • Interviews and estimation by experts • Public data from outside the organizationProf. Vijay K S Bapuji B-Schools, Davangere
  • 3. HR professionals gather data points across the organization from sources like: • Employee surveys • Telemetric Data • Attendance records • Multi-rater reviews • Salary and promotion history • Employee work history • Demographic data • Personality/temperament data • Recruitment process • Employee databases Prof. Vijay K S Bapuji B-Schools, Davangere
  • 4. For each source of data, it is important to know • where the data are housed, • who is the owner , • how you will collaborate with the owner, • How you will obtain and integrate the information with other data, and • how valid and reliable it is. Prof. Vijay K S Bapuji B-Schools, Davangere
  • 5. Typical questions faced (survey) • Consciousness towards the responses • Raise the issue of confidentiality • Objective behind the survey • Detailed information gathering during the survey • Revealing their personal information • Jargons, Abbreviations and inner meanings associated with the survey • Wording, Phrases and Options Prof. Vijay K S Bapuji B-Schools, Davangere
  • 6. Typical data issues • Biggest Challenges of HR Analytics? • Finding people with the right skillset to gather, manage, and report on the data • Data cleansing • Data quality • Too much data to parse or not knowing what data is most important • Data privacy and compliance • Proving its worth to executive leadership • Tying actions and insight to ROI • Identifying the best HR technologies to keep track of the data Red Marked – Data Issues Prof. Vijay K S Bapuji B-Schools, Davangere
  • 7. Typical data issues 1) Poor Organization. If you're not able to easily search through your data 2) Too Much Data. ... 3) Inconsistent Data. ... 4) Poor Data Security. ... 5) Poorly Defined Data. ... 6) Incorrect Data. ... 7) Poor Data Recovery. Prof. Vijay K S Bapuji B-Schools, Davangere
  • 8. Five Characteristics of Good data 1) Accuracy 2) Completeness 3) Consistency 4) Uniqueness 5) Timeliness Prof. Vijay K S Bapuji B-Schools, Davangere
  • 9. Techniques for establishing questions Data Is Only As Good As The Questions You Ask Few Questions need to be asked …. 1.What exactly do you want to find out? 2.What Data will you use that can help? 3.Where will your data come from? 4.How can you ensure data quality? 5.Which statistical analysis techniques do you want to apply? 6.What HR Analytics method need to be developed, if any? 7.Who are the final users of your analysis results? 8.What else do I need to know? 9.What data visualizations should you choose? 10.How can you create a data-driven culture? Prof. Vijay K S Bapuji B-Schools, Davangere
  • 10. Techniques for establishing questions Few techniques - Closed questions - Open Questions - Probing questions - Leading questions – Leading to some desired - Loaded Questions – Straight forward questions - Funnel Questions – Started as broad and then narrow down - Recall and process questions - Rhetorical questions – Phrases - A word on tone – Emojis and Images Prof. Vijay K S Bapuji B-Schools, Davangere
  • 11. Obtaining data, Cleaning data (exercise), Supplementing data Prof. Vijay K S Bapuji B-Schools, Davangere
  • 12. Obtaining data - This talks about the collected data from various sources Prof. Vijay K S Bapuji B-Schools, Davangere
  • 13. Cleaning data (exercise - Remove Unwanted observations - This includes duplicate or irrelevant observations. Duplicate observations Duplicate observations most frequently arise during data collection, such as when you: • Combine datasets from multiple places • Scrape data • Receive data from clients/other departments Irrelevant observations Irrelevant observations are those that don’t actually fit the specific problem that you’re trying to solve. • For example, if you were building a model for Single-Family homes only, you wouldn't want observations for Apartments in there. • This is also a great time to review your charts from Exploratory Analysis. You can look at the distribution charts for categorical features to see if there are any classes that shouldn’t be there. • Checking for irrelevant observations before engineering features can save you many headaches down the road. Prof. Vijay K S Bapuji B-Schools, Davangere
  • 14. Cleaning data (exercise - Filter Unwanted Outliers- • Outliers can cause problems with certain types of models. For example, linear regression models are less robust to outliers than decision tree models. • In general, if you have a legitimate reason to remove an outlier, it will help your model’s performance. • However, outliers are innocent until proven guilty. You should never remove an outlier just because it’s a "big number." That big number could be very informative for your model. • We can’t stress this enough: you must have a good reason for removing an outlier, such as suspicious measurements that are unlikely to be real data. Prof. Vijay K S Bapuji B-Schools, Davangere
  • 15. Cleaning data (exercise) - Handle Missing Data - you cannot simply ignore missing values in your dataset. Two ways to handle this • Dropping observations that have missing values • Imputing the missing values based on other observations Prof. Vijay K S Bapuji B-Schools, Davangere
  • 16. Cleaning data - Techniques • Get Rid of Extra Spaces • Select and Treat All Blank Cells • Convert Numbers Stored as Text into Numbers • Remove Duplicates • Highlight Errors • Change Text to Lower/Upper/Proper Case • Spell Check • Delete all Formatting Prof. Vijay K S Bapuji B-Schools, Davangere
  • 17. Supplementing data • This basically sourcing data from various sources • Something that completes or makes an addition • Add something to something to make it larger or better Here add from where? From various other sources viz. Social media, Some secondary research data, research survey findings …… Prof. Vijay K S Bapuji B-Schools, Davangere
  • 18. Thank You Prof. Vijay K S Bapuji B-Schools, Davangere