SlideShare a Scribd company logo
1 of 25
Flipkart
Business Analytics with SAS
GROUP 04
“Numbers have an important story to tell. They
rely on you to give them a voice.”
– Stephen Few
Introduction
• To predict what all factors can get a candidate
hired at Flipkart.
• The dataset relates to the recruitment process
at Flipkart where the candidates are
hired/rejected during the application
screening process at various stages.
Project motivation/Background
• Considered multiple data sets
• Zeroed on using live data for analysis
• Firsthand dataset of an Indian e-Commerce
firm Flipkart Internet Pvt. Ltd.
Objectives
• Do some predictors influence the hiring process more than
others?
• Have we considered all the important independent variables
that contribute in getting a candidate hired at Flipkart?
• Can we predict if a person will be hired or rejected based on
predictors?
• What business strategies can we implement to increase a
person’s chances of getting hired?
Tool/Techniques Used
• We used SAS Enterprise Miner 9.4 and Excel for
analysis.
• Performed data mining techniques like decision
tree, regression & neural networks
• Created confusion matrix based on the results of
these techniques and predicted which model is
better.
• Dataset has been shared by HR from the company on
request to do analysis on the recruitment process.
• Since it includes the application details of candidates who
are rejected it’s not a primary data of the Flipkart
employees.
• The data file consists of 47798 rows and 24 columns.
• For analysis, we have taken a sample of 4782 rows to
perform our analysis.
Data Set
Current Predictors
Exploring the Data Set
• Most of our data was categorical
• Preprocessed and added the following variables :
1. LastCoKnown - Have work experience or not.
2. HasSM - Has Social Media for example LinkedIn.
3. Referral/Non Referral - Whether a candidate is referred or
not referred.
4. Hema/notHema - Hema contributes 38% in the dataset,
introduced a field Hema/notHema.
5. Sunil/not Sunil Sunil - Contributes 14% in the dataset,
introduced a field Sunil/not Sunil.
6. Sunil/Hema- TAM’s with highest number of recruits.
What we selected and why?
• Predictors which had significant impact on the
output.
• To find a model that could obtain accurate
classification of new applicants based on their
predictor information.
Preprocessing the data
• Data Redundancy
• Used sample node.
• Impute node to treat missing values.
• Interpretation/ evaluation
Methods for Analysis
Predictive Analytics:
• Logistic Regression
• Decision Tree
• Interactive Decision Tree
• Neural Network
• Neural Network with Regression
Logistic Regression
• We chose step wise method and selection
criteria as validation misclassification
• For this Model we are getting an accuracy of
approximately 95%.
Interactive Decision Tree
• This model considers the number of candidates whose last
company is known, have a professional social media and have
been referred by Flipkart employees.
• For this Model (using confusion matrix) we are getting an
accuracy of approximately 77%.
Interactive Decision Tree
• This model considers the number of candidates who come under TAM “Hema”
& “Sunil”. Hema has 38% of applicants and Sunil 14%.
• For this model (using confusion matrix) we are getting an
accuracy of approximately 77%.
Neural Network
Neural Network
Neural Network with Regression
Neural Network with Regression
Main Model
Model Comparison
• Decision tree is best model with least
misclassification rate of 8.6%.
• Target variable is nominal data with 3 possible
values.
• Misclassification is best way to compare the
model because for nominal response prediction,
misclassification rates are often examined as a
means for assessing the performance of the
classifier.
Model Comparison
Business Strategy
• Company should advertise job openings on job boards which
show a higher % of Hiring(Jobs on Github, Glassdoor, etc).
• Referral plays a very important role in hiring procedures-
evident from Decision Tree (91%)
• Flipkart can introduce Incentives for the successful referral of
candidates which in turn will promote other employees to refer
known skilled and qualified candidates for a particular job
opening
• Flipkart can save time, money and capital on application
screening process. Flipkart won’t be spending on a candidate
who needs to be called on-site for interviews hence money
can be saved on Travel and Dear allowances
Some meaningful Implications &
Visualizations
Group04_ppt

More Related Content

What's hot

How to hire a data scientist
How to hire a data scientistHow to hire a data scientist
How to hire a data scientistHackerEarth
 
INTRODUCTION TO BUSINESS ANALYTICS
INTRODUCTION TO BUSINESS ANALYTICSINTRODUCTION TO BUSINESS ANALYTICS
INTRODUCTION TO BUSINESS ANALYTICSAninditaGogoi5
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptopRising Media, Inc.
 
Predictive Analytics for Non-programmers
Predictive Analytics for Non-programmersPredictive Analytics for Non-programmers
Predictive Analytics for Non-programmersOlalekan Fuad Elesin
 
Data Analytics and Big Data on IoT
Data Analytics and Big Data on IoTData Analytics and Big Data on IoT
Data Analytics and Big Data on IoTShivam Singh
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014Daniel Westzaan
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsDurga Palakurthy
 
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdar
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan SikdarTrend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdar
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdarraihansikdar
 
1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptopRising Media, Inc.
 
Making advanced analytics work for you
Making advanced analytics work for youMaking advanced analytics work for you
Making advanced analytics work for youGirish Nookella
 
Mbaddar intro pred_anlaytics_spss
Mbaddar intro pred_anlaytics_spssMbaddar intro pred_anlaytics_spss
Mbaddar intro pred_anlaytics_spssM Baddar
 
What is Data analytics and it's importance ?
What is Data analytics and it's importance ?What is Data analytics and it's importance ?
What is Data analytics and it's importance ?AbhayDhupar
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...raihansikdar
 
Data science in demand planning - when the machine is not enough
Data science in demand planning - when the machine is not enoughData science in demand planning - when the machine is not enough
Data science in demand planning - when the machine is not enoughTristan Wiggill
 
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic Borstnar
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic BorstnarSupporting B2Bsales forecasting by machine learning - Mirjana Klajic Borstnar
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic BorstnarInstitute of Contemporary Sciences
 
kinds of analytics
kinds of analyticskinds of analytics
kinds of analyticsBenila Paul
 

What's hot (20)

How to hire a data scientist
How to hire a data scientistHow to hire a data scientist
How to hire a data scientist
 
INTRODUCTION TO BUSINESS ANALYTICS
INTRODUCTION TO BUSINESS ANALYTICSINTRODUCTION TO BUSINESS ANALYTICS
INTRODUCTION TO BUSINESS ANALYTICS
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptop
 
Predictive Analytics for Non-programmers
Predictive Analytics for Non-programmersPredictive Analytics for Non-programmers
Predictive Analytics for Non-programmers
 
Data Analytics and Big Data on IoT
Data Analytics and Big Data on IoTData Analytics and Big Data on IoT
Data Analytics and Big Data on IoT
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
 
Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive Modelling
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive Analytics
 
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdar
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan SikdarTrend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdar
Trend analysis-of-time-series-data-using-data-mining-techniques By Raihan Sikdar
 
1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop
 
Making advanced analytics work for you
Making advanced analytics work for youMaking advanced analytics work for you
Making advanced analytics work for you
 
Mbaddar intro pred_anlaytics_spss
Mbaddar intro pred_anlaytics_spssMbaddar intro pred_anlaytics_spss
Mbaddar intro pred_anlaytics_spss
 
What is Data analytics and it's importance ?
What is Data analytics and it's importance ?What is Data analytics and it's importance ?
What is Data analytics and it's importance ?
 
1120 track1 grossman
1120 track1 grossman1120 track1 grossman
1120 track1 grossman
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
 
Data science in demand planning - when the machine is not enough
Data science in demand planning - when the machine is not enoughData science in demand planning - when the machine is not enough
Data science in demand planning - when the machine is not enough
 
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic Borstnar
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic BorstnarSupporting B2Bsales forecasting by machine learning - Mirjana Klajic Borstnar
Supporting B2Bsales forecasting by machine learning - Mirjana Klajic Borstnar
 
kinds of analytics
kinds of analyticskinds of analytics
kinds of analytics
 
1555 track1 alam
1555 track1 alam1555 track1 alam
1555 track1 alam
 
Analytics
AnalyticsAnalytics
Analytics
 

Viewers also liked

Gdz fizika labaratorni
Gdz fizika labaratorniGdz fizika labaratorni
Gdz fizika labaratorniLucky Alex
 
Chattanooga Passenger Rail Public Meeting - 04/28/2016
Chattanooga Passenger Rail Public Meeting - 04/28/2016Chattanooga Passenger Rail Public Meeting - 04/28/2016
Chattanooga Passenger Rail Public Meeting - 04/28/2016Eric Asboe
 
الفساد الادراي والمالي الواقع والاثار وسبل الحد منه
  الفساد الادراي والمالي الواقع والاثار وسبل الحد منه  الفساد الادراي والمالي الواقع والاثار وسبل الحد منه
الفساد الادراي والمالي الواقع والاثار وسبل الحد منهAbdullah Alqhahtani
 
Keseragaman mujahadah wahidiyah
Keseragaman mujahadah wahidiyahKeseragaman mujahadah wahidiyah
Keseragaman mujahadah wahidiyahattahdziby
 
Presentation for-Managers-on-ISS-Reporting
Presentation for-Managers-on-ISS-ReportingPresentation for-Managers-on-ISS-Reporting
Presentation for-Managers-on-ISS-ReportingJisan Azim
 
Anton Yanovsky. Russian Technology Transfer Network
Anton Yanovsky. Russian Technology Transfer NetworkAnton Yanovsky. Russian Technology Transfer Network
Anton Yanovsky. Russian Technology Transfer NetworkDistant Light Forum
 
Indian Insurance Industry - Key Issues and Challenges - Part - 2
Indian Insurance Industry - Key Issues and Challenges - Part - 2Indian Insurance Industry - Key Issues and Challenges - Part - 2
Indian Insurance Industry - Key Issues and Challenges - Part - 2Resurgent India
 
LogicaSoft real estate solution with odoo
LogicaSoft   real estate solution with odooLogicaSoft   real estate solution with odoo
LogicaSoft real estate solution with odooVincent Laurent
 
Odoo OpenERP 7 construction management
Odoo OpenERP 7 construction managementOdoo OpenERP 7 construction management
Odoo OpenERP 7 construction managementpragmatic123
 
Les méthodes et techniques pédagogiques
Les méthodes et techniques pédagogiquesLes méthodes et techniques pédagogiques
Les méthodes et techniques pédagogiquesPascal KUFEL
 
ePMO Workshop - Ali Kaabi
ePMO Workshop - Ali KaabiePMO Workshop - Ali Kaabi
ePMO Workshop - Ali KaabiAli Kaabi
 

Viewers also liked (17)

Gdz fizika labaratorni
Gdz fizika labaratorniGdz fizika labaratorni
Gdz fizika labaratorni
 
Chattanooga Passenger Rail Public Meeting - 04/28/2016
Chattanooga Passenger Rail Public Meeting - 04/28/2016Chattanooga Passenger Rail Public Meeting - 04/28/2016
Chattanooga Passenger Rail Public Meeting - 04/28/2016
 
2015 2016 year in review
2015 2016 year in review 2015 2016 year in review
2015 2016 year in review
 
الفساد الادراي والمالي الواقع والاثار وسبل الحد منه
  الفساد الادراي والمالي الواقع والاثار وسبل الحد منه  الفساد الادراي والمالي الواقع والاثار وسبل الحد منه
الفساد الادراي والمالي الواقع والاثار وسبل الحد منه
 
Keseragaman mujahadah wahidiyah
Keseragaman mujahadah wahidiyahKeseragaman mujahadah wahidiyah
Keseragaman mujahadah wahidiyah
 
Presentation for-Managers-on-ISS-Reporting
Presentation for-Managers-on-ISS-ReportingPresentation for-Managers-on-ISS-Reporting
Presentation for-Managers-on-ISS-Reporting
 
Anton Yanovsky. Russian Technology Transfer Network
Anton Yanovsky. Russian Technology Transfer NetworkAnton Yanovsky. Russian Technology Transfer Network
Anton Yanovsky. Russian Technology Transfer Network
 
Indian Insurance Industry - Key Issues and Challenges - Part - 2
Indian Insurance Industry - Key Issues and Challenges - Part - 2Indian Insurance Industry - Key Issues and Challenges - Part - 2
Indian Insurance Industry - Key Issues and Challenges - Part - 2
 
Willpower
WillpowerWillpower
Willpower
 
Mi gran amiga rosita
Mi gran amiga rositaMi gran amiga rosita
Mi gran amiga rosita
 
Odoo Retail Management
Odoo Retail ManagementOdoo Retail Management
Odoo Retail Management
 
Plastics Presentation
Plastics  PresentationPlastics  Presentation
Plastics Presentation
 
LogicaSoft real estate solution with odoo
LogicaSoft   real estate solution with odooLogicaSoft   real estate solution with odoo
LogicaSoft real estate solution with odoo
 
Odoo OpenERP 7 construction management
Odoo OpenERP 7 construction managementOdoo OpenERP 7 construction management
Odoo OpenERP 7 construction management
 
Les méthodes et techniques pédagogiques
Les méthodes et techniques pédagogiquesLes méthodes et techniques pédagogiques
Les méthodes et techniques pédagogiques
 
ePMO Workshop - Ali Kaabi
ePMO Workshop - Ali KaabiePMO Workshop - Ali Kaabi
ePMO Workshop - Ali Kaabi
 
Dietas hospitalarias
Dietas hospitalariasDietas hospitalarias
Dietas hospitalarias
 

Similar to Group04_ppt

Hair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptxHair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptxAsadAli104515
 
Introduction to Business Analytics
Introduction to Business AnalyticsIntroduction to Business Analytics
Introduction to Business AnalyticsDr. Amitabh Mishra
 
Four stage business analytics model
Four stage business analytics modelFour stage business analytics model
Four stage business analytics modelAnitha Velusamy
 
5 Steps to a Smart Compensation Plan
5 Steps to a Smart Compensation Plan5 Steps to a Smart Compensation Plan
5 Steps to a Smart Compensation PlanEve Lyons-Berg
 
The 4 Machine Learning Models Imperative for Business Transformation
The 4 Machine Learning Models Imperative for Business TransformationThe 4 Machine Learning Models Imperative for Business Transformation
The 4 Machine Learning Models Imperative for Business TransformationRocketSource
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnRohitKumar639388
 
TA reporting metrics and analytics
TA reporting metrics and analyticsTA reporting metrics and analytics
TA reporting metrics and analyticscjparker
 
Enabling Success With Big Data - Driven Talent Acquisition
Enabling Success With Big Data - Driven Talent AcquisitionEnabling Success With Big Data - Driven Talent Acquisition
Enabling Success With Big Data - Driven Talent AcquisitionDavid Bernstein
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 
Fundamentals of Recruitment Analytics Outline
Fundamentals of Recruitment Analytics OutlineFundamentals of Recruitment Analytics Outline
Fundamentals of Recruitment Analytics OutlineDan Meyer
 
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTURE
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTUREFUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTURE
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTUREHuman Capital Media
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt Poojamanic
 
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deckDC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deckBeth Fitzpatrick
 

Similar to Group04_ppt (20)

Hair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptxHair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptx
 
Introduction to Business Analytics
Introduction to Business AnalyticsIntroduction to Business Analytics
Introduction to Business Analytics
 
Four stage business analytics model
Four stage business analytics modelFour stage business analytics model
Four stage business analytics model
 
5 Steps to a Smart Compensation Plan
5 Steps to a Smart Compensation Plan5 Steps to a Smart Compensation Plan
5 Steps to a Smart Compensation Plan
 
The 4 Machine Learning Models Imperative for Business Transformation
The 4 Machine Learning Models Imperative for Business TransformationThe 4 Machine Learning Models Imperative for Business Transformation
The 4 Machine Learning Models Imperative for Business Transformation
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
 
TA reporting metrics and analytics
TA reporting metrics and analyticsTA reporting metrics and analytics
TA reporting metrics and analytics
 
CS-IS 027
CS-IS 027CS-IS 027
CS-IS 027
 
Enabling Success With Big Data - Driven Talent Acquisition
Enabling Success With Big Data - Driven Talent AcquisitionEnabling Success With Big Data - Driven Talent Acquisition
Enabling Success With Big Data - Driven Talent Acquisition
 
Talnt analytics
Talnt analyticsTalnt analytics
Talnt analytics
 
Scalable HR Integrations for Better Data Analytics: Challenges & Solutions
Scalable HR Integrations for Better Data Analytics: Challenges & SolutionsScalable HR Integrations for Better Data Analytics: Challenges & Solutions
Scalable HR Integrations for Better Data Analytics: Challenges & Solutions
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 
Data Analytics Domain
Data Analytics DomainData Analytics Domain
Data Analytics Domain
 
Data Analytics Domain
Data Analytics DomainData Analytics Domain
Data Analytics Domain
 
Data driven decision making
Data driven decision makingData driven decision making
Data driven decision making
 
Fundamentals of Recruitment Analytics Outline
Fundamentals of Recruitment Analytics OutlineFundamentals of Recruitment Analytics Outline
Fundamentals of Recruitment Analytics Outline
 
seminar
seminarseminar
seminar
 
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTURE
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTUREFUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTURE
FUTURE READY HR: STRATEGIES FOR POSITIVE WORKPLACE CULTURE
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt
 
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deckDC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
 

Group04_ppt

  • 2. “Numbers have an important story to tell. They rely on you to give them a voice.” – Stephen Few
  • 3. Introduction • To predict what all factors can get a candidate hired at Flipkart. • The dataset relates to the recruitment process at Flipkart where the candidates are hired/rejected during the application screening process at various stages.
  • 4. Project motivation/Background • Considered multiple data sets • Zeroed on using live data for analysis • Firsthand dataset of an Indian e-Commerce firm Flipkart Internet Pvt. Ltd.
  • 5. Objectives • Do some predictors influence the hiring process more than others? • Have we considered all the important independent variables that contribute in getting a candidate hired at Flipkart? • Can we predict if a person will be hired or rejected based on predictors? • What business strategies can we implement to increase a person’s chances of getting hired?
  • 6. Tool/Techniques Used • We used SAS Enterprise Miner 9.4 and Excel for analysis. • Performed data mining techniques like decision tree, regression & neural networks • Created confusion matrix based on the results of these techniques and predicted which model is better.
  • 7. • Dataset has been shared by HR from the company on request to do analysis on the recruitment process. • Since it includes the application details of candidates who are rejected it’s not a primary data of the Flipkart employees. • The data file consists of 47798 rows and 24 columns. • For analysis, we have taken a sample of 4782 rows to perform our analysis. Data Set
  • 9. Exploring the Data Set • Most of our data was categorical • Preprocessed and added the following variables : 1. LastCoKnown - Have work experience or not. 2. HasSM - Has Social Media for example LinkedIn. 3. Referral/Non Referral - Whether a candidate is referred or not referred. 4. Hema/notHema - Hema contributes 38% in the dataset, introduced a field Hema/notHema. 5. Sunil/not Sunil Sunil - Contributes 14% in the dataset, introduced a field Sunil/not Sunil. 6. Sunil/Hema- TAM’s with highest number of recruits.
  • 10. What we selected and why? • Predictors which had significant impact on the output. • To find a model that could obtain accurate classification of new applicants based on their predictor information.
  • 11. Preprocessing the data • Data Redundancy • Used sample node. • Impute node to treat missing values. • Interpretation/ evaluation
  • 12. Methods for Analysis Predictive Analytics: • Logistic Regression • Decision Tree • Interactive Decision Tree • Neural Network • Neural Network with Regression
  • 13. Logistic Regression • We chose step wise method and selection criteria as validation misclassification • For this Model we are getting an accuracy of approximately 95%.
  • 14. Interactive Decision Tree • This model considers the number of candidates whose last company is known, have a professional social media and have been referred by Flipkart employees. • For this Model (using confusion matrix) we are getting an accuracy of approximately 77%.
  • 15. Interactive Decision Tree • This model considers the number of candidates who come under TAM “Hema” & “Sunil”. Hema has 38% of applicants and Sunil 14%. • For this model (using confusion matrix) we are getting an accuracy of approximately 77%.
  • 18. Neural Network with Regression
  • 19. Neural Network with Regression
  • 21. Model Comparison • Decision tree is best model with least misclassification rate of 8.6%. • Target variable is nominal data with 3 possible values. • Misclassification is best way to compare the model because for nominal response prediction, misclassification rates are often examined as a means for assessing the performance of the classifier.
  • 23. Business Strategy • Company should advertise job openings on job boards which show a higher % of Hiring(Jobs on Github, Glassdoor, etc). • Referral plays a very important role in hiring procedures- evident from Decision Tree (91%) • Flipkart can introduce Incentives for the successful referral of candidates which in turn will promote other employees to refer known skilled and qualified candidates for a particular job opening • Flipkart can save time, money and capital on application screening process. Flipkart won’t be spending on a candidate who needs to be called on-site for interviews hence money can be saved on Travel and Dear allowances
  • 24. Some meaningful Implications & Visualizations

Editor's Notes

  1. Since the self splitting decision tree was difficult to interpret so we used interactive decision Observation: This model predicts that candidates who are referred from flipkart employees and have last company/ work experience gets hired more than those who doesn’t have work experience or whose last company is not known. Flipkart employees referring those candidates having relative work experience to those having none is approximately 91%. Business Perspective Inference Looking at the tree & leaf statistics it can be seen that at least 91% of Flipkart employees are referring those candidates whose previous company is known as compared to those whose previous company is unknown. And out of those whose previous company is known a significant number of candidates are getting hired. Hence, Flipkart should encourage more incentives and bonuses for employees who are helping candidates getting hired, so that unnecessary capital and time is not wasted on screening candidates which do not meet the expected criteria. This could in turn imply that if an employee is referring a candidate, the employee has a good know-how of the candidate he/she is referring and has a good idea of the requisite skills.
  2. Observation: Sunil has a smaller department but has greater demand of employees as he is heading the AD’s Group which is the backbone for any eCommerce firm to reach its customer base Business Perspective Inference The number of applications to the any department is not proportional to the size of the department, hence application to any department should not be a rejection criteria for any candidate.