SlideShare a Scribd company logo
1 of 48
Human Resource
Management Analytics
Navaneesh Gangala
Data Scientist
https://www.linkedin.com/in/navaneesh/
Problem statement:
• The main goal here is to predict whether an employee will stay or leave within
company.
• Besides this,
• Key drivers for attrition are to be identified
• Employees shall be classified into High ,Medium and Low risk profiles in terms of attrition
• Predicting thresholds for key drivers of attrition
• Employee level risk analysis showing supporting and contradicting features
Approach
• The dependent or target variable is binary in nature i.e. Terminated or not
Terminated
• In the given data, the target variable “Terminated” (0 = stay, 1 = leave) shall
be predicted using the other features.
• Hence classification could be the solution.
• To check which of the classification algorithm suits to the situation ROC -
AUC Analysis has been done
• From the model selected, the following have been derived
Key drivers that influence the employee Termination,
Thresholds for each driver
Various segments of risk profile has been derived.
• Initial Exploration :
• Total : 15 Variables
• Target : Terminated
• Unique Key : Employee_code (can be used to get the risk profile of the person once the model is
ready)
• Remaining 13 predictors are identified as
• Numerical : Tenure, TimeLastPos, LastRating, Annual.Income, Year.of.Birth
• Categorical :
Rehire,Department,Job.Level,Has.been.promoted,Client.work.travel,Education,Gender,Marital.Status
• Anomaly Detection :
• Structure and factor levels are good
• Tenure - Negative values detected and removed.
• Employee_code – few records found not to be unique hence removed
• LastRating - Median(2) can be used to replace imputed mean (1.95) to have better results
• Education - Two levels with similar meaning made into one( Bachelor's Degree  Bachelors Degree )
• Last Time Promoted values < Tenure - Records where Validation failed were removed
Data Preparation
Rehire Terminated Employee_code Department Job Level Tenure TimeLastPos Has been promoted LastRating Client work travel Education Gender Marital Status Annual Income Year of Birth
FALSE 1 133715Finance Lead Analyst 16 16 No 2Medium Travel Msc Analytics M Married 6638 1971
FALSE 1 91202Audit Senior Analyst 19 19 No 2Medium Travel Msc Analytics M Married 1004 1972
FALSE 1 51471Audit Staff I 33 33 No 2Medium Travel Msc Analytics M Married 643 1989
FALSE 1 95874Consulting Manager 39 39 No 2Medium Travel Msc Analytics M Married 5769 1964
FALSE 1 105869Finance Analyst 44 44 No 2Medium Travel Msc Analytics M Married 1043 1981
FALSE 1 20105Tax Staff II 79 79 No 2Medium Travel Msc Analytics M Married 1552 1963
FALSE 1 31135Audit Staff II 82 82 No 2Medium Travel Msc Analytics M Married 2884 1984
FALSE 1 52977Risk Management Lead Analyst 85 85 No 2Medium Travel Msc Analytics M Married 6828 1964
FALSE 1 115564Risk Management Analyst 89 89 No 2Medium Travel Msc Analytics M Married 2705 1988
Cleaned Data : HR SAMPLE V2.csv
• Imbalance Problem has not being detected
Target
variable
Unique
key
Project : Work flow
Exploratory Analysis Model Building Model Validation
Test data
1. Exploratory Analysis of Data
Test data
Promotion can be very significant
Promoted Higher Levels (GM and Director job levels) very less attrition
Mid Level employees who are Promoted have less attrition
Promotion is less Significant in this lot
Job Level – With Department
Employee Termination looks higher
Job levels has significance but not as great as Promotion (High level jobs – less exits)
More terminations can be seen in high population hence model should reveal more about this
Staff 1 sales employee less attrition
High levels of job profile very less attrition
Salary could be Significant -> low salary can make person to exit
Rating - not much Significant (looks like terminations follow population i.e uniform distribution)
YOB and Gender not Significant (Uniform distribution)
Travel not very significant (Uniform distribution)
Martial Status not significant (Uniform distribution)
Education Significance could also be less(Uniform distribution)
Rating Positive Effect
Salary Negative Effect
Rating Negative Effect
Salary Negative Effect
Rating Positive Effect
Salary Positive Effect
Rating Negative Effect
Salary Positive Effect
Combined effects of Rating and salary on exit of employee
Avg Tenure is less than 3 years if Not
Promoted
Effect of Promotion on Tenure and exit of employee
Tenure(days) Distribution across various job levels
Tenure vs exit
High termination below 1000 days (3 yrs)
(Other factors like promotion, salary play an important role.
Fluctuations here can be attributed to this)
Annual Income vs exit
High termination at low salaries(<3000)
Basic Decision tree (part of exploratory analysis)
Insights from Exploratory data analysis
• Promotion, Tenure, Job levels, Department, Annual income are significant
• Demographic variables like gender, Year of birth, marital status are less
significant. Same goes with travel time.
• Promotion turns out to be a key parameter in assessing the risk profile.
• Promotion in High and mid level management positions could significantly reduce
attrition in this segment.
• Along with promotion, salary turns out to be an important driver at entry level jobs
to retain them
• All these insights have to be validated using Machine learning algorithms.
Various Models
Model comparisons
Choosing a right classification model
Data
•.90-1 = excellent (A)
•.80-.90 = good (B)
•.70-.80 = fair (C)
•.60-.70 = poor (D)
•.50-.60 = fail (F)
Area Under the Curve (AUC)
Area under ROC curve is often used as a measure of
quality of the classification models. A random classifier
has an area under the curve of 0.5, while AUC for a
perfect classifier is equal to 1. In practice, most of the
classification models have an AUC between 0.5 and 1.
Accuracy(CA): the proportion of the total number of
predictions that were correct.
Precision : the proportion of positive cases that were
correctly identified.
Sensitivity or Recall : the proportion of actual positive
cases which are correctly identified.
Choosing a right classification model
Model comparisons
ROC - AUC Analysis
Random forest is performing well in a balanced way
Choosing a right classification model
By removing less significant variables (from exploratory analysis) and running the
model few improvements in KPIs is seen
Choosing a right classification model
Objectives of model
Model :
Random
forest
Cleaned Data
Key Drivers
Risk Profiles
Thresholds in
key drivers
Model chosen: Random forest
Useful ideas/future
prospects may pop up
during the process
Random forest (a brief overview)
Random forest (a brief overview)
m
m
m
Growing decision trees with m
predictors taken at each instance
Training data
(70% of given data)
Testing data
(30% of
given data)
Cleaned
data
m
m
m
Growing decision trees with m
predictors taken at each instance
To find the value of m predictors to be taken at once in
building the forest of decision trees
m= 3
i.e 3 predictors shall be suitable to be taken at once to build the forest of
decision trees.
> Growing random forest
> Results from random forest
As per the model, Has.been.promoted is the most important predictor followed by Tenure
TimeLastPos and so on till Rehire
> Finding important predictors or key drivers of attrition from the model
> Plotting them
Now, We can use our model to predict the outcome of our testing dataset.
Accuracy of the model : 84.67%
ROC Curve
Confusion Matrix
Random Forest Predicted
0 1
Actual
0 1878 368 0.836153161
1 131 879 0.87029703
0.93479343 0.70489174 0.846744472
1. Key Drivers for Termination or attrition
Decreasing order
of importance
Has.been.promoted, Tenure, Time.LastPos, Annual income, Year.of.birth, Department, Job.Level turns out to be key
drivers for termination or attrition
2. Risk Profiling
High  Error Rate : 1.14 %
Low  Error Rate : 2.45 %
Probability of termination of each employee can be obtained from the model.
These can be divided into High, Medium and Low risk segments
2. 1 Risk Profiling segmented approach
2.2 Tenure thresholds for High, medium and low risk divisions
Terminated
3. Thresholds - Tenure
3.1 Terminated Thresholds
3.2 Job Profile Thresholds
Terminated Terminated – Not Promoted Terminated – Promoted
3.3 Department Thresholds
Terminated Terminated – Not Promoted Terminated – Promoted
4. Risk Profiling Application Development
Web/standalone application can be developed where a HR Manager can understand attrition risk of each employee
Integrating Data from several sources like Social Media , Behavioral ,Work related ,feedbacks and Peer to Peer reviews etc.
Focusing on modeling techniques we can classifier them into profiles and once model build can help us to
understand the future attritions and features supporting and contradicting to particular choice employee made .
Can add Alerts and other more informative dashboards for HR managers to take necessary actions at right time
with the help of relative thresholds.
4.1 Risk Profile : For each Individual : Model ready for deployment
Contradicts Supports
4.2 Risk Profile : Matrix (Future Scope)
* Future Scope of work
Can be created from more behavior
Data to justify and to reduce the
attrition of the employee
Thank you
- Navaneesh Gangala

More Related Content

What's hot

Employee Attrition
Employee AttritionEmployee Attrition
Employee Attrition
Vinay sattur
 
Overview of HR Shared Services
Overview of HR Shared ServicesOverview of HR Shared Services
Overview of HR Shared Services
William Chin
 
Human resource planing
Human resource planingHuman resource planing
Human resource planing
Nisarg Shah
 
Employer branding
Employer brandingEmployer branding
Employer branding
mbusokhoza
 

What's hot (12)

HR Planning
HR PlanningHR Planning
HR Planning
 
Employee Attrition
Employee AttritionEmployee Attrition
Employee Attrition
 
Application of algebra
Application of algebraApplication of algebra
Application of algebra
 
Employer Branding
Employer BrandingEmployer Branding
Employer Branding
 
Overview of HR Shared Services
Overview of HR Shared ServicesOverview of HR Shared Services
Overview of HR Shared Services
 
Human Resources Management (HRM) Value Chain Processes, Organizational Cultur...
Human Resources Management (HRM) Value Chain Processes, Organizational Cultur...Human Resources Management (HRM) Value Chain Processes, Organizational Cultur...
Human Resources Management (HRM) Value Chain Processes, Organizational Cultur...
 
Human resource planing
Human resource planingHuman resource planing
Human resource planing
 
What's Next: Employer Branding
What's Next: Employer BrandingWhat's Next: Employer Branding
What's Next: Employer Branding
 
Employee Engagement
Employee EngagementEmployee Engagement
Employee Engagement
 
Presentation -(Human Resource)Current Recruitment Trends- A global view
Presentation -(Human Resource)Current Recruitment Trends- A global viewPresentation -(Human Resource)Current Recruitment Trends- A global view
Presentation -(Human Resource)Current Recruitment Trends- A global view
 
HR in Boardrooms
HR in BoardroomsHR in Boardrooms
HR in Boardrooms
 
Employer branding
Employer brandingEmployer branding
Employer branding
 

Similar to Employee Termination

Caliper talent audit
Caliper talent auditCaliper talent audit
Caliper talent audit
olovnordgren
 
Creditscore
CreditscoreCreditscore
Creditscore
kevinlan
 
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docxwk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
ericbrooks84875
 

Similar to Employee Termination (20)

Employee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptxEmployee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptx
 
HR Analytics, Done Right
HR Analytics, Done RightHR Analytics, Done Right
HR Analytics, Done Right
 
Employee Turnover Solution Using Analytical Techniques
Employee Turnover Solution Using Analytical TechniquesEmployee Turnover Solution Using Analytical Techniques
Employee Turnover Solution Using Analytical Techniques
 
HR six sigma process
HR six sigma process HR six sigma process
HR six sigma process
 
Talent Analytics ERE 2015
Talent Analytics ERE 2015 Talent Analytics ERE 2015
Talent Analytics ERE 2015
 
Employee Attrition Analysis / Churn Prediction
Employee Attrition Analysis / Churn PredictionEmployee Attrition Analysis / Churn Prediction
Employee Attrition Analysis / Churn Prediction
 
An Approach Towards Migration Analysis of Construction Employees
An Approach Towards Migration Analysis of Construction EmployeesAn Approach Towards Migration Analysis of Construction Employees
An Approach Towards Migration Analysis of Construction Employees
 
Forward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsForward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative Adjustments
 
Hr q professional development series—workforce planning panel
Hr q professional development series—workforce planning panelHr q professional development series—workforce planning panel
Hr q professional development series—workforce planning panel
 
Hr Analytics
Hr AnalyticsHr Analytics
Hr Analytics
 
Job Analysis Project
Job Analysis ProjectJob Analysis Project
Job Analysis Project
 
Caliper talent audit
Caliper talent auditCaliper talent audit
Caliper talent audit
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Creditscore
CreditscoreCreditscore
Creditscore
 
Analytics offerings
Analytics offeringsAnalytics offerings
Analytics offerings
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data science
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using R
 
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docxwk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
 
Fahim Karim: Attrition Prevention
Fahim Karim: Attrition PreventionFahim Karim: Attrition Prevention
Fahim Karim: Attrition Prevention
 
Deviation management new (1)
Deviation management new (1)Deviation management new (1)
Deviation management new (1)
 

Recently uploaded

Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 

Recently uploaded (20)

Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 

Employee Termination

  • 1. Human Resource Management Analytics Navaneesh Gangala Data Scientist https://www.linkedin.com/in/navaneesh/
  • 2. Problem statement: • The main goal here is to predict whether an employee will stay or leave within company. • Besides this, • Key drivers for attrition are to be identified • Employees shall be classified into High ,Medium and Low risk profiles in terms of attrition • Predicting thresholds for key drivers of attrition • Employee level risk analysis showing supporting and contradicting features
  • 3. Approach • The dependent or target variable is binary in nature i.e. Terminated or not Terminated • In the given data, the target variable “Terminated” (0 = stay, 1 = leave) shall be predicted using the other features. • Hence classification could be the solution. • To check which of the classification algorithm suits to the situation ROC - AUC Analysis has been done • From the model selected, the following have been derived Key drivers that influence the employee Termination, Thresholds for each driver Various segments of risk profile has been derived.
  • 4. • Initial Exploration : • Total : 15 Variables • Target : Terminated • Unique Key : Employee_code (can be used to get the risk profile of the person once the model is ready) • Remaining 13 predictors are identified as • Numerical : Tenure, TimeLastPos, LastRating, Annual.Income, Year.of.Birth • Categorical : Rehire,Department,Job.Level,Has.been.promoted,Client.work.travel,Education,Gender,Marital.Status • Anomaly Detection : • Structure and factor levels are good • Tenure - Negative values detected and removed. • Employee_code – few records found not to be unique hence removed • LastRating - Median(2) can be used to replace imputed mean (1.95) to have better results • Education - Two levels with similar meaning made into one( Bachelor's Degree  Bachelors Degree ) • Last Time Promoted values < Tenure - Records where Validation failed were removed Data Preparation
  • 5. Rehire Terminated Employee_code Department Job Level Tenure TimeLastPos Has been promoted LastRating Client work travel Education Gender Marital Status Annual Income Year of Birth FALSE 1 133715Finance Lead Analyst 16 16 No 2Medium Travel Msc Analytics M Married 6638 1971 FALSE 1 91202Audit Senior Analyst 19 19 No 2Medium Travel Msc Analytics M Married 1004 1972 FALSE 1 51471Audit Staff I 33 33 No 2Medium Travel Msc Analytics M Married 643 1989 FALSE 1 95874Consulting Manager 39 39 No 2Medium Travel Msc Analytics M Married 5769 1964 FALSE 1 105869Finance Analyst 44 44 No 2Medium Travel Msc Analytics M Married 1043 1981 FALSE 1 20105Tax Staff II 79 79 No 2Medium Travel Msc Analytics M Married 1552 1963 FALSE 1 31135Audit Staff II 82 82 No 2Medium Travel Msc Analytics M Married 2884 1984 FALSE 1 52977Risk Management Lead Analyst 85 85 No 2Medium Travel Msc Analytics M Married 6828 1964 FALSE 1 115564Risk Management Analyst 89 89 No 2Medium Travel Msc Analytics M Married 2705 1988 Cleaned Data : HR SAMPLE V2.csv • Imbalance Problem has not being detected Target variable Unique key
  • 6. Project : Work flow Exploratory Analysis Model Building Model Validation Test data
  • 7. 1. Exploratory Analysis of Data Test data
  • 8. Promotion can be very significant Promoted Higher Levels (GM and Director job levels) very less attrition Mid Level employees who are Promoted have less attrition Promotion is less Significant in this lot
  • 9. Job Level – With Department Employee Termination looks higher Job levels has significance but not as great as Promotion (High level jobs – less exits) More terminations can be seen in high population hence model should reveal more about this Staff 1 sales employee less attrition High levels of job profile very less attrition
  • 10. Salary could be Significant -> low salary can make person to exit
  • 11. Rating - not much Significant (looks like terminations follow population i.e uniform distribution)
  • 12. YOB and Gender not Significant (Uniform distribution)
  • 13. Travel not very significant (Uniform distribution)
  • 14. Martial Status not significant (Uniform distribution)
  • 15. Education Significance could also be less(Uniform distribution)
  • 16. Rating Positive Effect Salary Negative Effect Rating Negative Effect Salary Negative Effect Rating Positive Effect Salary Positive Effect Rating Negative Effect Salary Positive Effect Combined effects of Rating and salary on exit of employee
  • 17. Avg Tenure is less than 3 years if Not Promoted Effect of Promotion on Tenure and exit of employee
  • 18. Tenure(days) Distribution across various job levels
  • 19. Tenure vs exit High termination below 1000 days (3 yrs) (Other factors like promotion, salary play an important role. Fluctuations here can be attributed to this)
  • 20. Annual Income vs exit High termination at low salaries(<3000)
  • 21. Basic Decision tree (part of exploratory analysis)
  • 22. Insights from Exploratory data analysis • Promotion, Tenure, Job levels, Department, Annual income are significant • Demographic variables like gender, Year of birth, marital status are less significant. Same goes with travel time. • Promotion turns out to be a key parameter in assessing the risk profile. • Promotion in High and mid level management positions could significantly reduce attrition in this segment. • Along with promotion, salary turns out to be an important driver at entry level jobs to retain them • All these insights have to be validated using Machine learning algorithms.
  • 23. Various Models Model comparisons Choosing a right classification model Data
  • 24. •.90-1 = excellent (A) •.80-.90 = good (B) •.70-.80 = fair (C) •.60-.70 = poor (D) •.50-.60 = fail (F) Area Under the Curve (AUC) Area under ROC curve is often used as a measure of quality of the classification models. A random classifier has an area under the curve of 0.5, while AUC for a perfect classifier is equal to 1. In practice, most of the classification models have an AUC between 0.5 and 1. Accuracy(CA): the proportion of the total number of predictions that were correct. Precision : the proportion of positive cases that were correctly identified. Sensitivity or Recall : the proportion of actual positive cases which are correctly identified. Choosing a right classification model Model comparisons
  • 25. ROC - AUC Analysis Random forest is performing well in a balanced way Choosing a right classification model
  • 26. By removing less significant variables (from exploratory analysis) and running the model few improvements in KPIs is seen Choosing a right classification model
  • 27. Objectives of model Model : Random forest Cleaned Data Key Drivers Risk Profiles Thresholds in key drivers Model chosen: Random forest Useful ideas/future prospects may pop up during the process
  • 28. Random forest (a brief overview)
  • 29. Random forest (a brief overview) m m m Growing decision trees with m predictors taken at each instance
  • 30.
  • 31. Training data (70% of given data) Testing data (30% of given data) Cleaned data
  • 32. m m m Growing decision trees with m predictors taken at each instance To find the value of m predictors to be taken at once in building the forest of decision trees
  • 33. m= 3 i.e 3 predictors shall be suitable to be taken at once to build the forest of decision trees.
  • 34. > Growing random forest > Results from random forest
  • 35. As per the model, Has.been.promoted is the most important predictor followed by Tenure TimeLastPos and so on till Rehire > Finding important predictors or key drivers of attrition from the model > Plotting them
  • 36. Now, We can use our model to predict the outcome of our testing dataset. Accuracy of the model : 84.67%
  • 38. Confusion Matrix Random Forest Predicted 0 1 Actual 0 1878 368 0.836153161 1 131 879 0.87029703 0.93479343 0.70489174 0.846744472
  • 39. 1. Key Drivers for Termination or attrition Decreasing order of importance Has.been.promoted, Tenure, Time.LastPos, Annual income, Year.of.birth, Department, Job.Level turns out to be key drivers for termination or attrition
  • 40. 2. Risk Profiling High  Error Rate : 1.14 % Low  Error Rate : 2.45 % Probability of termination of each employee can be obtained from the model. These can be divided into High, Medium and Low risk segments 2. 1 Risk Profiling segmented approach
  • 41. 2.2 Tenure thresholds for High, medium and low risk divisions Terminated
  • 42. 3. Thresholds - Tenure 3.1 Terminated Thresholds
  • 43. 3.2 Job Profile Thresholds Terminated Terminated – Not Promoted Terminated – Promoted
  • 44. 3.3 Department Thresholds Terminated Terminated – Not Promoted Terminated – Promoted
  • 45. 4. Risk Profiling Application Development Web/standalone application can be developed where a HR Manager can understand attrition risk of each employee Integrating Data from several sources like Social Media , Behavioral ,Work related ,feedbacks and Peer to Peer reviews etc. Focusing on modeling techniques we can classifier them into profiles and once model build can help us to understand the future attritions and features supporting and contradicting to particular choice employee made . Can add Alerts and other more informative dashboards for HR managers to take necessary actions at right time with the help of relative thresholds.
  • 46. 4.1 Risk Profile : For each Individual : Model ready for deployment Contradicts Supports
  • 47. 4.2 Risk Profile : Matrix (Future Scope) * Future Scope of work Can be created from more behavior Data to justify and to reduce the attrition of the employee