SlideShare a Scribd company logo
RESEARCH POSTER PRESENTATION DESIGN © 2019
www.PosterPresentations.com
Abstract
Our work is about the study of
performance and eligibility of decision
trees for solving imbalanced data
problems.
First, we have investigated classical
decision trees on imbalanced data, we
noticed this yield a poor performance on
minority class, this stands for decision
trees is sensitive to imbalance class
problem. To improve this, we have been
able to combine multiple decision trees.
This combination lead us to think about
random forest (RF), including Balanced
random forest (BRF) and weighted random
forest (WRF). Our results show that
Balanced random forest outperform
Random forest, decision tree and weighted
random forest.
Keywords: Machine Learning, imbalanced
classes, Decision trees, Ensemble methods.
Supervisor : Prof. Kafunda Katalay
Author :Henock Makumbu Mboko
"Application of Ensemble methods for solving imbalanced data problems"
Data sources and subject context Estimation of Sensitivity with respect to the models studied Estimation of Recall for the Positives in relation to models studied
Interpretation of results
Estimation of the error rate in relation to the models studies
Data
Distribution
(D/N/ & P.D/
Models Error Rate Sensitivity Specificity RP RN
Data set 4523 4000 ND TD 14,0014738 42,5149701 97,1428571 43,030303 91,9463087
521 PD RF 11,274871 28,742515 97,1428571 58,5365854 90,6666667
Training set 3164 2810 D.N WRF 10,8327192 40,7185629 95,9663866 58,6206897 92,0225624
354 D.P
Test Set 1357 1190 D.N BRF 7,00073692 71,8562874 95,6912029 71,4285714 95,7771788
167 D.P
In this experiment, we use data from a
bank which contains the transactional
information of customers contacted in
advance by the bank in order to invite
them to make the deposit operations
and are observable over a given period
by the bank.
These data are so imbalanced meaning
that customers who responded
positively and negatively constitute the
minority and majority classes
respectively.
those who responded positively
constitute the minority class and those
who did not respond negatively form
the majority class.
Our dataset highlighted that the minority class is the
positive class and this constitute our interest class on which
we aim to improve the performance. Our measures of
performance are error rate, specificity, sensitivity and recall
for positive/negative.
The above table and figures highlighted that BFR
outperform RF, WRF and decision tree due to its lower
error rate, higher values for both, sensitivity and
specificity.
Although the BFR improves the minority class performance,
but we are still have a problem, because BFR will hurt the
majority class performance for the benefit of minority class.
WRF seems to outperform slightly BFR with respect to the
performance of negative class whereas BRF outperform all
other methods with respect to positive class performance.
As future work, we aim to improve BFR in order to trade-
off the majority and minority classes performance.
Tabular Presentation of the results
University of Kinshasa_RDC
Department of Mathematics and Computer Sciences, Faculty of Sciences

More Related Content

Similar to PosterPresentations.Henock_Makumbu.pdf

L017358286
L017358286L017358286
L017358286
IOSR Journals
 
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
ijaia
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural Networks
ITIIIndustries
 
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in ExcelAHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
Megha Ahuja
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
ijtsrd
 
228-SE3001_2
228-SE3001_2228-SE3001_2
228-SE3001_2
Boshra Albayaty
 
Statistics For Bi
Statistics For BiStatistics For Bi
Statistics For Bi
Angela Hays
 
Case Study 2 SCADA WormProtecting the nation’s critical infra.docx
Case Study 2 SCADA WormProtecting the nation’s critical infra.docxCase Study 2 SCADA WormProtecting the nation’s critical infra.docx
Case Study 2 SCADA WormProtecting the nation’s critical infra.docx
wendolynhalbert
 
Examples Of Comparing And Contrast Essays
Examples Of Comparing And Contrast EssaysExamples Of Comparing And Contrast Essays
Examples Of Comparing And Contrast Essays
Angelavette Dowdy
 
Feature selection
Feature selectionFeature selection
Feature selection
Abhinav Katoch
 
Results Based Accountabillty
Results Based AccountabilltyResults Based Accountabillty
Results Based Accountabillty
SuncoastMeetings
 
CariesICTAI2008
CariesICTAI2008CariesICTAI2008
CariesICTAI2008
Robson D. Montenegro
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
Datacademy.ai
 
Impact Of Perceptual Mapping Of Star Hotel
Impact Of Perceptual Mapping Of Star HotelImpact Of Perceptual Mapping Of Star Hotel
Impact Of Perceptual Mapping Of Star Hotel
Rachel Phillips
 
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docxBALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
ikirkton
 
Recession Proof Your Marketing Spend
Recession Proof Your Marketing SpendRecession Proof Your Marketing Spend
Recession Proof Your Marketing Spend
absorbedjug4032
 

Similar to PosterPresentations.Henock_Makumbu.pdf (16)

L017358286
L017358286L017358286
L017358286
 
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
UTILIZING IMBALANCED DATA AND CLASSIFICATION COST MATRIX TO PREDICT MOVIE PRE...
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural Networks
 
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in ExcelAHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
AHP-ANALYTIC HIERARCHY PROCESS- How To Slove AHP in Excel
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
 
228-SE3001_2
228-SE3001_2228-SE3001_2
228-SE3001_2
 
Statistics For Bi
Statistics For BiStatistics For Bi
Statistics For Bi
 
Case Study 2 SCADA WormProtecting the nation’s critical infra.docx
Case Study 2 SCADA WormProtecting the nation’s critical infra.docxCase Study 2 SCADA WormProtecting the nation’s critical infra.docx
Case Study 2 SCADA WormProtecting the nation’s critical infra.docx
 
Examples Of Comparing And Contrast Essays
Examples Of Comparing And Contrast EssaysExamples Of Comparing And Contrast Essays
Examples Of Comparing And Contrast Essays
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Results Based Accountabillty
Results Based AccountabilltyResults Based Accountabillty
Results Based Accountabillty
 
CariesICTAI2008
CariesICTAI2008CariesICTAI2008
CariesICTAI2008
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
 
Impact Of Perceptual Mapping Of Star Hotel
Impact Of Perceptual Mapping Of Star HotelImpact Of Perceptual Mapping Of Star Hotel
Impact Of Perceptual Mapping Of Star Hotel
 
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docxBALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
BALLARD INTEGRATED MANAGED SERVICES, INC. PART 2TEAM CINST.docx
 
Recession Proof Your Marketing Spend
Recession Proof Your Marketing SpendRecession Proof Your Marketing Spend
Recession Proof Your Marketing Spend
 

More from kwadwoAmedi

IndabaX2022Agishaposter.pdf
IndabaX2022Agishaposter.pdfIndabaX2022Agishaposter.pdf
IndabaX2022Agishaposter.pdf
kwadwoAmedi
 
POSTER_Ewonye.pdf
POSTER_Ewonye.pdfPOSTER_Ewonye.pdf
POSTER_Ewonye.pdf
kwadwoAmedi
 
Dengue Fever Presentation.pdf
Dengue Fever Presentation.pdfDengue Fever Presentation.pdf
Dengue Fever Presentation.pdf
kwadwoAmedi
 
IndabaX Ghana Poster.pdf
IndabaX Ghana Poster.pdfIndabaX Ghana Poster.pdf
IndabaX Ghana Poster.pdf
kwadwoAmedi
 
Poster Presentation GDSS 2022 (IndabaX Ghana) Adjei Boateng.pdf
Poster Presentation GDSS 2022 (IndabaX Ghana)  Adjei Boateng.pdfPoster Presentation GDSS 2022 (IndabaX Ghana)  Adjei Boateng.pdf
Poster Presentation GDSS 2022 (IndabaX Ghana) Adjei Boateng.pdf
kwadwoAmedi
 
GDSS_IndabaX_Maranatha.pdf
GDSS_IndabaX_Maranatha.pdfGDSS_IndabaX_Maranatha.pdf
GDSS_IndabaX_Maranatha.pdf
kwadwoAmedi
 
malnutration.pdf
malnutration.pdfmalnutration.pdf
malnutration.pdf
kwadwoAmedi
 

More from kwadwoAmedi (7)

IndabaX2022Agishaposter.pdf
IndabaX2022Agishaposter.pdfIndabaX2022Agishaposter.pdf
IndabaX2022Agishaposter.pdf
 
POSTER_Ewonye.pdf
POSTER_Ewonye.pdfPOSTER_Ewonye.pdf
POSTER_Ewonye.pdf
 
Dengue Fever Presentation.pdf
Dengue Fever Presentation.pdfDengue Fever Presentation.pdf
Dengue Fever Presentation.pdf
 
IndabaX Ghana Poster.pdf
IndabaX Ghana Poster.pdfIndabaX Ghana Poster.pdf
IndabaX Ghana Poster.pdf
 
Poster Presentation GDSS 2022 (IndabaX Ghana) Adjei Boateng.pdf
Poster Presentation GDSS 2022 (IndabaX Ghana)  Adjei Boateng.pdfPoster Presentation GDSS 2022 (IndabaX Ghana)  Adjei Boateng.pdf
Poster Presentation GDSS 2022 (IndabaX Ghana) Adjei Boateng.pdf
 
GDSS_IndabaX_Maranatha.pdf
GDSS_IndabaX_Maranatha.pdfGDSS_IndabaX_Maranatha.pdf
GDSS_IndabaX_Maranatha.pdf
 
malnutration.pdf
malnutration.pdfmalnutration.pdf
malnutration.pdf
 

Recently uploaded

4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
wyddcwye1
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
sameer shah
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 

Recently uploaded (20)

4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 

PosterPresentations.Henock_Makumbu.pdf

  • 1. RESEARCH POSTER PRESENTATION DESIGN © 2019 www.PosterPresentations.com Abstract Our work is about the study of performance and eligibility of decision trees for solving imbalanced data problems. First, we have investigated classical decision trees on imbalanced data, we noticed this yield a poor performance on minority class, this stands for decision trees is sensitive to imbalance class problem. To improve this, we have been able to combine multiple decision trees. This combination lead us to think about random forest (RF), including Balanced random forest (BRF) and weighted random forest (WRF). Our results show that Balanced random forest outperform Random forest, decision tree and weighted random forest. Keywords: Machine Learning, imbalanced classes, Decision trees, Ensemble methods. Supervisor : Prof. Kafunda Katalay Author :Henock Makumbu Mboko "Application of Ensemble methods for solving imbalanced data problems" Data sources and subject context Estimation of Sensitivity with respect to the models studied Estimation of Recall for the Positives in relation to models studied Interpretation of results Estimation of the error rate in relation to the models studies Data Distribution (D/N/ & P.D/ Models Error Rate Sensitivity Specificity RP RN Data set 4523 4000 ND TD 14,0014738 42,5149701 97,1428571 43,030303 91,9463087 521 PD RF 11,274871 28,742515 97,1428571 58,5365854 90,6666667 Training set 3164 2810 D.N WRF 10,8327192 40,7185629 95,9663866 58,6206897 92,0225624 354 D.P Test Set 1357 1190 D.N BRF 7,00073692 71,8562874 95,6912029 71,4285714 95,7771788 167 D.P In this experiment, we use data from a bank which contains the transactional information of customers contacted in advance by the bank in order to invite them to make the deposit operations and are observable over a given period by the bank. These data are so imbalanced meaning that customers who responded positively and negatively constitute the minority and majority classes respectively. those who responded positively constitute the minority class and those who did not respond negatively form the majority class. Our dataset highlighted that the minority class is the positive class and this constitute our interest class on which we aim to improve the performance. Our measures of performance are error rate, specificity, sensitivity and recall for positive/negative. The above table and figures highlighted that BFR outperform RF, WRF and decision tree due to its lower error rate, higher values for both, sensitivity and specificity. Although the BFR improves the minority class performance, but we are still have a problem, because BFR will hurt the majority class performance for the benefit of minority class. WRF seems to outperform slightly BFR with respect to the performance of negative class whereas BRF outperform all other methods with respect to positive class performance. As future work, we aim to improve BFR in order to trade- off the majority and minority classes performance. Tabular Presentation of the results University of Kinshasa_RDC Department of Mathematics and Computer Sciences, Faculty of Sciences