SlideShare a Scribd company logo
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Robust Ensemble Classifier Combination Based on Noise
Removal with One-Class SVM
Ferhat ¨Ozg¨ur C¸atak
ozgur.catak@tubitak.gov.tr
T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute
Kocaeli, Turkey
Nov. 10 2015
22nd International Conference on Neural Information Processing
(ICONIP2015)
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Noise
Contributions
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Noise
Contributions
Noise in Machine Learning I
Definition
Noise: random error or variance in a measured variable
Noise is ”irrelevant or meaningless data”
Real-world data are affected by several components; the presence of noise
is a key factor.
Data in the real world is noisy, mainly due to:
With Big Data technologies, we store every type of data
Messages, images, tweets etc.
80% of the information generated today is of an unstructured variety 1
.
Noisy data unnecessarily increases the amount of storage space required
and can also adversely affect the results of any machine learning
algorithm.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Noise
Contributions
Noise in Machine Learning II
The performance of the
classifiers will heavily
depend on
Quality of the
training data
Robustness against
noise of the
classifier itself
Classification
Maximum Accuracy
Data Quality
Corruptions
Learning Algorithm
Figure: The performance of a classifier
1Forbes/Tech, http://www.forbes.com/sites/ciocentral/2013/02/04/big-data-big-hype/
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Noise
Contributions
Contributions
Our Aim :
Partition the entire data set into sub-datasets to reduce the training time
The detection and removal of noise that is the result of an imperfect data
collection process
In literature, One-Class SVM is a solution to create Representational
Model of a Data Set.
Improve the classification accuracy using ensemble method (AdaBoost)
For many classification algorithms, AdaBoost results in dramatic
improvements in performance.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
One-Class SVM
AdaBoost
Data Partitioning Strategies
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
One-Class SVM
AdaBoost
Data Partitioning Strategies
One-Class SVM I
SVM method is used to find
classification models using
the maximum margin
separating hyper plane.
Sch¨olkopf et al. proposed a
training methodology that
handles only one class
classification called as
”one-class” classification.
Basically,
The method finds soft
boundaries of the data
set
Then, model determines
whether new instance
belongs to this dataset
or not
−1 0 1 2 3 4 5
X1
−1
0
1
2
3
4
5
X2
Origin
-1
+1
Decision
Boundary
One-Class SVM
Decision Boundary
Training Instances
Figure: One-class SVM. The origin, (0, 0), is the single
instance with label −1.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
One-Class SVM
AdaBoost
Data Partitioning Strategies
One-Class SVM II
Input instances : S = {(xi , yi )|xi ∈ Rn
, y ∈ {1, . . . , K}}m
i=1
Kernel function : φ : X → H
minw,ξ,ρ
1
2
||w||2
+
1
mC i
ξi − ρ
subject to
(w.φ(xi )) ≥ ρ − ξi
ξi ≥ 0, ∀i = 1, . . . , m
(1)
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
One-Class SVM
AdaBoost
Data Partitioning Strategies
AdaBoost
AdaBoost
AdaBoost (adaptive boosting) is an ensemble learning algorithm that can
be used for classification or regression
Adaboost improves the accuracy performance of learning algorithms
Given a space of feature vectors X and two possible class labels,
y ∈ {−1, +1}, AdaBoost goal is to learn a strong classifier H(x) as a
weighted ensemble of weak classifiers ht(x) predicting the label of any
instance x ∈ X.
AdaBoost is an algorithm for constructing a strong classifier as linear
combination
H(x) =
T
t=1
αtht(x)
of simple weak classifiers ht(x)
ht(x) : weak classifier/hypothesis
H(x) = sign(f (x)) : strong or final classifier/hypothesis
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
One-Class SVM
AdaBoost
Data Partitioning Strategies
Data Partitioning Strategies
Data Partitioning
In explatory data analysis, partition the large-scale data sets is a general
need.
Computationally infeasible data sets.
Data partitioning reasons:
Diversity : Uncorrelated base classifiers 2 3
Complexity : reducing the input complexity of large-scale data sets 4
Specific : build classifier models for the specific part of the input instances 5
.
2Kuncheva, Ludmila I. ”Using diversity measures for generating error-correcting output codes in classifier ensembles.” Pattern Recognition Letters
26.1 (2005): 83-90.
3Dara, Rozita, Masoud Makrehchi, and Mohamed S. Kamel. ”Filter-based data partitioning for training multiple classifier systems.” Knowledge and
Data Engineering, IEEE Transactions on 22.4 (2010): 508-522.
4Chawla, Nitesh V., et al. ”Distributed learning with bagging-like performance.” Pattern recognition letters 24.1 (2003): 455-471.
5Woods, Kevin, Kevin Bowyer, and W. Philip Kegelmeyer Jr. ”Combination of multiple classifiers using local accuracy estimates.” Computer Vision
and Pattern Recognition, 1996. Proceedings CVPR’96, 1996 IEEE Computer Society Conference on. IEEE, 1996.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Basic Idea
Analysis of the proposed algorithm
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Basic Idea
Analysis of the proposed algorithm
Basic Idea
Main Objective
Partition the input data set into sub-data sets, (Xm, Ym)
Pre-processing: Noise removing is applied to each individual sub-data set.
Create local classifier ensembles for each sub data chunk
Classifier combination: Weighted voting method is used to combine the
each ensemble classifier
Sub data set: X1
Sub data set: X2
Sub data set: Xm
...
X1 → X1,clean
X2 → X2,clean
Xm → Xm,clean
H(X1,clean) → Y
H(X2,clean) → Y
H(Xm,clean) → Y
Ensemble Model Building:
... ...
InputDataSet:
S={(xi,yi)|xi∈Rn
,y∈{1,...,K}}m
i=1
EnsembleModelsCombination
Noise Removing
Figure: Overall of proposed approach.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Basic Idea
Analysis of the proposed algorithm
Analysis of the proposed algorithm
Ensemble methods of neural networks gets better accuracy performance
over unseen examples 6
Proposed method :
For each sub-data set, there is a set of classifier functions (ensemble
classifier), H(m)
H(m)
(x) = arg max
k
M
t=1
αtht(x) (2)
combined into one single classification model, ˆH(x)
ˆH(x) = arg max
k
m
i=1
βmH(m)
(x) (3)
where βm is the accuracy of the ensemble classifier m.
6Krogh, Anders, and Jesper Vedelsby. ”Neural network ensembles, cross validation, and
active learning.” Advances in neural information processing systems 7 (1995): 231-238.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Experimental Setup
Our approach is applied to five different data sets to verify its model
effectivity and efficiency.
Table: Description of the testing data sets used in the experiments.
Data set #Train #Test #Classes #Attributes
cod-rna 59,535 157,413 2 8
ijcnn1 49,990 91,701 2 22
letter 15,000 5,000 26 16
shuttle 43,500 14,500 7 9
SensIT Vehicle 78,823 19,705 3 100
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Effect of Noise Removing on Input Matrix I
Gini
Gini impurity is a measure of how often a randomly chosen element from the set
would be incorrectly labeled if it were randomly labeled according to the
distribution of labels in the subset.
Gini Impurity is used to measure the quality of procedure.
Gini approaches deal appropriately with data diversity of a data.
The Gini measures the class distribution of variable y = {y1, · · · , ym}
g = 1 −
k
p2
j (4)
where pj is the probability of class k, in data set D.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Effect of Noise Removing on Input Matrix II
0 0.1 0.2 0.3 0.4 0.5 0.6
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Noise Removing Percantage
GiniImpurity
cod−rna
SensIT Vehicle
ijcnn1
letter
shuttle
(a) Cleaned part.
0 0.1 0.2 0.3 0.4 0.5 0.6
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Noise Removing Percantage
GiniImpurity
cod−rna
SensIT Vehicle
ijcnn1
letter
shuttle
(b) Removed part.
Figure: The impact of one-class SVM on the performance in terms of Gini impurity.
The Gini impurity value decreases with the cleaning of noisy instances from
the data, and increases on separated noisy data.
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Effect of Noise Removing on Input Matrix III
Aim is to minimizing the Gini impurity on clean data set Xclean, maximizing
the value on noisy data set Xnoisy .
Split Percantage = arg min
p
Gini(Xclean, p)
Gini(Xnoisy , p)
(5)
Table: The best noise removal percentages of each data sets.
Data Sets Percentage Gini(Xclean) Gini(Xnoisy ) Gini(Xclean)
Gini(Xnoisy )
cod-rna 0,55 0,331344656 0,56831775 0,583027112
SensIT Vehicle 0,60 0,461155751 0,568847982 0,810683638
ijcnn1 0,60 0,167652825 0,179397223 0,934534117
letter 0,60 0,9039108 0,926430562 0,975691905
shuttle 0,30 0,214142833 0,506938229 0,422423918
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
Simulation Results
Table: Classification performance on example datasets using One-Class SVM noise
removing and without removing for the proposed learning algorithm.
Extra trees K-nn SVM
Data set All Clean All Clean All Clean
cod-rna 0,75929 0,78652 0,91955 0,93513 0,88553 0,89806
ijcnn1 0,69758 0,72175 0,75533 0,77833 0,82989 0,84326
letter 0,91255 0,90853 0,90594 0,90318 0,92121 0,9199
shuttle 0,47801 0,44792 0,20262 0,26974 0,62301 0,63939
SensIT Vehicle 0,9602 0,90558 0,87904 0,88266 0,99232 0,99174
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Conclusions
Table of Contents
1 Introduction
Noise
Contributions
2 Preliminaries
One-Class SVM
AdaBoost
Data Partitioning Strategies
3 Proposed Approach
Basic Idea
Analysis of the proposed algorithm
4 Experiments
Experimental Setup
Effect of Noise Removing on Input Matrix
Simulation Results
5 Conclusions
Conclusions
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Conclusions
Conclusions
Classification Performance
Improving classification performance
Removing the noisy instances using one-class SVM
Find best noise removing ratio with Gini impurity value
Experiments
Experimental results show that
Input complexity of the classification algorithms reduced remarkably
The accuracy increased by using the noise removal process
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
Introduction
Preliminaries
Proposed Approach
Experiments
Conclusions
Conclusions
Thank You
Ferhat ¨Ozg¨ur C¸atak
ozgur.catak@tubitak.gov.tr
Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S

More Related Content

What's hot

Developing Document Image Retrieval System
Developing Document Image Retrieval SystemDeveloping Document Image Retrieval System
Developing Document Image Retrieval System
Konstantinos Zagoris
 
Unsupervised Feature Learning
Unsupervised Feature LearningUnsupervised Feature Learning
Unsupervised Feature LearningAmgad Muhammad
 
analysis on concealing information within non secret data
analysis on concealing information within non secret dataanalysis on concealing information within non secret data
analysis on concealing information within non secret data
Vema Reddy
 
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logic
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logicApproximate bounded-knowledge-extractionusing-type-i-fuzzy-logic
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logicCemal Ardil
 
Accelerating Deep Learning Inference 
on Mobile Systems
Accelerating Deep Learning Inference 
on Mobile SystemsAccelerating Deep Learning Inference 
on Mobile Systems
Accelerating Deep Learning Inference 
on Mobile Systems
Darian Frajberg
 
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
Dept. Mechanical Engineering, Faculty of Engineering, Pharos University in Alexandria PUA
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryInderjeet Singh
 
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In VideosA Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
ijtsrd
 
Image Recognition With the Help of Auto-Associative Neural Network
Image Recognition With the Help of Auto-Associative Neural NetworkImage Recognition With the Help of Auto-Associative Neural Network
Image Recognition With the Help of Auto-Associative Neural Network
CSCJournals
 
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Daniel Lewis
 
Data Steganography for Optical Color Image Cryptosystems
Data Steganography for Optical Color Image CryptosystemsData Steganography for Optical Color Image Cryptosystems
Data Steganography for Optical Color Image Cryptosystems
CSCJournals
 
J04401066071
J04401066071J04401066071
J04401066071
ijceronline
 
Making Robots Learn
Making Robots LearnMaking Robots Learn
Making Robots Learn
inside-BigData.com
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisAboul Ella Hassanien
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...guest3f9c6b
 
Using Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar DressesUsing Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar Dresses
HJ van Veen
 

What's hot (18)

Developing Document Image Retrieval System
Developing Document Image Retrieval SystemDeveloping Document Image Retrieval System
Developing Document Image Retrieval System
 
Unsupervised Feature Learning
Unsupervised Feature LearningUnsupervised Feature Learning
Unsupervised Feature Learning
 
Automatic Image Annotation
Automatic Image AnnotationAutomatic Image Annotation
Automatic Image Annotation
 
analysis on concealing information within non secret data
analysis on concealing information within non secret dataanalysis on concealing information within non secret data
analysis on concealing information within non secret data
 
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logic
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logicApproximate bounded-knowledge-extractionusing-type-i-fuzzy-logic
Approximate bounded-knowledge-extractionusing-type-i-fuzzy-logic
 
Accelerating Deep Learning Inference 
on Mobile Systems
Accelerating Deep Learning Inference 
on Mobile SystemsAccelerating Deep Learning Inference 
on Mobile Systems
Accelerating Deep Learning Inference 
on Mobile Systems
 
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
Neural Network Models on the Prediction of Tool Wear in Turning Processes: A ...
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance Industry
 
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In VideosA Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
 
Image Recognition With the Help of Auto-Associative Neural Network
Image Recognition With the Help of Auto-Associative Neural NetworkImage Recognition With the Help of Auto-Associative Neural Network
Image Recognition With the Help of Auto-Associative Neural Network
 
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
 
Data Steganography for Optical Color Image Cryptosystems
Data Steganography for Optical Color Image CryptosystemsData Steganography for Optical Color Image Cryptosystems
Data Steganography for Optical Color Image Cryptosystems
 
J04401066071
J04401066071J04401066071
J04401066071
 
Making Robots Learn
Making Robots LearnMaking Robots Learn
Making Robots Learn
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysis
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
 
Using Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar DressesUsing Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar Dresses
 

Viewers also liked

Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
Ferhat Ozgur Catak
 
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
Ferhat Ozgur Catak
 
Korelasyon tabanlı nitelik seçimi
Korelasyon tabanlı nitelik seçimiKorelasyon tabanlı nitelik seçimi
Korelasyon tabanlı nitelik seçimi
Ferhat Ozgur Catak
 
Veritabanı Sızma Testleri - Hafta 3
Veritabanı Sızma Testleri - Hafta 3Veritabanı Sızma Testleri - Hafta 3
Veritabanı Sızma Testleri - Hafta 3
Ferhat Ozgur Catak
 
Veritabanı Sızma Testleri - 2
Veritabanı Sızma Testleri - 2Veritabanı Sızma Testleri - 2
Veritabanı Sızma Testleri - 2
Ferhat Ozgur Catak
 
Sosyal muhendislik 1
Sosyal muhendislik 1Sosyal muhendislik 1
Sosyal muhendislik 1
Ferhat Ozgur Catak
 
Kesif ve Zafiyet Tarama
Kesif ve Zafiyet TaramaKesif ve Zafiyet Tarama
Kesif ve Zafiyet Tarama
Ferhat Ozgur Catak
 
Veritabanı Sızma Testleri - Keşif
Veritabanı Sızma Testleri - KeşifVeritabanı Sızma Testleri - Keşif
Veritabanı Sızma Testleri - Keşif
Ferhat Ozgur Catak
 
Ddos Sızma Testleri - 2
Ddos Sızma Testleri - 2Ddos Sızma Testleri - 2
Ddos Sızma Testleri - 2
Ferhat Ozgur Catak
 
Dağıtık Servis Dışı Bırakma Saldırıları
Dağıtık Servis Dışı Bırakma SaldırılarıDağıtık Servis Dışı Bırakma Saldırıları
Dağıtık Servis Dışı Bırakma Saldırıları
Ferhat Ozgur Catak
 
Sızma Testi ve Güvenlik Denetlemeleri - Giriş
Sızma Testi ve Güvenlik Denetlemeleri - GirişSızma Testi ve Güvenlik Denetlemeleri - Giriş
Sızma Testi ve Güvenlik Denetlemeleri - Giriş
Ferhat Ozgur Catak
 
Pivoting ve Tunneling
Pivoting ve TunnelingPivoting ve Tunneling
Pivoting ve Tunneling
Ferhat Ozgur Catak
 
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux BilgisiSızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
Ferhat Ozgur Catak
 
Temel Linux Kullanımı ve Komutları
Temel Linux Kullanımı ve KomutlarıTemel Linux Kullanımı ve Komutları
Temel Linux Kullanımı ve Komutları
Ahmet Gürel
 
PAROLA KIRMA SALDIRILARI
PAROLA KIRMA SALDIRILARIPAROLA KIRMA SALDIRILARI
PAROLA KIRMA SALDIRILARI
BGA Cyber Security
 
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Ferhat Ozgur Catak
 
MapReduce based SVM
MapReduce based SVMMapReduce based SVM
MapReduce based SVM
Ferhat Ozgur Catak
 
Berezha Security
Berezha SecurityBerezha Security
Berezha Security
Vlad Styran
 
Windows hacking 1
Windows hacking 1Windows hacking 1
Windows hacking 1
Ferhat Ozgur Catak
 
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
Ferhat Ozgur Catak
 

Viewers also liked (20)

Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
Eşle/İndirge Yöntemi Kullanılarak Destek Vektör Makinesi Algoritması ile Yü...
 
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
Rassal Bölümlenmiş Veri Üzerinde Aşırı Öğrenme Makinesi ve Topluluk Algoritma...
 
Korelasyon tabanlı nitelik seçimi
Korelasyon tabanlı nitelik seçimiKorelasyon tabanlı nitelik seçimi
Korelasyon tabanlı nitelik seçimi
 
Veritabanı Sızma Testleri - Hafta 3
Veritabanı Sızma Testleri - Hafta 3Veritabanı Sızma Testleri - Hafta 3
Veritabanı Sızma Testleri - Hafta 3
 
Veritabanı Sızma Testleri - 2
Veritabanı Sızma Testleri - 2Veritabanı Sızma Testleri - 2
Veritabanı Sızma Testleri - 2
 
Sosyal muhendislik 1
Sosyal muhendislik 1Sosyal muhendislik 1
Sosyal muhendislik 1
 
Kesif ve Zafiyet Tarama
Kesif ve Zafiyet TaramaKesif ve Zafiyet Tarama
Kesif ve Zafiyet Tarama
 
Veritabanı Sızma Testleri - Keşif
Veritabanı Sızma Testleri - KeşifVeritabanı Sızma Testleri - Keşif
Veritabanı Sızma Testleri - Keşif
 
Ddos Sızma Testleri - 2
Ddos Sızma Testleri - 2Ddos Sızma Testleri - 2
Ddos Sızma Testleri - 2
 
Dağıtık Servis Dışı Bırakma Saldırıları
Dağıtık Servis Dışı Bırakma SaldırılarıDağıtık Servis Dışı Bırakma Saldırıları
Dağıtık Servis Dışı Bırakma Saldırıları
 
Sızma Testi ve Güvenlik Denetlemeleri - Giriş
Sızma Testi ve Güvenlik Denetlemeleri - GirişSızma Testi ve Güvenlik Denetlemeleri - Giriş
Sızma Testi ve Güvenlik Denetlemeleri - Giriş
 
Pivoting ve Tunneling
Pivoting ve TunnelingPivoting ve Tunneling
Pivoting ve Tunneling
 
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux BilgisiSızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
Sızma Testi ve Güvenlik Denetlemeleri - Temel Linux Bilgisi
 
Temel Linux Kullanımı ve Komutları
Temel Linux Kullanımı ve KomutlarıTemel Linux Kullanımı ve Komutları
Temel Linux Kullanımı ve Komutları
 
PAROLA KIRMA SALDIRILARI
PAROLA KIRMA SALDIRILARIPAROLA KIRMA SALDIRILARI
PAROLA KIRMA SALDIRILARI
 
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
 
MapReduce based SVM
MapReduce based SVMMapReduce based SVM
MapReduce based SVM
 
Berezha Security
Berezha SecurityBerezha Security
Berezha Security
 
Windows hacking 1
Windows hacking 1Windows hacking 1
Windows hacking 1
 
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
Dağıtık Sistemler İçin Mahremiyet Korumalı Uç Öğrenme Makinesi Sınıflandırma ...
 

Similar to Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM

Applying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labelsApplying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labels
Darian Frajberg
 
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
Sujeet Suryawanshi
 
Testing survey by_directions
Testing survey by_directionsTesting survey by_directions
Testing survey by_directionsTao He
 
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET Journal
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural network
nooriasukmaningtyas
 
Performance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various ClassifiersPerformance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various Classifiers
amreshkr19
 
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
ijtsrd
 
A Stacked Generalization Ensemble Approach for Improved Intrusion Detection
A Stacked Generalization Ensemble Approach for Improved Intrusion DetectionA Stacked Generalization Ensemble Approach for Improved Intrusion Detection
A Stacked Generalization Ensemble Approach for Improved Intrusion Detection
IJCSIS Research Publications
 
A novel automatic voice recognition system based on text-independent in a noi...
A novel automatic voice recognition system based on text-independent in a noi...A novel automatic voice recognition system based on text-independent in a noi...
A novel automatic voice recognition system based on text-independent in a noi...
IJECEIAES
 
Using the Machine to predict Testability
Using the Machine to predict TestabilityUsing the Machine to predict Testability
Using the Machine to predict Testability
Miguel Lopez
 
Time Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del PraTime Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del Pra
Data Science Milan
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
adil raja
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
adil raja
 
Eswc2009
Eswc2009Eswc2009
Eswc2009fanizzi
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design TrainingESCOM
 
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data miningIEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
sunda2011
 
Service Classification through Machine Learning: Aiding in the Efficient Ide...
 Service Classification through Machine Learning: Aiding in the Efficient Ide... Service Classification through Machine Learning: Aiding in the Efficient Ide...
Service Classification through Machine Learning: Aiding in the Efficient Ide...
SEAA 2022
 

Similar to Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM (20)

Applying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labelsApplying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labels
 
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
 
Testing survey by_directions
Testing survey by_directionsTesting survey by_directions
Testing survey by_directions
 
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural network
 
Performance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various ClassifiersPerformance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various Classifiers
 
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
Acoustic Scene Classification by using Combination of MODWPT and Spectral Fea...
 
A Stacked Generalization Ensemble Approach for Improved Intrusion Detection
A Stacked Generalization Ensemble Approach for Improved Intrusion DetectionA Stacked Generalization Ensemble Approach for Improved Intrusion Detection
A Stacked Generalization Ensemble Approach for Improved Intrusion Detection
 
.doc
.doc.doc
.doc
 
.doc
.doc.doc
.doc
 
.doc
.doc.doc
.doc
 
A novel automatic voice recognition system based on text-independent in a noi...
A novel automatic voice recognition system based on text-independent in a noi...A novel automatic voice recognition system based on text-independent in a noi...
A novel automatic voice recognition system based on text-independent in a noi...
 
Using the Machine to predict Testability
Using the Machine to predict TestabilityUsing the Machine to predict Testability
Using the Machine to predict Testability
 
Time Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del PraTime Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del Pra
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
Eswc2009
Eswc2009Eswc2009
Eswc2009
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
 
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data miningIEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Data mining
 
Service Classification through Machine Learning: Aiding in the Efficient Ide...
 Service Classification through Machine Learning: Aiding in the Efficient Ide... Service Classification through Machine Learning: Aiding in the Efficient Ide...
Service Classification through Machine Learning: Aiding in the Efficient Ide...
 

Recently uploaded

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 

Recently uploaded (20)

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 

Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM

  • 1. Introduction Preliminaries Proposed Approach Experiments Conclusions Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, Turkey Nov. 10 2015 22nd International Conference on Neural Information Processing (ICONIP2015) Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 2. Introduction Preliminaries Proposed Approach Experiments Conclusions Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 3. Introduction Preliminaries Proposed Approach Experiments Conclusions Noise Contributions Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 4. Introduction Preliminaries Proposed Approach Experiments Conclusions Noise Contributions Noise in Machine Learning I Definition Noise: random error or variance in a measured variable Noise is ”irrelevant or meaningless data” Real-world data are affected by several components; the presence of noise is a key factor. Data in the real world is noisy, mainly due to: With Big Data technologies, we store every type of data Messages, images, tweets etc. 80% of the information generated today is of an unstructured variety 1 . Noisy data unnecessarily increases the amount of storage space required and can also adversely affect the results of any machine learning algorithm. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 5. Introduction Preliminaries Proposed Approach Experiments Conclusions Noise Contributions Noise in Machine Learning II The performance of the classifiers will heavily depend on Quality of the training data Robustness against noise of the classifier itself Classification Maximum Accuracy Data Quality Corruptions Learning Algorithm Figure: The performance of a classifier 1Forbes/Tech, http://www.forbes.com/sites/ciocentral/2013/02/04/big-data-big-hype/ Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 6. Introduction Preliminaries Proposed Approach Experiments Conclusions Noise Contributions Contributions Our Aim : Partition the entire data set into sub-datasets to reduce the training time The detection and removal of noise that is the result of an imperfect data collection process In literature, One-Class SVM is a solution to create Representational Model of a Data Set. Improve the classification accuracy using ensemble method (AdaBoost) For many classification algorithms, AdaBoost results in dramatic improvements in performance. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 7. Introduction Preliminaries Proposed Approach Experiments Conclusions One-Class SVM AdaBoost Data Partitioning Strategies Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 8. Introduction Preliminaries Proposed Approach Experiments Conclusions One-Class SVM AdaBoost Data Partitioning Strategies One-Class SVM I SVM method is used to find classification models using the maximum margin separating hyper plane. Sch¨olkopf et al. proposed a training methodology that handles only one class classification called as ”one-class” classification. Basically, The method finds soft boundaries of the data set Then, model determines whether new instance belongs to this dataset or not −1 0 1 2 3 4 5 X1 −1 0 1 2 3 4 5 X2 Origin -1 +1 Decision Boundary One-Class SVM Decision Boundary Training Instances Figure: One-class SVM. The origin, (0, 0), is the single instance with label −1. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 9. Introduction Preliminaries Proposed Approach Experiments Conclusions One-Class SVM AdaBoost Data Partitioning Strategies One-Class SVM II Input instances : S = {(xi , yi )|xi ∈ Rn , y ∈ {1, . . . , K}}m i=1 Kernel function : φ : X → H minw,ξ,ρ 1 2 ||w||2 + 1 mC i ξi − ρ subject to (w.φ(xi )) ≥ ρ − ξi ξi ≥ 0, ∀i = 1, . . . , m (1) Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 10. Introduction Preliminaries Proposed Approach Experiments Conclusions One-Class SVM AdaBoost Data Partitioning Strategies AdaBoost AdaBoost AdaBoost (adaptive boosting) is an ensemble learning algorithm that can be used for classification or regression Adaboost improves the accuracy performance of learning algorithms Given a space of feature vectors X and two possible class labels, y ∈ {−1, +1}, AdaBoost goal is to learn a strong classifier H(x) as a weighted ensemble of weak classifiers ht(x) predicting the label of any instance x ∈ X. AdaBoost is an algorithm for constructing a strong classifier as linear combination H(x) = T t=1 αtht(x) of simple weak classifiers ht(x) ht(x) : weak classifier/hypothesis H(x) = sign(f (x)) : strong or final classifier/hypothesis Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 11. Introduction Preliminaries Proposed Approach Experiments Conclusions One-Class SVM AdaBoost Data Partitioning Strategies Data Partitioning Strategies Data Partitioning In explatory data analysis, partition the large-scale data sets is a general need. Computationally infeasible data sets. Data partitioning reasons: Diversity : Uncorrelated base classifiers 2 3 Complexity : reducing the input complexity of large-scale data sets 4 Specific : build classifier models for the specific part of the input instances 5 . 2Kuncheva, Ludmila I. ”Using diversity measures for generating error-correcting output codes in classifier ensembles.” Pattern Recognition Letters 26.1 (2005): 83-90. 3Dara, Rozita, Masoud Makrehchi, and Mohamed S. Kamel. ”Filter-based data partitioning for training multiple classifier systems.” Knowledge and Data Engineering, IEEE Transactions on 22.4 (2010): 508-522. 4Chawla, Nitesh V., et al. ”Distributed learning with bagging-like performance.” Pattern recognition letters 24.1 (2003): 455-471. 5Woods, Kevin, Kevin Bowyer, and W. Philip Kegelmeyer Jr. ”Combination of multiple classifiers using local accuracy estimates.” Computer Vision and Pattern Recognition, 1996. Proceedings CVPR’96, 1996 IEEE Computer Society Conference on. IEEE, 1996. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 12. Introduction Preliminaries Proposed Approach Experiments Conclusions Basic Idea Analysis of the proposed algorithm Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 13. Introduction Preliminaries Proposed Approach Experiments Conclusions Basic Idea Analysis of the proposed algorithm Basic Idea Main Objective Partition the input data set into sub-data sets, (Xm, Ym) Pre-processing: Noise removing is applied to each individual sub-data set. Create local classifier ensembles for each sub data chunk Classifier combination: Weighted voting method is used to combine the each ensemble classifier Sub data set: X1 Sub data set: X2 Sub data set: Xm ... X1 → X1,clean X2 → X2,clean Xm → Xm,clean H(X1,clean) → Y H(X2,clean) → Y H(Xm,clean) → Y Ensemble Model Building: ... ... InputDataSet: S={(xi,yi)|xi∈Rn ,y∈{1,...,K}}m i=1 EnsembleModelsCombination Noise Removing Figure: Overall of proposed approach. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 14. Introduction Preliminaries Proposed Approach Experiments Conclusions Basic Idea Analysis of the proposed algorithm Analysis of the proposed algorithm Ensemble methods of neural networks gets better accuracy performance over unseen examples 6 Proposed method : For each sub-data set, there is a set of classifier functions (ensemble classifier), H(m) H(m) (x) = arg max k M t=1 αtht(x) (2) combined into one single classification model, ˆH(x) ˆH(x) = arg max k m i=1 βmH(m) (x) (3) where βm is the accuracy of the ensemble classifier m. 6Krogh, Anders, and Jesper Vedelsby. ”Neural network ensembles, cross validation, and active learning.” Advances in neural information processing systems 7 (1995): 231-238. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 15. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 16. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Experimental Setup Our approach is applied to five different data sets to verify its model effectivity and efficiency. Table: Description of the testing data sets used in the experiments. Data set #Train #Test #Classes #Attributes cod-rna 59,535 157,413 2 8 ijcnn1 49,990 91,701 2 22 letter 15,000 5,000 26 16 shuttle 43,500 14,500 7 9 SensIT Vehicle 78,823 19,705 3 100 Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 17. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Effect of Noise Removing on Input Matrix I Gini Gini impurity is a measure of how often a randomly chosen element from the set would be incorrectly labeled if it were randomly labeled according to the distribution of labels in the subset. Gini Impurity is used to measure the quality of procedure. Gini approaches deal appropriately with data diversity of a data. The Gini measures the class distribution of variable y = {y1, · · · , ym} g = 1 − k p2 j (4) where pj is the probability of class k, in data set D. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 18. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Effect of Noise Removing on Input Matrix II 0 0.1 0.2 0.3 0.4 0.5 0.6 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Noise Removing Percantage GiniImpurity cod−rna SensIT Vehicle ijcnn1 letter shuttle (a) Cleaned part. 0 0.1 0.2 0.3 0.4 0.5 0.6 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Noise Removing Percantage GiniImpurity cod−rna SensIT Vehicle ijcnn1 letter shuttle (b) Removed part. Figure: The impact of one-class SVM on the performance in terms of Gini impurity. The Gini impurity value decreases with the cleaning of noisy instances from the data, and increases on separated noisy data. Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 19. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Effect of Noise Removing on Input Matrix III Aim is to minimizing the Gini impurity on clean data set Xclean, maximizing the value on noisy data set Xnoisy . Split Percantage = arg min p Gini(Xclean, p) Gini(Xnoisy , p) (5) Table: The best noise removal percentages of each data sets. Data Sets Percentage Gini(Xclean) Gini(Xnoisy ) Gini(Xclean) Gini(Xnoisy ) cod-rna 0,55 0,331344656 0,56831775 0,583027112 SensIT Vehicle 0,60 0,461155751 0,568847982 0,810683638 ijcnn1 0,60 0,167652825 0,179397223 0,934534117 letter 0,60 0,9039108 0,926430562 0,975691905 shuttle 0,30 0,214142833 0,506938229 0,422423918 Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 20. Introduction Preliminaries Proposed Approach Experiments Conclusions Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results Simulation Results Table: Classification performance on example datasets using One-Class SVM noise removing and without removing for the proposed learning algorithm. Extra trees K-nn SVM Data set All Clean All Clean All Clean cod-rna 0,75929 0,78652 0,91955 0,93513 0,88553 0,89806 ijcnn1 0,69758 0,72175 0,75533 0,77833 0,82989 0,84326 letter 0,91255 0,90853 0,90594 0,90318 0,92121 0,9199 shuttle 0,47801 0,44792 0,20262 0,26974 0,62301 0,63939 SensIT Vehicle 0,9602 0,90558 0,87904 0,88266 0,99232 0,99174 Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 21. Introduction Preliminaries Proposed Approach Experiments Conclusions Conclusions Table of Contents 1 Introduction Noise Contributions 2 Preliminaries One-Class SVM AdaBoost Data Partitioning Strategies 3 Proposed Approach Basic Idea Analysis of the proposed algorithm 4 Experiments Experimental Setup Effect of Noise Removing on Input Matrix Simulation Results 5 Conclusions Conclusions Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 22. Introduction Preliminaries Proposed Approach Experiments Conclusions Conclusions Conclusions Classification Performance Improving classification performance Removing the noisy instances using one-class SVM Find best noise removing ratio with Gini impurity value Experiments Experimental results show that Input complexity of the classification algorithms reduced remarkably The accuracy increased by using the noise removal process Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S
  • 23. Introduction Preliminaries Proposed Approach Experiments Conclusions Conclusions Thank You Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr Ferhat ¨Ozg¨ur C¸atak ozgur.catak@tubitak.gov.tr T¨UB˙ITAK - B˙ILGEM - Cyber Security Institute Kocaeli, TurkeyRobust Ensemble Classifier Combination Based on Noise Removal with One-Class S