SlideShare a Scribd company logo
1 of 42
Download to read offline
Improving Adaptive Bagging Methods
for Evolving Data Streams
A. Bifet, G. Holmes, B. Pfahringer, and R. Gavaldà
University of Waikato
Hamilton, New Zealand
Laboratory for Relational Algorithmics, Complexity and Learning LARCA
UPC-Barcelona Tech, Catalonia
Nanjing, 4 November 2009
1st Asian Conference on Machine Learning (ACML’09)
Motivation
MOA Software for Mining Data Streams
Build a useful software mining for massive data sets
Bagging for data streams
Improve accuracy on classification methods for data
streams
2 / 26
Data stream classification cycle
1 Process an example at a time,
and inspect it only once (at
most)
2 Use a limited amount of
memory
3 Work in a limited amount of
time
4 Be ready to predict at any
point
3 / 26
Realtime analytics: from Databases to Dataflows
Data streams
Data streams are ordered datasets
Not all datasets are data streams
All dataset may be processed incrementally as a data
stream
MOA: Massive Online Analysis
Faster Mining Software using less resources
Instant mining: more for less
4 / 26
What is MOA?
{M}assive {O}nline {A}nalysis is a framework for online learning
from data streams.
It is closely related to WEKA
It includes a collection of offline and online as well as tools
for evaluation:
boosting and bagging
Hoeffding Trees
with and without Naïve Bayes classifiers at the leaves.
5 / 26
WEKA
Waikato Environment for Knowledge Analysis
Collection of state-of-the-art machine learning algorithms
and data processing tools implemented in Java
Released under the GPL
Support for the whole process of experimental data mining
Preparation of input data
Statistical evaluation of learning schemes
Visualization of input data and the result of learning
Used for education, research and applications
Complements “Data Mining” by Witten & Frank
6 / 26
WEKA: the bird
7 / 26
MOA: the bird
The Moa (another native NZ bird) is not only flightless, like the
Weka, but also extinct.
8 / 26
MOA: the bird
The Moa (another native NZ bird) is not only flightless, like the
Weka, but also extinct.
8 / 26
MOA: the bird
The Moa (another native NZ bird) is not only flightless, like the
Weka, but also extinct.
8 / 26
MOA: the bird
The Moa (another native NZ bird) is not only flightless, like the
Weka, but also extinct.
8 / 26
Concept Drift Framework
t
f(t) f(t)
α
α
t0
W
0.5
1
Definition
Given two data streams a, b, we define c = a⊕W
t0
b as the data
stream built joining the two data streams a and b
Pr[c(t) = b(t)] = 1/(1+e−4(t−t0)/W ).
Pr[c(t) = a(t)] = 1−Pr[c(t) = b(t)]
9 / 26
Concept Drift Framework
t
f(t) f(t)
α
α
t0
W
0.5
1
Example
(((a⊕W0
t0
b)⊕W1
t1
c)⊕W2
t2
d)...
(((SEA9 ⊕W
t0
SEA8)⊕W
2t0
SEA7)⊕W
3t0
SEA9.5)
CovPokElec = (CoverType⊕5,000
581,012 Poker)⊕5,000
1,000,000 ELEC2
9 / 26
New Ensemble Methods For Evolving Data Streams
New Ensemble Methods For Evolving Streams (KDD’09)
a new experimental data stream framework for studying
concept drift
two new variants of Bagging:
ADWIN Bagging
Adaptive-Size Hoeffding Tree (ASHT) Bagging.
an evaluation study on synthetic and real-world datasets
10 / 26
Outline
1 Adaptive-Size Hoeffding Tree bagging
2 ADWIN Bagging
3 Empirical evaluation
11 / 26
Adaptive-Size Hoeffding Tree
T1 T2 T3 T4
Ensemble of trees of different size
each tree has a maximum size
after one node splits, it deletes some nodes to reduce its
size if the size of the tree is higher than the maximum value
12 / 26
Adaptive-Size Hoeffding Tree
T1 T2 T3 T4
Ensemble of trees of different size
smaller trees adapt more quickly to changes,
larger trees do better during periods with little change
diversity
12 / 26
Adaptive-Size Hoeffding Tree
0,2
0,21
0,22
0,23
0,24
0,25
0,26
0,27
0,28
0,29
0,3
0 0,1 0,2 0,3 0,4 0,5 0,6
Kappa
Error
0,25
0,255
0,26
0,265
0,27
0,275
0,28
0,1 0,12 0,14 0,16 0,18 0,2 0,22 0,24 0,26 0,28 0,3
Kappa
Error
Figure: Kappa-Error diagrams for ASHT bagging (left) and bagging
(right) on dataset RandomRBF with drift, plotting 90 pairs of
classifiers.
13 / 26
Improvement for ASHT Bagging Method
Improvement for ASHT Bagging ensemble method
Bagging using trees of different size
add a change detector for each tree in the ensemble
DDM: Gama et al.
EDDM: Baena, del Campo, Fidalgo et al.
14 / 26
Outline
1 Adaptive-Size Hoeffding Tree bagging
2 ADWIN Bagging
3 Empirical evaluation
15 / 26
ADWIN Bagging
ADWIN
An adaptive sliding window whose size is recomputed online
according to the rate of change observed.
ADWIN has rigorous guarantees (theorems)
On ratio of false positives and negatives
On the relation of the size of the current window and
change rates
ADWIN Bagging
When a change is detected, the worst classifier is removed and
a new classifier is added.
16 / 26
Optimal Change Detector and Predictor
ADWIN
High accuracy
Fast detection of change
Low false positives and false negatives ratios
Low computational cost: minimum space and time needed
Theoretical guarantees
No parameters needed
Estimator with Memory and Change Detector
17 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 1
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 1 W1 = 01010110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 10 W1 = 1010110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 101 W1 = 010110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 1010 W1 = 10110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 10101 W1 = 0110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 101010 W1 = 110111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 1010101 W1 = 10111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111
W0= 10101011 W1 = 0111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111 |µ̂W0
− µ̂W1
| ≥ εc : CHANGE DET.!
W0= 101010110 W1 = 111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 101010110111111 Drop elements from the tail of W
W0= 101010110 W1 = 111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Example
W= 01010110111111 Drop elements from the tail of W
W0= 101010110 W1 = 111111
ADWIN: ADAPTIVE WINDOWING ALGORITHM
1 Initialize Window W
2 for each t > 0
3 do W ← W ∪{xt } (i.e., add xt to the head of W)
4 repeat Drop elements from the tail of W
5 until |µ̂W0
− µ̂W1
| < εc holds
6 for every split of W into W = W0 ·W1
7 Output µ̂W
18 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
Theorem
At every time step we have:
1 (False positive rate bound). If µt remains constant within
W, the probability that ADWIN shrinks the window at this
step is at most δ.
2 (False negative rate bound). Suppose that for some
partition of W in two parts W0W1 (where W1 contains the
most recent items) we have |µW0
− µW1
| > 2εc. Then with
probability 1−δ ADWIN shrinks W to W1, or shorter.
ADWIN tunes itself to the data stream at hand, with no need for
the user to hardwire or precompute parameters.
19 / 26
Algorithm ADaptive Sliding WINdow
ADWIN
ADWIN using a Data Stream Sliding Window Model,
can provide the exact counts of 1’s in O(1) time per point.
tries O(logW) cutpoints
uses O(1
ε logW) memory words
the processing time per example is O(logW) (amortized
and worst-case).
Sliding Window Model
1010101 101 11 1 1
Content: 4 2 2 1 1
Capacity: 7 3 2 1 1
20 / 26
ADWIN bagging using Hoeffding Adaptive Trees
Decision Trees: Hoeffding Adaptive Tree
CVFDT: Hulten, Spencer and Domingos
No theoretical guarantees on the error rate of CVFDT
Parameters needed : size of window, number of
examples,...
Hoeffding Adaptive Tree:
replace frequency statistics counters by estimators
don’t need a window to store examples
use a change detector with theoretical guarantees to
substitute trees
Advantages:
1 Theoretical guarantees
2 No Parameters
21 / 26
Outline
1 Adaptive-Size Hoeffding Tree bagging
2 ADWIN Bagging
3 Empirical evaluation
22 / 26
Empirical evaluation
Dataset Most Accurate Method
Hyperplane Drift 0.0001 Bag10 ASHT W+R
Hyperplane Drift 0.001 DDM Bag10 ASHT W
SEA W = 50 BagADWIN 10 HAT
SEA W = 50000 BagADWIN 10 HAT
RandomRBF No Drift 50 centers Bag 10 HT
RandomRBF Drift .0001 50 centers BagADWIN 10 HAT
RandomRBF Drift .001 50 centers DDM Bag10 ASHT W
RandomRBF Drift .001 10 centers BagADWIN 10 HAT
Cover Type DDM Bag10 ASHT W
Poker BagADWIN 10 HAT
Electricity DDM Bag10 ASHT W
CovPokElec BagADWIN 10 HAT
23 / 26
Empirical evaluation
SEA
W= 50000
Time Acc. Mem.
BagADWIN 10 HAT 154.91 88.88 ± 0.05 2.35
DDM Bag10 ASHT W 44.02 88.72 ± 0.05 0.65
NaiveBayes 5.52 84.60 ± 0.03 0.00
NBADWIN 12.40 87.83 ± 0.07 0.02
HT 7.20 85.02 ± 0.11 0.33
HT DDM 7.88 88.17 ± 0.18 0.16
HAT 20.96 88.40 ± 0.07 0.18
BagADWIN 10 HT 53.15 88.58 ± 0.10 0.88
Bag10 HT 30.88 85.38 ± 0.06 3.36
Bag10 ASHT W+R 33.56 88.51 ± 0.06 0.84
24 / 26
Empirical evaluation
Accuracy
71,4
71,9
72,4
72,9
73,4
73,9
10.000 140.000 270.000 400.000 530.000 660.000 790.000 920.000
Instances
Accuracy
(%)
BagAdwin HAT
DDM BagHAST
EDDM BagHast
DDM HT
EDDM HT
BagAdwin HT
BagHAST
Figure: Accuracy on dataset LED with three concept drifts.
25 / 26
Summary
http://www.cs.waikato.ac.nz/∼abifet/MOA/
Conclusions
New improvements for ensemble bagging methods:
Adaptive-Size Hoeffding Tree bagging using change
detection methods
ADWIN bagging using Hoeffding Adaptive Trees
MOA is easy to use and extend
Future Work
Extend MOA to more data mining and learning methods.
26 / 26

More Related Content

Similar to OzaBagAdwin (OBA) slide [7].pdf

Vogel's Approximation Method
Vogel's Approximation MethodVogel's Approximation Method
Vogel's Approximation MethodUsharaniRavikumar
 
171894616 slickline-calculations
171894616 slickline-calculations171894616 slickline-calculations
171894616 slickline-calculationsHardev Malhi
 
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...Kumar Goud
 
Ee w05.1 m_ 2. electricity generation _ part 4 (generation technologies)
Ee  w05.1 m_ 2. electricity generation _ part 4 (generation technologies)Ee  w05.1 m_ 2. electricity generation _ part 4 (generation technologies)
Ee w05.1 m_ 2. electricity generation _ part 4 (generation technologies)Silvester Van Koten
 
The TOU project - Test 1
The TOU project - Test 1The TOU project - Test 1
The TOU project - Test 1chr1stopher
 
MIDTERM-EXAM-ANSWER-EE443.pptx
MIDTERM-EXAM-ANSWER-EE443.pptxMIDTERM-EXAM-ANSWER-EE443.pptx
MIDTERM-EXAM-ANSWER-EE443.pptxgreensun
 
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2 Hazardous Area Isolators (Selec...
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2  Hazardous Area Isolators (Selec...Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2  Hazardous Area Isolators (Selec...
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2 Hazardous Area Isolators (Selec...Thorne & Derrick International
 
Lect2 up330 (100328)
Lect2 up330 (100328)Lect2 up330 (100328)
Lect2 up330 (100328)aicdesign
 
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...IJAEMSJORNAL
 
ATE_MAO_2010_Jun
ATE_MAO_2010_JunATE_MAO_2010_Jun
ATE_MAO_2010_JunMDO_Lab
 
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdf
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdfSolucionário Fundamentos Circuitos Elétricos 5ª Ed.pdf
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdfpedro199229
 
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdf
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdfFundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdf
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdfissackmohamed3
 
Making TIAM-MACRO-SA an Integrated Assessment Model
Making TIAM-MACRO-SA an Integrated Assessment ModelMaking TIAM-MACRO-SA an Integrated Assessment Model
Making TIAM-MACRO-SA an Integrated Assessment ModelIEA-ETSAP
 
A novel four wire inverter system using SVPWM technique for ups applications
A novel four wire inverter system using SVPWM technique for ups applicationsA novel four wire inverter system using SVPWM technique for ups applications
A novel four wire inverter system using SVPWM technique for ups applicationsIRJET Journal
 

Similar to OzaBagAdwin (OBA) slide [7].pdf (20)

Vogel's Approximation Method
Vogel's Approximation MethodVogel's Approximation Method
Vogel's Approximation Method
 
171894616 slickline-calculations
171894616 slickline-calculations171894616 slickline-calculations
171894616 slickline-calculations
 
5.1 greedy 03
5.1 greedy 035.1 greedy 03
5.1 greedy 03
 
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
 
Ee w05.1 m_ 2. electricity generation _ part 4 (generation technologies)
Ee  w05.1 m_ 2. electricity generation _ part 4 (generation technologies)Ee  w05.1 m_ 2. electricity generation _ part 4 (generation technologies)
Ee w05.1 m_ 2. electricity generation _ part 4 (generation technologies)
 
10a--EDPA_PerUnit.pptx
10a--EDPA_PerUnit.pptx10a--EDPA_PerUnit.pptx
10a--EDPA_PerUnit.pptx
 
The TOU project - Test 1
The TOU project - Test 1The TOU project - Test 1
The TOU project - Test 1
 
MIDTERM-EXAM-ANSWER-EE443.pptx
MIDTERM-EXAM-ANSWER-EE443.pptxMIDTERM-EXAM-ANSWER-EE443.pptx
MIDTERM-EXAM-ANSWER-EE443.pptx
 
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2 Hazardous Area Isolators (Selec...
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2  Hazardous Area Isolators (Selec...Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2  Hazardous Area Isolators (Selec...
Stahl Isolators (ISpac) – ATEX Zone 1 Zone 2 Hazardous Area Isolators (Selec...
 
Lect2 up330 (100328)
Lect2 up330 (100328)Lect2 up330 (100328)
Lect2 up330 (100328)
 
P1111145969
P1111145969P1111145969
P1111145969
 
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...
Artificial Neural Network Controller for Reducing the Total Harmonic Distorti...
 
ATE_MAO_2010_Jun
ATE_MAO_2010_JunATE_MAO_2010_Jun
ATE_MAO_2010_Jun
 
Frederik Zahle - Design of an Aeroelastically Tailored 10 MW Wind Turbine Rotor
Frederik Zahle - Design of an Aeroelastically Tailored 10 MW Wind Turbine RotorFrederik Zahle - Design of an Aeroelastically Tailored 10 MW Wind Turbine Rotor
Frederik Zahle - Design of an Aeroelastically Tailored 10 MW Wind Turbine Rotor
 
Max 232
Max 232Max 232
Max 232
 
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdf
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdfSolucionário Fundamentos Circuitos Elétricos 5ª Ed.pdf
Solucionário Fundamentos Circuitos Elétricos 5ª Ed.pdf
 
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdf
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdfFundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdf
Fundamentals_of_Electric_Circuits_Instructor_Solutions_Manual_by.pdf
 
Making TIAM-MACRO-SA an Integrated Assessment Model
Making TIAM-MACRO-SA an Integrated Assessment ModelMaking TIAM-MACRO-SA an Integrated Assessment Model
Making TIAM-MACRO-SA an Integrated Assessment Model
 
Mep pres
Mep presMep pres
Mep pres
 
A novel four wire inverter system using SVPWM technique for ups applications
A novel four wire inverter system using SVPWM technique for ups applicationsA novel four wire inverter system using SVPWM technique for ups applications
A novel four wire inverter system using SVPWM technique for ups applications
 

More from KoayFT

11-PLDs.pdf
11-PLDs.pdf11-PLDs.pdf
11-PLDs.pdfKoayFT
 
17117908_PA, Miller Effect, Cascaded Amplifiers.ppt
17117908_PA, Miller Effect, Cascaded Amplifiers.ppt17117908_PA, Miller Effect, Cascaded Amplifiers.ppt
17117908_PA, Miller Effect, Cascaded Amplifiers.pptKoayFT
 
17117908.ppt
17117908.ppt17117908.ppt
17117908.pptKoayFT
 
Slide_the-operational-amplifier.pdf
Slide_the-operational-amplifier.pdfSlide_the-operational-amplifier.pdf
Slide_the-operational-amplifier.pdfKoayFT
 
13185501.ppt
13185501.ppt13185501.ppt
13185501.pptKoayFT
 
15028152.ppt
15028152.ppt15028152.ppt
15028152.pptKoayFT
 
combinationallogiccircuit.ppt
combinationallogiccircuit.pptcombinationallogiccircuit.ppt
combinationallogiccircuit.pptKoayFT
 

More from KoayFT (7)

11-PLDs.pdf
11-PLDs.pdf11-PLDs.pdf
11-PLDs.pdf
 
17117908_PA, Miller Effect, Cascaded Amplifiers.ppt
17117908_PA, Miller Effect, Cascaded Amplifiers.ppt17117908_PA, Miller Effect, Cascaded Amplifiers.ppt
17117908_PA, Miller Effect, Cascaded Amplifiers.ppt
 
17117908.ppt
17117908.ppt17117908.ppt
17117908.ppt
 
Slide_the-operational-amplifier.pdf
Slide_the-operational-amplifier.pdfSlide_the-operational-amplifier.pdf
Slide_the-operational-amplifier.pdf
 
13185501.ppt
13185501.ppt13185501.ppt
13185501.ppt
 
15028152.ppt
15028152.ppt15028152.ppt
15028152.ppt
 
combinationallogiccircuit.ppt
combinationallogiccircuit.pptcombinationallogiccircuit.ppt
combinationallogiccircuit.ppt
 

Recently uploaded

Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numberssuginr1
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...HyderabadDolls
 
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...vershagrag
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...HyderabadDolls
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...kumargunjan9515
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 

Recently uploaded (20)

Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
 
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 

OzaBagAdwin (OBA) slide [7].pdf

  • 1. Improving Adaptive Bagging Methods for Evolving Data Streams A. Bifet, G. Holmes, B. Pfahringer, and R. Gavaldà University of Waikato Hamilton, New Zealand Laboratory for Relational Algorithmics, Complexity and Learning LARCA UPC-Barcelona Tech, Catalonia Nanjing, 4 November 2009 1st Asian Conference on Machine Learning (ACML’09)
  • 2. Motivation MOA Software for Mining Data Streams Build a useful software mining for massive data sets Bagging for data streams Improve accuracy on classification methods for data streams 2 / 26
  • 3. Data stream classification cycle 1 Process an example at a time, and inspect it only once (at most) 2 Use a limited amount of memory 3 Work in a limited amount of time 4 Be ready to predict at any point 3 / 26
  • 4. Realtime analytics: from Databases to Dataflows Data streams Data streams are ordered datasets Not all datasets are data streams All dataset may be processed incrementally as a data stream MOA: Massive Online Analysis Faster Mining Software using less resources Instant mining: more for less 4 / 26
  • 5. What is MOA? {M}assive {O}nline {A}nalysis is a framework for online learning from data streams. It is closely related to WEKA It includes a collection of offline and online as well as tools for evaluation: boosting and bagging Hoeffding Trees with and without Naïve Bayes classifiers at the leaves. 5 / 26
  • 6. WEKA Waikato Environment for Knowledge Analysis Collection of state-of-the-art machine learning algorithms and data processing tools implemented in Java Released under the GPL Support for the whole process of experimental data mining Preparation of input data Statistical evaluation of learning schemes Visualization of input data and the result of learning Used for education, research and applications Complements “Data Mining” by Witten & Frank 6 / 26
  • 8. MOA: the bird The Moa (another native NZ bird) is not only flightless, like the Weka, but also extinct. 8 / 26
  • 9. MOA: the bird The Moa (another native NZ bird) is not only flightless, like the Weka, but also extinct. 8 / 26
  • 10. MOA: the bird The Moa (another native NZ bird) is not only flightless, like the Weka, but also extinct. 8 / 26
  • 11. MOA: the bird The Moa (another native NZ bird) is not only flightless, like the Weka, but also extinct. 8 / 26
  • 12. Concept Drift Framework t f(t) f(t) α α t0 W 0.5 1 Definition Given two data streams a, b, we define c = a⊕W t0 b as the data stream built joining the two data streams a and b Pr[c(t) = b(t)] = 1/(1+e−4(t−t0)/W ). Pr[c(t) = a(t)] = 1−Pr[c(t) = b(t)] 9 / 26
  • 13. Concept Drift Framework t f(t) f(t) α α t0 W 0.5 1 Example (((a⊕W0 t0 b)⊕W1 t1 c)⊕W2 t2 d)... (((SEA9 ⊕W t0 SEA8)⊕W 2t0 SEA7)⊕W 3t0 SEA9.5) CovPokElec = (CoverType⊕5,000 581,012 Poker)⊕5,000 1,000,000 ELEC2 9 / 26
  • 14. New Ensemble Methods For Evolving Data Streams New Ensemble Methods For Evolving Streams (KDD’09) a new experimental data stream framework for studying concept drift two new variants of Bagging: ADWIN Bagging Adaptive-Size Hoeffding Tree (ASHT) Bagging. an evaluation study on synthetic and real-world datasets 10 / 26
  • 15. Outline 1 Adaptive-Size Hoeffding Tree bagging 2 ADWIN Bagging 3 Empirical evaluation 11 / 26
  • 16. Adaptive-Size Hoeffding Tree T1 T2 T3 T4 Ensemble of trees of different size each tree has a maximum size after one node splits, it deletes some nodes to reduce its size if the size of the tree is higher than the maximum value 12 / 26
  • 17. Adaptive-Size Hoeffding Tree T1 T2 T3 T4 Ensemble of trees of different size smaller trees adapt more quickly to changes, larger trees do better during periods with little change diversity 12 / 26
  • 18. Adaptive-Size Hoeffding Tree 0,2 0,21 0,22 0,23 0,24 0,25 0,26 0,27 0,28 0,29 0,3 0 0,1 0,2 0,3 0,4 0,5 0,6 Kappa Error 0,25 0,255 0,26 0,265 0,27 0,275 0,28 0,1 0,12 0,14 0,16 0,18 0,2 0,22 0,24 0,26 0,28 0,3 Kappa Error Figure: Kappa-Error diagrams for ASHT bagging (left) and bagging (right) on dataset RandomRBF with drift, plotting 90 pairs of classifiers. 13 / 26
  • 19. Improvement for ASHT Bagging Method Improvement for ASHT Bagging ensemble method Bagging using trees of different size add a change detector for each tree in the ensemble DDM: Gama et al. EDDM: Baena, del Campo, Fidalgo et al. 14 / 26
  • 20. Outline 1 Adaptive-Size Hoeffding Tree bagging 2 ADWIN Bagging 3 Empirical evaluation 15 / 26
  • 21. ADWIN Bagging ADWIN An adaptive sliding window whose size is recomputed online according to the rate of change observed. ADWIN has rigorous guarantees (theorems) On ratio of false positives and negatives On the relation of the size of the current window and change rates ADWIN Bagging When a change is detected, the worst classifier is removed and a new classifier is added. 16 / 26
  • 22. Optimal Change Detector and Predictor ADWIN High accuracy Fast detection of change Low false positives and false negatives ratios Low computational cost: minimum space and time needed Theoretical guarantees No parameters needed Estimator with Memory and Change Detector 17 / 26
  • 23. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 1 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 24. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 1 W1 = 01010110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 25. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 10 W1 = 1010110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 26. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 101 W1 = 010110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 27. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 1010 W1 = 10110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 28. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 10101 W1 = 0110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 29. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 101010 W1 = 110111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 30. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 1010101 W1 = 10111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 31. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 W0= 10101011 W1 = 0111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 32. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 |µ̂W0 − µ̂W1 | ≥ εc : CHANGE DET.! W0= 101010110 W1 = 111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 33. Algorithm ADaptive Sliding WINdow ADWIN Example W= 101010110111111 Drop elements from the tail of W W0= 101010110 W1 = 111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 34. Algorithm ADaptive Sliding WINdow ADWIN Example W= 01010110111111 Drop elements from the tail of W W0= 101010110 W1 = 111111 ADWIN: ADAPTIVE WINDOWING ALGORITHM 1 Initialize Window W 2 for each t > 0 3 do W ← W ∪{xt } (i.e., add xt to the head of W) 4 repeat Drop elements from the tail of W 5 until |µ̂W0 − µ̂W1 | < εc holds 6 for every split of W into W = W0 ·W1 7 Output µ̂W 18 / 26
  • 35. Algorithm ADaptive Sliding WINdow ADWIN Theorem At every time step we have: 1 (False positive rate bound). If µt remains constant within W, the probability that ADWIN shrinks the window at this step is at most δ. 2 (False negative rate bound). Suppose that for some partition of W in two parts W0W1 (where W1 contains the most recent items) we have |µW0 − µW1 | > 2εc. Then with probability 1−δ ADWIN shrinks W to W1, or shorter. ADWIN tunes itself to the data stream at hand, with no need for the user to hardwire or precompute parameters. 19 / 26
  • 36. Algorithm ADaptive Sliding WINdow ADWIN ADWIN using a Data Stream Sliding Window Model, can provide the exact counts of 1’s in O(1) time per point. tries O(logW) cutpoints uses O(1 ε logW) memory words the processing time per example is O(logW) (amortized and worst-case). Sliding Window Model 1010101 101 11 1 1 Content: 4 2 2 1 1 Capacity: 7 3 2 1 1 20 / 26
  • 37. ADWIN bagging using Hoeffding Adaptive Trees Decision Trees: Hoeffding Adaptive Tree CVFDT: Hulten, Spencer and Domingos No theoretical guarantees on the error rate of CVFDT Parameters needed : size of window, number of examples,... Hoeffding Adaptive Tree: replace frequency statistics counters by estimators don’t need a window to store examples use a change detector with theoretical guarantees to substitute trees Advantages: 1 Theoretical guarantees 2 No Parameters 21 / 26
  • 38. Outline 1 Adaptive-Size Hoeffding Tree bagging 2 ADWIN Bagging 3 Empirical evaluation 22 / 26
  • 39. Empirical evaluation Dataset Most Accurate Method Hyperplane Drift 0.0001 Bag10 ASHT W+R Hyperplane Drift 0.001 DDM Bag10 ASHT W SEA W = 50 BagADWIN 10 HAT SEA W = 50000 BagADWIN 10 HAT RandomRBF No Drift 50 centers Bag 10 HT RandomRBF Drift .0001 50 centers BagADWIN 10 HAT RandomRBF Drift .001 50 centers DDM Bag10 ASHT W RandomRBF Drift .001 10 centers BagADWIN 10 HAT Cover Type DDM Bag10 ASHT W Poker BagADWIN 10 HAT Electricity DDM Bag10 ASHT W CovPokElec BagADWIN 10 HAT 23 / 26
  • 40. Empirical evaluation SEA W= 50000 Time Acc. Mem. BagADWIN 10 HAT 154.91 88.88 ± 0.05 2.35 DDM Bag10 ASHT W 44.02 88.72 ± 0.05 0.65 NaiveBayes 5.52 84.60 ± 0.03 0.00 NBADWIN 12.40 87.83 ± 0.07 0.02 HT 7.20 85.02 ± 0.11 0.33 HT DDM 7.88 88.17 ± 0.18 0.16 HAT 20.96 88.40 ± 0.07 0.18 BagADWIN 10 HT 53.15 88.58 ± 0.10 0.88 Bag10 HT 30.88 85.38 ± 0.06 3.36 Bag10 ASHT W+R 33.56 88.51 ± 0.06 0.84 24 / 26
  • 41. Empirical evaluation Accuracy 71,4 71,9 72,4 72,9 73,4 73,9 10.000 140.000 270.000 400.000 530.000 660.000 790.000 920.000 Instances Accuracy (%) BagAdwin HAT DDM BagHAST EDDM BagHast DDM HT EDDM HT BagAdwin HT BagHAST Figure: Accuracy on dataset LED with three concept drifts. 25 / 26
  • 42. Summary http://www.cs.waikato.ac.nz/∼abifet/MOA/ Conclusions New improvements for ensemble bagging methods: Adaptive-Size Hoeffding Tree bagging using change detection methods ADWIN bagging using Hoeffding Adaptive Trees MOA is easy to use and extend Future Work Extend MOA to more data mining and learning methods. 26 / 26