Frequent Pattern Growth Algorithm (FP growth method)

1 I NAME OF PRESENTER
FP Algorithm
Ashis Kumar Chanda
Department of Computer Science and Engineering
University of Dhaka

2 I NAME OF PRESENTERCSE, DU2
Key concepts
oIntroduction
o Idea of FP
o FP-construction
o Analysis of FP

Introduction
The First & main algorithm of Data mining is
Apriori
But it has some Bottleneck
Bottleneck: candidate-generation and test
So, a question arise
Can we avoid candidate generation?

Idea of FP
Frequent pattern growth adopts a divide-and-
conquer strategy
It just scan database two times & use no
candidate set
We define two parts in FP-construction

FP-construction
Stpe-1:
1. First scan database, find frequent number
of each element
2. Then sort them in descending order
3. Now make a tree with root as null
4. Now scan database secondly, sort
transaction according to descending
support count
T100: I1, I2, I5
T100: I2, I1, I5

FP-construction
TID List items
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
T400 I2,I1,I4
---- ---
---- ---

Fp-tree
7
null
T100 I2,I1,I5
CSE, DU

Fp-tree
8
null
T100 I2,I1,I5
I2
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
9
null
T100 I2,I1,I5
I2
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
10
null
T100 I2,I1,I5
I2
I1
1
1 Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
11
null
T100 I2,I1,I5
I2
I1
1
1 Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
12
null
T100 I2,I1,I5
I2
I1
I5
1
1 Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
13
null
T100 I2,I1,I5
I2
I1
I5
1
1 Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
14
null
T100 I2,I1,I5
T200 I2,I4
I2
I1
I5
1
1
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
15
null
T100 I2,I1,I5
T200 I2,I4
I2
I1
I5
I4
2
1
1
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
16
null
T100 I2,I1,I5
T200 I2,I4
I2
I1
I5
I4
2
1
1
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
17
null
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
I2
I1
I5
I4 I3
3
1
1
1 1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
18
null
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
I2
I1
I5
I4 I3
3
1
1
1 1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
19
null
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
T400 I1,I3
I2
I1
I5
I4 I3
3
1
1
1 1
I1
1
I3
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
20
null
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
T400 I1,I3
I2
I1
I5
I4 I3
3
1
1
1 1
I1
1
I3
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Fp-tree
21
null
T100 I2,I1,I5
T200 I2,I4
T300 I2,I3
T400 I1,I3
I2
I1
I5
I4 I3
3
1
1
1 1
I1
1
I3
1
Item Head of link
I2 |
I1 |
I5 |
I3 |
I4 |
CSE, DU

Final fp tree
22 CSE, DU

FP-construction
• Step-2:
• Now make conditional FP-tree by perform
mining recursively
• Then perform concatenation of the suffix
pattern
• And we get generated frequent patterns
23 CSE, DU

Step-2
24 CSE, DU

Step-2
25
I2 I1
I2 I1 I3
Conditional Pattern base
CSE, DU

Step-2
26 CSE, DU

Step-2
27
I2:1
I1:1
I2:2
I1:2
I3:1
Fig: 1 Fig: 2
CSE, DU

Step-2
28
dd
Conditional pattern base of I5
Conditional pattern base of I1,I5
Frequent patterns are I2 I1 I5, I1 I5, I2 I5
CSE, DU

Final Output

Analysis of FP
 No candidate element
 Scan database only two times
 No need to use huge memory
 Efficient for mining both long & short
frequent patterns

Complexity
Complexity of searching through all paths
is bounded by
O(header_count2 * depth of tree)
Creation of a new cFP-Tree occurs also

FP-Tree size
The FP-Tree usually has a smaller size
– Best case scenario:
all transactions contain the same set of items
• 1 path in the FP-tree
– Worst case scenario: every transaction has a unique set of
items (no items in common)
The size of the FP-tree depends on how the items are
ordered
Ordering by decreasing support is typically used but it
does not always lead to the smallest tree (it's a heuristic)

References
- Data Mining Concepts & Techniques
by J. Han & M. Kamber
- Database system Concept
by Abraham Sillberschatz, Korth, Sudarshan
- Lecture of Dr. S. Srinath
Institute of Technology at Madras, India

Frequent Pattern Growth Algorithm (FP growth method)

More Related Content

What's hot

Similar to Frequent Pattern Growth Algorithm (FP growth method)

More from Ashis Chanda

Recently uploaded

Frequent Pattern Growth Algorithm (FP growth method)