SlideShare a Scribd company logo
A One-Pass Triclustering Approach: Is There any Room 
for Big Data? 
Dmitry V. Gnatyshak1 Dmitry I. Ignatov1 Sergei O. Kuznetsov1 Lhouari 
Nourine2 
National Research University Higher School of Economics, Russian Federation 
Blaise Pascal University, LIMOS, CNRS, France 
10.10.2014 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 1 / 26
Outline 
1 Motivation 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
4 Conclusion 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
4 Conclusion 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 3 / 26
Motiation 
Big amount of multimodal data: 
Gene expression data 
Folksonomies 
. . . 
Non-binary data can be scaled (possibly increasing the dimensionality) 
Increasing amount of big data: fast algorithms required (linear or sublinear, 
one-pass) 
Existing methods — finding all p-clusters satisfying some conditions (often 
exponential number) 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 4 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
4 Conclusion 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 5 / 26
Prime OAC-triclustering 
Formal concept analysis: triadic case 
Definition 
Let G, M, B be some sets. Let the ternary relation I be a subset of their cartesian 
product: I ⊆ G × M × B. Then the tuple K = {G,M,B, I } is called a triadic 
formal context. 
G — a set of objects, M — a set of attributes, B — a set of conditions. 
GM m1 m2 m3 m1 m2 m3 m1 m2 m3 
g1 x x x x x x x x 
g2 x x x x x 
g3 x x x x 
g4 x x x x x x 
B b1 b2 b3 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
Prime OAC-triclustering 
Formal concept analysis: triadic case 
Definition 
Galois operators (prime operators) are defined the same way as in dyadic case: 
2G → 2M × 2B 
2M → 2G × 2B 
2B → 2G × 2M 
2G × 2M → 2B 
2G × 2B → 2M 
2M × 2B → 2G 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
Prime OAC-triclustering 
Formal concept analysis: triadic case 
GM m1 m2 m3 m1 m2 m3 m1 m2 m3 
g1 x x x x x x x x 
g2 x x x x x 
g3 x x x x 
g4 x x x x x x 
B b1 b2 b3 
({g1, g2}, {m1,m2})′ = {b1, b3} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
Prime OAC-triclustering 
Formal concept analysis: triadic case 
GM m1 m2 m3 m1 m2 m3 m1 m2 m3 
g1 x x x x x x x x 
g2 x x x x x 
g3 x x x x 
g4 x x x x x x 
B b1 b2 b3 
m′2 
= {(g1, b1), (g2, b1), (g3, b1), (g1, b2), (g1, b3), (g2, b3), (g4, b3)} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
Prime OAC-triclustering 
Formal concept analysis: triadic case 
Definition 
The triple (X,Y , Z) is called triadic formal concept of the context 
K = (G,M,B, I ), if X ⊆ G,Y ⊆ M, Z ⊆ B, (X,Y )′ = Z, (X, Z)′ = Y , 
(Y , Z)′ = X. 
X is called (formal) extent, Y — (formal) intent, Z — (formal) modus. 
GM m1 m2 m3 m1 m2 m3 m1 m2 m3 
g1 x x x x x x x x 
g2 x x x x x 
g3 x x x x 
g4 x x x x x x 
B b1 b2 b3 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
Prime OAC-triclustering 
Basic algorithm 
This method uses the following types of prime operators (for the context 
K = (G,M,B, I )): 
(g,m)′ = {b ∈ B | (g,m, b) ∈ I }, 
(g, b)′ = {m ∈ M | (g,m, b) ∈ I }, 
(m, b)′ = {g ∈ G | (g,m, b) ∈ I } 
Definition 
Then the triple T = ((m, b)′, (g, b)′, (g,m)′) is called prime OAC-tricluster based 
on triple (g,m, b) ∈ I . The sets of tricluster are called, respectively, extent, 
intent, and modus. Triple (g,m, b) is called a generating triple of the tricluster T. 
Definition 
Density of a tricluster: ρ(X,Y , Z) = |I∩(X×Y×Z)| 
|X||Y||Z| 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 7 / 26
Prime OAC-triclustering 
Basic algorithm 
An example of a tricluster based on triple (eg,em 
,eb 
): 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 8 / 26
Prime OAC-triclustering 
Basic algorithm 
Require: K = (G,M, B, I ) — triadic context; 
ρmin — density threshold 
Ensure: T = {T = (X, Y, Z)} 
1: T := ∅ 
2: for all (g,m) : g ∈ G,m ∈ M do 
3: PrimesObjAttr [g,m] = (g,m)′ 
4: end for 
5: for all (g, b) : g ∈ G,b ∈ B do 
6: PrimesObjCond[g, b] = (g, b)′ 
7: end for 
8: for all (m, b) : m ∈ M,b ∈ B do 
9: PrimesAttrCond[m, b] = (m, b)′ 
10: end for 
11: for all (g,m, b) ∈ I do 
12: T = (PrimesAttrCond[m, b], PrimesObjCond[g, b], PrimesObjAttr [g,m]) 
13: Tkey = hash(T) 
14: if Tkey̸∈ T .keys ∧ ρ(T) ≥ ρmin then 
15: T [Tkey] := T 
16: end if 
17: end for 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 9 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Let K = (G,M,B, I ) be a triadic context. We do not know G, M, B, I , or their 
cardinalities. 
Input on each iteration: {(g,m, b)} = J ⊆ I . 
Goal — maintain an updated version of the results and efficiently update them 
when new triples are received. 
We need to keep in memory the results of prime operators’ application (prime 
sets): 
PrimesObjAttr — dictionary with elements of type ((g,m), {b ∈ B}), g ∈ G, 
m ∈ M; 
PrimesObjCond — dictionary with elements of type ((g, b), {m ∈ M}), 
g ∈ G, b ∈ B; 
PrimesAttrCond — dictionary with elements of type ((m, b), {g ∈ G}), 
m ∈ M, b ∈ B. 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 10 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Remark 
In this case we need to consider triclusters based on different triples different, even 
if their extents, intents, and modi are equal. 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 11 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Algorithm of triples addition (standard): 
Require: J — a set of triples to add; 
T = {T = (∗X, ∗Y , ∗Z)} — current tricluster set; 
PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 
Ensure: T = {T = (∗X, ∗Y , ∗Z)}; 
PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 
1: for all (g,m, b) ∈ J do 
2: PrimesObjAttr [g,m] := PrimesObjAttr [g,m] ∪ b 
3: PrimesObjCond[g, b] := PrimesObjCond[g, b] ∪ m 
4: PrimesAttrCond[m, b] := PrimesAttrCond[m, b] ∪ g 
5: T := 
T ∪ (&PrimesAttrCond[m, b],&PrimesObjCond[g, b],&PrimesObjAttr [g,m]) 
6: end for 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 12 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Algorithm of triples removal (optional): 
Triclusters must be kept in the dictionary with generating triples being keys. 
Require: J — a set of triples to remove; 
T = {(key(T),T = (∗X, ∗Y , ∗Z))} — current tricluster dictionary; 
PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 
Ensure: T = {T = (∗X, ∗Y , ∗Z)}; 
PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 
1: for all (g,m, b) ∈ J do 
2: T := T  T [(g,m, b)] 
3: PrimesObjAttr [g,m] := PrimesObjAttr [g,m]  b 
4: PrimesObjCond[g, b] := PrimesObjCond[g, b]  m 
5: PrimesAttrCond[m, b] := PrimesAttrCond[m, b]  g 
6: end for 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 13 / 26
Prime OAC-triclustering 
Online version of the algorithm 
If a user have asked for an output, we may need to remove the triclusters with the 
same extent, intent and modi at the post-processing stage. At this stage we can 
also check various conditions (for instance, minimal density condition). 
Require: T = {T = (∗X, ∗Y , ∗Z)} — current tricluster set; 
Ensure: T = {T = (∗X, ∗Y , ∗Z)} — processed tricluster hash-set; 
1: for all T ∈ T do 
2: Compute hash(T) 
3: if hash(T)̸∈ T then 
4: T := T ∪ T 
5: end if 
6: end for 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 14 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Remark 1 
To allow an efficient access to the prime sets dictionaries PrimesObjAttr , 
PrimesObjCond, and PrimesAttrCond must be implemented as hash tables. 
Remark 2 
For an efficient computation of triclusters’ hash values we can keep hash values of 
prime sets along with prime sets. Then the calculation of the triclusters’ hash 
values will require to find a value of some function of the prime sets’ hash values 
(multiplied by non-repeating coefficients, for instance). 
It is important not to use LHS hash-function (Locality Sensitive Hashing). 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 15 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Complexities: 
Time complexity: O(|I |) (as there is a constant number of operations on 
each step); 
More precisely: 8|I | operations in total; 
1 Modification of 3 prime sets (3); 
2 Creation of a new tricluster (1); 
3 Addition of pointers to its extent, intent, and modus (3); 
4 Addition of the tricluster to the set of all triclusters (1). 
Memory complexity: O(|I |) (as we need to keep in memory only prime sets, 
|I | elements in each dictionary + keys). 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 16 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Example: 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g1,m1, b1) 
1 PrimesObjAttr = {((g1,m1), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1})} 
3 PrimesAttrCond = {((m1, b1), {g1})} 
4 T := T ∪ {PrimesAttrCond[m1, b1], PrimesObjCond[g1, b1], PrimesObjAttr [g1,m1]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g1,m2, b1) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2})} 
3 PrimesAttrCond = {((m1, b1), {g1}), ((m2, b1), {g1})} 
4 T := T ∪ {PrimesAttrCond[m2, b1], PrimesObjCond[g1, b1], PrimesObjAttr [g1,m2]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g2,m1, b1) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1})} 
4 T := T ∪ {PrimesAttrCond[m1, b1], PrimesObjCond[g2, b1], PrimesObjAttr [g2,m1]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g2,m2, b1) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1}), ((g2,m2), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2})} 
4 T := T ∪ {PrimesAttrCond[m2, b1], PrimesObjCond[g2, b1], PrimesObjAttr [g2,m2]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g3,m3, b1) 
1 PrimesObjAttr = 
{((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1}), ((g2,m2), {b1}), ((g3,m3), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3})} 
4 T := T ∪ {PrimesAttrCond[m3, b1], PrimesObjCond[g3, b1], PrimesObjAttr [g3,m3]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g1,m2, b2) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), 
{b1}), ((g2,m2), {b1}), ((g3,m3), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), 
{m3}), ((g1, b2), {m2})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), 
{g3}), ((m2, b2), {g1})} 
4 T := T ∪ {PrimesAttrCond[m2, b2], PrimesObjCond[g1, b2], PrimesObjAttr [g1,m2]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g2,m1, b2) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), 
((g2,m2), {b1}), ((g3,m3), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), 
((g1, b2), {m2}), ((g2, b2), {m1})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), 
((m2, b2), {g1}), ((m1, b2), {g2})} 
4 T := T ∪ {PrimesAttrCond[m1, b2], PrimesObjCond[g2, b2], PrimesObjAttr [g2,m1]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g2,m2, b2) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), 
((g2,m2), {b1, b2}), ((g3,m3), {b1})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), 
((g1, b2), {m2}), ((g2, b2), {m1,m2})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), 
((m2, b2), {g1, g2}), ((m1, b2), {g2})} 
4 T := T ∪ {PrimesAttrCond[m2, b2], PrimesObjCond[g2, b2], PrimesObjAttr [g2,m2]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
→ (g3,m3, b2) 
1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), ((g2,m2), 
{b1, b2}), ((g3,m3), {b1, b2})} 
2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), ((g1, b2), 
{m2}), ((g2, b2), {m1,m2}), ((g3, b2), {m3})} 
3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), 
((m2, b2), {g1, g2}), ((m1, b2), {g2}), ((m3, b2), {g3})} 
4 T := T ∪ {PrimesAttrCond[m3, b2], PrimesObjCond[g3, b2], PrimesObjAttr [g3,m3]} 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
Postprocessing: 
1 T(g1,m1,b1) = (g1, g2,m1,m2, b1) ← add 
2 T(g1,m2,b1) = (g1, g2,m1,m2, b1, b2) ← add 
3 T(g2,m1,b1) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 
4 T(g2,m2,b1) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 
5 T(g3,m3,b1) = (g3,m3, b1, b2) ← add 
6 T(g1,m2,b2) = (g1, g2,m2, b1, b2) ← add 
7 T(g2,m1,b2) = (g2,m1,m2, b1, b2) ← add 
8 T(g2,m2,b2) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 
9 T(g3,m3,b2) = (g3,m3, b1, b2) ← the same as T(g3,m3,b1), skip 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Prime OAC-triclustering 
Online version of the algorithm 
The final output set of triclusters: 
1 T1 = ({g1, g2}, {m1,m2}, {b1}) 
2 T2 = ({g1, g2}, {m1,m2}, {b1, b2}) 
3 T3 = ({g3}, {m3}, {b1, b2}) 
4 T4 = ({g1, g2}, {m2}, {b1, b2}) 
5 T5 = ({g2}, {m1,m2}, {b1, b2}) 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
4 Conclusion 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 18 / 26
Experiments 
Description of the experiments 
Goals: 
Show that the online algorithm of p-dimensional attribute clustering 
outperforms the basic algorithm 
Confirm the complexity estimations 
For each dataset for each version of the algorithm 11 experiments were conducted: 
for each there were different density threshold (from 0 to 1 with 0.1 intervals). To 
evaluate the time more precisely, for each context there were 5 runs of the 
algorithms with the average result recorded. 
Additional tests to check the performance on big datasets and confirm linearity of 
the online algorithm. 
All experiments were conducted on the computer with Intel Core i7-351U 2.40 
GHz processor, 8 GB RAM, Windows 8 operating system. 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 19 / 26
Experiments 
Datasets 
5 pseudo-random uniform contexts 50 × 50 × 50. Probability of each 
quadruple’s presence varied from 0.02 to 0.1 with 0.02 interval 
10 pseudo-random uniform contexts with average density equal to 0.001. 
Cardinalities of sets varied from 100 to 1000 with 100 interval 
Top-250 list of IMDB (Internet Movie Database) (triples: (movie, tag, 
genre)) 
Sample of 3000 triples of the first 100 000 triples of Bibsonomy.org dataset 
(triples: (user, bookmark, tag)) 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 20 / 26
Experiments 
Datasets 
Context |G| |M| |B| # triples Density 
Synthetic1, 0.02 50 50 50 2530 0.02024 
Synthetic1, 0.04 50 50 50 5001 0.04001 
Synthetic1, 0.06 50 50 50 7454 0.05963 
Synthetic1, 0.08 50 50 50 10046 0.08037 
Synthetic1, 0.1 50 50 50 12462 0.09970 
Synthetic2, 100 100 100 100 996 0.001 
Synthetic2, 200 200 200 200 7995 0.001 
Synthetic2, 300 300 300 300 27161 0.001 
Synthetic2, 400 400 400 400 63921 0.001 
Synthetic2, 500 500 500 500 125104 0.001 
Synthetic2, 600 600 600 600 216021 0.001 
Synthetic2, 700 700 700 700 343157 0.001 
Synthetic2, 800 800 800 800 512097 0.001 
Synthetic2, 900 900 900 900 729395 0.001 
Synthetic2, 1000 1000 1000 1000 1000589 0.001 
IMDB 250 795 22 3818 0.00087 
BibSonomy 51 924 2844 3000 0.000022 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 20 / 26
Experiments 
Results 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 21 / 26
Experiments 
Results 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 21 / 26
Experiments 
Results 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 22 / 26
Experiments 
Results 
Densities for the contexts: 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 23 / 26
Experiments 
Results 
Densities for the contexts: 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 23 / 26
Outline 
1 Motivation 
2 Prime OAC-triclustering 
Formal concept analysis 
Basic algorithm 
Online version of the algorithm 
3 Experiments 
Description of the experiments 
Datasets 
Results 
4 Conclusion 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 24 / 26
Conclusion 
Prime OAC-triclustering algorithm was described 
One-pass linear online version of its basic algorithm was proposed 
Efficiency of the online algorithm and complexities of both algorithms were 
confirmed. 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 25 / 26
Thank you! 
Questions? 
Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 26 / 26

More Related Content

What's hot

SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithms
Jagadeeswaran Rathinavel
 
ABC with Wasserstein distances
ABC with Wasserstein distancesABC with Wasserstein distances
ABC with Wasserstein distances
Christian Robert
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
Kolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametriclineKolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametricline
Alina Barbulescu
 
Subquad multi ff
Subquad multi ffSubquad multi ff
Subquad multi ff
Fabian Velazquez
 
Adaptive Three Operator Splitting
Adaptive Three Operator SplittingAdaptive Three Operator Splitting
Adaptive Three Operator Splitting
Fabian Pedregosa
 
Unbiased Bayes for Big Data
Unbiased Bayes for Big DataUnbiased Bayes for Big Data
Unbiased Bayes for Big Data
Christian Robert
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
MeetupDataScienceRoma
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
prior selection for mixture estimation
prior selection for mixture estimationprior selection for mixture estimation
prior selection for mixture estimation
Christian Robert
 
MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methods
Christian Robert
 
20 sameer sharma 186-206
20 sameer sharma 186-20620 sameer sharma 186-206
20 sameer sharma 186-206
Alexander Decker
 
Fast Identification of Heavy Hitters by Cached and Packed Group Testing
Fast Identification of Heavy Hitters by Cached and Packed Group TestingFast Identification of Heavy Hitters by Cached and Packed Group Testing
Fast Identification of Heavy Hitters by Cached and Packed Group Testing
Rakuten Group, Inc.
 
Faster Practical Block Compression for Rank/Select Dictionaries
Faster Practical Block Compression for Rank/Select DictionariesFaster Practical Block Compression for Rank/Select Dictionaries
Faster Practical Block Compression for Rank/Select Dictionaries
Rakuten Group, Inc.
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
The Statistical and Applied Mathematical Sciences Institute
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
Solving the energy problem of helium final report
Solving the energy problem of helium final reportSolving the energy problem of helium final report
Solving the energy problem of helium final report
JamesMa54
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
Unbiased Markov chain Monte Carlo
Unbiased Markov chain Monte CarloUnbiased Markov chain Monte Carlo
Unbiased Markov chain Monte Carlo
JeremyHeng10
 

What's hot (20)

SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithms
 
ABC with Wasserstein distances
ABC with Wasserstein distancesABC with Wasserstein distances
ABC with Wasserstein distances
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Kolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametriclineKolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametricline
 
Subquad multi ff
Subquad multi ffSubquad multi ff
Subquad multi ff
 
Adaptive Three Operator Splitting
Adaptive Three Operator SplittingAdaptive Three Operator Splitting
Adaptive Three Operator Splitting
 
Unbiased Bayes for Big Data
Unbiased Bayes for Big DataUnbiased Bayes for Big Data
Unbiased Bayes for Big Data
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
prior selection for mixture estimation
prior selection for mixture estimationprior selection for mixture estimation
prior selection for mixture estimation
 
MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methods
 
20 sameer sharma 186-206
20 sameer sharma 186-20620 sameer sharma 186-206
20 sameer sharma 186-206
 
Fast Identification of Heavy Hitters by Cached and Packed Group Testing
Fast Identification of Heavy Hitters by Cached and Packed Group TestingFast Identification of Heavy Hitters by Cached and Packed Group Testing
Fast Identification of Heavy Hitters by Cached and Packed Group Testing
 
Faster Practical Block Compression for Rank/Select Dictionaries
Faster Practical Block Compression for Rank/Select DictionariesFaster Practical Block Compression for Rank/Select Dictionaries
Faster Practical Block Compression for Rank/Select Dictionaries
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Solving the energy problem of helium final report
Solving the energy problem of helium final reportSolving the energy problem of helium final report
Solving the energy problem of helium final report
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Unbiased Markov chain Monte Carlo
Unbiased Markov chain Monte CarloUnbiased Markov chain Monte Carlo
Unbiased Markov chain Monte Carlo
 

Viewers also liked

A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
Dmitrii Ignatov
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
Dmitrii Ignatov
 
Введение в рекомендательные системы. 3 case-study без NetFlix.
Введение в рекомендательные системы. 3 case-study без NetFlix.Введение в рекомендательные системы. 3 case-study без NetFlix.
Введение в рекомендательные системы. 3 case-study без NetFlix.
Dmitrii Ignatov
 
AIST 2016 Opening Slides
AIST 2016 Opening SlidesAIST 2016 Opening Slides
AIST 2016 Opening Slides
Dmitrii Ignatov
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduce
Dmitrii Ignatov
 
Experimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshopExperimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshop
Dmitrii Ignatov
 
On the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCAOn the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCA
Dmitrii Ignatov
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filtering
Dmitrii Ignatov
 
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
Dmitrii Ignatov
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequences
Dmitrii Ignatov
 
Intro to Data Mining and Machine Learning
Intro to Data Mining and Machine LearningIntro to Data Mining and Machine Learning
Intro to Data Mining and Machine Learning
Dmitrii Ignatov
 
Sequence mining
Sequence miningSequence mining
Sequence mining
Dmitrii Ignatov
 
Поиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правилаПоиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правила
Dmitrii Ignatov
 
RAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern StructuresRAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern Structures
Dmitrii Ignatov
 
Pattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic SequencesPattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic Sequences
Dmitrii Ignatov
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Dmitrii Ignatov
 
Searching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensorsSearching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensors
Dmitrii Ignatov
 

Viewers also liked (17)

A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
 
Введение в рекомендательные системы. 3 case-study без NetFlix.
Введение в рекомендательные системы. 3 case-study без NetFlix.Введение в рекомендательные системы. 3 case-study без NetFlix.
Введение в рекомендательные системы. 3 case-study без NetFlix.
 
AIST 2016 Opening Slides
AIST 2016 Opening SlidesAIST 2016 Opening Slides
AIST 2016 Opening Slides
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduce
 
Experimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshopExperimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshop
 
On the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCAOn the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCA
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filtering
 
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequences
 
Intro to Data Mining and Machine Learning
Intro to Data Mining and Machine LearningIntro to Data Mining and Machine Learning
Intro to Data Mining and Machine Learning
 
Sequence mining
Sequence miningSequence mining
Sequence mining
 
Поиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правилаПоиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правила
 
RAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern StructuresRAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern Structures
 
Pattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic SequencesPattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic Sequences
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
 
Searching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensorsSearching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensors
 

Similar to A One-Pass Triclustering Approach: Is There any Room for Big Data?

Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Dmitrii Ignatov
 
Extracting biclusters of similar values with Triadic Concept Analysis
Extracting biclusters of similar values with Triadic Concept AnalysisExtracting biclusters of similar values with Triadic Concept Analysis
Extracting biclusters of similar values with Triadic Concept Analysis
INSA Lyon - L'Institut National des Sciences Appliquées de Lyon
 
Presentation.pdf
Presentation.pdfPresentation.pdf
Presentation.pdf
Chiheb Ben Hammouda
 
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equationsTraffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Guillaume Costeseque
 
Triggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphsTriggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphs
INSA Lyon - L'Institut National des Sciences Appliquées de Lyon
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
Christian Robert
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities
Gael Varoquaux
 
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
The Statistical and Applied Mathematical Sciences Institute
 
Smart Multitask Bregman Clustering
Smart Multitask Bregman ClusteringSmart Multitask Bregman Clustering
Smart Multitask Bregman Clustering
Venkat Sai Sharath Mudhigonda
 
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
Frank Nielsen
 
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
Frank Nielsen
 
Unit 3
Unit 3Unit 3
Unit 3
guna287176
 
Introduction to HMM
Introduction to HMMIntroduction to HMM
Introduction to HMM
Hyoseop Lee
 
Cmb part3
Cmb part3Cmb part3
Common fixed point theorems with continuously subcompatible mappings in fuzz...
 Common fixed point theorems with continuously subcompatible mappings in fuzz... Common fixed point theorems with continuously subcompatible mappings in fuzz...
Common fixed point theorems with continuously subcompatible mappings in fuzz...
Alexander Decker
 
CDT 22 slides.pdf
CDT 22 slides.pdfCDT 22 slides.pdf
CDT 22 slides.pdf
Christian Robert
 
Bayesian hybrid variable selection under generalized linear models
Bayesian hybrid variable selection under generalized linear modelsBayesian hybrid variable selection under generalized linear models
Bayesian hybrid variable selection under generalized linear models
Caleb (Shiqiang) Jin
 
Tensor train to solve stochastic PDEs
Tensor train to solve stochastic PDEsTensor train to solve stochastic PDEs
Tensor train to solve stochastic PDEs
Alexander Litvinenko
 
Ijmet 10 01_046
Ijmet 10 01_046Ijmet 10 01_046
Ijmet 10 01_046
IAEME Publication
 

Similar to A One-Pass Triclustering Approach: Is There any Room for Big Data? (20)

Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
 
Extracting biclusters of similar values with Triadic Concept Analysis
Extracting biclusters of similar values with Triadic Concept AnalysisExtracting biclusters of similar values with Triadic Concept Analysis
Extracting biclusters of similar values with Triadic Concept Analysis
 
Presentation.pdf
Presentation.pdfPresentation.pdf
Presentation.pdf
 
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equationsTraffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equations
 
Triggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphsTriggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphs
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities
 
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
 
Smart Multitask Bregman Clustering
Smart Multitask Bregman ClusteringSmart Multitask Bregman Clustering
Smart Multitask Bregman Clustering
 
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPC...
 
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)
 
Unit 3
Unit 3Unit 3
Unit 3
 
Unit 3
Unit 3Unit 3
Unit 3
 
Introduction to HMM
Introduction to HMMIntroduction to HMM
Introduction to HMM
 
Cmb part3
Cmb part3Cmb part3
Cmb part3
 
Common fixed point theorems with continuously subcompatible mappings in fuzz...
 Common fixed point theorems with continuously subcompatible mappings in fuzz... Common fixed point theorems with continuously subcompatible mappings in fuzz...
Common fixed point theorems with continuously subcompatible mappings in fuzz...
 
CDT 22 slides.pdf
CDT 22 slides.pdfCDT 22 slides.pdf
CDT 22 slides.pdf
 
Bayesian hybrid variable selection under generalized linear models
Bayesian hybrid variable selection under generalized linear modelsBayesian hybrid variable selection under generalized linear models
Bayesian hybrid variable selection under generalized linear models
 
Tensor train to solve stochastic PDEs
Tensor train to solve stochastic PDEsTensor train to solve stochastic PDEs
Tensor train to solve stochastic PDEs
 
Ijmet 10 01_046
Ijmet 10 01_046Ijmet 10 01_046
Ijmet 10 01_046
 

More from Dmitrii Ignatov

Interpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley ValuesInterpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley Values
Dmitrii Ignatov
 
AIST2019 – opening slides
AIST2019 – opening slidesAIST2019 – opening slides
AIST2019 – opening slides
Dmitrii Ignatov
 
Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...
Dmitrii Ignatov
 
Aist2014
Aist2014Aist2014
Aist2014
Dmitrii Ignatov
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic Rules
Dmitrii Ignatov
 
Orpailleur -- triclustering talk
Orpailleur -- triclustering talkOrpailleur -- triclustering talk
Orpailleur -- triclustering talk
Dmitrii Ignatov
 
CoClus ICDM Workshop talk
CoClus ICDM Workshop talkCoClus ICDM Workshop talk
CoClus ICDM Workshop talk
Dmitrii Ignatov
 
Pseudo-triclustering
Pseudo-triclusteringPseudo-triclustering
Pseudo-triclustering
Dmitrii Ignatov
 
Radio recommender system for FMHost
Radio recommender system for FMHostRadio recommender system for FMHost
Radio recommender system for FMHost
Dmitrii Ignatov
 
CrowDM system
CrowDM systemCrowDM system
CrowDM system
Dmitrii Ignatov
 

More from Dmitrii Ignatov (10)

Interpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley ValuesInterpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley Values
 
AIST2019 – opening slides
AIST2019 – opening slidesAIST2019 – opening slides
AIST2019 – opening slides
 
Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...
 
Aist2014
Aist2014Aist2014
Aist2014
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic Rules
 
Orpailleur -- triclustering talk
Orpailleur -- triclustering talkOrpailleur -- triclustering talk
Orpailleur -- triclustering talk
 
CoClus ICDM Workshop talk
CoClus ICDM Workshop talkCoClus ICDM Workshop talk
CoClus ICDM Workshop talk
 
Pseudo-triclustering
Pseudo-triclusteringPseudo-triclustering
Pseudo-triclustering
 
Radio recommender system for FMHost
Radio recommender system for FMHostRadio recommender system for FMHost
Radio recommender system for FMHost
 
CrowDM system
CrowDM systemCrowDM system
CrowDM system
 

Recently uploaded

PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 

Recently uploaded (20)

PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 

A One-Pass Triclustering Approach: Is There any Room for Big Data?

  • 1. A One-Pass Triclustering Approach: Is There any Room for Big Data? Dmitry V. Gnatyshak1 Dmitry I. Ignatov1 Sergei O. Kuznetsov1 Lhouari Nourine2 National Research University Higher School of Economics, Russian Federation Blaise Pascal University, LIMOS, CNRS, France 10.10.2014 Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 1 / 26
  • 2. Outline 1 Motivation Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
  • 3. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
  • 4. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
  • 5. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results 4 Conclusion Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 2 / 26
  • 6. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results 4 Conclusion Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 3 / 26
  • 7. Motiation Big amount of multimodal data: Gene expression data Folksonomies . . . Non-binary data can be scaled (possibly increasing the dimensionality) Increasing amount of big data: fast algorithms required (linear or sublinear, one-pass) Existing methods — finding all p-clusters satisfying some conditions (often exponential number) Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 4 / 26
  • 8. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results 4 Conclusion Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 5 / 26
  • 9. Prime OAC-triclustering Formal concept analysis: triadic case Definition Let G, M, B be some sets. Let the ternary relation I be a subset of their cartesian product: I ⊆ G × M × B. Then the tuple K = {G,M,B, I } is called a triadic formal context. G — a set of objects, M — a set of attributes, B — a set of conditions. GM m1 m2 m3 m1 m2 m3 m1 m2 m3 g1 x x x x x x x x g2 x x x x x g3 x x x x g4 x x x x x x B b1 b2 b3 Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
  • 10. Prime OAC-triclustering Formal concept analysis: triadic case Definition Galois operators (prime operators) are defined the same way as in dyadic case: 2G → 2M × 2B 2M → 2G × 2B 2B → 2G × 2M 2G × 2M → 2B 2G × 2B → 2M 2M × 2B → 2G Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
  • 11. Prime OAC-triclustering Formal concept analysis: triadic case GM m1 m2 m3 m1 m2 m3 m1 m2 m3 g1 x x x x x x x x g2 x x x x x g3 x x x x g4 x x x x x x B b1 b2 b3 ({g1, g2}, {m1,m2})′ = {b1, b3} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
  • 12. Prime OAC-triclustering Formal concept analysis: triadic case GM m1 m2 m3 m1 m2 m3 m1 m2 m3 g1 x x x x x x x x g2 x x x x x g3 x x x x g4 x x x x x x B b1 b2 b3 m′2 = {(g1, b1), (g2, b1), (g3, b1), (g1, b2), (g1, b3), (g2, b3), (g4, b3)} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
  • 13. Prime OAC-triclustering Formal concept analysis: triadic case Definition The triple (X,Y , Z) is called triadic formal concept of the context K = (G,M,B, I ), if X ⊆ G,Y ⊆ M, Z ⊆ B, (X,Y )′ = Z, (X, Z)′ = Y , (Y , Z)′ = X. X is called (formal) extent, Y — (formal) intent, Z — (formal) modus. GM m1 m2 m3 m1 m2 m3 m1 m2 m3 g1 x x x x x x x x g2 x x x x x g3 x x x x g4 x x x x x x B b1 b2 b3 Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 6 / 26
  • 14. Prime OAC-triclustering Basic algorithm This method uses the following types of prime operators (for the context K = (G,M,B, I )): (g,m)′ = {b ∈ B | (g,m, b) ∈ I }, (g, b)′ = {m ∈ M | (g,m, b) ∈ I }, (m, b)′ = {g ∈ G | (g,m, b) ∈ I } Definition Then the triple T = ((m, b)′, (g, b)′, (g,m)′) is called prime OAC-tricluster based on triple (g,m, b) ∈ I . The sets of tricluster are called, respectively, extent, intent, and modus. Triple (g,m, b) is called a generating triple of the tricluster T. Definition Density of a tricluster: ρ(X,Y , Z) = |I∩(X×Y×Z)| |X||Y||Z| Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 7 / 26
  • 15. Prime OAC-triclustering Basic algorithm An example of a tricluster based on triple (eg,em ,eb ): Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 8 / 26
  • 16. Prime OAC-triclustering Basic algorithm Require: K = (G,M, B, I ) — triadic context; ρmin — density threshold Ensure: T = {T = (X, Y, Z)} 1: T := ∅ 2: for all (g,m) : g ∈ G,m ∈ M do 3: PrimesObjAttr [g,m] = (g,m)′ 4: end for 5: for all (g, b) : g ∈ G,b ∈ B do 6: PrimesObjCond[g, b] = (g, b)′ 7: end for 8: for all (m, b) : m ∈ M,b ∈ B do 9: PrimesAttrCond[m, b] = (m, b)′ 10: end for 11: for all (g,m, b) ∈ I do 12: T = (PrimesAttrCond[m, b], PrimesObjCond[g, b], PrimesObjAttr [g,m]) 13: Tkey = hash(T) 14: if Tkey̸∈ T .keys ∧ ρ(T) ≥ ρmin then 15: T [Tkey] := T 16: end if 17: end for Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 9 / 26
  • 17. Prime OAC-triclustering Online version of the algorithm Let K = (G,M,B, I ) be a triadic context. We do not know G, M, B, I , or their cardinalities. Input on each iteration: {(g,m, b)} = J ⊆ I . Goal — maintain an updated version of the results and efficiently update them when new triples are received. We need to keep in memory the results of prime operators’ application (prime sets): PrimesObjAttr — dictionary with elements of type ((g,m), {b ∈ B}), g ∈ G, m ∈ M; PrimesObjCond — dictionary with elements of type ((g, b), {m ∈ M}), g ∈ G, b ∈ B; PrimesAttrCond — dictionary with elements of type ((m, b), {g ∈ G}), m ∈ M, b ∈ B. Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 10 / 26
  • 18. Prime OAC-triclustering Online version of the algorithm Remark In this case we need to consider triclusters based on different triples different, even if their extents, intents, and modi are equal. Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 11 / 26
  • 19. Prime OAC-triclustering Online version of the algorithm Algorithm of triples addition (standard): Require: J — a set of triples to add; T = {T = (∗X, ∗Y , ∗Z)} — current tricluster set; PrimesObjAttr , PrimesObjCond, PrimesAttrCond; Ensure: T = {T = (∗X, ∗Y , ∗Z)}; PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 1: for all (g,m, b) ∈ J do 2: PrimesObjAttr [g,m] := PrimesObjAttr [g,m] ∪ b 3: PrimesObjCond[g, b] := PrimesObjCond[g, b] ∪ m 4: PrimesAttrCond[m, b] := PrimesAttrCond[m, b] ∪ g 5: T := T ∪ (&PrimesAttrCond[m, b],&PrimesObjCond[g, b],&PrimesObjAttr [g,m]) 6: end for Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 12 / 26
  • 20. Prime OAC-triclustering Online version of the algorithm Algorithm of triples removal (optional): Triclusters must be kept in the dictionary with generating triples being keys. Require: J — a set of triples to remove; T = {(key(T),T = (∗X, ∗Y , ∗Z))} — current tricluster dictionary; PrimesObjAttr , PrimesObjCond, PrimesAttrCond; Ensure: T = {T = (∗X, ∗Y , ∗Z)}; PrimesObjAttr , PrimesObjCond, PrimesAttrCond; 1: for all (g,m, b) ∈ J do 2: T := T T [(g,m, b)] 3: PrimesObjAttr [g,m] := PrimesObjAttr [g,m] b 4: PrimesObjCond[g, b] := PrimesObjCond[g, b] m 5: PrimesAttrCond[m, b] := PrimesAttrCond[m, b] g 6: end for Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 13 / 26
  • 21. Prime OAC-triclustering Online version of the algorithm If a user have asked for an output, we may need to remove the triclusters with the same extent, intent and modi at the post-processing stage. At this stage we can also check various conditions (for instance, minimal density condition). Require: T = {T = (∗X, ∗Y , ∗Z)} — current tricluster set; Ensure: T = {T = (∗X, ∗Y , ∗Z)} — processed tricluster hash-set; 1: for all T ∈ T do 2: Compute hash(T) 3: if hash(T)̸∈ T then 4: T := T ∪ T 5: end if 6: end for Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 14 / 26
  • 22. Prime OAC-triclustering Online version of the algorithm Remark 1 To allow an efficient access to the prime sets dictionaries PrimesObjAttr , PrimesObjCond, and PrimesAttrCond must be implemented as hash tables. Remark 2 For an efficient computation of triclusters’ hash values we can keep hash values of prime sets along with prime sets. Then the calculation of the triclusters’ hash values will require to find a value of some function of the prime sets’ hash values (multiplied by non-repeating coefficients, for instance). It is important not to use LHS hash-function (Locality Sensitive Hashing). Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 15 / 26
  • 23. Prime OAC-triclustering Online version of the algorithm Complexities: Time complexity: O(|I |) (as there is a constant number of operations on each step); More precisely: 8|I | operations in total; 1 Modification of 3 prime sets (3); 2 Creation of a new tricluster (1); 3 Addition of pointers to its extent, intent, and modus (3); 4 Addition of the tricluster to the set of all triclusters (1). Memory complexity: O(|I |) (as we need to keep in memory only prime sets, |I | elements in each dictionary + keys). Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 16 / 26
  • 24. Prime OAC-triclustering Online version of the algorithm Example: Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 25. Prime OAC-triclustering Online version of the algorithm → (g1,m1, b1) 1 PrimesObjAttr = {((g1,m1), {b1})} 2 PrimesObjCond = {((g1, b1), {m1})} 3 PrimesAttrCond = {((m1, b1), {g1})} 4 T := T ∪ {PrimesAttrCond[m1, b1], PrimesObjCond[g1, b1], PrimesObjAttr [g1,m1]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 26. Prime OAC-triclustering Online version of the algorithm → (g1,m2, b1) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2})} 3 PrimesAttrCond = {((m1, b1), {g1}), ((m2, b1), {g1})} 4 T := T ∪ {PrimesAttrCond[m2, b1], PrimesObjCond[g1, b1], PrimesObjAttr [g1,m2]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 27. Prime OAC-triclustering Online version of the algorithm → (g2,m1, b1) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1})} 4 T := T ∪ {PrimesAttrCond[m1, b1], PrimesObjCond[g2, b1], PrimesObjAttr [g2,m1]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 28. Prime OAC-triclustering Online version of the algorithm → (g2,m2, b1) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1}), ((g2,m2), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2})} 4 T := T ∪ {PrimesAttrCond[m2, b1], PrimesObjCond[g2, b1], PrimesObjAttr [g2,m2]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 29. Prime OAC-triclustering Online version of the algorithm → (g3,m3, b1) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1}), ((g2,m1), {b1}), ((g2,m2), {b1}), ((g3,m3), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3})} 4 T := T ∪ {PrimesAttrCond[m3, b1], PrimesObjCond[g3, b1], PrimesObjAttr [g3,m3]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 30. Prime OAC-triclustering Online version of the algorithm → (g1,m2, b2) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1}), ((g2,m2), {b1}), ((g3,m3), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), ((g1, b2), {m2})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), ((m2, b2), {g1})} 4 T := T ∪ {PrimesAttrCond[m2, b2], PrimesObjCond[g1, b2], PrimesObjAttr [g1,m2]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 31. Prime OAC-triclustering Online version of the algorithm → (g2,m1, b2) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), ((g2,m2), {b1}), ((g3,m3), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), ((g1, b2), {m2}), ((g2, b2), {m1})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), ((m2, b2), {g1}), ((m1, b2), {g2})} 4 T := T ∪ {PrimesAttrCond[m1, b2], PrimesObjCond[g2, b2], PrimesObjAttr [g2,m1]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 32. Prime OAC-triclustering Online version of the algorithm → (g2,m2, b2) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), ((g2,m2), {b1, b2}), ((g3,m3), {b1})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), ((g1, b2), {m2}), ((g2, b2), {m1,m2})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), ((m2, b2), {g1, g2}), ((m1, b2), {g2})} 4 T := T ∪ {PrimesAttrCond[m2, b2], PrimesObjCond[g2, b2], PrimesObjAttr [g2,m2]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 33. Prime OAC-triclustering Online version of the algorithm → (g3,m3, b2) 1 PrimesObjAttr = {((g1,m1), {b1}), ((g1,m2), {b1, b2}), ((g2,m1), {b1, b2}), ((g2,m2), {b1, b2}), ((g3,m3), {b1, b2})} 2 PrimesObjCond = {((g1, b1), {m1,m2}), ((g2, b1), {m1,m2}), ((g3, b1), {m3}), ((g1, b2), {m2}), ((g2, b2), {m1,m2}), ((g3, b2), {m3})} 3 PrimesAttrCond = {((m1, b1), {g1, g2}), ((m2, b1), {g1, g2}), ((m3, b1), {g3}), ((m2, b2), {g1, g2}), ((m1, b2), {g2}), ((m3, b2), {g3})} 4 T := T ∪ {PrimesAttrCond[m3, b2], PrimesObjCond[g3, b2], PrimesObjAttr [g3,m3]} Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 34. Prime OAC-triclustering Online version of the algorithm Postprocessing: 1 T(g1,m1,b1) = (g1, g2,m1,m2, b1) ← add 2 T(g1,m2,b1) = (g1, g2,m1,m2, b1, b2) ← add 3 T(g2,m1,b1) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 4 T(g2,m2,b1) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 5 T(g3,m3,b1) = (g3,m3, b1, b2) ← add 6 T(g1,m2,b2) = (g1, g2,m2, b1, b2) ← add 7 T(g2,m1,b2) = (g2,m1,m2, b1, b2) ← add 8 T(g2,m2,b2) = (g1, g2,m1,m2, b1, b2) ← the same as T(g1,m2,b1), skip 9 T(g3,m3,b2) = (g3,m3, b1, b2) ← the same as T(g3,m3,b1), skip Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 35. Prime OAC-triclustering Online version of the algorithm The final output set of triclusters: 1 T1 = ({g1, g2}, {m1,m2}, {b1}) 2 T2 = ({g1, g2}, {m1,m2}, {b1, b2}) 3 T3 = ({g3}, {m3}, {b1, b2}) 4 T4 = ({g1, g2}, {m2}, {b1, b2}) 5 T5 = ({g2}, {m1,m2}, {b1, b2}) Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 17 / 26
  • 36. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results 4 Conclusion Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 18 / 26
  • 37. Experiments Description of the experiments Goals: Show that the online algorithm of p-dimensional attribute clustering outperforms the basic algorithm Confirm the complexity estimations For each dataset for each version of the algorithm 11 experiments were conducted: for each there were different density threshold (from 0 to 1 with 0.1 intervals). To evaluate the time more precisely, for each context there were 5 runs of the algorithms with the average result recorded. Additional tests to check the performance on big datasets and confirm linearity of the online algorithm. All experiments were conducted on the computer with Intel Core i7-351U 2.40 GHz processor, 8 GB RAM, Windows 8 operating system. Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 19 / 26
  • 38. Experiments Datasets 5 pseudo-random uniform contexts 50 × 50 × 50. Probability of each quadruple’s presence varied from 0.02 to 0.1 with 0.02 interval 10 pseudo-random uniform contexts with average density equal to 0.001. Cardinalities of sets varied from 100 to 1000 with 100 interval Top-250 list of IMDB (Internet Movie Database) (triples: (movie, tag, genre)) Sample of 3000 triples of the first 100 000 triples of Bibsonomy.org dataset (triples: (user, bookmark, tag)) Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 20 / 26
  • 39. Experiments Datasets Context |G| |M| |B| # triples Density Synthetic1, 0.02 50 50 50 2530 0.02024 Synthetic1, 0.04 50 50 50 5001 0.04001 Synthetic1, 0.06 50 50 50 7454 0.05963 Synthetic1, 0.08 50 50 50 10046 0.08037 Synthetic1, 0.1 50 50 50 12462 0.09970 Synthetic2, 100 100 100 100 996 0.001 Synthetic2, 200 200 200 200 7995 0.001 Synthetic2, 300 300 300 300 27161 0.001 Synthetic2, 400 400 400 400 63921 0.001 Synthetic2, 500 500 500 500 125104 0.001 Synthetic2, 600 600 600 600 216021 0.001 Synthetic2, 700 700 700 700 343157 0.001 Synthetic2, 800 800 800 800 512097 0.001 Synthetic2, 900 900 900 900 729395 0.001 Synthetic2, 1000 1000 1000 1000 1000589 0.001 IMDB 250 795 22 3818 0.00087 BibSonomy 51 924 2844 3000 0.000022 Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 20 / 26
  • 40. Experiments Results Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 21 / 26
  • 41. Experiments Results Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 21 / 26
  • 42. Experiments Results Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 22 / 26
  • 43. Experiments Results Densities for the contexts: Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 23 / 26
  • 44. Experiments Results Densities for the contexts: Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 23 / 26
  • 45. Outline 1 Motivation 2 Prime OAC-triclustering Formal concept analysis Basic algorithm Online version of the algorithm 3 Experiments Description of the experiments Datasets Results 4 Conclusion Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 24 / 26
  • 46. Conclusion Prime OAC-triclustering algorithm was described One-pass linear online version of its basic algorithm was proposed Efficiency of the online algorithm and complexities of both algorithms were confirmed. Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 25 / 26
  • 47. Thank you! Questions? Dmitry V. Gnatyshak et al. A One-Pass Triclustering Approach 10.10.2014 26 / 26