Clustering technique
for conceptual cluster
Brice Govin
Clustering technique for
conceptual cluster
• Clustering ?
• Concept?
• An iteration to rule them all
• And in an approach bind them
• And come the hobbits and their issues
Clustering?
Clustering?
Clustering?
Clustering?
Concept
?
Concept?
Conceptual cluster
Clustering concepts
An iteration to rule them all
Extracted graph
A
E
I
B
D
F
H
G
C
D Packages
Imports
An iteration to rule them all
Kernel Selection
A
E
I
B
D
F
H
G
C
An iteration to rule them all
Navigating the graph
A
E
I
B
D
F
H
G
C
An iteration to rule them all
Navigating the graph
A
E
I
B
D
F
H
G
C
An iteration to rule them all
Navigating the graph
A
E
I
B
D
F
H
G
C
An iteration to rule them all
Navigating the graph
A
E
I
B
D
F
H
G
C
G
An iteration to rule them all
Difference between traditional methods
and ours
A
E
I
B
D
F
H
G
C
dist(A,F)
similarity(A,H)
And in an approach
bind them
And in an approach bind them
Why iterative?
Because hierarchy of clusters
Cluster
1
Cluster
1.3
Cluster
2.1
Cluster
3.1
Cluster
3.2
Cluster
2
Cluster
3
Cluster
1.2
Cluster
1.1
Cluster
2.2
And in an approach bind them
How it works?
First instance, packages clustering
Packages
Subprograms
And in an approach bind them
How it works?
First instance, packages clustering
Packages
Cluster
1
Cluster
2
Cluster
3
And in an approach bind them
How it works?
2nd instance, subprograms clustering
Packages
Subprograms
And in an approach bind them
How it works?
2nd instance, subprograms clustering
Subprograms
Cluster
1.3
Cluster
2.1
Cluster
3.1
Cluster
3.2
Cluster
1.2
Cluster
1.1
Cluster
2.2
And come the hobbits and their issues
Cyclic dependencies
A
E
I
B
D
F
H
G
C
And come the hobbits and their issues
Cyclic dependencies
Keep track of navigated nodes
A
E
I
B
D
F
H
G
C
And come the hobbits and their issues
Multiple allocation
A
E
I
B
D
F
H
G
C
Talk about numbers
1st Instance: packages clustering
2nd Instance: subprograms clustering
Precision: 98%
Recall: 88%
Precision: ~60%
Recall: ~50%%
What next?

Clustering technique for conceptual clusters

  • 1.
  • 2.
    Clustering technique for conceptualcluster • Clustering ? • Concept? • An iteration to rule them all • And in an approach bind them • And come the hobbits and their issues
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
    An iteration torule them all Extracted graph A E I B D F H G C D Packages Imports
  • 11.
    An iteration torule them all Kernel Selection A E I B D F H G C
  • 12.
    An iteration torule them all Navigating the graph A E I B D F H G C
  • 13.
    An iteration torule them all Navigating the graph A E I B D F H G C
  • 14.
    An iteration torule them all Navigating the graph A E I B D F H G C
  • 15.
    An iteration torule them all Navigating the graph A E I B D F H G C G
  • 16.
    An iteration torule them all Difference between traditional methods and ours A E I B D F H G C dist(A,F) similarity(A,H)
  • 17.
    And in anapproach bind them
  • 18.
    And in anapproach bind them Why iterative? Because hierarchy of clusters Cluster 1 Cluster 1.3 Cluster 2.1 Cluster 3.1 Cluster 3.2 Cluster 2 Cluster 3 Cluster 1.2 Cluster 1.1 Cluster 2.2
  • 19.
    And in anapproach bind them How it works? First instance, packages clustering Packages Subprograms
  • 20.
    And in anapproach bind them How it works? First instance, packages clustering Packages Cluster 1 Cluster 2 Cluster 3
  • 21.
    And in anapproach bind them How it works? 2nd instance, subprograms clustering Packages Subprograms
  • 22.
    And in anapproach bind them How it works? 2nd instance, subprograms clustering Subprograms Cluster 1.3 Cluster 2.1 Cluster 3.1 Cluster 3.2 Cluster 1.2 Cluster 1.1 Cluster 2.2
  • 23.
    And come thehobbits and their issues Cyclic dependencies A E I B D F H G C
  • 24.
    And come thehobbits and their issues Cyclic dependencies Keep track of navigated nodes A E I B D F H G C
  • 25.
    And come thehobbits and their issues Multiple allocation A E I B D F H G C
  • 26.
    Talk about numbers 1stInstance: packages clustering 2nd Instance: subprograms clustering Precision: 98% Recall: 88% Precision: ~60% Recall: ~50%%
  • 27.