The document proposes a lattice-based approach for consensus clustering. It introduces the consensus clustering problem and existing approaches. It then describes a least-squares criterion for ensemble and combined consensus clustering. A lattice-based algorithm is presented that finds a consensus partition by identifying an antichain of concepts in the lattice formed from a partition context. Computational experiments on synthetic datasets are used to evaluate the lattice-based approach and compare it to state-of-the-art algorithms, using adjusted rand index to measure similarity between partitions.
Pattern-based classification of demographic sequencesDmitrii Ignatov
We have proposed prefix-based gapless sequential patterns for classification of demographic sequences. In comparison to black-box machine learning techniques, this one provides interpretable patterns suitable for treatment by professional demographers. As for the language, we have used Pattern Structures as an extension of Formal Concept Analysis for the case of complex data like sequences, graphs, intervals, etc.
This paper presents an interesting idea how to compute a consensus of several k-partitions of a set by means of finding an antichain in the concept lattice of an appropriate formal context.
Total Dominating Color Transversal Number of Graphs And Graph Operationsinventionjournals
Total Dominating Color Transversal Set of a graph is a Total Dominating Set of the graph which is also Transversal of Some 휒 - Partition of the graph. Here 휒 is the Chromatic number of the graph. Total Dominating Color Transversal number of a graph is the cardinality of a Total Dominating Color Transversal Set which has minimum cardinality among all such sets that the graph admits. In this paper, we consider the well known graph operations Join, Corona, Strong product and Lexicographic product of graphs and determine Total Dominating Color Transversal number of the resultant graphs.
We study QPT (quasi-polynomial tractability) in the worst case setting of linear tensor product problems defined over Hilbert spaces. We prove QPT for algorithms that use only function values under three assumptions'
1. the minimal errors for the univariate case decay polynomially fast to zero,
2. the largest singular value for the univariate case is simple,
3. the eigenfunction corresponding to the largest singular value is a multiple of the function value at some point.
The first two assumptions are necessary for QPT. The third assumption is necessary for QPT for some Hilbert spaces.
Joint work with Erich Novak
Pattern-based classification of demographic sequencesDmitrii Ignatov
We have proposed prefix-based gapless sequential patterns for classification of demographic sequences. In comparison to black-box machine learning techniques, this one provides interpretable patterns suitable for treatment by professional demographers. As for the language, we have used Pattern Structures as an extension of Formal Concept Analysis for the case of complex data like sequences, graphs, intervals, etc.
This paper presents an interesting idea how to compute a consensus of several k-partitions of a set by means of finding an antichain in the concept lattice of an appropriate formal context.
Total Dominating Color Transversal Number of Graphs And Graph Operationsinventionjournals
Total Dominating Color Transversal Set of a graph is a Total Dominating Set of the graph which is also Transversal of Some 휒 - Partition of the graph. Here 휒 is the Chromatic number of the graph. Total Dominating Color Transversal number of a graph is the cardinality of a Total Dominating Color Transversal Set which has minimum cardinality among all such sets that the graph admits. In this paper, we consider the well known graph operations Join, Corona, Strong product and Lexicographic product of graphs and determine Total Dominating Color Transversal number of the resultant graphs.
We study QPT (quasi-polynomial tractability) in the worst case setting of linear tensor product problems defined over Hilbert spaces. We prove QPT for algorithms that use only function values under three assumptions'
1. the minimal errors for the univariate case decay polynomially fast to zero,
2. the largest singular value for the univariate case is simple,
3. the eigenfunction corresponding to the largest singular value is a multiple of the function value at some point.
The first two assumptions are necessary for QPT. The third assumption is necessary for QPT for some Hilbert spaces.
Joint work with Erich Novak
To describe the dynamics taking place in networks that structurally change over time, we propose an approach to search for attributes whose value changes impact the topology of the graph. In several applications, it appears that the variations of a group of attributes are often followed by some structural changes in the graph that one may assume they generate. We formalize the triggering pattern discovery problem as a method jointly rooted in sequence mining and graph analysis. We apply our approach on three real-world dynamic graphs of different natures - a co-authoring network, an airline network, and a social bookmarking system - assessing the relevancy of the triggering pattern mining approach.
Abstract Quadripartitioned single valued neutrosophic (QSVN) set is a powerful structure where we have four components Truth-T, Falsity-F, Unknown-U and Contradiction-C. And also it generalizes the concept of fuzzy, initutionstic and single valued neutrosophic set. In this paper we have proposed the concept of K-algebras on QSVN, level subset of QSVN and studied some of the results. In addition to this we have also investigated the characteristics of QSVN Ksubalgebras under homomorphism.
On maximal and variational Fourier restrictionVjekoslavKovac1
Workshop talk slides, Follow-up workshop to trimester program "Harmonic Analysis and Partial Differential Equations", Hausdorff Institute, Bonn, May 2019.
My talk in the International Conference on Computational Finance 2019 (ICCF2019). The talk is about designing new efficient methods for option pricing under the rough Bergomi model.
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Chiheb Ben Hammouda
Conference talk at the SIAM Conference on Financial Mathematics and Engineering, held in virtual format, June 1-4 2021, about our recently published work "Hierarchical adaptive sparse grids and quasi-Monte Carlo for option pricing under the rough Bergomi model".
- Link of the paper: https://www.tandfonline.com/doi/abs/10.1080/14697688.2020.1744700
Experimental Economics and Machine Learning workshopDmitrii Ignatov
This presentation summarises recent activities on EEML workshop organisation. In fact, this is a successful event which attracts economists and computers scientists who would like to use recent advances in machine learning and data mining to understand human behavior in different domains related to Economics and Social Science.
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016Dmitrii Ignatov
Some photo impressions from NIPS & ICDM 2016 in Barcelona mixed with workshops like Learning with Tensors (http://tensor-learn.org/) and related stuff.
To describe the dynamics taking place in networks that structurally change over time, we propose an approach to search for attributes whose value changes impact the topology of the graph. In several applications, it appears that the variations of a group of attributes are often followed by some structural changes in the graph that one may assume they generate. We formalize the triggering pattern discovery problem as a method jointly rooted in sequence mining and graph analysis. We apply our approach on three real-world dynamic graphs of different natures - a co-authoring network, an airline network, and a social bookmarking system - assessing the relevancy of the triggering pattern mining approach.
Abstract Quadripartitioned single valued neutrosophic (QSVN) set is a powerful structure where we have four components Truth-T, Falsity-F, Unknown-U and Contradiction-C. And also it generalizes the concept of fuzzy, initutionstic and single valued neutrosophic set. In this paper we have proposed the concept of K-algebras on QSVN, level subset of QSVN and studied some of the results. In addition to this we have also investigated the characteristics of QSVN Ksubalgebras under homomorphism.
On maximal and variational Fourier restrictionVjekoslavKovac1
Workshop talk slides, Follow-up workshop to trimester program "Harmonic Analysis and Partial Differential Equations", Hausdorff Institute, Bonn, May 2019.
My talk in the International Conference on Computational Finance 2019 (ICCF2019). The talk is about designing new efficient methods for option pricing under the rough Bergomi model.
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Chiheb Ben Hammouda
Conference talk at the SIAM Conference on Financial Mathematics and Engineering, held in virtual format, June 1-4 2021, about our recently published work "Hierarchical adaptive sparse grids and quasi-Monte Carlo for option pricing under the rough Bergomi model".
- Link of the paper: https://www.tandfonline.com/doi/abs/10.1080/14697688.2020.1744700
Experimental Economics and Machine Learning workshopDmitrii Ignatov
This presentation summarises recent activities on EEML workshop organisation. In fact, this is a successful event which attracts economists and computers scientists who would like to use recent advances in machine learning and data mining to understand human behavior in different domains related to Economics and Social Science.
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016Dmitrii Ignatov
Some photo impressions from NIPS & ICDM 2016 in Barcelona mixed with workshops like Learning with Tensors (http://tensor-learn.org/) and related stuff.
A short introduction into Sequential Pattern Mining in Russia. We consider frequent and frequent closed sequences along with two algorithms (SPADE and PrefixSpan). A demographic case study is provided as well. One can find links and references to relevant literature and software. We mainly follow Han & Kamber Data Mining book (2nd edition, Chapter 8.3).
Краткое введение в Sequential Pattern Mining на русском языке. Рассматриваются алгоритмы для поиска частых и частых замкнутых последовательностей (SPADE и PrefixSpan) Кейс-стади на примере демографических последовательностей. Приведены ссылки на библиотеки и реализации некоторых базовых алгоритмов. Основное изложение по мотивам учебника Джиавея Хана и Мишелин Камбер.
Context-Aware Recommender System Based on Boolean Matrix FactorisationDmitrii Ignatov
In this work we propose and study an approach for collaborative filtering, which is based on Boolean matrix factorisation and exploits additional (context) information about users and items. To avoid similarity loss in case of Boolean representation we use an adjusted type of projection of a target user to the obtained factor space.
We have compared the proposed method with SVD-based approach on the MovieLens dataset. The experiments demonstrate that the proposed method has better MAE and Precision and comparable Recall and F-measure. We also report an increase of quality in the context information presence.
Поиск частых множеств признаков (товаров) и ассоциативные правилаDmitrii Ignatov
Краткое введение в анализ ассоциативных правил в терминах Анализа Формальных Понятий. Примеры задач: поиск документов почти-дубликатов, анализ посещаемости сайтов, контекстная реклама.
On the Family of Concept Forming Operators in Polyadic FCADmitrii Ignatov
Triadic Formal Concept Analysis (3FCA) was introduced by Lehman and Wille almost two decades ago. And many researchers work in Data Mining and Formal Concept Analysis using the notions of closed sets, Galois and closure operators, closure systems. However, up-to-date even though that different researchers actively work on mining triadic and n-ary relations, a proper closure operator for enumeration of triconcepts, i.e. maximal triadic cliques of tripartite hypergaphs, was not introduced. In this talk we show that the previously introduced operators for obtaining triconcepts are not always consistent, describe their family and study their properties. We also introduce the notion of maximal switching generator to explain why such concept-forming operators are not closure operators due to violation of monotonicity property.
Pattern Mining and Machine Learning for Demographic SequencesDmitrii Ignatov
In this talk, we present the results of our first studies in application of pattern mining and machine learning to analysis of demographic sequences in Russia based on data of 11 generations from 1930 till 1984. The main goal is not prediction and data mining methods themselves, but rather extraction of interesting patterns and knowledge acquisition from substantial datasets of demographic data. We use decision trees as techniques for demographic events prediction and emerging patterns for searching significant and potentially useful sequences.
In our previous work an efficient one-pass online algorithm for triclustering of binary data (triadic formal contexts) was proposed. This algorithm is a modified version of the basic algorithm for OAC- triclustering approach; it has linear time and memory complexities. In this paper we parallelise it via map-reduce framework in order to make it suitable for big datasets. The results of computer experiments show the efficiency of the proposed algorithm; for example, it outperforms the online counterpart on Bibsonomy dataset with ≈ 800, 000 triples.
AIST is a scientific conference on Analysis of Images, Social Networks, and Texts. The conference is intended for computer scientists and practitioners whose research interests involve Internet mathematics and other related fields of data science. Similar to the previous year, the conference will be focused on applications of data mining and machine learning techniques to various problem domains: image processing, analysis of social networks, and natural language processing. We hope that the participants will benefit from the interdisciplinary nature of the conference and exchange experience.
Searching for optimal patterns in Boolean tensorsDmitrii Ignatov
This is our slides for a spotlight talk at Learning with Tensors workshop at NIPS 2016. We have shortly summarise comparison of five different triclustering algorithms (TRIAS, TriBox, OACPrime, OACBox, and SpecTric).
RAPS: A Recommender Algorithm Based on Pattern StructuresDmitrii Ignatov
We propose a new algorithm for recommender systems with numeric
ratings which is based on Pattern Structures (RAPS). As the input the algorithm
takes rating matrix, e.g., such that it contains movies rated by users. For a target
user, the algorithm returns a rated list of items (movies) based on its previous ratings
and ratings of other users.We compare the results of the proposed algorithm
in terms of precision and recall measures with Slope One, one of the state-of-the-art
item-based algorithms, on Movie Lens dataset and RAPS demonstrates the
best or comparable quality.
A One-Pass Triclustering Approach: Is There any Room for Big Data?Dmitrii Ignatov
An efficient one-pass online algorithm for triclustering of binary data (triadic formal contexts) is proposed. This algorithm is a modified version of the basic algorithm for OAC-triclustering approach, but it has linear time and memory complexities with respect to the cardinality
of the underlying ternary relation and can be easily parallelized in order to be applied for the analysis of big datasets. The results of computer experiments show the efficiency of the proposed algorithm.
Boolean matrix factorisation for collaborative filteringDmitrii Ignatov
We propose a new approach for Collaborative filtering which
is based on Boolean Matrix Factorisation (BMF) and Formal Concept
Analysis. In a series of experiments on real data (MovieLens dataset) we
compare the approach with an SVD-based one in terms of Mean Average
Error (MAE). One of the experimental consequences is that it is enough
to have a binary-scaled rating data to obtain almost the same quality
in terms of MAE by BMF as for the SVD-based algorithm in case of
non-scaled data.
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Dmitrii Ignatov
We present a new recommender system developed for the Russian interactive radio network FMhost based on a previously proposed model. The underlying model combines a collaborative user-based approach with information from tags of listened tracks in order to match user and radio station profiles.
It follows an adaptive online learning strategy based on the user history. We compare the proposed algorithms and an industry standard technique based on singular value decomposition (SVD)
in terms of precision, recall, and NDCG measures; experiments show that in our case the fusion-based approach shows the best results.
Jiawei Han, Micheline Kamber and Jian Pei
Data Mining: Concepts and Techniques, 3rd ed.
The Morgan Kaufmann Series in Data Management Systems
Morgan Kaufmann Publishers, July 2011. ISBN 978-0123814791
In this paper, we solve a semi-supervised regression
problem. Due to the luck of knowledge about the
data structure and the presence of random noise, the considered data model is uncertain. We propose a method which combines graph Laplacian regularization and cluster ensemble methodologies. The co-association matrix of the ensemble is calculated on both labeled and unlabeled data; this matrix is used as a similarity matrix in the regularization framework to derive the predicted outputs. We use the low-rank decomposition of the co-association matrix to significantly speedup calculations and reduce memory. Two clustering problem examples are presented.
Full version is here https://arxiv.org/abs/1901.03919
A Matrix Based Approach for Weighted Argumentation FrameworksCarlo Taticchi
The assignment of weights to attacks in a classical Argumentation Framework allows to compute semantics by taking into account the different importance of each argument. We represent a Weighted Argumentation Framework by a non-binary matrix, and we characterise the basic extensions (such as w-admissible, w-stable, w-complete) by analysing sub-blocks of this matrix. Also, we show how to reduce the matrix into another one of smaller size, that is equivalent to the original one for the determination of extensions. Furthermore, we provide two algorithms that allow to build incrementally w-grounded and w-preferred extensions starting from a w-admissible extension.
We propose a regularized method for multivariate linear regression when the number of predictors may exceed the sample size. This method is designed to strengthen the estimation and the selection of the relevant input features with three ingredients: it takes advantage of the dependency pattern between the responses by estimating the residual covariance; it performs selection on direct links between predictors and responses; and selection is driven by prior structural information. To this end, we build on a recent reformulation of the multivariate linear regression model to a conditional Gaussian graphical model and propose a new regularization scheme accompanied with an efficient optimization procedure. On top of showing very competitive performance on artificial and real data sets, our method demonstrates capabilities for fine interpretation of its parameters, as illustrated in applications to genetics, genomics and spectroscopy.
Approximate Bayesian computation for the Ising/Potts modelMatt Moores
Bayes’ formula involves the likelihood function, p(y|theta), which is a problem when the likelihood is unavailable in closed form. ABC is a method for approximating the posterior p(theta|y) without evaluating the likelihood. Instead, pseudo-data is simulated from a generative model and compared with the observations. This talk will give an introduction to ABC algorithms: rejection sampling, ABC-MCMC and ABC-SMC. Application of these algorithms to image analysis will be presented as an illustrative example. These methods have been implemented in the R package bayesImageS.
This is joint work with Christian Robert (Warwick/Dauphine), Kerrie Mengersen and Christopher Drovandi (QUT).
Similar to A lattice-based consensus clustering (20)
Interpretable Concept-Based Classification with Shapley ValuesDmitrii Ignatov
The slides contain our talk on Shapley values as an interpretable Machine learning technique for JSM-method, a rule-based classification and reasoning technique, for ranking particular attributes of an undetermined example under classification.
https://doi.org/10.1007/978-3-030-57855-8_7
These are opening slides of the 8th International Conference on Analysis of Images, Social Networks and Texts (AIST 2019). We summarise general facts on AIST conf. series. See http://aistconf.org website for more details.
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Dmitrii Ignatov
Mining ternary relations or triadic Boolean tensors is one of the recent trends in knowledge discovery that allows one to take into account various modalities of input object-attribute data.
For example, in movie databases like IMBD, an analyst may find not only movies grouped by specific genres but see their common keywords. In the so called folksonomies, users can be grouped according to their shared resources and used tags. In gene expression analysis, genes can be grouped along with samples of tissues and time intervals providing comprehensible patterns. However, pattern explosion effects even with one more dimension are seriously aggravated. In this paper, we continue our previous study on searching for a smaller collection of ``optimal'' patterns in triadic data with respect to a set of quality criteria such as patterns' cardinality, density, diversity, coverage, etc. We show how a simple data preprocessing has enabled us to use the frequent itemset mining algorithm.
Social Learning in Networks: Extraction Deterministic RulesDmitrii Ignatov
In this talk, we want to introduce experimental
economics to the field of data mining and vice versa. It continues
related work on mining deterministic behavior rules of human
subjects in data gathered from experiments. Game-theoretic
predictions partially fail to work with this data. Equilibria also
known as game-theoretic predictions solely succeed with experienced
subjects in specific games – conditions, which are rarely
given. Contemporary experimental economics offers a number of
alternative models apart from game theory. In relevant literature,
these models are always biased by philosophical plausibility
considerations and are claimed to fit the data. An agnostic
data mining approach to the problem is introduced in this
paper – the philosophical plausibility considerations follow after
the correlations are found. No other biases are regarded apart
from determinism. The dataset of the paper “Social Learning in
Networks” by Choi et al 2012 is taken for evaluation. As a result,
we come up with new findings. As future work, the design of a
new infrastructure is discussed.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxRASHMI M G
Abnormal or anomalous secondary growth in plants. It defines secondary growth as an increase in plant girth due to vascular cambium or cork cambium. Anomalous secondary growth does not follow the normal pattern of a single vascular cambium producing xylem internally and phloem externally.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Nucleophilic Addition of carbonyl compounds.pptxSSR02
Nucleophilic addition is the most important reaction of carbonyls. Not just aldehydes and ketones, but also carboxylic acid derivatives in general.
Carbonyls undergo addition reactions with a large range of nucleophiles.
Comparing the relative basicity of the nucleophile and the product is extremely helpful in determining how reversible the addition reaction is. Reactions with Grignards and hydrides are irreversible. Reactions with weak bases like halides and carboxylates generally don’t happen.
Electronic effects (inductive effects, electron donation) have a large impact on reactivity.
Large groups adjacent to the carbonyl will slow the rate of reaction.
Neutral nucleophiles can also add to carbonyls, although their additions are generally slower and more reversible. Acid catalysis is sometimes employed to increase the rate of addition.
1. A Lattice-based Consensus Clustering
Algorithm
Artem Bocharov, Dmitry Gnatyshak, Dmitry Ignatov, Boris Mirkin,
Andrey Shestakov
Computer Science Faculty, Dept. of Data Analysis and Artificial Intelligence, HSE, Moscow
The 13th International Conference on Concept Lattices and Their Applications
July 21, 2016
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 1 / 45
2. Objectives
1 Propose lattice-based consensus criteria and algorithms
2 Experimentally compare least-squares consensus clustering results
with those by recent algorithms for consensus clustering
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 2 / 45
5. Clustering results: different partitions
[Reigner, 1965], [Mirkin, 1969]
Figure 1 : Four clusterings at the same datasetCLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 5 / 45
6. Consensus Problem
Figure 2 : Clustering ensemble (on the left) and consensus clustering result (on
the right)
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 6 / 45
7. Approaches
Probabilistic
Bayesian Cluster Ensembles [Wang, 2009]
Mixture Model [Topchy, 2004]
Direct
Cumulative Voting [Dimitriadou, 2002], [Ayad, 2010]
Graph Partitioning [Ghosh, 2002]
Consensus matrix (Pairwise Similarity) [Guenoche, 2011]
A = (aij ), aij is the number of partitions in which objects yi and yj are
in the same cluster
Least Squares Consensus Clustering [Muchnik, Mirkin, 1981], [Mirkin,
2012]
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 7 / 45
9. Basic Definitions
Def.1
A partition of a nonempty set A is a set of its subsets σ = {B | B ⊆ A}
such that
B∈σ
B = A and B ∩ C = ∅ for all B, C ∈ σ.
Every element of σ is called block.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 9 / 45
11. Two goals for consensus clustering
[Mirkin & Muchnik, 1981], [Mirkin, 2012]
Given partitions R1, R2, ..., RT find a consensus partition S so that:
Ensemble consensus: S is good for recovering Rt, t = 1, 2, . . . T
Combined consensus: Rt, t = 1, 2, . . . T are good for describing S
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 11 / 45
13. Equivalent reformulations of the
least-squares criteria
Ensemble consensus clustering
g(S) =
K
k=1 i,j∈Sk
aij /|Sk|
where A = (aij ) — ensemble consensus matrix of R = {R1, . . . , RT }.
Combined consensus clustering
f (S) =
K
k=1 i,j∈Sk
(pij − T/N)
where P = (pij ) — summary projection matrix, and N — number of
objects.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 13 / 45
15. Basic Definitions
Def.2
A partition lattice of set A is an ordered set (Part(A), ∨, ∧) where Part(A)
is a set of all possible partitions of A and for all partitions σ and ρ
supremum and infimum are defined as follows:
σ ∨ ρ = {Nρ(B) ∪
C∈Nρ(B)
Nσ(C)|B ∈ σ},
σ ∧ ρ = {B ∩ C | for all B ∈ σ, C ∈ ρ, and B ∩ C ̸= ∅},
where
Nρ(B) = {C | C ∈ ρ and B ∩ C ̸= ∅}.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 15 / 45
18. Basic Definitions
Def.3
Let A be a set and let ρ, σ ∈ Part(A). The partition ρ is finer than the
partition σ if every block B of σ is a union of blocks of ρ, that is ρ ≤ σ.
Equivalently one can use traditional connection between supremum,
infimum and partial order in the lattice: ρ ≤ σ iff ρ ∨ σ = σ (ρ ∧ σ = ρ).
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 18 / 45
19. Isomorphism of Lattices
Theorem 1 (Ganter&Wille)
For a given partially ordered set P = (P, ≤) the concept lattice of the
formal context K = (J(P), M(P), ≤) is isomorphic to the
Dedekind–MacNeille completion of P, where J(P) and M(P) are set of
join-irreducible and meet-irreducible elements of P.
Theorem 2
For a given partition lattice L = (Part(A), ∨, ∧) there exist a formal
context K = (P2, A2, I), where P2 = {{a, b} | a, b ∈ A and a ̸= b},
A2 = {σ | σ ∈ Part(A) and |σ| = 2} and {a, b}Iσ when a and b belong to
the same block of σ. The concept lattice B(P2, A2, I) is isomorphic to the
initial lattice (Part(A), ∨, ∧).
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 19 / 45
20. Isomorphism of Lattices
There is a correspondence between elements of L = (Part(A), ∨, ∧)
and formal concepts of B(P2, A2, I).
Every (C, D) ∈ B(P2, A2, I) corresponds to σ = D and every pair
{i, j} from C is in one of σ blocks, where σ ∈ Part(A).
Every (C, D) ∈ BDM(J(L), M(L), ≤) corresponds to
σ = D = C.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 20 / 45
21. Concept Lattice
Figure 6 : The diagram of the concept lattice isomorphic to the partition lattice
of four elements
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 21 / 45
22. Partition context
Def.4
Let us call KR = (G, ⊔Mt, I ⊆ G × ⊔Mt) a partition context, where G is
a set of objects, t = 1, . . . , T, and each Mt consists of labels of all clusters
in the t-th k-means partition from the ensemble.
For example, gImt1 means that the object g has been clustered to the first
cluster by t-th clustering algorithm in the ensemble.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 22 / 45
23. The idea of the algorithm
Our consensus algorithm looks for S, an antichain of concepts of KR,
such that for every (A, B) and (C, D) the condition A ∩ C = ∅ is
fulfilled.
The concept extent A corresponds to one of the resulting clusters,
and its intent contains all labels of the ensemble members that voted
for the objects from A being in one cluster.
It is a reasonable consensus hypothesis that at least ⌈T/2⌉ should
vote for a set of objects to be in one cluster.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 23 / 45
24. An example of the algorithm execution
Figure 7 : An example from A. Bocharov’s thesis
The anticahin: S = {({o1, o2, o3, o6}, {a1, b1}), ({o4, o5, o7}, {b2, c2})}.
The orphan object: o8. o′
8 = {a2, b2, c1}.
The resulting partition: σ = {{o1, o2, o3, o6}, {o4, o5, o7, o8}}.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 24 / 45
25. Perfect Recovery Condition
Theorem 3
In the concept lattice of a partition context
KR = (G, ⊔Mt, I ⊆ G × ⊔Mt), there is the antichain of concepts S such
that all extents of its concepts Ai coincide with Si from λ, the true
partition, if and only if S′′
i = Si where i = 1, . . . K.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 25 / 45
27. Gaussian cluster generation
Generated partition
300 five-dimensional objects comprising three randomly generated
spherical Gaussian clusters.
The variance of each cluster lies in 0.1 − 0.3
The center components are independently generated from N(0, 0.7).
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 27 / 45
29. Experiments
Let us denote thus generated partition as λ with kλ clusters. The
profile of partitions R = {ρ1, ρ2, . . . , ρT } for consensus algorithms is
constructed as a result of T runs of k-means clustering algorithm
starting from random k centers.
We carry out the experiments in four settings (next slides).
The size of an ensemble T = 100 for all our experiments.
10 runs for every of 10 generated datasets.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 29 / 45
30. Experiment 1
Investigation of influence of the number of clusters kλ ∈ {2, 3, 5, 9} under
various numbers of minimal votes
a) two clusters case kλ = 2, k ∈ {2, 3, 4, 5},
b) three clusters case kλ = 3, k ∈ {2, 3},
c) five clusters case kλ = 5, k ∈ {2, 5},
d) nine clusters case kλ = 9, k ∈ {2, 3, 4, 5, 6, 7, 8, 9};
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 30 / 45
31. Experiments 2 & 3
2 Investigation of the numbers of clusters of ensemble clusterers with
fixed number of true clusters kλ = 5
a) k = 2,
b) k ∈ {2, 3, 4, 5},
c) k ∈ {5},
d) k ∈ {5, 6, 7, 8, 9}
e) k = 9;
3 Investigation of the number of objects N ∈ {100, 300, 500, 1000};
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 31 / 45
32. Experiment 4
Comparison with the state-of-the-art algorithms
a) two clusters case kλ = 2, k = 2,
b) three clusters case kλ = 3, k ∈ {2, 3},
c) five clusters case kλ = 5, k ∈ {2, 3, 4, 5},
d) nine clusters case kλ = 9, k ∈ {2, 3, 4, 5, 6, 7, 8, 9}.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 32 / 45
33. Similarity between partitions
ARI measure
Adjusted Rand Index (Hubert, Arabie 1986)
Given two partitions ρa = {Ra
1 , . . . , Ra
ka
} and ρb = {Rb
1 , . . . , Rb
kb
}, where
Na
h = |Ra
h|, Nhm = |Ra
h Rb
m|, N is the number of objects,
Ca =
h
Na
h
2
=
h
Na
h (Na
h −1)
2 .
φARI
(ρa
, ρb
) = hm
Nhm
2
− CaCb
N
2
1
2(Ca + Cb) − CaCb
N
2
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 33 / 45
34. Algorithms under comparison
AddRemAdd (Mirkin 2011; Mirkin and Shestakov, 2013)
Voting Scheme (Dimitriadou, Weingessel and Hornik, 2002)
cVote (Ayad, 2010)
Condorcet and Borda Consensus (Dominguez, Carrie and Pujol, 2008)
Meta-CLustering Algorithm (Strehl and Ghosh, 2002)
Hyper Graph Partitioning Algorithm
Cluster-based Similarity Partitioning Algorithm
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 34 / 45
36. Computational Experiments
0 10% 20% 30% 40% 50% 60% 70%
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Minimal voting threshold
ARI
Two cluters
Three clusters
Five clusters
Nine clusters
Figure 9 : Influence of minimal voting threshold to ARI for different number of
true clusters
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 36 / 45
37. Computational Experiments
1 2 3 4 5 6 7 8 9 10
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Dataset no.
ARI
2
2–5
5
5–9
9
Figure 10 : ARI for different numbers of clusters of the ensemble clusterers with
kλ = 5 (each point is averaged over 10 datasets)
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 37 / 45
38. Computational Experiments
1 2 3 4 5 6 7 8 9 10
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Dataset no.
ARI
100
300
500
1000
Figure 11 : Influence of different numbers of objects to ARI
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 38 / 45
41. Conclusion
Optimal voting threshold in terms of minimal intent size for the
resulting anticahin of concepts is not constant; moreover, it is not
always majority of votes of ensemble members.
Our FCA-based consensus clustering method works better if set the
number of blocks for the ensemble clusterers to be equal to the size
of the original (true) partition.
ARI depends on the number of objects: The higher the number, the
lower ARI.
For two (and almost for all three) true clusters our method beats the
compared algorithms and in some cases consensus clustering task was
solved with 100% accuracy.
For larger number of clusters, our method is median among the
compared methods.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 41 / 45
42. Future prospects
■ Upper and lower semi-lattices in the Pattern Structures framework
(Ganter, Kuznetsov, 2001) as a search space.
■ Experiments with real data and applications.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 42 / 45
44. References
B. Mirkin
Clustering: A Data Recovery Approach, 2012
found E. Dimitriadou, A. Weingessel and K. Hornik
A Combination Scheme for Fuzzy Clustering
In International Journal of Pattern Recognition and Artificial Intelligence,
2002.
H. Ayad, M. Kamel
On voting-based consensus of cluster ensembles
Pattern Recognition, pp. 1943-1953, 2010
A. Guenoche.
Consensus of partitions : a constructive approach
Adv. Data Analysis and Classification 5, pp. 215-229, 2011.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 44 / 45
45. References
X. Sevillano Dominguez, J. C. Socoro Carrie and
F. Alias Pujol.
Fuzzy clusterers combination by positional voting for robust document
clustering
Procesamiento del lenguaje natural, 43, pp. 245-253.
A. Strehl, J. Ghosh
Cluster ensembles – a knowledge reuse framework for combining multiple
partitions
Journal on Machine Learning Research, 2002.
CLA 2016 (HSE) Lattice-Based Consensus clustering 21.07.2016 45 / 45