Clustering by Maximizing Mutual Information Across Views

•Download as PPTX, PDF•

0 likes•71 views

This document proposes a new method called Contrastive Representation Learning for Clustering (CRLC) that applies the principle of maximizing mutual information across views to learn cluster-level and instance-level semantics for unsupervised image clustering. CRLC trains an encoder to map images to a representation space by maximizing the agreement between the image representation and a cluster-assignment probability vector, using either cosine similarity or log-of-dot-product as the critic. This training loss functions to minimize a contrastive loss and learn discriminative representations. Experimental results show CRLC learns more separated representations than baselines and achieves better clustering and semi-supervised learning performance.

Technology

Clustering by Maximizing Mutual
Information Across Views
Kien Do, Truyen Tran, Svetha Venkatesh
Applied AI Institute (A2I2), Deakin University, Australia
1

Image Clustering Problem
2
The explosion of unlabelled data has led to the growing demand for unsupervised clustering

Clustering Assumptions
3
Inter-cluster distance
should be large
Intra-cluster distance
should be small

Existing Clustering Methods
4
Enc Dec
Clustering the latent code
Autoencoder-based methods (e.g., DCN, VaDE, DGG)
DCN [1]
Closer in the latent space of the AE
The latent should only capture semantic information from the input
[1] Towards k-means-friendly spaces: Simultaneous deep learning and clustering, Yang et al., ICML 2017

Existing Clustering Methods (cont.)
5
IIC [1]
Methods that only use the cluster-assignment probability (e.g., IIC, PICA)
Problem: May not capture enough useful
information from data => over-clustering is
often required.
[1] Invariant Information Clustering for Unsupervised Image Classification and Segmentation, Ji et al., ICCV 2019

Motivation
• We need a method that can model the cluster-level and the instance-
level semantics.
• The InfoMax/Contrastive Learning principle can be applied to this
scenario.
6

Overview about InfoMax/Contrastive Learning
• A principle for learning view-invariant representations. These
representations often capture the data semantics.
• The idea is maximizing the mutual information (MI) between 2
different views.
• Since direct computation of the MI is hard, we maximize its
variational lower bound instead.
7

The InfoNCE bound
• InfoNCE [1] is a lower bound of MI
• It is biased but has low variance
• Maximizing InfoNCE is equivalent to minimizing a contrastive loss:
8
[1] On Variational Bounds of Mutual Information, Poole et al., ICML 2019
is a “critic” measuring the similarity between and

Contrastive Representation Learning and
Clustering (CRLC)
9
Image representation vector
Cluster-assignment probability vector

Choosing an optimal critic
• A critic is optimal ( ) if it leads to the tightest InfoNCE bound.
• It can be shown that
• In continuous cases, cosine similarity is the optimal critic
• In discrete cases, “log-of-dot-product” is the optimal critic
11

A Simple extension to Semi-supervised Learning
12
Assume that we also have access to some labeled set . The training loss is:

Learned Representation Visualization
15
CRLC SimCLR
In CRLC, the learned representations are more separate than in SimCLR

Comparison with FixMatch
CRLC-semi is much more stable and converges much faster than
FixMatch when only few label data are available
17

This document proposes a formal definition of disentangled representations based on three key properties: informativeness, separability, and interpretability. It also proposes new evaluation metrics for disentangled representations that measure these properties in a mathematically rigorous way. The metrics are designed to support both supervised and unsupervised models, be applicable to real datasets, and provide consistent results. Experiments applying the metrics to different models show they align with visual assessments of disentanglement and avoid issues with prior metrics. The document concludes by formally defining disentangled representations and recommending metrics for use depending on data availability.

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Unce...

Kien Duc Do

This document proposes new methods for semi-supervised learning using consistency regularization. It introduces Maximum Uncertainty Regularization (MUR) to generate uncertain virtual data points and Consistency under Weight Perturbation (CWP) using Variational Bayesian Inference to perturb weights. MUR finds the most uncertain virtual point to act as a perturbed sample. CWP forces a noisy classifier with perturbed weights to produce consistent outputs. Experimental results on CIFAR-10/100 and SVHN demonstrate that both MUR and CWP lead to better performance for semi-supervised learning compared to existing consistency regularization methods.

Theory and evaluation metrics for learning disentangled representations v2

Kien Duc Do

This document proposes a formal definition of disentangled representations and new evaluation metrics. It defines disentangled representations as having three key properties: informativeness, separability, and interpretability. Mathematical formulas are provided for these properties. Two new metrics are proposed to measure separability and interpretability in both supervised and unsupervised settings. The metrics aim to overcome limitations of prior work by not requiring knowledge of all ground truth factors or additional training. Experiments applying the new metrics to models are referenced but not described in detail.

Alanoud alqoufi inductive learning

Alanoud Alqoufi

This document provides an overview of inductive learning. It defines inductive learning as learning from observation and earlier knowledge by generalizing rules and conclusions. Inductive learning allows identifying patterns in training data or earlier knowledge. The identified and extracted generalized rules can then be used for reasoning and problem solving. Some common inductive learning methods mentioned include divide-and-conquer and covering algorithms. The document also discusses the RULES family of algorithms for rule extraction, and provides some examples of applications of inductive learning such as credit decisions, education, and medical applications.

61_Empirical

Boshra Albayaty

This document summarizes an empirical study comparing several supervised machine learning approaches for word sense disambiguation: Naive Bayes, decision tree, decision list, and support vector machine (SVM). The study used a dataset of 15 words annotated with senses from WordNet and Senseval-3. Each approach was implemented and evaluated based on its accuracy in identifying the correct sense of each word. The results showed that the decision list approach achieved the highest overall accuracy of 69.12%, followed by SVM at 56.11%, naive Bayes at 58.32%, and decision tree at 45.14%. Thus, the study concluded that decision list performed best on this dataset for the task of word sense disambiguation.

The Formation of Job Referral Networks: Evidence from a Field Experiment in U...

essp2

1) The study examines how job referral networks form in urban Ethiopia through a field experiment. 2) The experiment tests whether people link to others for self-regarding reasons like getting referrals, or other-regarding reasons like helping others get jobs. 3) Results show people in self-interest treatments linked to less connected others for self-interested reasons like getting referrals. But in other-regarding treatments, people did not link to help others. 4) The study suggests policies could encourage employers to ask referrals from a more diverse range of people to strengthen peripheral groups' network positions.

User centric data dissemination in disruption tolerant networkas

Showyou Tang

This document summarizes a presentation on user-centric data dissemination in disruption tolerant networks. It discusses the problem of maximizing cost-effectiveness in uncontrollable network environments. An approach is presented that uses centrality metrics and multi-hop centrality to select optimal relay nodes. Simulation results on realistic DTN traces show the approach outperforms flooding and has better performance under different time, buffer, and network information constraints.

AI: Learning in AI 2

DataminingTools Inc

Bayesian learning views hypotheses as intermediaries between data and predictions. Belief networks can represent learning problems with known or unknown structures and fully or partially observable variables. Belief networks use localized representations, whereas neural networks use distributed representations. Reinforcement learning uses rewards to learn successful agent functions, such as Q-learning which learns action-value functions. Active learning agents consider actions, outcomes, and how actions affect rewards received. Genetic algorithms evolve individuals to successful solutions measured by fitness functions. Explanation-based learning speeds up programs by reusing results of prior computations.

Numerous recent works utilize bi-Lipschitz regularization of neural network layers to preserve relative distances between data instances in the feature spaces of each layer. This distance sensitivity with respect to the data aids in tasks such as uncertainty calibration and out-of-distribution (OOD) detection. In previous works, features extracted with a distance sensitive model are used to construct feature covariance matrices which are used in deterministic uncertainty estimation or OOD detection. However, in cases where there is a distribution over tasks, these methods result in covariances which are sub-optimal, as they may not leverage all of the meta information which can be shared among tasks. With the use of an attentive set encoder, we propose to meta learn either diagonal or diagonal plus low-rank factors to efficiently construct task specific covariance matrices. Additionally, we propose an inference procedure which utilizes scaled energy to achieve a final predictive distribution which is well calibrated under a distributional dataset shift.

AI: Learning in AI

DataminingTools Inc

Neural networks can be used for machine learning tasks like classification. They consist of interconnected nodes that update their weight values during a training process using examples. Neural networks have been applied successfully to tasks like handwritten character recognition, autonomous vehicle control by observing human drivers, and text-to-speech pronunciation generation. Their architecture is inspired by the human brain but neural networks are trained using computational methods while the brain uses biological processes.

Online Coreset Selection for Rehearsal-based Continual Learning

MLAI2

A dataset is a shred of crucial evidence to describe a task. However, each data point in the dataset does not have the same potential, as some of the data points can be more representative or informative than others. This unequal importance among the data points may have a large impact in rehearsal-based continual learning, where we store a subset of the training examples (coreset) to be replayed later to alleviate catastrophic forgetting. In continual learning, the quality of the samples stored in the coreset directly affects the model's effectiveness and efficiency. The coreset selection problem becomes even more important under realistic settings, such as imbalanced continual learning or noisy data scenarios. To tackle this problem, we propose Online Coreset Selection (OCS), a simple yet effective method that selects the most representative and informative coreset at each iteration and trains them in an online manner. Our proposed method maximizes the model's adaptation to a target dataset while selecting high-affinity samples to past tasks, which directly inhibits catastrophic forgetting. We validate the effectiveness of our coreset selection mechanism over various standard, imbalanced, and noisy datasets against strong continual learning baselines, demonstrating that it improves task adaptation and prevents catastrophic forgetting in a sample-efficient manner.

LearningAG.ppt

butest

The document discusses machine learning and learning agents in three main points: 1. It defines machine learning and discusses different types of machine learning tasks like supervised, unsupervised, and reinforcement learning. 2. It explains the key differences between traditional machine learning approaches and learning agents, noting that learning is one of many goals for agents and must be integrated with other agent functions. 3. It discusses different challenges of integrating machine learning into intelligent agents, such as balancing learning with recall of existing knowledge and addressing time constraints on learning from the environment.

Facial Emoji Recognition

ijtsrd

Facial emoji recognition is a human computer interaction system. In recent times, automatic face recognition or facial expression recognition has attracted increasing attention from researchers in psychology, computer science, linguistics, neuroscience, and similar fields. Facial emoji recognizer is an end user application which detects the expression of the person in the video being captured by the camera. The smiley relevant to the expression of the person in the video is shown on the screen which changes with the change in the expressions. Facial expressions are important in human communication and interactions. Also, they are used as an important tool in studies about behavior and in medical fields. Facial emoji recognizer provides a fast and practical approach for non meddlesome emotion detection. The purpose was to develop an intelligent system for facial based expression classification using CNN algorithm. Haar classifier is used for face detection and CNN algorithm is utilized for the expression detection and giving the emoticon relevant to the expression as the output. N. Swapna Goud | K. Revanth Reddy | G. Alekhya | G. S. Sucheta ""Facial Emoji Recognition"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-3 , April 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23166.pdf Paper URL: https://www.ijtsrd.com/engineering/computer-engineering/23166/facial-emoji-recognition/n-swapna-goud

2-IJCSE-00536

Boshra Albayaty

This document compares different supervised learning approaches for word sense disambiguation (WSD), including Naive Bayes, Decision Tree, and Decision List classifiers. An experiment is conducted using a dataset of 15 words and their senses from WordNet. The Decision List approach achieves the highest accuracy at 69.12%, followed by Naive Bayes at 58.32% and Decision Tree at 45.14%. While no single approach performed best for all words, overall Decision List provided the most accurate WSD and is presented as the best performing method for this problem among the three approaches studied.

WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...

Nexgen Technology

TO GET THIS PROJECT COMPLETE SOURCE ON SUPPORT WITH EXECUTION PLEASE CALL BELOW CONTACT DETAILS MOBILE: 9791938249, 0413-2211159, WEB: WWW.NEXGENPROJECT.COM,WWW.FINALYEAR-IEEEPROJECTS.COM, EMAIL:Praveen@nexgenproject.com NEXGEN TECHNOLOGY provides total software solutions to its customers. Apsys works closely with the customers to identify their business processes for computerization and help them implement state-of-the-art solutions. By identifying and enhancing their processes through information technology solutions. NEXGEN TECHNOLOGY help it customers optimally use their resources.

Sota

guesta4fafe

The document proposes a methodology to improve evolutionary multi-objective algorithms (EMOAs) by incorporating achievement scalarizing functions (ASFs) to provide convergence to the Pareto optimal front while maintaining diversity. The methodology executes in serial stages: running an EMOA to get a non-dominated set, clustering this set to extract a representative set, calculating pseudo-weights for the representative set, and perturbing the extreme points to generate reference points to drive the ASF towards the Pareto front over iterations until no improvements are found. Initial studies on test problems ZDT1, ZDT2 and ZDT3 show promising results, with the proposed approach finding a representative set of clustered Pareto points in fewer generations compared to NSGA

Hybrid Method HVS-MRMR for Variable Selection in Multilayer Artificial Neural...

IJECEIAES

The variable selection is an important technique the reducing dimensionality of data frequently used in data preprocessing for performing data mining. This paper presents a new variable selection algorithm uses the heuristic variable selection (HVS) and Minimum Redundancy Maximum Relevance (MRMR). We enhance the HVS method for variab le selection by incorporating (MRMR) filter. Our algorithm is based on wrapper approach using multi-layer perceptron. We called this algorithm a HVS-MRMR Wrapper for variables selection. The relevance of a set of variables is measured by a convex combination of the relevance given by HVS criterion and the MRMR criterion. This approach selects new relevant variables; we evaluate the performance of HVS-MRMR on eight benchmark classification problems. The experimental results show that HVS-MRMR selected a less number of variables with high classification accuracy compared to MRMR and HVS and without variables selection on most datasets. HVS-MRMR can be applied to various classification problems that require high classification accuracy.

ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE

IJCSEA Journal

The generation of effective feature-based rules is essential to the development of any intelligent system. This paper presents an approach that integrates a powerful fuzzy rule generation algorithm with a rough set-assisted feature reduction method to generate diagnostic rule with a certainty factor. Certainty factor of each rule is calculated by considering both the membership value of each linguistic term introduced at time of fuzzyfication of data as well as possibility values, due to inconsistent data, generated by rough set theory at time of rule generation. In time of knowledge inferencing in an intelligent system, certainty factor of each rule will play an important role to find out the appropriate rule to be selected. Experimental results demonstrate the superiority of our approach.

WXGB6108_Article Review_The Effect of Attitudes, Goal Setting and Self-Effica...

Husna Zayadi

Expandable bayesian

Ahmad Amri

This document describes an expandable Bayesian network (EBN) approach for 3D object description from multiple images and sensor data. The key points are: - EBNs can dynamically instantiate network structures at runtime based on the number of input images, allowing the use of a varying number of evidence features. - EBNs introduce the use of hidden variables to handle correlation of evidence features across images, whereas previous approaches did not properly model this. - The document presents an application of an EBN for building detection and description from aerial images using multiple views and sensor data. Experimental results showed the EBN approach provided significant performance improvements over other methods.

Mis End Term Exam Theory Concepts

Vidya sagar Sharma

The document provides an overview of concepts and topics to be covered in the MIS End Term Exam for AI and A2 on February 6th 2020, including: decision trees, classifier algorithms like ID3, CART and Naive Bayes; supervised and unsupervised learning; clustering using K-means; bias and variance; overfitting and underfitting; ensemble learning techniques like bagging and random forests; and the use of test and train data.

Design Pattern Explained CH1

Jamie (Taka) Wang

This document discusses the object-oriented paradigm and how it can be applied to solve problems. It provides an example of directing students to their next class and compares functional decomposition and object-oriented approaches. The key aspects of object-oriented design discussed are identifying objects based on problem domain concepts, defining responsibilities and interfaces for each object, and limiting coupling between objects.

Chaptr 7 (final)

Nateshwar Kamlesh

The document discusses different types of machine learning, including rote learning, learning by taking advice, and learning through problem solving. Rote learning involves simply storing examples to improve future performance, as demonstrated in Samuel's checkers program. Learning by taking advice requires translating human advice into operational rules a program can understand and apply, as shown through FOO's hearts-playing program. Learning through problem solving involves adjusting parameter weights based on success, as Samuel's program did by modifying evaluation function coefficients over time based on game outcomes.

Learning

Amit Pandey

The document discusses different types of learning strategies and methods. It defines learning as the acquisition of knowledge through study. It describes several learning methods including rote learning, direct instruction, analogy, induction, and deduction. Rote learning involves simple memorization while direct instruction involves being told information. Induction involves forming general concepts from examples and deduction uses existing knowledge to derive new facts. The document provides examples to illustrate each learning method.

Design Pattern Explained CH8

Jamie (Taka) Wang

This document discusses object-oriented programming concepts like objects, encapsulation, inheritance, commonality and variability analysis, and abstract classes. It provides both traditional and broad views of these concepts. The broad view sees objects as entities with specific responsibilities or behaviors. Encapsulation can involve hiding any implementation details, not just data. Inheritance is best used to classify variations in behavior. Commonality analysis identifies shared elements while variability analysis identifies variations. Abstract classes represent commonality and concrete subclasses represent identified variations.

Parallel and distributed genetic algorithm with multiple objectives to impro...

khalil IBRAHIM

we argue that the timetabling problem reflects the problem of scheduling university courses, So you must specify the range of time periods and a group of instructors for a range of lectures to check a set of constraints and reduce the cost of other constraints ,this is the problem called NP-hard, it is a class of problems that are informally, it’s mean that necessary operations to solve the problem will increase exponentially and directly proportional to the size of the problem, The construction of timetable is the most complicated problem that was facing many universities, and increased by size of the university data and overlapping disciplines between colleges, and when a traditional algorithm (EA) is unable to provide satisfactory results, a distributed EA (dEA), which deploys the population on distributed systems, it also offers an opportunity to solve extremely high dimensional problems through distributed coevolution using a divide-and-conquer mechanism, Further, the distributed environment allows a dEA to maintain population diversity, thereby avoiding local optima and also facilitating multi-objective search, by employing different distribution models to parallelize the processing of EAs, we designed a genetic algorithm suitable for Universities environment and the constraints facing it when building timetable for lectures.

Representational Continuity for Unsupervised Continual Learning

MLAI2

Continual learning (CL) aims to learn a sequence of tasks without forgetting the previously acquired knowledge. However, recent CL advances are restricted to supervised continual learning (SCL) scenarios. Consequently, they are not scalable to real-world applications where the data distribution is often biased and unannotated. In this work, we focus on unsupervised continual learning (UCL), where we learn the feature representations on an unlabelled sequence of tasks and show that reliance on annotated data is not necessary for continual learning. We conduct a systematic study analyzing the learned feature representations and show that unsupervised visual representations are surprisingly more robust to catastrophic forgetting, consistently achieve better performance, and generalize better to out-of-distribution tasks than SCL. Furthermore, we find that UCL achieves a smoother loss landscape through qualitative analysis of the learned representations and learns meaningful feature representations. Additionally, we propose Lifelong Unsupervised Mixup (Lump), a simple yet effective technique that interpolates between the current task and previous tasks' instances to alleviate catastrophic forgetting for unsupervised representations.

imageclassification-160206090009.pdf

KammetaJoshna

The document discusses image classification using deep neural networks. It provides background on image classification and convolutional neural networks. The document outlines techniques like activation functions, pooling, dropout and data augmentation to prevent overfitting. It summarizes a paper on ImageNet classification using CNNs with multiple convolutional layers and GPU training. Key results showed improved accuracy with larger datasets and model capacity.

Image classification with Deep Neural Networks

Yogendra Tamang

This document discusses image classification using deep neural networks. It provides background on image classification and convolutional neural networks. The document outlines techniques like activation functions, pooling, dropout and data augmentation to prevent overfitting. It summarizes a paper on ImageNet classification using CNNs with multiple convolutional and fully connected layers. The paper achieved state-of-the-art results on ImageNet in 2010 and 2012 by training CNNs on a large dataset using multiple GPUs.

What's hot

Internship project presentation_final_upload

Suraj Rathore

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...

MLAI2

AI: Learning in AI

DataminingTools Inc

Online Coreset Selection for Rehearsal-based Continual Learning

MLAI2

LearningAG.ppt

butest

Facial Emoji Recognition

ijtsrd

2-IJCSE-00536

Boshra Albayaty

WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...

Nexgen Technology

Sota

guesta4fafe

Hybrid Method HVS-MRMR for Variable Selection in Multilayer Artificial Neural...

IJECEIAES

ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE

IJCSEA Journal

WXGB6108_Article Review_The Effect of Attitudes, Goal Setting and Self-Effica...

Husna Zayadi

Expandable bayesian

Ahmad Amri

Mis End Term Exam Theory Concepts

Vidya sagar Sharma

Design Pattern Explained CH1

Jamie (Taka) Wang

Chaptr 7 (final)

Nateshwar Kamlesh

Learning

Amit Pandey

Design Pattern Explained CH8

Jamie (Taka) Wang

Parallel and distributed genetic algorithm with multiple objectives to impro...

khalil IBRAHIM

What's hot (19)

Internship project presentation_final_upload

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...

AI: Learning in AI

Online Coreset Selection for Rehearsal-based Continual Learning

LearningAG.ppt

Facial Emoji Recognition

2-IJCSE-00536

WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...

Sota

Hybrid Method HVS-MRMR for Variable Selection in Multilayer Artificial Neural...

ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE

WXGB6108_Article Review_The Effect of Attitudes, Goal Setting and Self-Effica...

Expandable bayesian

Mis End Term Exam Theory Concepts

Design Pattern Explained CH1

Chaptr 7 (final)

Learning

Design Pattern Explained CH8

Parallel and distributed genetic algorithm with multiple objectives to impro...

Similar to Clustering by Maximizing Mutual Information Across Views

Representational Continuity for Unsupervised Continual Learning

MLAI2

imageclassification-160206090009.pdf

KammetaJoshna

Image classification with Deep Neural Networks

Yogendra Tamang

An Empirical Study of Training Self-Supervised Vision Transformers.pptx

Sangmin Woo

[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...

Ziyuan Zhao

Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...

Seunghyun Hwang

IRJET - Factors Affecting Deployment of Deep Learning based Face Recognition ...

IRJET Journal

This document discusses factors affecting the deployment of deep learning models for face recognition on smartphones. It examines training data requirements, suitable neural network architectures, and effective loss functions. Larger datasets with more subjects and images are preferred for training models that generalize well. Residual networks like ResNet have achieved good accuracy while being efficient for face recognition. Loss functions like center loss and triplet loss help learn discriminative features by reducing intra-class and increasing inter-class variations.

Predicting More from Less: Synergies of Learning

CS, NcState

Data Mining Un-Compressed Images from cloud with Clustering Compression techn...

ijaia

This document summarizes a research paper on compressing uncompressed images from the cloud using k-means clustering and Lempel-Ziv-Welch (LZW) compression. It begins by introducing cloud computing and k-means clustering. It then describes using k-means to group uncompressed images and compressing the images using LZW coding to reduce file sizes while maintaining image quality. The document discusses advantages of LZW compression like achieving compression ratios around 5:1. It provides examples of applying k-means clustering and LZW compression to simplify image compression.

Defending against label-flipping attacks in federated learning systems using ...

IAESIJAI

The user experience can be greatly improved by using learning models that have been trained using data from mobile devices and other internet of things (IoT) devices. Numerous efforts have been made to implement federated learning (FL) algorithms in order to facilitate the success of machine learning models. Researchers have been working on various privacy-preserving methodologies, such as deep neural networks (DNN), support vector machines (SVM), logistic regression, and gradient boosted decision trees, to support a wider range of machine learning models. The capacity for computing and storage has increased over time, emphasizing the growing significance of data mining in engineering. Artificial intelligence and machine learning have recently achieved remarkable progress. We carried out research on data poisoning attacks in the FL system and proposed defence technique using uniform manifold approximation and projection (UMAP). We compare the efficiency by using UMAP, principal component analysis (PCA), Kernel principal component analysis (KPCA) and k-mean clustering algorithm. We make clear in the paper that UMAP performs better than PCA, KPCA and k-mean, and gives excellent performance in detection and mitigating against data-poisoning attacks.

COVID-19 detection from scarce chest X-Ray image data using few-shot deep lea...

Shruti Jadon

Learning where to look: focus and attention in deep vision

Universitat Politècnica de Catalunya

This document summarizes Kevin McGuinness' presentation on deep learning for computer vision. It discusses visual attention models and their ability to predict eye gaze, applications in image cropping, retrieval and classification. It also covers medical image analysis using deep learning for knee osteoarthritis grading and neonatal brain segmentation. Deep crowd analysis is examined for crowd counting. Finally, interactive deep vision for image segmentation using user interactions is presented.

Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...

IJCSIS Research Publications

Data mining is utilized to manage huge measure of information which are put in the data ware houses and databases, to discover required information and data. Numerous data mining systems have been proposed, for example, association rules, decision trees, neural systems, clustering, and so on. It has turned into the purpose of consideration from numerous years. A re-known amongst the available data mining strategies is clustering of the dataset. It is the most effective data mining method. It groups the dataset in number of clusters based on certain guidelines that are predefined. It is dependable to discover the connection between the distinctive characteristics of data. In k-mean clustering algorithm, the function is being selected on the basis of the relevancy of the function for predicting the data and also the Euclidian distance between the centroid of any cluster and the data objects outside the cluster is being computed for the clustering the data points. In this work, author enhanced the Euclidian distance formula to increase the cluster quality. The problem of accuracy and redundancy of the dissimilar points in the clusters remains in the improved k-means for which new enhanced approach is been proposed which uses the similarity function for checking the similarity level of the point before including it to the cluster.

MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION

ijaia

Most of the currently known methods treat person re-identification task as classification problem and used commonly neural networks. However, these methods used only high-level convolutional feature or to express the feature representation of pedestrians. Moreover, the current data sets for person reidentification is relatively small. Under the limitation of the number of training set, deep convolutional networks are difficult to train adequately. Therefore, it is very worthwhile to introduce auxiliary data sets to help training. In order to solve this problem, this paper propose a novel method of deep transfer learning, and combines the comparison model with the classification model and multi-level fusion of the convolution features on the basis of transfer learning. In a multi-layers convolutional network, the characteristics of each layer of network are the dimensionality reduction of the previous layer of results, but the information of multi-level features is not only inclusive, but also has certain complementarity. We can using the information gap of different layers of convolutional neural networks to extract a better feature expression. Finally, the algorithm proposed in this paper is fully tested on four data sets (VIPeR, CUHK01, GRID and PRID450S). The obtained re-identification results prove the effectiveness of the algorithm.

A Mixture Model of Hubness and PCA for Detection of Projected Outliers

Zac Darcy

With the Advancement of time and technology, Outlier Mining methodologies help to sift through the large amount of interesting data patterns and winnows the malicious data entering in any field of concern. It has become indispensible to build not only a robust and a generalised model for anomaly detection but also to dress the same model with extra features like utmost accuracy and precision. Although the K-means algorithm is one of the most popular, unsupervised, unique and the easiest clustering algorithm, yet it can be used to dovetail PCA with hubness and the robust model formed from Guassian Mixture to build a very generalised and a robust anomaly detection system. A major loophole of the K-means algorithm is its constant attempt to find the local minima and result in a cluster that leads to ambiguity. In this paper, an attempt has done to combine K-means algorithm with PCA technique that results in the formation of more closely centred clusters that work more accurately with K-means algorithm .This combination not only provides the great boost to the detection of outliers but also enhances its accuracy and precision.

A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS

Zac Darcy

A Mixture Model of Hubness and PCA for Detection of Projected Outliers

Zac Darcy

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018

Universitat Politècnica de Catalunya

https://telecombcn-dl.github.io/2018-dlai/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.

An Iterative Improved k-means Clustering

IDES Editor

This document presents a new iterative improved k-means clustering algorithm. The k-means clustering algorithm is widely used but depends on random initial starting points, which can impact the results. The new algorithm aims to provide better initial starting points to improve k-means clustering results. The algorithm divides the data into K initial groups, calculates new cluster centers iteratively using a distance-based formula, assigns data points to clusters, and repeats until cluster centers no longer change. Experimental results on several datasets show the new algorithm converges in fewer iterations than standard k-means, demonstrating it finds better cluster solutions.

End-to-end deep auto-encoder for segmenting a moving object with limited tra...

IJECEIAES

The document proposes two end-to-end deep auto-encoder approaches for segmenting moving objects from surveillance videos when limited training data is available. The first approach uses transfer learning with a pre-trained VGG-16 model as the encoder and its transposed architecture as the decoder. The second approach uses a multi-depth auto-encoder with convolutional and upsampling layers. Both approaches apply data augmentation techniques like PCA and traditional methods to increase the training data size. The models are trained and evaluated on the CDnet2014 dataset, achieving better performance than other models trained with limited data.

Similar to Clustering by Maximizing Mutual Information Across Views (20)

Representational Continuity for Unsupervised Continual Learning

imageclassification-160206090009.pdf

Image classification with Deep Neural Networks

An Empirical Study of Training Self-Supervised Vision Transformers.pptx

[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...

Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...

IRJET - Factors Affecting Deployment of Deep Learning based Face Recognition ...

Predicting More from Less: Synergies of Learning

Data Mining Un-Compressed Images from cloud with Clustering Compression techn...

Defending against label-flipping attacks in federated learning systems using ...

COVID-19 detection from scarce chest X-Ray image data using few-shot deep lea...

Learning where to look: focus and attention in deep vision

Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...

MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION

A Mixture Model of Hubness and PCA for Detection of Projected Outliers

A MIXTURE MODEL OF HUBNESS AND PCA FOR DETECTION OF PROJECTED OUTLIERS

A Mixture Model of Hubness and PCA for Detection of Projected Outliers

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018

An Iterative Improved k-means Clustering

End-to-end deep auto-encoder for segmenting a moving object with limited tra...

Recently uploaded

Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors

DianaGray10

Join us to learn how UiPath Apps can directly and easily interact with prebuilt connectors via Integration Service--including Salesforce, ServiceNow, Open GenAI, and more. The best part is you can achieve this without building a custom workflow! Say goodbye to the hassle of using separate automations to call APIs. By seamlessly integrating within App Studio, you can now easily streamline your workflow, while gaining direct access to our Connector Catalog of popular applications. We’ll discuss and demo the benefits of UiPath Apps and connectors including: Creating a compelling user experience for any software, without the limitations of APIs. Accelerating the app creation process, saving time and effort Enjoying high-performance CRUD (create, read, update, delete) operations, for seamless data management. Speakers: Russell Alfeche, Technology Leader, RPA at qBotic and UiPath MVP Charlie Greenberg, host

Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips

ScyllaDB

ScyllaDB monitoring provides a lot of useful information. But sometimes it’s not easy to find the root of the problem if something is wrong or even estimate the remaining capacity by the load on the cluster. This talk shares our team's practical tips on: 1) How to find the root of the problem by metrics if ScyllaDB is slow 2) How to interpret the load and plan capacity for the future 3) Compaction strategies and how to choose the right one 4) Important metrics which aren’t available in the default monitoring setup.

From Natural Language to Structured Solr Queries using LLMs

Sease

This talk draws on experimentation to enable AI applications with Solr. One important use case is to use AI for better accessibility and discoverability of the data: while User eXperience techniques, lexical search improvements, and data harmonization can take organizations to a good level of accessibility, a structural (or “cognitive” gap) remains between the data user needs and the data producer constraints. That is where AI – and most importantly, Natural Language Processing and Large Language Model techniques – could make a difference. This natural language, conversational engine could facilitate access and usage of the data leveraging the semantics of any data source. The objective of the presentation is to propose a technical approach and a way forward to achieve this goal. The key concept is to enable users to express their search queries in natural language, which the LLM then enriches, interprets, and translates into structured queries based on the Solr index’s metadata. This approach leverages the LLM’s ability to understand the nuances of natural language and the structure of documents within Apache Solr. The LLM acts as an intermediary agent, offering a transparent experience to users automatically and potentially uncovering relevant documents that conventional search methods might overlook. The presentation will include the results of this experimental work, lessons learned, best practices, and the scope of future work that should improve the approach and make it production-ready.

MySQL InnoDB Storage Engine: Deep Dive - Mydbops

Mydbops

This presentation, titled "MySQL - InnoDB" and delivered by Mayank Prasad at the Mydbops Open Source Database Meetup 16 on June 8th, 2024, covers dynamic configuration of REDO logs and instant ADD/DROP columns in InnoDB. This presentation dives deep into the world of InnoDB, exploring two ground-breaking features introduced in MySQL 8.0: • Dynamic Configuration of REDO Logs: Enhance your database's performance and flexibility with on-the-fly adjustments to REDO log capacity. Unleash the power of the snake metaphor to visualize how InnoDB manages REDO log files. • Instant ADD/DROP Columns: Say goodbye to costly table rebuilds! This presentation unveils how InnoDB now enables seamless addition and removal of columns without compromising data integrity or incurring downtime. Key Learnings: • Grasp the concept of REDO logs and their significance in InnoDB's transaction management. • Discover the advantages of dynamic REDO log configuration and how to leverage it for optimal performance. • Understand the inner workings of instant ADD/DROP columns and their impact on database operations. • Gain valuable insights into the row versioning mechanism that empowers instant column modifications.

GNSS spoofing via SDR (Criptored Talks 2024)

Javier Junquera

In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security. This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing. The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.

What is an RPA CoE? Session 2 – CoE Roles

DianaGray10

Northern Engraving | Nameplate Manufacturing Process - 2024

Northern Engraving

Manufacturing custom quality metal nameplates and badges involves several standard operations. Processes include sheet prep, lithography, screening, coating, punch press and inspection. All decoration is completed in the flat sheet with adhesive and tooling operations following. The possibilities for creating unique durable nameplates are endless. How will you create your brand identity? We can help!

Y-Combinator seed pitch deck template PP

c5vrf27qcz

JavaLand 2024: Application Development Green Masterplan

Miro Wengner

"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba

Fwdays

This is a session that details how PostgreSQL's features and Azure AI Services can be effectively used to significantly enhance the search functionality in any application. In this session, we'll share insights on how we used PostgreSQL to facilitate precise searches across multiple fields in our mobile application. The techniques include using LIKE and ILIKE operators and integrating a trigram-based search to handle potential misspellings, thereby increasing the search accuracy. We'll also discuss how the azure_ai extension on PostgreSQL databases in Azure and Azure AI Services were utilized to create vectors from user input, a feature beneficial when users wish to find specific items based on text prompts. While our application's case study involves a drug search, the techniques and principles shared in this session can be adapted to improve search functionality in a wide range of applications. Join us to learn how PostgreSQL and Azure AI can be harnessed to enhance your application's search capability.

QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...

AlexanderRichford

QR Secure: A Hybrid Approach Using Machine Learning and Security Validation Functions to Prevent Interaction with Malicious QR Codes. Aim of the Study: The goal of this research was to develop a robust hybrid approach for identifying malicious and insecure URLs derived from QR codes, ensuring safe interactions. This is achieved through: Machine Learning Model: Predicts the likelihood of a URL being malicious. Security Validation Functions: Ensures the derived URL has a valid certificate and proper URL format. This innovative blend of technology aims to enhance cybersecurity measures and protect users from potential threats hidden within QR codes 🖥 🔒 This study was my first introduction to using ML which has shown me the immense potential of ML in creating more secure digital environments!

Session 1 - Intro to Robotic Process Automation.pdf

UiPathCommunity

👉 Check out our full 'Africa Series - Automation Student Developers (EN)' page to register for the full program: https://bit.ly/Automation_Student_Kickstart In this session, we shall introduce you to the world of automation, the UiPath Platform, and guide you on how to install and setup UiPath Studio on your Windows PC. 📕 Detailed agenda: What is RPA? Benefits of RPA? RPA Applications The UiPath End-to-End Automation Platform UiPath Studio CE Installation and Setup 💻 Extra training through UiPath Academy: Introduction to Automation UiPath Business Automation Platform Explore automation development with UiPath Studio 👉 Register here for our upcoming Session 2 on June 20: Introduction to UiPath Studio Fundamentals: https://community.uipath.com/events/details/uipath-lagos-presents-session-2-introduction-to-uipath-studio-fundamentals/

Dandelion Hashtable: beyond billion requests per second on a commodity server

Antonios Katsarakis

This slide deck presents DLHT, a concurrent in-memory hashtable. Despite efforts to optimize hashtables, that go as far as sacrificing core functionality, state-of-the-art designs still incur multiple memory accesses per request and block request processing in three cases. First, most hashtables block while waiting for data to be retrieved from memory. Second, open-addressing designs, which represent the current state-of-the-art, either cannot free index slots on deletes or must block all requests to do so. Third, index resizes block every request until all objects are copied to the new index. Defying folklore wisdom, DLHT forgoes open-addressing and adopts a fully-featured and memory-aware closed-addressing design based on bounded cache-line-chaining. This design offers lock-free index operations and deletes that free slots instantly, (2) completes most requests with a single memory access, (3) utilizes software prefetching to hide memory latencies, and (4) employs a novel non-blocking and parallel resizing. In a commodity server and a memory-resident workload, DLHT surpasses 1.6B requests per second and provides 3.5x (12x) the throughput of the state-of-the-art closed-addressing (open-addressing) resizable hashtable on Gets (Deletes).

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

Neo4j

Essentials of Automations: Exploring Attributes & Automation Parameters

Safe Software

Building automations in FME Flow can save time, money, and help businesses scale by eliminating data silos and providing data to stakeholders in real-time. One essential component to orchestrating complex automations is the use of attributes & automation parameters (both formerly known as “keys”). In fact, it’s unlikely you’ll ever build an Automation without using these components, but what exactly are they? Attributes & automation parameters enable the automation author to pass data values from one automation component to the next. During this webinar, our FME Flow Specialists will cover leveraging the three types of these output attributes & parameters in FME Flow: Event, Custom, and Automation. As a bonus, they’ll also be making use of the Split-Merge Block functionality. You’ll leave this webinar with a better understanding of how to maximize the potential of automations by making use of attributes & automation parameters, with the ultimate goal of setting your enterprise integration workflows up on autopilot.

Harnessing the Power of NLP and Knowledge Graphs for Opioid Research

Neo4j

AI in the Workplace Reskilling, Upskilling, and Future Work.pptx

Sunil Jagani

"$10 thousand per minute of downtime: architecture, queues, streaming and fin...

Fwdays

Direct losses from downtime in 1 minute = $5-$10 thousand dollars. Reputation is priceless. As part of the talk, we will consider the architectural strategies necessary for the development of highly loaded fintech solutions. We will focus on using queues and streaming to efficiently work and manage large amounts of data in real-time and to minimize latency. We will focus special attention on the architectural patterns used in the design of the fintech system, microservices and event-driven architecture, which ensure scalability, fault tolerance, and consistency of the entire system.

Christine's Supplier Sourcing Presentaion.pptx

christinelarrosa

"What does it really mean for your system to be available, or how to define w...

Fwdays

Recently uploaded (20)

Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors

Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips

From Natural Language to Structured Solr Queries using LLMs

MySQL InnoDB Storage Engine: Deep Dive - Mydbops

GNSS spoofing via SDR (Criptored Talks 2024)

What is an RPA CoE? Session 2 – CoE Roles

Northern Engraving | Nameplate Manufacturing Process - 2024

Y-Combinator seed pitch deck template PP

JavaLand 2024: Application Development Green Masterplan

"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba

QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...

Session 1 - Intro to Robotic Process Automation.pdf

Dandelion Hashtable: beyond billion requests per second on a commodity server

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

Essentials of Automations: Exploring Attributes & Automation Parameters

Harnessing the Power of NLP and Knowledge Graphs for Opioid Research

AI in the Workplace Reskilling, Upskilling, and Future Work.pptx

"$10 thousand per minute of downtime: architecture, queues, streaming and fin...

Christine's Supplier Sourcing Presentaion.pptx

"What does it really mean for your system to be available, or how to define w...

Clustering by Maximizing Mutual Information Across Views

1. Clustering by Maximizing Mutual Information Across Views Kien Do, Truyen Tran, Svetha Venkatesh Applied AI Institute (A2I2), Deakin University, Australia 1

2. Image Clustering Problem 2 The explosion of unlabelled data has led to the growing demand for unsupervised clustering

3. Clustering Assumptions 3 Inter-cluster distance should be large Intra-cluster distance should be small

4. Existing Clustering Methods 4 Enc Dec Clustering the latent code Autoencoder-based methods (e.g., DCN, VaDE, DGG) DCN [1] Closer in the latent space of the AE The latent should only capture semantic information from the input [1] Towards k-means-friendly spaces: Simultaneous deep learning and clustering, Yang et al., ICML 2017

5. Existing Clustering Methods (cont.) 5 IIC [1] Methods that only use the cluster-assignment probability (e.g., IIC, PICA) Problem: May not capture enough useful information from data => over-clustering is often required. [1] Invariant Information Clustering for Unsupervised Image Classification and Segmentation, Ji et al., ICCV 2019

6. Motivation • We need a method that can model the cluster-level and the instance- level semantics. • The InfoMax/Contrastive Learning principle can be applied to this scenario. 6

7. Overview about InfoMax/Contrastive Learning • A principle for learning view-invariant representations. These representations often capture the data semantics. • The idea is maximizing the mutual information (MI) between 2 different views. • Since direct computation of the MI is hard, we maximize its variational lower bound instead. 7

8. The InfoNCE bound • InfoNCE [1] is a lower bound of MI • It is biased but has low variance • Maximizing InfoNCE is equivalent to minimizing a contrastive loss: 8 [1] On Variational Bounds of Mutual Information, Poole et al., ICML 2019 is a “critic” measuring the similarity between and

9. Contrastive Representation Learning and Clustering (CRLC) 9 Image representation vector Cluster-assignment probability vector

10. Training Loss 10 where:

11. Choosing an optimal critic • A critic is optimal ( ) if it leads to the tightest InfoNCE bound. • It can be shown that • In continuous cases, cosine similarity is the optimal critic • In discrete cases, “log-of-dot-product” is the optimal critic 11

12. A Simple extension to Semi-supervised Learning 12 Assume that we also have access to some labeled set . The training loss is:

13. Results on Clustering 13

14. Results w.r.t. different critics 14

15. Learned Representation Visualization 15 CRLC SimCLR In CRLC, the learned representations are more separate than in SimCLR

16. Results on SSL 16

17. Comparison with FixMatch CRLC-semi is much more stable and converges much faster than FixMatch when only few label data are available 17

18. 18 Thank you for your attention!

Editor's Notes

The explosion of unlabelled data has led to the growing demand for unsupervised clustering

Clustering by Maximizing Mutual Information Across Views

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Clustering by Maximizing Mutual Information Across Views

Similar to Clustering by Maximizing Mutual Information Across Views (20)

Recently uploaded

Recently uploaded (20)

Clustering by Maximizing Mutual Information Across Views

Editor's Notes