A Non--convex optimization approach to Correlation Clustering

•Download as PPTX, PDF•

0 likes•32 views

This document discusses using a non-convex optimization approach for correlation clustering. It summarizes that correlation clustering aims to maximize the weight of edges within clusters by clustering vertices based on positive and negative edge weights. Previous approximation algorithms had limitations in scaling or providing exact solutions. The document proposes a non-convex relaxation approach solved using the Frank-Wolfe algorithm, which provides theoretical guarantees while outperforming other methods in runtime and solution quality on synthetic and real-world datasets. It emphasizes the need for algorithms research to combine theoretical guarantees with practical implementations and testing on large datasets.

Engineering

1
A non-convex optimization approach
to correlation clustering
Erik Thiel, Morteza Haghir Chehreghani, Devdatt Dubhashi
Chalmers University of Technology, Sweden

Correlation Clustering
• Input: Graph 𝐺 = 𝑉, 𝐸 and positive or
negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸
• Output: A clustering of the vertices to
maximize the sum of the weights of edges
within each cluster.

Difference from usual Clustering
• Weights can be positive or negative!
• Contentious what’s ”good” quality clustering
• But in correlation clustering there is
unambiguous objective
• The number of clusters need not be specified,
will emerge from the optimizing the objective.

Approximation Algorithms
• Bansal, Blum Chawla (2004): PTAS on
complete graphs
• Charikar Guruswami, Wirth (2005): APX hard
on general graphs
• Charikar et al (2005), Swamy (2004): 0.76
approximation
• Guruswami-Giotis (2006): PTAS with fixed no
of clusters

However …
• No implementation, no code …
• Doesn’t work in practice …

A Tale of Two Cultures
• Deep elegant theory
• “Polynomial time”
• No implementation
• No experiments on data
sets
• Does not work in
practice or scale
• Beamer/LaTeX
• Sometimes theory
• Linear or sub-linear
• Well engineered
implementation
• Extensive testing on
data sets
• Must work in practice,
scale to “Big Data”
• Powerpoint
Algorithms Theory Machine Learning

Tightness of Relaxation
• The non-convex relaxation is tight: no gap
between continuous and discrete problem,
simple proof by randomized rounding.
• In contrast SDP relaxation is not tight.

Block-Coordinate FW
 Applies to a problem in the form of
We consider a maximization
instead of a minimization.

Non-convex Convergence Theory
• For a differentiable (but not necessarily
convex) function, the convergence rate of FW
is 𝑂(1/ 𝑇).
• If the function is multilinear, the convergence
rate is 𝑂(1/𝑇).
• Note that our correlation clustering objective
is indeed multilinear!

Synthetic Data: Generative Model
• Planted model with k clusters and noise p
• With probability (1-p), high positive weight on
edge within a cluster and high negative weight
on edge across clusters, with probability p,
arbitrary weight

SDP yields very slow and low-quality results.
e.g., 15 hours vs. a couple of sec for n=200.
See also [Elsner and Schudy 2009]

Correlation Clustering Colours
• Vertices are the Munsell tiles
• Edge between tiles x and y has weight sim(x,y)
-1/2, where sim is the CIELAB similarity
(between 0 and 1).
• Thus edges for similar tiles will have positive
weights and for dissimilar tiles will have
negative weights.

Summary
• Non-convex relaxation solved with Frank
Wolfe yields an algorithm with guarantees
that beats all other methods handily in both
runtime and quality.
• Combine theory and rigour of algorithms
research with engineering good
implementations and extensive testing on
data.

What's hot

Dueling Network Architectures for Deep Reinforcement LearningYoonho Lee

Deep Learning Theory Seminar (Chap 3, part 2)Sangwoo Mo

Lecture 17: Supervised Learning Recapbutest

RBM from Scratch Hadi Sinaee

0415_seminar_DeepDPGHye-min Ahn

Deep Learning and Optimization MethodsStefan Kühn

Nonlinear dimension reductionYan Xu

Linear regression with gradient descentSuraj Parmar

Pr266Hyeongmin Lee

PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical FlowHyeongmin Lee

Linear Size MeshesDon Sheehy

Multi-Chart Generative Surface ModelingHeliBenHamu

Visual Object Analysis using Regions and Local FeaturesUniversitat Politècnica de Catalunya

KNN Algorithm using C++Afraz Khan

04 numericalRonald Teo

The Machinery behind Deep LearningStefan Kühn

Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017MLconf

DL_lecture3_regularization_I.pdfsagayalavanya2

What's hot (18)

Dueling Network Architectures for Deep Reinforcement Learning

Deep Learning Theory Seminar (Chap 3, part 2)

Lecture 17: Supervised Learning Recap

RBM from Scratch

0415_seminar_DeepDPG

Deep Learning and Optimization Methods

Nonlinear dimension reduction

Linear regression with gradient descent

Pr266

PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

Linear Size Meshes

Multi-Chart Generative Surface Modeling

Visual Object Analysis using Regions and Local Features

KNN Algorithm using C++

04 numerical

The Machinery behind Deep Learning

Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017

DL_lecture3_regularization_I.pdf

Similar to A Non--convex optimization approach to Correlation Clustering

Corr clust-kielDevdatt Dubhashi

CS532L4_Backpropagation.pptxMFaisalRiaz5

ARTIFICIAL-NEURAL-NETWORKMACHINELEARNINGmohanapriyastp

generalized_nbody_acs_2015_challacombeMatt Challacombe

Paper Study - Incremental Data-Flow Analysis Algorithms by Ryder et alMin-Yih Hsu

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Sangwoo Mo

Dictionary Learning in Games - GDC 2014Manchor Ko

A new development in the hierarchical clustering of repertory grid dataMark Heckmann

Gdc2012 frames, sparsity and global illumination Manchor Ko

Joint contrastive learning with infinite possibilitiestaeseon ryu

2015 pag-metagenomec.titus.brown

[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...Jihwan Bang

Hyperbolic Deep Reinforcement LearningSangwoo Mo

Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...WiMLDSMontreal

Lec16: Medical Image Registration (Advanced): Deformable RegistrationUlaş Bağcı

Exploring Simple Siamese Representation LearningSungchul Kim

ngboost.pptxHadrian7

Nimrita deep learningNimrita Koul

Incremental Sense Weight Training for In-depth Interpretation of Contextualiz...Jinho Choi

Deep learning - a primerUwe Friedrichsen

Similar to A Non--convex optimization approach to Correlation Clustering (20)

Corr clust-kiel

CS532L4_Backpropagation.pptx

ARTIFICIAL-NEURAL-NETWORKMACHINELEARNING

generalized_nbody_acs_2015_challacombe

Paper Study - Incremental Data-Flow Analysis Algorithms by Ryder et al

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...

Dictionary Learning in Games - GDC 2014

A new development in the hierarchical clustering of repertory grid data

Gdc2012 frames, sparsity and global illumination

Joint contrastive learning with infinite possibilities

2015 pag-metagenome

[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...

Hyperbolic Deep Reinforcement Learning

Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...

Lec16: Medical Image Registration (Advanced): Deformable Registration

Exploring Simple Siamese Representation Learning

ngboost.pptx

Nimrita deep learning

Incremental Sense Weight Training for In-depth Interpretation of Contextualiz...

Deep learning - a primer

Recently uploaded

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Introduction and different types of Ethernet.pptxupamatechverse

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Recently uploaded (20)

SPICE PARK APR2024 ( 6,793 SPICE Models )

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

Introduction and different types of Ethernet.pptx

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

Introduction to IEEE STANDARDS and its different types.pptx

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...

Microscopic Analysis of Ceramic Materials.pptx

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

A Non--convex optimization approach to Correlation Clustering

1. 1 A non-convex optimization approach to correlation clustering Erik Thiel, Morteza Haghir Chehreghani, Devdatt Dubhashi Chalmers University of Technology, Sweden

2. Correlation Clustering • Input: Graph 𝐺 = 𝑉, 𝐸 and positive or negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸 • Output: A clustering of the vertices to maximize the sum of the weights of edges within each cluster.

3. Difference from usual Clustering • Weights can be positive or negative! • Contentious what’s ”good” quality clustering • But in correlation clustering there is unambiguous objective • The number of clusters need not be specified, will emerge from the optimizing the objective.

4. Web scale clustering

5. Approximation Algorithms • Bansal, Blum Chawla (2004): PTAS on complete graphs • Charikar Guruswami, Wirth (2005): APX hard on general graphs • Charikar et al (2005), Swamy (2004): 0.76 approximation • Guruswami-Giotis (2006): PTAS with fixed no of clusters

6. Exact Formulation

7. SDP Relaxation + Rounding

8. However … • No implementation, no code … • Doesn’t work in practice …

9. A Tale of Two Cultures • Deep elegant theory • “Polynomial time” • No implementation • No experiments on data sets • Does not work in practice or scale • Beamer/LaTeX • Sometimes theory • Linear or sub-linear • Well engineered implementation • Extensive testing on data sets • Must work in practice, scale to “Big Data” • Powerpoint Algorithms Theory Machine Learning

10. Non-Convex Relaxation

11. Tightness of Relaxation • The non-convex relaxation is tight: no gap between continuous and discrete problem, simple proof by randomized rounding. • In contrast SDP relaxation is not tight.

12. Frank-Wolfe Algorithm

13. Frank-Wolfe Algorithm

14. Frank-Wolfe Algorithm ICML 014 Tutorial

15. Frank-Wolfe Algorithm ICML 014 Tutorial

16. Block-Coordinate FW  Applies to a problem in the form of We consider a maximization instead of a minimization.

17. Non-convex Convergence Theory • For a differentiable (but not necessarily convex) function, the convergence rate of FW is 𝑂(1/ 𝑇). • If the function is multilinear, the convergence rate is 𝑂(1/𝑇). • Note that our correlation clustering objective is indeed multilinear!

18. Combinatorial Search

19. Synthetic Data: Generative Model • Planted model with k clusters and noise p • With probability (1-p), high positive weight on edge within a cluster and high negative weight on edge across clusters, with probability p, arbitrary weight

20.

21. SDP yields very slow and low-quality results. e.g., 15 hours vs. a couple of sec for n=200. See also [Elsner and Schudy 2009]

22.

23. Sampling Block-Coordinate FW

24. Sampling Block-Coordinate FW

25. Variance Reduction Technique

26. Variance reduction

27. Newsgroup data set

28. Newsgroup data set

29. Correlation Clustering Colours • Vertices are the Munsell tiles • Edge between tiles x and y has weight sim(x,y) -1/2, where sim is the CIELAB similarity (between 0 and 1). • Thus edges for similar tiles will have positive weights and for dissimilar tiles will have negative weights.

30. Colour Partitions

31. Summary • Non-convex relaxation solved with Frank Wolfe yields an algorithm with guarantees that beats all other methods handily in both runtime and quality. • Combine theory and rigour of algorithms research with engineering good implementations and extensive testing on data.

A Non--convex optimization approach to Correlation Clustering

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Similar to A Non--convex optimization approach to Correlation Clustering

Similar to A Non--convex optimization approach to Correlation Clustering (20)

Recently uploaded

Recently uploaded (20)

A Non--convex optimization approach to Correlation Clustering