Corr clust-kiel

•Download as PPTX, PDF•

0 likes•55 views

Devdatt Dubhashi

Correlation clustering with non convex optimization, to appear at AAAI 2019

Data & Analytics

11/5/2018 Mikael Kågebäck, Chalmers CSE 1
Credit: Aldebaran
Correlation Clustering: A Tale of Two Cultures
Erik Thiel, Morteza Chehreghani, Devdatt Dubhashi
Chalmers University of Technology, Sweden

Colour Partitions
• Ask humans subjects in a language group to
label tiles with colour terms, then aggregate
all results into a partition for that language.
• Take the CIELAB colour coordinates to define a
similarity between colour tiles and forma
partition based on these similarities.

Regier T, Kemp C, Kay P. 2015.
Word meanings across languages
support efficient communication.
In
The Handbook of Language
Emergence, ed. B MacWhinney, W
O’Grady

Correlation Clustering
• Input: Graph 𝐺 = 𝑉, 𝐸 and positive or
negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸
• Output: A clustering of the vertices to
maximize the sum of the weights of edges
within each cluster.

Difference from usual Clustering
• Weights can be positive or negative!
• Contentious what’s ”good” quality clustering
• But in correlation clustering there is
unambiguous objective
• The number of clusters need not be specified,
will emerge from the optimizing the objective.

Approximation Algorithms
• Bansal, Blum Chawla (2004): PTAS on
complete graphs
• Charikar Guruswami, Wirth (2005): APX hard
on general graphs
• Charikar et al (2005), Swamy (2004): 0.76
approximation
• Guruswami-Giotis (2006): PTAS with fixed no
of clusters

However …
• No implementation, no code …
• Doesn’t work in practice …

A Tale of Two Cultures
• Deep elegant theory
• “Polynomial time”
• No implementation
• No experiments on data
sets
• Does not work in
practice or scale
• Beamer/LaTeX
• Sometimes theory
• Linear or sub-linear
• Well engineered
implementation
• Extensive testing on
data sets
• Must work in practice,
scale to “Big Data”
• Powerpoint
Algorithms Theory Machine Learning

Tightness of Relaxation
• The non-convex relaxation is tight: no gap
between continuous and discrete problem,
simple proof by randomized rounding.
• In contrast SDP relaxation is not tight.

Non-convex Convergence Theory
• For a differentiable (but not necessarily
convex) function, the FW algorithm converges
in 𝑂(1/ 𝑇) steps.
• If the function is multilinear, then it converges
in 𝑂(1/𝑇) steps.
• Note that our correlation clustering objective
is indeed multilinear!

Synthetic Data: Generative Model
• Planted model with k clusters and noise p
• With probability (1-p), high positive weight on
edge within a cluster and high negative weight
on edge across clusters, with probability p,
arbitrary weight

Summary
• Non-convex relaxation solved with Frank
Wolfe yields an algorithm with guarantees
that beats all other methods handily in both
runtime and quality.
• Combine theory and rigour of algorithms
research with engineering good
implementations and extensive testing on
data.

Similar to Corr clust-kiel

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Ian Morgan

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Bayes Nets meetup London

Lecture on the annotation of transposable elementsfmaumus

DutchMLSchool. Logistic Regression, Deepnets, Time SeriesBigML, Inc

Understanding Deep Learning Requires Rethinking GeneralizationAhmet Kuzubaşlı

Core Methods In Educational Data Miningebelani

Machine Learning (Decisoion Trees)Makerere Unversity School of Public Health, Victoria University

L4. Ensembles of Decision TreesMachine Learning Valencia

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery ivaderivader

Local vs. Global Models for Effort Estimation and Defect Prediction CS, NcState

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Sangwoo Mo

Learn from Example and Learn Probabilistic ModelJunya Tanaka

Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen

Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...Association for Computational Linguistics

Machine Learning Essentials Demystified part1 | Big Data DemystifiedOmid Vahdaty

[PR12] understanding deep learning requires rethinking generalizationJaeJun Yoo

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

DutchMLSchool. ML: A Technical PerspectiveBigML, Inc

Anthiil Inside workshop on NLPSatyam Saxena

Representation Learning of Text for NLPAnuj Gupta

Similar to Corr clust-kiel (20)

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...

Lecture on the annotation of transposable elements

DutchMLSchool. Logistic Regression, Deepnets, Time Series

Understanding Deep Learning Requires Rethinking Generalization

Core Methods In Educational Data Mining

Machine Learning (Decisoion Trees)

L4. Ensembles of Decision Trees

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

Local vs. Global Models for Effort Estimation and Defect Prediction

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...

Learn from Example and Learn Probabilistic Model

Machine Learning Foundations for Professional Managers

Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...

Machine Learning Essentials Demystified part1 | Big Data Demystified

[PR12] understanding deep learning requires rethinking generalization

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

DutchMLSchool. ML: A Technical Perspective

Anthiil Inside workshop on NLP

Representation Learning of Text for NLP

Recently uploaded

Statistics notes ,it includes mean to index numberssuginr1

Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls

Kings of Saudi Arabia, information about themeitharjee

TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg

Discover Why Less is More in B2B Researchmichael115558

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann

Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...gragchanchal546

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg

Gartner's Data Analytics Maturity Model.pptxchadhar227

Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg

+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health

High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515

Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg

Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg

Ranking and Scoring Exercises for ResearchRajesh Mondal

Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg

Recently uploaded (20)

Statistics notes ,it includes mean to index numbers

Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...

Kings of Saudi Arabia, information about them

TrafficWave Generator Will Instantly drive targeted and engaging traffic back...

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...

Discover Why Less is More in B2B Research

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...

Gartner's Data Analytics Maturity Model.pptx

Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...

+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...

High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...

Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...

Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...

Ranking and Scoring Exercises for Research

Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...

Corr clust-kiel

1. 11/5/2018 Mikael Kågebäck, Chalmers CSE 1 Credit: Aldebaran Correlation Clustering: A Tale of Two Cultures Erik Thiel, Morteza Chehreghani, Devdatt Dubhashi Chalmers University of Technology, Sweden

2. Are Colours Universal?

3. WCS Stimulus Palette

4. Colour Partitions • Ask humans subjects in a language group to label tiles with colour terms, then aggregate all results into a partition for that language. • Take the CIELAB colour coordinates to define a similarity between colour tiles and forma partition based on these similarities.

5. World Colour Survey

6. Regier T, Kemp C, Kay P. 2015. Word meanings across languages support efficient communication. In The Handbook of Language Emergence, ed. B MacWhinney, W O’Grady

7. Correlation Clustering • Input: Graph 𝐺 = 𝑉, 𝐸 and positive or negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸 • Output: A clustering of the vertices to maximize the sum of the weights of edges within each cluster.

9. Difference from usual Clustering • Weights can be positive or negative! • Contentious what’s ”good” quality clustering • But in correlation clustering there is unambiguous objective • The number of clusters need not be specified, will emerge from the optimizing the objective.

10. Web scale clustering

11. Approximation Algorithms • Bansal, Blum Chawla (2004): PTAS on complete graphs • Charikar Guruswami, Wirth (2005): APX hard on general graphs • Charikar et al (2005), Swamy (2004): 0.76 approximation • Guruswami-Giotis (2006): PTAS with fixed no of clusters

12. Exact Formulation

13. SDP Relaxation + Rounding

14. However … • No implementation, no code … • Doesn’t work in practice …

15. A Tale of Two Cultures • Deep elegant theory • “Polynomial time” • No implementation • No experiments on data sets • Does not work in practice or scale • Beamer/LaTeX • Sometimes theory • Linear or sub-linear • Well engineered implementation • Extensive testing on data sets • Must work in practice, scale to “Big Data” • Powerpoint Algorithms Theory Machine Learning

16. Non-Convex Relaxation

17. Tightness of Relaxation • The non-convex relaxation is tight: no gap between continuous and discrete problem, simple proof by randomized rounding. • In contrast SDP relaxation is not tight.

18.

19.

20. Non-convex Convergence Theory • For a differentiable (but not necessarily convex) function, the FW algorithm converges in 𝑂(1/ 𝑇) steps. • If the function is multilinear, then it converges in 𝑂(1/𝑇) steps. • Note that our correlation clustering objective is indeed multilinear!

21. Combinatorial Search

22. Synthetic Data: Generative Model • Planted model with k clusters and noise p • With probability (1-p), high positive weight on edge within a cluster and high negative weight on edge across clusters, with probability p, arbitrary weight

23.

24.

25.

26. Sampling Block FW

27.

28. Variance Reduction Technique

29. Variance reduction

30. Newsgroup data set

31. Colour Partitions

32. Summary • Non-convex relaxation solved with Frank Wolfe yields an algorithm with guarantees that beats all other methods handily in both runtime and quality. • Combine theory and rigour of algorithms research with engineering good implementations and extensive testing on data.

33. Happy Birthday Anand!

Corr clust-kiel

Recommended

Recommended

More Related Content

Similar to Corr clust-kiel

Similar to Corr clust-kiel (20)

More from Devdatt Dubhashi

More from Devdatt Dubhashi (7)

Recently uploaded

Recently uploaded (20)

Corr clust-kiel