SlideShare a Scribd company logo
Cluster Labeling with Double
Application of SOM

http://www.ad-exchange.fr/

Vahid Moosavi
Researcher at Future Cities Laboratory
PhD Student at Chair for Computer Aided Architectural Design (CAAD), ETH Zurich,
Ludger Hovestadt

1 November 2013

SEC

svm@arch.ethz.ch
Outline

•
•
•
•

How to explain clustering and clusters?
Cluster labeling problem
Current methods and the proposed method
The Case: Finding Thematic Research Areas within FCL
How to explain clustering and clusters?
• Conceptually Clusters are
–
–
–
–

Meant to show temporal and evolving identities
To show emergent concepts from lower level concepts
Bottom up from instances Vs. external references
Decoupling identities from objects, things and instances and to
create new dimensions in between
•
•
•
•

Brands,
Spoken languages and dialects
Academic disciplines
Genres of movies, music classes , ….
3
How to explain clustering and clusters?
• And Technically Clusters
– Show the direction of eigenvectors of data matrix
– Are the result of transformations, which can be linear or nonlinear

• And what is happening with new phenomena like Big Data and information
sharing is that externally defined (some times imposed!) references are not
sufficient any more.
– Academic disciplines
– Software industries : Google Android Vs. Microsoft windows XP!
The goal: How to make the process of clustering and concept generation,
computationally practical for individuals?
4
Cluster labeling problem
• Topic Modeling in Natural Language
Processing
– Document Clustering
– Automatic Sentiment Analysis

• Market Segmentation and Customer
Clustering
– CRM data
– City Call center Data (Mood of the City)

• Enterprise Knowledge Modeling
Using Text Archives
Topic Modeling in Natural Language Processing
The Expression of Emotions in 20th Century Books
Alberto Acerbi, et. al. 2013

6
Topic Modeling in Natural Language Processing
The Expression of Emotions in 20th Century Books
Alberto Acerbi, et. al. 2013

7
Clustering and Cluster Labeling
In terms of Geography-Andre Skupin (2005)
Clustering and Cluster Labeling
A Semantic
Landscape of the
Last.fm Music
Current Methods

• Differential Cluster Labeling
– Mutual Information
– Chi-Squared Selection

• Cluster-Internal Labeling
– Centroid Labels
– Title Labels
– External knowledge labels
The Proposed Method

• Use of SOM as a nonlinear Data-Clustering (transformation)
and visualization technique
• Use of the concept of tensor to produce required data
matrices for SOM
The Proposed Method

Tensors
(multi-aspect data representation)
Aspect A Features

Objects

Objects

Aspect A Features

• Wavelet Decomposition
• One original object (one signal ) is
decomposed to several aspects (different
scales or frequencies)

13
The Proposed Method

SOM
(as a nonlinear data transformation: here used for clustering and visualization)
SOM is a Generic Machine works normally with Matrices of
data

10 records, 100+ dimensions
200+ records, 100+ dimensions
But with clear clusters

200+ records, 100+ dimensions

15
The Proposed Method
Features

Clusters Vector

Y

Tensor

Objects

Objects

X

SOM Clustering

Original Data set

A second Order Tensor
Clusters Vector

Features

Objects

Features

Z

SOM

Visualization of the
main concepts
(potential labels) within
each cluster
The Case: Finding Thematic Research Areas within FCL

17
Finding Thematic Research Areas within FCL

Each row vector shows one
persons interest related to
those selected features

Objects

Features

X
18
Finding Thematic Research Areas within FCL

First plot
(each curve is one person)

19
Finding Thematic Research Areas within FCL

SOM

20
Finding Thematic Research Areas within FCL
SOM + K means clustering

Now…
What are the main
concepts within
each cluster?
How to label these
clusters?

Objects

Clusters Vector

Y

5 clusters detected
21
Finding Thematic Research Areas within FCL

Tensor based transformation

Y

Clusters Vector

Features

Objects

Clusters Vector

Objects

Features

X

Z
A simple visualization of each cluster
regarding to all the features

22
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

23
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

24
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

25
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

26
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

27
Finding Thematic Research Areas within FCL

Tensor based transformation
+ another SOM

28
Thanks!

More Related Content

Viewers also liked

System identification and Surrogate Modeling
System identification and Surrogate ModelingSystem identification and Surrogate Modeling
System identification and Surrogate Modeling
Vahid Moosavi
 
On Optimization Problems
On Optimization ProblemsOn Optimization Problems
On Optimization Problems
Vahid Moosavi
 
Cultivation presentation
Cultivation presentationCultivation presentation
Cultivation presentation
charlottesweb23
 
Bi g data_urban modeling_21082013
Bi g data_urban modeling_21082013Bi g data_urban modeling_21082013
Bi g data_urban modeling_21082013Vahid Moosavi
 
ResilientCities
ResilientCitiesResilientCities
ResilientCities
Lucia Tilio
 
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
Vahid Moosavi
 

Viewers also liked (6)

System identification and Surrogate Modeling
System identification and Surrogate ModelingSystem identification and Surrogate Modeling
System identification and Surrogate Modeling
 
On Optimization Problems
On Optimization ProblemsOn Optimization Problems
On Optimization Problems
 
Cultivation presentation
Cultivation presentationCultivation presentation
Cultivation presentation
 
Bi g data_urban modeling_21082013
Bi g data_urban modeling_21082013Bi g data_urban modeling_21082013
Bi g data_urban modeling_21082013
 
ResilientCities
ResilientCitiesResilientCities
ResilientCities
 
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
Pre-Specific Modeling - Computational Machines in Coexistence with Urban Data...
 

Similar to Cluster labeling fcl_weeklymeeting30102013

introduction to pattern recognition
introduction to pattern recognitionintroduction to pattern recognition
introduction to pattern recognition
MohammadMoattar2
 
M01_OO_Intro.ppt
M01_OO_Intro.pptM01_OO_Intro.ppt
M01_OO_Intro.ppt
RojaPogul1
 
Courzon iskouk09 jun2010
Courzon iskouk09 jun2010Courzon iskouk09 jun2010
Courzon iskouk09 jun2010
ISKOUK
 
social.pptx
social.pptxsocial.pptx
social.pptx
NISHASOMSCS113
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
Kira
 
Learn about Your Location (Using ALL Your Data)
Learn about Your Location (Using ALL Your Data)Learn about Your Location (Using ALL Your Data)
Learn about Your Location (Using ALL Your Data)
Commonwealth Computer Research, Inc.
 
Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?
Heimo Hänninen
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
Laura Po
 
2015 10-08 - additive manufacturing software 1
2015 10-08 - additive manufacturing software  12015 10-08 - additive manufacturing software  1
2015 10-08 - additive manufacturing software 1
Biofabrication Group at University of Pisa
 
Editors l21 l24
Editors l21 l24Editors l21 l24
Editors l21 l24
Neha Pachauri
 
Building and using ontologies (2015)
Building and using ontologies (2015)Building and using ontologies (2015)
Building and using ontologies (2015)
Elena Simperl
 
FRBR and TMS: Applying a Conceptual Organizational Model for Cataloguing Pho...
FRBR and TMS:Applying a Conceptual Organizational Model for Cataloguing Pho...FRBR and TMS:Applying a Conceptual Organizational Model for Cataloguing Pho...
FRBR and TMS: Applying a Conceptual Organizational Model for Cataloguing Pho...
Sarah Gillis
 
Hierarchical clustering in Python and beyond
Hierarchical clustering in Python and beyondHierarchical clustering in Python and beyond
Hierarchical clustering in Python and beyond
Frank Kelly
 
Information theoritic analysis of entity dynamics on the linked open data cloud
Information theoritic analysis of entity dynamics on the linked open data cloudInformation theoritic analysis of entity dynamics on the linked open data cloud
Information theoritic analysis of entity dynamics on the linked open data cloud
MOVING Project
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
Vladimir Alexiev, PhD, PMP
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
Stuart Wrigley
 
Bekas for cognitive_speaker_series
Bekas for cognitive_speaker_seriesBekas for cognitive_speaker_series
Bekas for cognitive_speaker_series
diannepatricia
 
Bekas for cognitive_speaker_series
Bekas for cognitive_speaker_seriesBekas for cognitive_speaker_series
Bekas for cognitive_speaker_series
diannepatricia
 
MIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptxMIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptx
elsagalgao
 

Similar to Cluster labeling fcl_weeklymeeting30102013 (20)

introduction to pattern recognition
introduction to pattern recognitionintroduction to pattern recognition
introduction to pattern recognition
 
M01_OO_Intro.ppt
M01_OO_Intro.pptM01_OO_Intro.ppt
M01_OO_Intro.ppt
 
Courzon iskouk09 jun2010
Courzon iskouk09 jun2010Courzon iskouk09 jun2010
Courzon iskouk09 jun2010
 
social.pptx
social.pptxsocial.pptx
social.pptx
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
 
Learn about Your Location (Using ALL Your Data)
Learn about Your Location (Using ALL Your Data)Learn about Your Location (Using ALL Your Data)
Learn about Your Location (Using ALL Your Data)
 
Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
2015 10-08 - additive manufacturing software 1
2015 10-08 - additive manufacturing software  12015 10-08 - additive manufacturing software  1
2015 10-08 - additive manufacturing software 1
 
Editors l21 l24
Editors l21 l24Editors l21 l24
Editors l21 l24
 
Building and using ontologies (2015)
Building and using ontologies (2015)Building and using ontologies (2015)
Building and using ontologies (2015)
 
FRBR and TMS: Applying a Conceptual Organizational Model for Cataloguing Pho...
FRBR and TMS:Applying a Conceptual Organizational Model for Cataloguing Pho...FRBR and TMS:Applying a Conceptual Organizational Model for Cataloguing Pho...
FRBR and TMS: Applying a Conceptual Organizational Model for Cataloguing Pho...
 
Hierarchical clustering in Python and beyond
Hierarchical clustering in Python and beyondHierarchical clustering in Python and beyond
Hierarchical clustering in Python and beyond
 
Information theoritic analysis of entity dynamics on the linked open data cloud
Information theoritic analysis of entity dynamics on the linked open data cloudInformation theoritic analysis of entity dynamics on the linked open data cloud
Information theoritic analysis of entity dynamics on the linked open data cloud
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
Lecture 1 (bce-7)
Lecture   1 (bce-7)Lecture   1 (bce-7)
Lecture 1 (bce-7)
 
Bekas for cognitive_speaker_series
Bekas for cognitive_speaker_seriesBekas for cognitive_speaker_series
Bekas for cognitive_speaker_series
 
Bekas for cognitive_speaker_series
Bekas for cognitive_speaker_seriesBekas for cognitive_speaker_series
Bekas for cognitive_speaker_series
 
MIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptxMIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptx
 

More from Vahid Moosavi

Toward of a Theory of Modeling
Toward of a Theory of ModelingToward of a Theory of Modeling
Toward of a Theory of Modeling
Vahid Moosavi
 
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data StreamsMarkovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
Vahid Moosavi
 
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
Vahid Moosavi
 
Modeling in Coexistence with Data: Toward a Generic Notion of Space
Modeling in Coexistence with Data: Toward a Generic Notion of SpaceModeling in Coexistence with Data: Toward a Generic Notion of Space
Modeling in Coexistence with Data: Toward a Generic Notion of Space
Vahid Moosavi
 
Data Driven Modeling Beyond Idealization
Data Driven Modeling Beyond IdealizationData Driven Modeling Beyond Idealization
Data Driven Modeling Beyond Idealization
Vahid Moosavi
 
Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Vahid Moosavi
 

More from Vahid Moosavi (6)

Toward of a Theory of Modeling
Toward of a Theory of ModelingToward of a Theory of Modeling
Toward of a Theory of Modeling
 
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data StreamsMarkovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
Markovian Modeling of Urban Traffic Flows in Coexistence With Urban Data Streams
 
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
Finding Candidate Locations for Aerosol Pollution Monitoring at Street Level ...
 
Modeling in Coexistence with Data: Toward a Generic Notion of Space
Modeling in Coexistence with Data: Toward a Generic Notion of SpaceModeling in Coexistence with Data: Toward a Generic Notion of Space
Modeling in Coexistence with Data: Toward a Generic Notion of Space
 
Data Driven Modeling Beyond Idealization
Data Driven Modeling Beyond IdealizationData Driven Modeling Beyond Idealization
Data Driven Modeling Beyond Idealization
 
Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013
 

Recently uploaded

Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 

Recently uploaded (20)

Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 

Cluster labeling fcl_weeklymeeting30102013

  • 1. Cluster Labeling with Double Application of SOM http://www.ad-exchange.fr/ Vahid Moosavi Researcher at Future Cities Laboratory PhD Student at Chair for Computer Aided Architectural Design (CAAD), ETH Zurich, Ludger Hovestadt 1 November 2013 SEC svm@arch.ethz.ch
  • 2. Outline • • • • How to explain clustering and clusters? Cluster labeling problem Current methods and the proposed method The Case: Finding Thematic Research Areas within FCL
  • 3. How to explain clustering and clusters? • Conceptually Clusters are – – – – Meant to show temporal and evolving identities To show emergent concepts from lower level concepts Bottom up from instances Vs. external references Decoupling identities from objects, things and instances and to create new dimensions in between • • • • Brands, Spoken languages and dialects Academic disciplines Genres of movies, music classes , …. 3
  • 4. How to explain clustering and clusters? • And Technically Clusters – Show the direction of eigenvectors of data matrix – Are the result of transformations, which can be linear or nonlinear • And what is happening with new phenomena like Big Data and information sharing is that externally defined (some times imposed!) references are not sufficient any more. – Academic disciplines – Software industries : Google Android Vs. Microsoft windows XP! The goal: How to make the process of clustering and concept generation, computationally practical for individuals? 4
  • 5. Cluster labeling problem • Topic Modeling in Natural Language Processing – Document Clustering – Automatic Sentiment Analysis • Market Segmentation and Customer Clustering – CRM data – City Call center Data (Mood of the City) • Enterprise Knowledge Modeling Using Text Archives
  • 6. Topic Modeling in Natural Language Processing The Expression of Emotions in 20th Century Books Alberto Acerbi, et. al. 2013 6
  • 7. Topic Modeling in Natural Language Processing The Expression of Emotions in 20th Century Books Alberto Acerbi, et. al. 2013 7
  • 8. Clustering and Cluster Labeling In terms of Geography-Andre Skupin (2005)
  • 9. Clustering and Cluster Labeling A Semantic Landscape of the Last.fm Music
  • 10. Current Methods • Differential Cluster Labeling – Mutual Information – Chi-Squared Selection • Cluster-Internal Labeling – Centroid Labels – Title Labels – External knowledge labels
  • 11. The Proposed Method • Use of SOM as a nonlinear Data-Clustering (transformation) and visualization technique • Use of the concept of tensor to produce required data matrices for SOM
  • 13. Aspect A Features Objects Objects Aspect A Features • Wavelet Decomposition • One original object (one signal ) is decomposed to several aspects (different scales or frequencies) 13
  • 14. The Proposed Method SOM (as a nonlinear data transformation: here used for clustering and visualization)
  • 15. SOM is a Generic Machine works normally with Matrices of data 10 records, 100+ dimensions 200+ records, 100+ dimensions But with clear clusters 200+ records, 100+ dimensions 15
  • 16. The Proposed Method Features Clusters Vector Y Tensor Objects Objects X SOM Clustering Original Data set A second Order Tensor Clusters Vector Features Objects Features Z SOM Visualization of the main concepts (potential labels) within each cluster
  • 17. The Case: Finding Thematic Research Areas within FCL 17
  • 18. Finding Thematic Research Areas within FCL Each row vector shows one persons interest related to those selected features Objects Features X 18
  • 19. Finding Thematic Research Areas within FCL First plot (each curve is one person) 19
  • 20. Finding Thematic Research Areas within FCL SOM 20
  • 21. Finding Thematic Research Areas within FCL SOM + K means clustering Now… What are the main concepts within each cluster? How to label these clusters? Objects Clusters Vector Y 5 clusters detected 21
  • 22. Finding Thematic Research Areas within FCL Tensor based transformation Y Clusters Vector Features Objects Clusters Vector Objects Features X Z A simple visualization of each cluster regarding to all the features 22
  • 23. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 23
  • 24. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 24
  • 25. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 25
  • 26. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 26
  • 27. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 27
  • 28. Finding Thematic Research Areas within FCL Tensor based transformation + another SOM 28

Editor's Notes

  1. CRM Data