SlideShare a Scribd company logo
1 of 23
Download to read offline
Social event detection using multimodal
clustering and integrating supervisory signals
Georgios Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris
Centre for Research and Technology Hellas, Information Technologies Institute (CERTH-ITI)




ACM International Conference on Multimedia Retrieval
Hong Kong, Jun 5-8, 2012
Social Events in Multimedia
Event detection in multimedia:
• Real-world events  attendants taking photos 
  captured photos are shared in social networks
• Multimedia collection  find groups of images
  depicting the real-world events

 soccer




  music



 mklab.iti.gr           #2              socialsensor.eu
Problem Setting & Formulation

• Collection of images + metadata
   – Metadata typically include tags, geotagging information,
     timestamp, owner
   – Metadata can be noisy or missing
   – A set of feature vectors can be extracted from each image
     and its metadata
• Problem:
   – Find groups of images such that each group depict a
     unique social event

         Essentially, an image clustering problem.

 mklab.iti.gr                #3                  socialsensor.eu
The Role of Different Features

• Visual similarity: Images look similar

• Spatial-temporal context: Images were captured at
  approximately the same location and time

• Tags: Users have annotated images using similar tags

• Same owner: Photos captured by the same person

     PROBLEM: We don’t know what matters most

 mklab.iti.gr             #4               socialsensor.eu
Heuristics-based Approaches
• Rely on online sources and text metadata [Ruocco &
  Ramampiaro, 2011; Liu et al., 2011b]
   – structured data about events may not be available in
     online sources
   – for many images, text metadata can be of low quality
• Use heuristics [Liu et al., 2011a; Papadopoulos et al., 2011]
  (e.g. “all photos taken by the same user at the same day 
  same event”)
   – such heuristics are manually constructed in ad hoc ways




 mklab.iti.gr                 #5                  socialsensor.eu
Multimodal Clustering Approaches
Existing approaches:
• May utilize early/late fusion strategies. The final result will
  depend heavily on the fusion weights [Cai et al., 2011]
   – It may be difficult to determine appropriate weights, either manually
     or using a search procedure.
• May attempt to estimate generative models or minimize the
  disagreement between the clusterings according to different
  modalities [Bekkerman & Jeon, 2007; Khalidov et al., 2011]
   – Some modalities are more important than others when desired
     clusters correspond to specific concepts.

  In order to create clusters that correspond to semantically
  different concepts, will require putting more emphasis on the
  appropriate features.


 mklab.iti.gr                     #6                       socialsensor.eu
Baseline Multimodal Clustering




                early fusion




 mklab.iti.gr        #7          socialsensor.eu
Rationale of Proposed Approach

• What if during the clustering procedure we take into
  account a relevant example clustering?

• This would essentially integrate a supervisory signal
  in the multimodal clustering procedure.
  How to do this?

• Essentially, we want to define what it means for two
  items expressed in multiple modalities to belong in
  the same cluster, and then, try to learn this from
  example clusterings.

 mklab.iti.gr             #8                socialsensor.eu
Proposed Approach
1. For the items in the input clustering for our task, compute the
   distances between all pairs of items for all modalities.
2. For each pair of items compile the distances (for all
   modalities) in a vector. For pairs of items, assign a +ve label
   (same cluster) and –ve (different cluster)
3. Train a classifier to predict a “same cluster” relationship for
   pairs of items.
4. For each item in the test set to be clustered compute the
   “same cluster” relationship using that classifier.
5. Form an “indicator vector” for each item to be clustered 
   summarizes the “same cluster” relationship to the other
   items to be clustered.
6. Cluster indicator vectors (e.g. using k-means) to determine
   the final multimodal clustering.

 mklab.iti.gr                  #9                   socialsensor.eu
Overview of Proposed Approach

                1
                    2


                                   5 6
                          supervised fusion

                        3-4
 mklab.iti.gr       #10                  socialsensor.eu
Indicator Vectors
Indicator vectors of items that correspond to the same cluster
   should be more similar to each other than to indicator vectors
   of items that do not correspond to the same cluster.




 mklab.iti.gr                #11                   socialsensor.eu
Evaluation - Dataset/Features

• Benchmark dataset:
     MediaEval Social Event Detection 2011

• 36 social events of two types (soccer, music)
  comprising 2,074 Flickr images
• Features [distance]:
   –   SIFT BoW [cosine similarity]
   –   Time uploaded [absolute difference in hours]
   –   Tags [cosine similarity]
   –   Geo-location (for ~20% of images) [geodesic distance]


 mklab.iti.gr                 #12                  socialsensor.eu
Evaluation - Protocol

• Split set of event in two 50-50 random sets. One set
  used for training the classifier, other used for testing
  clustering accuracy.
• Evaluated against a multimodal spectral clustering
  approach that uses a short of early fusion strategy.
  Search in the space of fusion parameters executed.
• 10 random runs were executed: in each run, a
  separate random subset of the events was used for
  training and the rest was used for testing.



 mklab.iti.gr             #13                 socialsensor.eu
Evaluation - Results (1)

• Best NMI achieved by proposed approach




 mklab.iti.gr         #14                  socialsensor.eu
Evaluation - Results (2)

• Average and std. deviation of NMI achieved by
  tested methods




 mklab.iti.gr           #15               socialsensor.eu
Example Results (1)
Event: CE Sabadell - Real Unión de Irún, 31 May 2009




• Proposed method: Correctly found three photos
• Baseline: Apart from the three photos, it also included
  irrelevant ones, e.g. (other soccer events, concert)




 mklab.iti.gr                #16                   socialsensor.eu
Example Results (2)
Event: Barcelona FC triple celebration, 28 May 2009




• Proposed method: Failed to include all relevant photos to a
  single cluster (it split them to three), but at least each of the
  three clusters contained only relevant ones.
• Baseline method: Not only split the photos into three clusters,
  but also included many irrelevant ones in each cluster.


 mklab.iti.gr                 #17                    socialsensor.eu
Conclusions
Proposed approach for multimodal clustering with an
  application on event detection in multimedia.
Advantages
• Does not rely on ad-hoc fusion strategies.
• Matches implicit semantics of example clusterings.
• Naturally handles missing modalities.
Disadvantages
• Computationally expensive:
   – computation of N2 “same cluster relationships”
   – clustering of N dimensional vectors

 mklab.iti.gr               #18                  socialsensor.eu
Future Work

• Study how larger-scale training (last.fm, upcoming,
  eventful) affects performance

• Reduce “same-cluster” feature space (to K << N2)
   – Representative image selection
   – Dimensionality reduction

• Integrate event selection step in the proposed
  approach (currently it considers all images as
  belonging to events).

• Participate in MediaEval SED 2012!
 mklab.iti.gr              #19             socialsensor.eu
Acknowledgement




mklab.iti.gr      #20   socialsensor.eu
Questions




Further contact:   gpetkos@iti.gr / papadop@iti.gr


Follow:            @socialsensor_ip
                   @sympapadopoulos
                   @kompats



   mklab.iti.gr             #21                      socialsensor.eu
Previous Work (1)
• Multimodal spectral clustering
   X. Cai, F. Nie, H. Huang, F. Kamangar (2011) Heterogeneous image
      feature integration via multi-modal spectral clustering. In IEEE conf. on
      Computer Vision and Pattern Recognition (CVPR), pp. 1977-1984

• Probabilistic Bayesian network approach
   V. Khalidov, F. Forbes, R.P. Horaud (2011) Conjugate mixture models for
      clustering multimodal data. In Neural Computation, 23(2):517–557

• Combinatorial Markov Random Fields
   R. Bekkerman, J. Jeon (2007) Multi-modal clustering for multimedia
      collections. In IEEE conference on Computer Vision and Pattern
      Recognition (CVPR), pp. 1-8




 mklab.iti.gr                      #22                        socialsensor.eu
Previous Work (2)
• MediaEval SED 2011
   M. Brenner, E. Izquierdo (2011) Mediaeval benchmark: Social event
      detection in collaborative photo collections. In MediaEval SED.
   X. Liu, B. Huet, R. Troncy (2011) Eurecom @ MediaEval 2011 social event
      detection task. In MediaEval SED.
   S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali (2011) CERTH @
      MediaEval 2011 social event detection task. In MediaEval SED.
   M. Ruocco, H. Ramampiaro (2011) NTNU @ MediaEval 2011 social event
      detection task. In MediaEval SED.
   Y. Wang, L. Xie, H. Sundaram (2011) Social event detection with
      clustering and filtering. In MediaEval SED.




 mklab.iti.gr                    #23                       socialsensor.eu

More Related Content

Similar to Social Event Detection using Multimodal Clustering and Integrating Supervisory Signals

Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...Symeon Papadopoulos
 
Leveraging social media for training object detectors
Leveraging social media for training object detectorsLeveraging social media for training object detectors
Leveraging social media for training object detectorsManish Kumar
 
benchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediabenchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediaVenkat Projects
 
Goal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationGoal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationAmjad Adib
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskMediaEval2012
 
benchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediabenchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediaVenkat Projects
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorialcsedays
 
CERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection TaskCERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection Taskmultimediaeval
 
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET Journal
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskSymeon Papadopoulos
 
Jose Antonio presentation at WSDL
Jose Antonio presentation at WSDLJose Antonio presentation at WSDL
Jose Antonio presentation at WSDLJose Antonio Olvera
 
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Universita della Calabria,
 
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Cagatay Turkay
 
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...IEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...IEEEMEMTECHSTUDENTSPROJECTS
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning AppendixZitao Liu
 
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...Tom Mens
 
Socially Shared Images with Automated Annotation Process by Using Improved Us...
Socially Shared Images with Automated Annotation Process by Using Improved Us...Socially Shared Images with Automated Annotation Process by Using Improved Us...
Socially Shared Images with Automated Annotation Process by Using Improved Us...IJERA Editor
 

Similar to Social Event Detection using Multimodal Clustering and Integrating Supervisory Signals (20)

Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...
 
Paper 153
Paper 153Paper 153
Paper 153
 
Leveraging social media for training object detectors
Leveraging social media for training object detectorsLeveraging social media for training object detectors
Leveraging social media for training object detectors
 
benchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediabenchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social media
 
Goal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationGoal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to Implementation
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
 
benchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediabenchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social media
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorial
 
CERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection TaskCERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection Task
 
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
 
WWW2015 PHD Symposium
WWW2015 PHD SymposiumWWW2015 PHD Symposium
WWW2015 PHD Symposium
 
Jose Antonio presentation at WSDL
Jose Antonio presentation at WSDLJose Antonio presentation at WSDL
Jose Antonio presentation at WSDL
 
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
 
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
 
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...
IEEE 2014 JAVA DATA MINING PROJECTS Multi comm finding community structure in...
 
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...
2014 IEEE JAVA DATA MINING PROJECT Multi comm finding community structure in ...
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning Appendix
 
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
 
Socially Shared Images with Automated Annotation Process by Using Improved Us...
Socially Shared Images with Automated Annotation Process by Using Improved Us...Socially Shared Images with Automated Annotation Process by Using Improved Us...
Socially Shared Images with Automated Annotation Process by Using Improved Us...
 

More from Symeon Papadopoulos

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...Symeon Papadopoulos
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionSymeon Papadopoulos
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationSymeon Papadopoulos
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Symeon Papadopoulos
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingSymeon Papadopoulos
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSymeon Papadopoulos
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualitySymeon Papadopoulos
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentSymeon Papadopoulos
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetSymeon Papadopoulos
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionSymeon Papadopoulos
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterSymeon Papadopoulos
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Symeon Papadopoulos
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Symeon Papadopoulos
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceSymeon Papadopoulos
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Symeon Papadopoulos
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsSymeon Papadopoulos
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsSymeon Papadopoulos
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Symeon Papadopoulos
 

More from Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015
 

Recently uploaded

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Social Event Detection using Multimodal Clustering and Integrating Supervisory Signals

  • 1. Social event detection using multimodal clustering and integrating supervisory signals Georgios Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris Centre for Research and Technology Hellas, Information Technologies Institute (CERTH-ITI) ACM International Conference on Multimedia Retrieval Hong Kong, Jun 5-8, 2012
  • 2. Social Events in Multimedia Event detection in multimedia: • Real-world events  attendants taking photos  captured photos are shared in social networks • Multimedia collection  find groups of images depicting the real-world events soccer music mklab.iti.gr #2 socialsensor.eu
  • 3. Problem Setting & Formulation • Collection of images + metadata – Metadata typically include tags, geotagging information, timestamp, owner – Metadata can be noisy or missing – A set of feature vectors can be extracted from each image and its metadata • Problem: – Find groups of images such that each group depict a unique social event Essentially, an image clustering problem. mklab.iti.gr #3 socialsensor.eu
  • 4. The Role of Different Features • Visual similarity: Images look similar • Spatial-temporal context: Images were captured at approximately the same location and time • Tags: Users have annotated images using similar tags • Same owner: Photos captured by the same person PROBLEM: We don’t know what matters most mklab.iti.gr #4 socialsensor.eu
  • 5. Heuristics-based Approaches • Rely on online sources and text metadata [Ruocco & Ramampiaro, 2011; Liu et al., 2011b] – structured data about events may not be available in online sources – for many images, text metadata can be of low quality • Use heuristics [Liu et al., 2011a; Papadopoulos et al., 2011] (e.g. “all photos taken by the same user at the same day  same event”) – such heuristics are manually constructed in ad hoc ways mklab.iti.gr #5 socialsensor.eu
  • 6. Multimodal Clustering Approaches Existing approaches: • May utilize early/late fusion strategies. The final result will depend heavily on the fusion weights [Cai et al., 2011] – It may be difficult to determine appropriate weights, either manually or using a search procedure. • May attempt to estimate generative models or minimize the disagreement between the clusterings according to different modalities [Bekkerman & Jeon, 2007; Khalidov et al., 2011] – Some modalities are more important than others when desired clusters correspond to specific concepts. In order to create clusters that correspond to semantically different concepts, will require putting more emphasis on the appropriate features. mklab.iti.gr #6 socialsensor.eu
  • 7. Baseline Multimodal Clustering early fusion mklab.iti.gr #7 socialsensor.eu
  • 8. Rationale of Proposed Approach • What if during the clustering procedure we take into account a relevant example clustering? • This would essentially integrate a supervisory signal in the multimodal clustering procedure. How to do this? • Essentially, we want to define what it means for two items expressed in multiple modalities to belong in the same cluster, and then, try to learn this from example clusterings. mklab.iti.gr #8 socialsensor.eu
  • 9. Proposed Approach 1. For the items in the input clustering for our task, compute the distances between all pairs of items for all modalities. 2. For each pair of items compile the distances (for all modalities) in a vector. For pairs of items, assign a +ve label (same cluster) and –ve (different cluster) 3. Train a classifier to predict a “same cluster” relationship for pairs of items. 4. For each item in the test set to be clustered compute the “same cluster” relationship using that classifier. 5. Form an “indicator vector” for each item to be clustered  summarizes the “same cluster” relationship to the other items to be clustered. 6. Cluster indicator vectors (e.g. using k-means) to determine the final multimodal clustering. mklab.iti.gr #9 socialsensor.eu
  • 10. Overview of Proposed Approach 1 2 5 6 supervised fusion 3-4 mklab.iti.gr #10 socialsensor.eu
  • 11. Indicator Vectors Indicator vectors of items that correspond to the same cluster should be more similar to each other than to indicator vectors of items that do not correspond to the same cluster. mklab.iti.gr #11 socialsensor.eu
  • 12. Evaluation - Dataset/Features • Benchmark dataset: MediaEval Social Event Detection 2011 • 36 social events of two types (soccer, music) comprising 2,074 Flickr images • Features [distance]: – SIFT BoW [cosine similarity] – Time uploaded [absolute difference in hours] – Tags [cosine similarity] – Geo-location (for ~20% of images) [geodesic distance] mklab.iti.gr #12 socialsensor.eu
  • 13. Evaluation - Protocol • Split set of event in two 50-50 random sets. One set used for training the classifier, other used for testing clustering accuracy. • Evaluated against a multimodal spectral clustering approach that uses a short of early fusion strategy. Search in the space of fusion parameters executed. • 10 random runs were executed: in each run, a separate random subset of the events was used for training and the rest was used for testing. mklab.iti.gr #13 socialsensor.eu
  • 14. Evaluation - Results (1) • Best NMI achieved by proposed approach mklab.iti.gr #14 socialsensor.eu
  • 15. Evaluation - Results (2) • Average and std. deviation of NMI achieved by tested methods mklab.iti.gr #15 socialsensor.eu
  • 16. Example Results (1) Event: CE Sabadell - Real Unión de Irún, 31 May 2009 • Proposed method: Correctly found three photos • Baseline: Apart from the three photos, it also included irrelevant ones, e.g. (other soccer events, concert) mklab.iti.gr #16 socialsensor.eu
  • 17. Example Results (2) Event: Barcelona FC triple celebration, 28 May 2009 • Proposed method: Failed to include all relevant photos to a single cluster (it split them to three), but at least each of the three clusters contained only relevant ones. • Baseline method: Not only split the photos into three clusters, but also included many irrelevant ones in each cluster. mklab.iti.gr #17 socialsensor.eu
  • 18. Conclusions Proposed approach for multimodal clustering with an application on event detection in multimedia. Advantages • Does not rely on ad-hoc fusion strategies. • Matches implicit semantics of example clusterings. • Naturally handles missing modalities. Disadvantages • Computationally expensive: – computation of N2 “same cluster relationships” – clustering of N dimensional vectors mklab.iti.gr #18 socialsensor.eu
  • 19. Future Work • Study how larger-scale training (last.fm, upcoming, eventful) affects performance • Reduce “same-cluster” feature space (to K << N2) – Representative image selection – Dimensionality reduction • Integrate event selection step in the proposed approach (currently it considers all images as belonging to events). • Participate in MediaEval SED 2012! mklab.iti.gr #19 socialsensor.eu
  • 20. Acknowledgement mklab.iti.gr #20 socialsensor.eu
  • 21. Questions Further contact: gpetkos@iti.gr / papadop@iti.gr Follow: @socialsensor_ip @sympapadopoulos @kompats mklab.iti.gr #21 socialsensor.eu
  • 22. Previous Work (1) • Multimodal spectral clustering X. Cai, F. Nie, H. Huang, F. Kamangar (2011) Heterogeneous image feature integration via multi-modal spectral clustering. In IEEE conf. on Computer Vision and Pattern Recognition (CVPR), pp. 1977-1984 • Probabilistic Bayesian network approach V. Khalidov, F. Forbes, R.P. Horaud (2011) Conjugate mixture models for clustering multimodal data. In Neural Computation, 23(2):517–557 • Combinatorial Markov Random Fields R. Bekkerman, J. Jeon (2007) Multi-modal clustering for multimedia collections. In IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-8 mklab.iti.gr #22 socialsensor.eu
  • 23. Previous Work (2) • MediaEval SED 2011 M. Brenner, E. Izquierdo (2011) Mediaeval benchmark: Social event detection in collaborative photo collections. In MediaEval SED. X. Liu, B. Huet, R. Troncy (2011) Eurecom @ MediaEval 2011 social event detection task. In MediaEval SED. S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali (2011) CERTH @ MediaEval 2011 social event detection task. In MediaEval SED. M. Ruocco, H. Ramampiaro (2011) NTNU @ MediaEval 2011 social event detection task. In MediaEval SED. Y. Wang, L. Xie, H. Sundaram (2011) Social event detection with clustering and filtering. In MediaEval SED. mklab.iti.gr #23 socialsensor.eu