CERTH @ MediaEval 2012 Social Event Detection Task

•

0 likes•4,594 views

Symeon Papadopoulos

The problem
• Identify social events in tagged photos collections:
– Challenge 1: Technical Events @ Germany
– Challenge 2: Soccer matches @ Madrid, Hamburg
– Challenge3: Indignados protest @ Madrid

• Alternative formulation:
– Represent a collection of photos as a graph, where items
with high probability to belong to the same event are
connected.
– Each event forms a dense sub-graph in it.
– Points to community detection as method to address the
problem.

2

Graph Creation (1)

• Graph creation is based on the use of “Same
Class” model
– A classifier which predicts whether two images
belong to the same event or not
– Support Vector Machine classifier trained with the
data of the 2011 challenge
– Input features: dissimilarities across user, title, tags,
description, time taken, GIST, SURF/VLAD

4

Graph Creation (2)

• Use the same class model to connect the items
of the collection that belong to the same event
• Retrieve candidate neighbours (~350) to
reduce computational cost
– 50 with respect to textual features
– 150 with respect to time
– 50 with respect to location (when it exists)
– 100 with respect to visual features

5

Event Partitioning and Expansion (1)
• Event partitioning
– The nodes of the graph are clustered into
candidate events by using the Structural Clustering
Algorithm for Networks (SCAN).
– The items clustered together by SCAN are used to
obtain an aggregate representation of each
candidate social event.
– Split the candidate events that exceed a
predefined time range into shorter events.

6

Event Partitioning and Expansion (2)
• Expansion of the candidate events set
– Each image that does not belong to any event
forms a single-item event.
– Merge these single-item events into larger clusters
by checking location and time.
– Add the new events in the set of the candidate
events

7

Event Filtering (1)
• Filter in two ways:
– By using geo-location (if exists)
– By using tag-based models
• Geo-location Filtering
– Discard events that don’t contained into the
bounding box of the specific challenge
– 30% of candidate events are discarded

8

Event Filtering (2)
• Tag-based filtering
– Build term models by finding the 500 dominant
terms for the specific locations and event types.
– we collect images from Flickr that are relevant to
the location or the type of event of interest.
– Images for Madrid, Hamburg and Germany
– Images for indignados, soccer and technical
events

9

Event Filtering (3)
• Tag-based filtering
– Probability of appearance

– We compute the ratio of the probability of
appearance in the focus set over the probability of
appearance in the reference set.
– Keep the 500 terms with the highest ratio
– Jaccard similarity between a tag model and events
terms

10

Evaluation

Notation
Run 1: Same class model trained with 10000 pairs of images.
Run 2: Same class model trained with 30000 pairs of images.
Run 3: Same class model of run 1 with post processing step

11

Discussion (1)
• Moving from a smaller (run 1) to a larger (run
2) training dataset does not seem to improve
most of the performance  over fitting
• Method fails in challenge 1 because these
events are different from these of the training
dataset
• A good tag model has to be used for
classification in post-filtering step

12

Discussion (2)
• Future actions:
– train the same class model with a richer set of
data
– explore different graph construction strategies
and community detection algorithms.
• Ways to improve:
– better topic classification methods
– more sophisticated methods for location
estimation

13

Similar to CERTH @ MediaEval 2012 Social Event Detection Task

Searching Images: Recent research at SouthamptonJonathon Hare

ICTER 2014 Invited Talk: Large Scale Data Processing in the Real World: from ...Srinath Perera

8th semester syllabus b sc csit-pawan kaflePAWAN KAFLE

Pathways Overview For Open House 19 Sep2010jmorriso

YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...JacobSilbiger1

Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Maurice Nsabimana

acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30

Introduction to machine learning with GPUsCarol McDonald

JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshopHannes Fassold

Deep Learning for Stock PredictionLim Zhi Yuan (Zane)

Toward Easy Export of Imagery Products and Feature Classes as Training Data f...Dawn Wright

01 chapterRexcy Asha

Comparison GWAP Mechanical TurkElena Simperl

DM_clustering.pptnandhini manoharan

Event Mining in Social MultimediaSymeon Papadopoulos

Object Based Image Analysis Kabir Uddin

[RSS2023] Local Object Crop Collision Network for Efficient SimulationDongwonSon1

Object RecognitionEman Abed AlWahhab

Modern recommender system in large content websiteCyrus Chien-Ching Chiu

C19013010 the tutorial to build shared ai services session 1Bill Liu

Similar to CERTH @ MediaEval 2012 Social Event Detection Task (20)

Searching Images: Recent research at Southampton

ICTER 2014 Invited Talk: Large Scale Data Processing in the Real World: from ...

8th semester syllabus b sc csit-pawan kafle

Pathways Overview For Open House 19 Sep2010

YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...

Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...

acmsigtalkshare-121023190142-phpapp01.pptx

Introduction to machine learning with GPUs

JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop

Deep Learning for Stock Prediction

Toward Easy Export of Imagery Products and Feature Classes as Training Data f...

01 chapter

Comparison GWAP Mechanical Turk

DM_clustering.ppt

Event Mining in Social Multimedia

Object Based Image Analysis

[RSS2023] Local Object Crop Collision Network for Efficient Simulation

Object Recognition

Modern recommender system in large content website

C19013010 the tutorial to build shared ai services session 1

More from Symeon Papadopoulos

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...Symeon Papadopoulos

Deepfakes: An Emerging Internet Threat and their DetectionSymeon Papadopoulos

Knowledge-based Fusion for Image Tampering LocalizationSymeon Papadopoulos

Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Symeon Papadopoulos

COVID-19 Infodemic vs Contact TracingSymeon Papadopoulos

Similarity-based retrieval of multimedia contentSymeon Papadopoulos

Twitter-based Sensing of City-level Air QualitySymeon Papadopoulos

Aggregating and Analyzing the Context of Social Media ContentSymeon Papadopoulos

Verifying Multimedia Content on the InternetSymeon Papadopoulos

A Web-based Service for Image Tampering DetectionSymeon Papadopoulos

Learning to detect Misleading Content on TwitterSymeon Papadopoulos

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos

Verifying Multimedia Use at MediaEval 2016Symeon Papadopoulos

Multimedia PrivacySymeon Papadopoulos

Placing Images with Refined Language Models and Similarity Search with PCA-re...Symeon Papadopoulos

In-depth Exploration of Geotagging PerformanceSymeon Papadopoulos

Perceived versus Actual Predictability of Personal Information in Social Netw...Symeon Papadopoulos

Web and Social Media Image Forensics for News ProfessionalsSymeon Papadopoulos

Predicting News Popularity by Mining Online DiscussionsSymeon Papadopoulos

Finding Diverse Social Images at MediaEval 2015Symeon Papadopoulos

More from Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...

Deepfakes: An Emerging Internet Threat and their Detection

Knowledge-based Fusion for Image Tampering Localization

Deepfake Detection: The Importance of Training Data Preprocessing and Practic...

COVID-19 Infodemic vs Contact Tracing

Similarity-based retrieval of multimedia content

Twitter-based Sensing of City-level Air Quality

Aggregating and Analyzing the Context of Social Media Content

Verifying Multimedia Content on the Internet

A Web-based Service for Image Tampering Detection

Learning to detect Misleading Content on Twitter

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers

Verifying Multimedia Use at MediaEval 2016

Multimedia Privacy

Placing Images with Refined Language Models and Similarity Search with PCA-re...

In-depth Exploration of Geotagging Performance

Perceived versus Actual Predictability of Personal Information in Social Netw...

Web and Social Media Image Forensics for News Professionals

Predicting News Popularity by Mining Online Discussions

Finding Diverse Social Images at MediaEval 2015

CERTH @ MediaEval 2012 Social Event Detection Task

1. CERTH @ MediaEval 2012 Social Event Detection Task Manos Schinas, Georgios Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris Pisa, 4-5 October 2012

2. The problem • Identify social events in tagged photos collections: – Challenge 1: Technical Events @ Germany – Challenge 2: Soccer matches @ Madrid, Hamburg – Challenge3: Indignados protest @ Madrid • Alternative formulation: – Represent a collection of photos as a graph, where items with high probability to belong to the same event are connected. – Each event forms a dense sub-graph in it. – Points to community detection as method to address the problem. 2

3. Approach Step 1 Step 2 Step 3 3

4. Graph Creation (1) • Graph creation is based on the use of “Same Class” model – A classifier which predicts whether two images belong to the same event or not – Support Vector Machine classifier trained with the data of the 2011 challenge – Input features: dissimilarities across user, title, tags, description, time taken, GIST, SURF/VLAD 4

5. Graph Creation (2) • Use the same class model to connect the items of the collection that belong to the same event • Retrieve candidate neighbours (~350) to reduce computational cost – 50 with respect to textual features – 150 with respect to time – 50 with respect to location (when it exists) – 100 with respect to visual features 5

6. Event Partitioning and Expansion (1) • Event partitioning – The nodes of the graph are clustered into candidate events by using the Structural Clustering Algorithm for Networks (SCAN). – The items clustered together by SCAN are used to obtain an aggregate representation of each candidate social event. – Split the candidate events that exceed a predefined time range into shorter events. 6

7. Event Partitioning and Expansion (2) • Expansion of the candidate events set – Each image that does not belong to any event forms a single-item event. – Merge these single-item events into larger clusters by checking location and time. – Add the new events in the set of the candidate events 7

8. Event Filtering (1) • Filter in two ways: – By using geo-location (if exists) – By using tag-based models • Geo-location Filtering – Discard events that don’t contained into the bounding box of the specific challenge – 30% of candidate events are discarded 8

9. Event Filtering (2) • Tag-based filtering – Build term models by finding the 500 dominant terms for the specific locations and event types. – we collect images from Flickr that are relevant to the location or the type of event of interest. – Images for Madrid, Hamburg and Germany – Images for indignados, soccer and technical events 9

10. Event Filtering (3) • Tag-based filtering – Probability of appearance – We compute the ratio of the probability of appearance in the focus set over the probability of appearance in the reference set. – Keep the 500 terms with the highest ratio – Jaccard similarity between a tag model and events terms 10

11. Evaluation Notation Run 1: Same class model trained with 10000 pairs of images. Run 2: Same class model trained with 30000 pairs of images. Run 3: Same class model of run 1 with post processing step 11

12. Discussion (1) • Moving from a smaller (run 1) to a larger (run 2) training dataset does not seem to improve most of the performance  over fitting • Method fails in challenge 1 because these events are different from these of the training dataset • A good tag model has to be used for classification in post-filtering step 12

13. Discussion (2) • Future actions: – train the same class model with a richer set of data – explore different graph construction strategies and community detection algorithms. • Ways to improve: – better topic classification methods – more sophisticated methods for location estimation 13

14. Questions 14

Editor's Notes

But if not possible to match with any city, then don’t filter out the photo (bias towards higher recall).

CERTH @ MediaEval 2012 Social Event Detection Task

Recommended

Recommended

More Related Content

Similar to CERTH @ MediaEval 2012 Social Event Detection Task

Similar to CERTH @ MediaEval 2012 Social Event Detection Task (20)

More from Symeon Papadopoulos

More from Symeon Papadopoulos (20)

CERTH @ MediaEval 2012 Social Event Detection Task

Editor's Notes