TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task

•Download as PPTX, PDF•

0 likes•77,394 views

Esra Açar

The presentation of our method at the MediaEval workshop of 2014.

Data & Analytics

TUB-IRML at MediaEval 2014 Violent Scenes Detection
Task: Violence Modeling through Feature Space Partitioning
Esra Acar, Sahin Albayrak
Competence Center Information Retrieval & Machine Learning

Outline
►The Violence Detection Method
Video Representation
 Violence Detection Model
►Results & Discussion
►Conclusions & Future Work
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 2

The Violence Detection Method
►The two main components of our method are:
 (1) the representation of video segments, and
 (2) the learning of a violence model.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 3

Video Representation (1)
The generation process of sparse coding based audio and visual representations for video segments.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 4

Video Representation (2)
The generation of audio and visual dictionaries with sparse coding.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 5

Video Representation (3)
► In addition to the mid-level audio and visual representations,
we use low-level features which are:
Motion-related descriptors – Violent Flow (ViF) which is a
descriptor proposed for real-time detection of violent crowd
behaviors, and
 Static content representations – Affect-related color
descriptors such as statistics on saturation, brightness and
hue in the HSL color space, and colorfulness.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 6

Violence Detection Model
►Violence is a concept which can audio-visually be expressed in
diverse manners.
►We learn multiple models for the violence concept instead of a
unique model.
 Feature space partitioning by clustering video segments in
the training dataset, and
 Learn a different model for each violence sub-concept.
►We perform a classifier selection to solve the classifier
combination issue.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 7

Results & Discussion
The MAP2014 and MAP@100 of our method with different representations
Method MAP2014 –
Movies
MAP@100 –
Movies
MAP2014 –
Web videos
MAP@100 –
Web videos
Run1 0.169 0.368 0.517 0.582
Run2 0.139 0.284 0.371 0.478
Run3 0.080 0.208 0.477 0.495
Run4 0.172 0.409 0.489 0.586
Run5 0.170 0.406 0.479 0.567
SVM-based
0.093 0.302 - -
unique model
Run1  MFCC-based mid-level audio representations
Run2  HoG- and HoF-based mid-level features and ViF
Run3  Affect-related color features
Run4  Audio and visual features (except color)
Run5  All audio-visual representations are linearly fused at the decision level
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 8

Conclusions & Future Work
►The mid-level audio representation based on MFCC and
sparse coding
 provides promising performance in terms of MAP2014 and
MAP@100 metrics, and
 also outperforms our visual representations.
► As a future work, we need to
 extend/improve our visual representation set, and
 further investigate the feature space partitioning concept.
16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 10

M.Sc.
Competence Center Information Retrieval &
Machine Learning
www.dai-labor.de
Fon
Fax
+49 (0) 30 / 314 – 74
+49 (0) 30 / 314 – 74 003
DAI-Labor
Technische Universität Berlin
Fakultät IV – Elektrontechnik & Informatik
Sekretariat TEL 14
Ernst-Reuter-Platz 7
10587 Berlin, Deutschland
11
Esra Acar
Researcher
esra.acar@tu-berlin.de
Thanks!
013
TUB-IRML at MediaEval 16 October 2014 2014 Violent Scenes Detection Task

Similar to TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task

Mining the Web for Multimedia-based Enriching - Multimedia Hyperlinking and ...Benoit HUET

MediaEval 2015 - RFA at MediaEval 2015 Affective Impact of Movies Task: A Mul...multimediaeval

A Linked Data Recommender System using a Neighborhood-based Graph KernelVito Ostuni

Handy P@rking Overviewhandyparking

Fehlmann and Kranich - Measuring tests using cosmicInternational Software Benchmarking Standards Group (ISBSG)

Chances and Challenges in Comparing Cross-Language Retrieval ToolsGiovanna Roda

Managing a Software Ecosystem Using a Multiple Software Product Line: a Case ...Simon Urli

IWSM2014 MEGSUS14 - software sustainability - a broader perspective (Luigi ...Nesma

Activity Recognition using RGBDnazlitemu

Semantic Multimedia Remixing - MediaEval 2013 Search and Hyperlinking TaskMediaMixerCommunity

Ubiquitous participation karim al-yafiBPCW10

Engineering Students - Idea Submission Template.pptxJAYAPRIYAR7

Obj reportManish Raghav

CVrahul.budhiraja

CV_LahiruKRasnayakeLahiru Rasnayake

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...IRJET Journal

Quantifying the Impact of OSS Adoption Risks with the help of i* ModelsGESSI UPC

Cvrahul.budhiraja

A survey on Enhancements in Speech RecognitionIRJET Journal

PaperEmre Külah

Similar to TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task (20)

Mining the Web for Multimedia-based Enriching - Multimedia Hyperlinking and ...

MediaEval 2015 - RFA at MediaEval 2015 Affective Impact of Movies Task: A Mul...

A Linked Data Recommender System using a Neighborhood-based Graph Kernel

Handy P@rking Overview

Fehlmann and Kranich - Measuring tests using cosmic

Chances and Challenges in Comparing Cross-Language Retrieval Tools

Managing a Software Ecosystem Using a Multiple Software Product Line: a Case ...

IWSM2014 MEGSUS14 - software sustainability - a broader perspective (Luigi ...

Activity Recognition using RGBD

Semantic Multimedia Remixing - MediaEval 2013 Search and Hyperlinking Task

Ubiquitous participation karim al-yafi

Engineering Students - Idea Submission Template.pptx

Obj report

CV_LahiruKRasnayake

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...

Quantifying the Impact of OSS Adoption Risks with the help of i* Models

A survey on Enhancements in Speech Recognition

Paper

Recently uploaded

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

Probability Grade 10 Third Quarter LessonsJoseMangaJr1

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Week-01-2.ppt BBB human Computer interactionfulawalesam

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Recently uploaded (20)

BabyOno dropshipping via API with DroFx.pptx

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

BigBuy dropshipping via API with DroFx.pptx

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Anomaly detection and data imputation within time series

ALSO dropshipping via API with DroFx.pptx

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Sampling (random) method and Non random.ppt

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Probability Grade 10 Third Quarter Lessons

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...

Smarteg dropshipping via API with DroFx.pptx

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Generative AI on Enterprise Cloud with NiFi and Milvus

Week-01-2.ppt BBB human Computer interaction

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task

1. TUB-IRML at MediaEval 2014 Violent Scenes Detection Task: Violence Modeling through Feature Space Partitioning Esra Acar, Sahin Albayrak Competence Center Information Retrieval & Machine Learning

2. Outline ►The Violence Detection Method Video Representation  Violence Detection Model ►Results & Discussion ►Conclusions & Future Work 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 2

3. The Violence Detection Method ►The two main components of our method are:  (1) the representation of video segments, and  (2) the learning of a violence model. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 3

4. Video Representation (1) The generation process of sparse coding based audio and visual representations for video segments. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 4

5. Video Representation (2) The generation of audio and visual dictionaries with sparse coding. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 5

6. Video Representation (3) ► In addition to the mid-level audio and visual representations, we use low-level features which are: Motion-related descriptors – Violent Flow (ViF) which is a descriptor proposed for real-time detection of violent crowd behaviors, and  Static content representations – Affect-related color descriptors such as statistics on saturation, brightness and hue in the HSL color space, and colorfulness. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 6

7. Violence Detection Model ►Violence is a concept which can audio-visually be expressed in diverse manners. ►We learn multiple models for the violence concept instead of a unique model.  Feature space partitioning by clustering video segments in the training dataset, and  Learn a different model for each violence sub-concept. ►We perform a classifier selection to solve the classifier combination issue. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 7

8. Results & Discussion The MAP2014 and MAP@100 of our method with different representations Method MAP2014 – Movies MAP@100 – Movies MAP2014 – Web videos MAP@100 – Web videos Run1 0.169 0.368 0.517 0.582 Run2 0.139 0.284 0.371 0.478 Run3 0.080 0.208 0.477 0.495 Run4 0.172 0.409 0.489 0.586 Run5 0.170 0.406 0.479 0.567 SVM-based 0.093 0.302 - - unique model Run1  MFCC-based mid-level audio representations Run2  HoG- and HoF-based mid-level features and ViF Run3  Affect-related color features Run4  Audio and visual features (except color) Run5  All audio-visual representations are linearly fused at the decision level 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 8

9. Conclusions & Future Work ►The mid-level audio representation based on MFCC and sparse coding  provides promising performance in terms of MAP2014 and MAP@100 metrics, and  also outperforms our visual representations. ► As a future work, we need to  extend/improve our visual representation set, and  further investigate the feature space partitioning concept. 16 October 2014 TUB-IRML at MediaEval 2014 Violent Scenes Detection Task 10

10. M.Sc. Competence Center Information Retrieval & Machine Learning www.dai-labor.de Fon Fax +49 (0) 30 / 314 – 74 +49 (0) 30 / 314 – 74 003 DAI-Labor Technische Universität Berlin Fakultät IV – Elektrontechnik & Informatik Sekretariat TEL 14 Ernst-Reuter-Platz 7 10587 Berlin, Deutschland 11 Esra Acar Researcher esra.acar@tu-berlin.de Thanks! 013 TUB-IRML at MediaEval 16 October 2014 2014 Violent Scenes Detection Task

TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task

Recommended

Recommended

More Related Content

Similar to TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task

Similar to TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task (20)

Recently uploaded

Recently uploaded (20)

TUB-IRML at the MediaEval 2014 Violent Scenes Detection Task