Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

•Download as PPTX, PDF•

0 likes•76 views

Paper: http://ceur-ws.org/Vol-2882/paper52.pdf Janadhip Jacutprakart, Rukiye Savran Kiziltepe, John Q. Gan, Giorgos Papanastasiou and Alba G. Seco de Herrera : Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we present the methods of approach and the main results from the Essex NLIP Team’s participation in the MediEval 2020 Predicting Media Memorability task. The task requires participants to build systems that can predict short-term and long-term memorability scores on real-world video samples provided. The focus of our approach is on the use of colour-based visual features as well as the use of the video annotation meta-data. In addition, hyper-parameter tuning was explored. Besides the simplicity of the methodology, our approach achieves competitive results. We investigated the use of different visual features. We assessed the performance of memorability scores through various regression models where Random Forest regression is our final model, to predict the memorability of videos.

Science

MediaEval 2020
Predicting Media Memorability
Janadhip Jacutprakart, Rukiye Savran Kiziltepe,
John Q. Gan, Alba García Seco de Herrera, Giorgos
Papanastasiou
School of Computer Science and Electronic Engineering
University of Essex
15 December 2020
https://essexnlip.uk/

Features
 AlexNetFC7
 HOG
 HSVHist
 RGBHist
 LBP
 VGGFC7
 C3D
 Descriptive*
 Text descriptions
 Annotations
 Response time
 Key press
 Video position
 Short-term score
 Long-term score

Final model approach
Visual
Features
Random
Forest
Memorability
Score
RUN
•HSVHist
•Hyperparemeter-
Tunning
RUN
•RGBHist
•Hyperparemeter-
Tunning
RUN
•RGBHist
•NO
Hyperparemeter-
Tunning
RUN
•HSVHist &
RGBHist
•Hyperparemeter-
Tunning
RUN
•Descriptive
•Hyperparemeter-
Tunning
1 2 3 4 5

Run Result
Short-Term Long-Term
Run Feature Parameter
Tuning
DevSet TestSet DevSet TestSet
1 HSVHist Yes 0.415 0.042 0.419 0.032
2 RGBHist No 0.455 -0.003 0.387 0.0043
3 RGBHist Yes 0.428 -0.015 0.391 0.032
4 HSVHist
& RGBHist
Yes 0.463 -0.022 0.422 -0.017
5 Descriptive Yes 0.508 0.02 0.001 -0.054
Mean 0.058 0.036
Variance 0.002 0.002

 Achieved the highest score on on colour-
based features and metadata on the video
position annotation on development sets.
 Can achieved competitive results from
simplicity propose and found an influence
on memorability score from the video
position and number of annotations
Conclusion

Similar to Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

FinalKan-Han (John) Lu

“Flexible Machine Learning Solutions with Lattice FPGAs,” a Presentation from...Edge AI and Vision Alliance

0507036meraz rizel

OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...NECST Lab @ Politecnico di Milano

Introduction to the Graphics Pipeline of the PS3Slide_N

Reconfigurable Platform Composer Tool ProjectMDC_UNICA

GPGPU Accelerates PostgreSQL (English)Kohei KaiGai

OpenACC Monthly Highlights: November 2020OpenACC

IRJET - Wireless Transmission of Data using LDPC Codes based on Raspberry PiIRJET Journal

Rajesh KumarRajesh Kumar

NVIDIA CUDAJungsoo Nam

MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...MIPI Alliance

Map reduce debugging with jumbuneMahesh Nair

Labview1_ Computer Applications in Control_ACRRLMohammad Sabouri

Dpdk applicationsVipin Varghese

JPEG PLENO - Towards a New Standard for Plenoptic Image CompressionTouradj Ebrahimi

Fueling the AI Revolution with GamingAlison B. Lowndes

Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Codemotion

Snug 2014 Chinajaseel_abdulla

High Performance Network Infrastructure for Future Internet - Julio OliveiraCPqD

Similar to Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task (20)

Final

“Flexible Machine Learning Solutions with Lattice FPGAs,” a Presentation from...

0507036

OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...

Introduction to the Graphics Pipeline of the PS3

Reconfigurable Platform Composer Tool Project

GPGPU Accelerates PostgreSQL (English)

OpenACC Monthly Highlights: November 2020

IRJET - Wireless Transmission of Data using LDPC Codes based on Raspberry Pi

Rajesh Kumar

NVIDIA CUDA

MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...

Map reduce debugging with jumbune

Labview1_ Computer Applications in Control_ACRRL

Dpdk applications

JPEG PLENO - Towards a New Standard for Plenoptic Image Compression

Fueling the AI Revolution with Gaming

Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...

Snug 2014 China

High Performance Network Infrastructure for Future Internet - Julio Oliveira

Recently uploaded

Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA

Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1

Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013

Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani

Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl

Animal Communication- Auditory and Visual.pptxUmerFayaz5

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani

Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136

Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji

Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P

9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Low Rate Call Girls In Saket, Delhi NCR

NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3

Engler and Prantl system of classification in plant taxonomyNistarini College, Purulia (W.B) India

Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY

Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar

Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314

G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2

Is RISC-V ready for HPC workload? Maybe?Patrick Diehl

A relative description on Sonoporation.pdfnehabiju2046

GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji

Recently uploaded (20)

Grafana in space: Monitoring Japan's SLIM moon lander in real time

Recombinant DNA technology (Immunological screening)

Scheme-of-Work-Science-Stage-4 cambridge science.docx

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.

Animal Communication- Auditory and Visual.pptx

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...

Cultivation of KODO MILLET . made by Ghanshyam pptx

Luciferase in rDNA technology (biotechnology).pptx

Artificial Intelligence In Microbiology by Dr. Prince C P

9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service

NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf

Engler and Prantl system of classification in plant taxonomy

Behavioral Disorder: Schizophrenia & it's Case Study.pdf

Analytical Profile of Coleus Forskohlii | Forskolin .pdf

Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...

G9 Science Q4- Week 1-2 Projectile Motion.ppt

Is RISC-V ready for HPC workload? Maybe?

A relative description on Sonoporation.pdf

GFP in rDNA Technology (Biotechnology).pptx

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

1. MediaEval 2020 Predicting Media Memorability Janadhip Jacutprakart, Rukiye Savran Kiziltepe, John Q. Gan, Alba García Seco de Herrera, Giorgos Papanastasiou School of Computer Science and Electronic Engineering University of Essex 15 December 2020 https://essexnlip.uk/

2. Features  AlexNetFC7  HOG  HSVHist  RGBHist  LBP  VGGFC7  C3D  Descriptive*  Text descriptions  Annotations  Response time  Key press  Video position  Short-term score  Long-term score

3. Final model approach Visual Features Random Forest Memorability Score RUN •HSVHist •Hyperparemeter- Tunning RUN •RGBHist •Hyperparemeter- Tunning RUN •RGBHist •NO Hyperparemeter- Tunning RUN •HSVHist & RGBHist •Hyperparemeter- Tunning RUN •Descriptive •Hyperparemeter- Tunning 1 2 3 4 5

4. Run Result Short-Term Long-Term Run Feature Parameter Tuning DevSet TestSet DevSet TestSet 1 HSVHist Yes 0.415 0.042 0.419 0.032 2 RGBHist No 0.455 -0.003 0.387 0.0043 3 RGBHist Yes 0.428 -0.015 0.391 0.032 4 HSVHist & RGBHist Yes 0.463 -0.022 0.422 -0.017 5 Descriptive Yes 0.508 0.02 0.001 -0.054 Mean 0.058 0.036 Variance 0.002 0.002

5.  Achieved the highest score on on colour- based features and metadata on the video position annotation on development sets.  Can achieved competitive results from simplicity propose and found an influence on memorability score from the video position and number of annotations Conclusion

6. Thank you for your attention!

Editor's Notes

MediaEval is a benchmarking initiative dedicated to evaluating new algorithms for multimedia access and retrieval. The main purpose of this system is to automatically identify whether a video will remain fresh in our memory for a period of time. Remembering videos are a key aspect of advertisement, entertainment, and recommendation systems. It is highly likely we speak of a video that remains fresh in our memory and subsequently share its contents with others. Creating memorable video content is crucial for generating consumer impact and engaging entertainment and profitable marketing campaigns. Understanding and predicting memorability as a function of video features is therefore important for computational video analysis task The memorability dataset comprises 10,000 short soundless videos split into 8,000 videos for the development set and 2,000 videos for the test set. They were extracted from raw footage used by professionals when creating the content of 7s-duration each. The
The features are stored in individual folders per feature type and in individual csv files per sample. For example, in the Features folder there are 7 folders containing the 7 features as follows: AlexNetFC7 (image-level feature) HOG (image-level feature) HSVHist (image-level feature) RGBHist (image-level feature) LBP (image-level feature) VGGFC7 (image-level feature) C3D (video-level feature) For the image-level features we extract features from 3 frames for each video, each one in an individual file, where the filenames are composed as follows: <video_id>-<frame_no>.csv. The 3 frames per each video represent the first, the middle and the last frame in the movie. For example, for video_id 8 we extract the following AlexNet feature-files (please keep in mind that the same structure applies to all the image-level feature folders): AlexNetFC7/00008-000.csv : AlexNetFC7 feature for video_id = 8, frame_no = 0 (first frame) AlexNetFC7/00008-098.csv : AlexNetFC7 feature for video_id = 8, frame_no = 98 (middle frame) AlexNetFC7/00008-195.csv : AlexNetFC7 feature for video_id = 8, frame_no = 195 (last frame) ... For the video-level features we extract 1 feature for each video, where the filenames are composed as follows: <video_id>.mp4.csv. Using the same video_id 8 as example, we extract the following C3D feature-file: C3D/00008.mp4.csv : C3D features for video_id = 8

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Recommended

Recommended

More Related Content

Similar to Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Similar to Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task (20)

More from multimediaeval

More from multimediaeval (20)

Recently uploaded

Recently uploaded (20)

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Editor's Notes