MediaEval 2015 - IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task

•

0 likes•363 views

This paper presents our approach and results in the Search and Anchoring in Video Archives task at MediaEval 2015.The Search part aims at returning a ranked list of video segments that are relevant to a textual user query. The Anchoring part focuses on the automatic selection of video segments, from a list of videos, that can be used as anchors to encourage further exploration within the archive. Our approach consists in structuring each video into a hierarchy of topically focused fragments, to extract salient segments in the videos at different levels of details with precise jump-in points for them. These segments will be leveraged both to answer the queries and to create anchor segments, relying on content based analysis and comparisons. The algorithm deriving the hierarchical structure relies on the burstiness phenomenon in word occurrences which gives an advantage over the classical bag-of-words model. http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

Education

IRISA at MediaEval 2015:
Search and Anchoring in Video
Archives Task
Anca Simon, Pascale Sébillot, Guillaume Gravier

System overview
Textual representation of a video cos
similarity
W1 W2 … Wn
V1 V2 … Vm
visual query
textual query
search
Rank all segments
based on the cohesion
score:
anchor
Goal
 Segments of variable length, topically focused at various levels of
details with precise jump-in points
214/09/2015 MediaEval

Leverage the burstiness phenomenon in word occurrences
•Bursty words: characterized by long inter-arrival times followed by short inter-
arrival times;
• Non-bursty words: exhibit inter-arrival times with smaller variance.
Starting point: Kleinberg’s algorithm (Kleinberg, 2002)
Hierarchy of topically focused
fragments (HTFF)
314/09/2015 MediaEval

Hierarchy of topically focused
fragments (HTFF)
Agglomerative
clustering of burst
intervals
For each term in the textual representation of a video compute the hierarchy
of burst intervals.
414/09/2015 MediaEval

Search sub-task
0.01 -> 10.5
(place, ballroom, color, king, royal, rock, tweed, palace, etc.)
0.01 -> 0.10
(ballroom)
0.01- > 1.50
(royal, ballroom, palace, etc.)
0.18 -> 1.50
(royal, site, etc.)
1.33 -> 1.50
(royal, palace)
2.29- > 3.21
(build, prosperous)
7.43- > 8.35
(friend, Picasso)
… …
…
Automatic transcript: Castle in the country (bbctwo)
[start time: 0.01 -> end time: 29.23]
Text query:
history, palace, tudor
(I am looking for documentaries
on British history)
514/09/2015 MediaEval

Search sub-task (contd.)
34.54 -> 39.52
(woman, store, train, roof, racer, mirror, route, cowboy hat, etc.)
36.38- > 36.42
(race car)
36.49 - > 37.1
(train, store, route)
39.3 -> 39.52
(girl, cowboy hat)
…
Visual query:
train
(I am looking for footage
from the history of the
British railways)
…
…
relevant
Visual concepts representation: Comedy map of Britain (bbctwo)
[start time: 0.01 -> end time: 59.55]
614/09/2015 MediaEval

Search results
SEARCH Binned relevance Tolerance to irrelevance
P@5 P@10 P@20 P@5 P@10 P@20 P@5 P@10 P@20
Textual query 0.34 0.31 0.19 0.34 0.31 0.19 0.34 0.31 0.19
Visual query 0.12 0.11 0.06 0.12 0.11 0.06 0.12 0.11 0.06
714/09/2015 MediaEval

Anchoring results
ANCHORING
Precision Recall MRR
LIMSI 0.557 0.435 0.773
MANUAL 0.469 0.38 0.735
- average number of anchors/video: 19.03 (CI at 95% [18.4,19.65] )
814/09/2015 MediaEval

Conclusion and future work
Leveraged a new topical structure for search and anchoring:
Hierarchy of topically focused fragments
-relies on the burstiness phenomenon in word reoccurrences
Future work:
• Study the impact of the granularity levels in the HTFF;
• Combine visual and textual bursts;
• Integrate semantic relations in the burst detection
algorithm.
914/09/2015 MediaEval

Kleinberg’s algorithm
1014/09/2015 MediaEval

Viewers also liked

Patologia manguito rotadores

christinho1994

Practica3 productos

Danni Guzman Sanchez

Der Swissdesktop der Firma futuretek aus der Schweiz ist ein DaaS (Desktop-as-a-Service) Angebot und ermöglicht die Virtualisierung von PC-Arbeitsplätzen. Für unter CHF 100 wird der Arbeitsplatz virtualisiert und die Daten sicher in einer Schweizer Cloud zentralisiert. Selbst Microsoft Software kann temporär und damit im "Pay-what-you-use" Modell kostengünstig und flexibel genutzt werden. Weitere Informationen: http://www.swissdesktop.com.

Virtualized PC workplace as service: Swissdesktop

Chris Peter ⓥ

Rele de seguranca

Luiz Cláudio

Fatigue Analyses of modifications on pressurized aircraft fuselages are both necessary and tedious. Using the Hyperworks software suite and StressCheck, RUAG has developed a fatigue analysis method which streamlines the process from the creation of the spectrum up to the detailed analysis of selected fastener holes and delivers results quickly and efficiently. This method was then used to certify the installation of two large windows in the floor of a single engine turboprop A/C for aerial survey applications. Speakers David Schmid, Manager Structural Analysis, RUAG Schweiz AG

Fatigue Analysis of a Pressurized Aircraft Fuselage Modification using Hyperw...

Altair

Omni-Channel-Marketing

Andreas Jacobs

excavadoras

David Navas Escobar

Resumen un mundo feliz

karla190294

Clasificación de ángulos según su medida

19671966

eSociety Bodensee 2020: Cross-border Cooperation in the Lake Constance Greate...

Hans-Dieter Zimmermann

Y... ¿dónde está la Navidad?

Ana Cobos

[Support de cours] WebMarketing et communication web - IPAC 2014

QWEB.ECO

sampel representatif buku A. Arens, Alvin. Bab 14.trans

Rita Alfian

Magazine Het Ondernemersbelang Noord Veluwe 0212

HetOndernemersBelang

Donor Acquisition in the "New Normal"

Avalon Consulting

Viewers also liked (15)

Patologia manguito rotadores

Practica3 productos

Virtualized PC workplace as service: Swissdesktop

Rele de seguranca

Fatigue Analysis of a Pressurized Aircraft Fuselage Modification using Hyperw...

Omni-Channel-Marketing

excavadoras

Resumen un mundo feliz

Clasificación de ángulos según su medida

eSociety Bodensee 2020: Cross-border Cooperation in the Lake Constance Greate...

Y... ¿dónde está la Navidad?

[Support de cours] WebMarketing et communication web - IPAC 2014

sampel representatif buku A. Arens, Alvin. Bab 14.trans

Magazine Het Ondernemersbelang Noord Veluwe 0212

Donor Acquisition in the "New Normal"

More from multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper62.pdf YouTube: https://youtu.be/gV-rvV3iFDA Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri and Julien Morlier : Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal CNN for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. This work presents a method for classifying table tennis strokes using spatio-temporal convolutional neural networks. The fine-grained classification is performed on trimmed video segments recorded at 120 fps with different players performing in natural conditions. From those segments, the frames are extracted, their optical flow is computed and the pose of the player is estimated. From the optical flow amplitude, a region of interest is inferred. A three stream spatio-temporal convolutional neural network using combination of those modalities and 3D attention mechanisms is presented in order to perform classification. Presented by: Pierre-Etienne Martin

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper50.pdf Hai Nguyen-Truong, San Cao, N. A. Khoa Nguyen, Bang-Dang Pham, Hieu Dao, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table Tennis Strokes Classification Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Sports Video Classification Tasks in the Multimedia Evaluation 2020 Challenge focuses on classifying different types of table tennis strokes in video segments. In this task, we - the HCMUS Team - perform multiple experiments, which includes a combination of models such as SlowFast, Optical Flow, DensePose, R2+1, Channel-Separated Convolutional Networks, to classify 21 types of table tennis strokes from video segments. In total, we submit eight runs corresponding to five different models with different sets of hyper-parameters in each of our models. In addition, we apply some pre-processing techniques on the dataset in order for our model to learn and classify more accurately. According to the evaluation results, one of our team's methods out-performs the other team's. In particular, our best run achieves 31.35\% global accuracy, and all of our methods show potential results in terms of local and global accuracy for action recognition tasks.

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper2.pdf YouTube: https://youtu.be/-bRL868b8ys Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri, Laurent Mascarilla, Jordan Calandre and Julien Morlier : Sports Video Classification: Classification of Strokes in Table Tennis for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. Fine-grained action classification has raised new challenges compared to classical action classification problems. Sport video analysis is a very popular research topic, due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests, up to analysis of athletes' performances. Running since 2019 as a part of MediaEval, we offer a task which consists in classifying table tennis strokes from videos recorded in natural conditions at the University of Bordeaux. The aim is to build tools for teachers, coaches and players to analyse table tennis games. Such tools could lead to an automatic profiling of the player and adaptation of his training for improving his/her sport skills more efficiently. Presented by: Pierre-Etienne Martin

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper61.pdf YouTube: https://youtu.be/brmI4g3jLS4 Ricardo Kleinlein, Cristina Luna-Jiménez, Fernando Fernández-Martínez and Zoraida Callejas : Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention and LSTM Models. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper reports on the GTH-UPM team experience in the Predicting Media Memorability task at MediaEval 2020. Teams were requested to predict memorability scores at both short-term and long-term, understanding such score as a measure of whether a video was perdurable in a viewer's memory or not. Our proposed system relies on a late fusion of the scores predicted by three sequential models, each trained over a different modality: video captions, aural embeddings and visual optical flow-based vectors. Whereas single-modality models show a low or zero Spearman correlation coefficient value, their combination considerably boosts performance over development data up to 0.2 in the short-term memorability prediction subtask and 0.19 in the long-term subtask. However, performance over test data drops to 0.016 and -0.041, respectively. Presented by: Ricardo Kleinlein

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper52.pdf Janadhip Jacutprakart, Rukiye Savran Kiziltepe, John Q. Gan, Giorgos Papanastasiou and Alba G. Seco de Herrera : Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we present the methods of approach and the main results from the Essex NLIP Team’s participation in the MediEval 2020 Predicting Media Memorability task. The task requires participants to build systems that can predict short-term and long-term memorability scores on real-world video samples provided. The focus of our approach is on the use of colour-based visual features as well as the use of the video annotation meta-data. In addition, hyper-parameter tuning was explored. Besides the simplicity of the methodology, our approach achieves competitive results. We investigated the use of different visual features. We assessed the performance of memorability scores through various regression models where Random Forest regression is our final model, to predict the memorability of videos.

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper6.pdf YouTube: https://youtu.be/ySGGu_4vaxs Alba García Seco De Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin, Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu and Alan F. Smeaton : Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable? Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes the MediaEval 2020 Predicting Media Memorability task. After first being proposed at MediaEval 2018, the Predicting Media Memorability task is in its 3rd edition this year, as the prediction of short-term and long-term video memorability (VM) remains a challenging task. In 2020, the format remained the same as in previous editions. This year the videos are a subset of the TRECVid 2019 Video to Text dataset, containing more action rich video content as compare with the 2019 task. In this paper a description of some aspects of this task is provided, including its main characteristics, a description of the collection, the ground truth dataset, evaluation metrics and the requirements for the run submission. Presented by: Rukiye Savran Kiziltepe

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper45.pdf Benoit Bonnet, Teddy Furon and Patrick Bas : Fooling an Automatic Image Quality Estimator. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper we present our work on the 2020 MediaEval task: Pixel "Privacy: Quality Camouflage for Social Images". Blind Image Quality Assessment (BIQA) is a classifier that for any given image will return a quality score. Our task is to modify an image to decrease its BIQA score while maintaining a good perceived quality. Since BIQA is a deep neural network, we worked on an adversarial attack approach of the problem.

Fooling an Automatic Image Quality Estimator

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper16.pdf YouTube: https://youtu.be/ix_b9K7j72w Zhengyu Zhao : Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents the submission of our RU-DS team to the Pixel Privacy Task 2020. We propose to fool the blind image quality assessment model by transforming images based on optimizing a human-understandable color filter. In contrast to the common work that relies on small, $L_p$-bounded additive pixel perturbations, our approach yields large yet smooth perturbations. Experimental results demonstrate that in the specific context of this task, our approach is able to achieve strong adversarial effects, but has to sacrifice the image appeal. Presented by: Zhengyu Zhao

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper77.pdf YouTube: https://youtu.be/8Rr4KknGSac Zhuoran Liu, Zhengyu Zhao, Martha Larson and Laurent Amsaleg : Pixel Privacy: Quality Camouflage for Social Images. Proc. of MediaEval 2020, 14-15 December 2020, Online. High-quality social images shared online can be misappropriated for unauthorized goals, where the quality filtering step is commonly carried out by automatic Blind Image Quality Assessment (BIQA) algorithms. Pixel Privacy benchmarks privacy-protective approaches that protect privacy-sensitive images against unethical computer vision algorithms. In the 2020 task, participants are encouraged to develop camouflage methods that can effectively decrease the BIQA quality score of high-quality images and maintain image appeal. The camouflaged images need to be either imperceptible to the human eye, or it can be a visible enhancement. Presented by: Zhuoran Liu

Pixel Privacy: Quality Camouflage for Social Images

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper73.pdf YouTube: https://youtu.be/TadJ6y7xZeA Thuc Nguyen-Quang, Tuan-Duy Nguyen, Thang-Long Nguyen-Ho, Anh-Kiet Duong, Xuan-Nhat Hoang, Vinh-Thuyen Nguyen-Truong, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching. Proc. of MediaEval 2020, 14-15 December 2020, Online. Matching text and images based on their semantics has an important role in cross-media retrieval. However, text and images in articles have a complex connection. In the context of MediaEval 2020 Challenge, we propose three multi-modal methods for mapping text and images of news articles to the shared space in order to perform efficient cross-retrieval. Our methods show systemic improvement and validate our hypotheses, while the best-performed method reaches a recall@100 score of 0.2064. Presented by: Thuc Nguyen-Quang

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper72.pdf Sabarinathan D and Suganya Ramamoorthy : Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attention Unit. Proc. of MediaEval 2020, 14-15 December 2020, Online. Colorectal cancer is the third most common cause of cancer worldwide. In the era of medical Industry, identifying colorectal cancer in its early stages has been a challenging problem. Inspired by these issues, the main objective of this paper is to develop a Multi supervision net algorithm for segmenting polys on a comprehensive dataset. The risk of colorectal cancer could be reduced by early diagnosis of poly during a colonoscopy. The disease and their symptoms are highly varying and always a need for a continuous update of knowledge for the doctors and medical analyst. The diseases fall into different categories and a small variation of symptoms may lead to higher rate of risk. We have taken Medico polyp challenge dataset, which consists of 1000 segmented polyp images from gastrointestinal track. We proposed an efficient Net B4 as a pre-trained architecture in multi-supervision net. The model is trained with multiple output layers. We present quantitative results on colorectal dataset to evaluate the performance and achieved good results in all the performance metrics. The experimental results proved that the proposed model is robust and provides a good level of accuracy in segmenting polyps on a comprehensive dataset for different metrics such as Dice coefficient, Recall, Precision and F2.

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper47.pdf YouTube: https://youtu.be/vMsM4zg2-JY Tien-Phat Nguyen, Tan-Cong Nguyen, Gia-Han Diep, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico task, MediaEval 2020, explores the challenge of building accurate and high-performance algorithms to detect all types of polyps in endoscopic images. We proposed different approaches leveraging the advantages of either ResUnet++ or PraNet model to efficiently segment polyps in colonoscopy images, with modifications on the network structure, parameters, and training strategies to tackle various observed characteristics of the given dataset. Our methods outperform the other teams' methods, for both accuracy and efficiency. After the evaluation, we are at top 2 for task 1 (with Jaccard index of 0.777, best Precision and Accuracy scores) and top 1 for task 2 (with 67.52 FPS and Jaccard index of 0.658).

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper31.pdf Syed Muhammad Faraz Ali, Muhammad Taha Khan, Syed Unaiz Haider, Talha Ahmed, Zeshan Khan and Muhammad Atif Tahir : Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Intestinal Tract. Proc. of MediaEval 2020, 14-15 December 2020, Online. Identification of polyps in endoscopic images is critical for the diagnosis of colon cancer. Finding the exact shape and size of polyps requires the segmentation of endoscopic images. This research explores the advantage of using depth-wise separable convolution in the atrous convolution of the ResUNet++ architecture. Deep atrous spatial pyramid pooling was also implemented on the ResUNet++ architecture. The results show that architecture with separable convolution has a smaller size and fewer GFLOPs without degrading the performance too much.

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper22.pdf Debapriya Banik and Debotosh Bhattacharjee : Deep Conditional Adversarial learning for polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This approach has addressed the Medico automatic polyp segmentation challenge which is a part of Mediaeval 2020. We have proposed a deep conditional adversarial learning based network for the automatic polyp segmentation task. The network comprises of two interdependent models namely a generator and a discriminator. The generator network is a FCN employed for the prediction of the polyp mask while the discriminator enforces the segmentation to be as similar as the real segmented mask (ground truth). Our proposed model achieved a comparative result on the test dataset provided by the organizers of the challenge.

Deep Conditional Adversarial learning for polyp Segmentation

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper21.pdf Hwang Maxwell, Wu Cai, Hwang Kao-Shing, Xu Yong Si and Wu Chien-Hsing : A Temporal-Spatial Attention Model for Medical Image Detection. Proc. of MediaEval 2020, 14-15 December 2020, Online. A local region model with attentive temporal-spatial pathways is proposed for automatically learning various target structures. The attentive spatial pathway highlights the salient region to generate bounding boxes and ignores irrelevant regions in an input image. The proposed attention mechanism allows efficient object localization and the overall predictive performance is increased because there are fewer false positives for the object detection task for medical images with manual annotations. The experimental results show that proposed models consistently increase the base architectures' predictive performance for different datasets and training sizes without undue computational efficiency.

A Temporal-Spatial Attention Model for Medical Image Detection

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper20.pdf YouTube: https://youtu.be/CVelQl5Luf0 Quoc-Huy Trinh, Minh-Van Nguyen, Thiet-Gia Huynh and Minh-Triet Tran : HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Network and UNet for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico: Multimedia Task focuses on developing an efficient and accurate framework to computer-aided diagnosis systems for automatic polyp segmentation to detect all types of polyps in endoscopic images of the gastrointestinal (GI) tract. We are HCMUS-team approach a solution, which includes combination Residual module, Inception module, Adaptive Convolutional neural network with Unet model and PraNet to semantic segmentation all types of polyps in endoscopic images. We submit multiple runs with different architecture and parameters in our model. Our methods show potential results in accuracy and efficiency through multiple experiments.

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper15.pdf Rabindra Khadka : Transfer of Knowledge: Fine-tuning for Polyp Segmentation with Attention. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes how the transfer of prior knowledge can effectively take on segmentation tasks with the help of attention mechanisms. The UNet model pretrained on brain MRI dataset was fine-tuned with the polyp dataset. Attention mechanism was integrated to focus on relevant regions in the input images. The implemented architecture is evaluated on 200 validation images based on intersection over union and dice score between groundtruth and predicted region. The model demonstrates a promising result with computational efciency.

Fine-tuning for Polyp Segmentation with Attention

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper12.pdf Adrian Krenzer and Frank Puppe : Bigger Networks are not Always Better: Deep Convolutional Neural Networks for Automated Polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents our team's (AI-JMU) approach to the Medico automated polyp segmentation challenge. We consider deep convolutional neural networks to be well suited for this task. To determine the best architecture we test and compare state of the art backbones and two different heads. Finally we achieve a Jaccard index of 73.74\% on the challenge test set. We further demonstrate that bigger networks do not always perform better. However the growing network size always increases the computational complexity.

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper51.pdf Amel Ksibi, Amina Salhi, Ala Alluhaidan and Sahar A. El-Rahman : Insights for wellbeing: Predicting Personal Air Quality Index using Regression Approach. Proc. of MediaEval 2020, 14-15 December 2020, Online. Providing air pollution information to individuals enables them to understand the air quality of their living environments. Thus, the association between people’s wellbeing and the properties of the surrounding environment is an essential area of investigation. This paper proposes Air Quality Prediction through harvesting public/open data and leveraging them to get the Personal Air Quality index. These are usually incomplete. To cope with the problem of missing data, we applied the KNN imputation method. To predict Personal Air Quality Index, we apply a voting regression approach based on three base regressors which are Gradient Boosting regressor, Random Forest regressor, and linear regressor. Evaluating the experimental results using the RMSE metric, we got an average score of 35.39 for Walker and 51.16 for Car.

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper40.pdf YouTube: https://youtu.be/SL5Hvu1mARY Trung-Quan Nguyen, Dang-Hieu Nguyen and Loc Tai Tan Nguyen : Use Visual Features From Surrounding Scenes to Improve Personal Air Quality Data Prediction Performance. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we propose a method to predict the personal air quality index in an area by using the combination of the levels of the following pollutants: PM2.5, NO2, and O3, measured from the nearby weather stations of that area, and the photos of surrounding scenes taken at that area. Our approach uses the Inverse Distance Weighted (IDW) technique to estimate the missing air pollutant levels and then use regression to integrate visual features from taken photos to optimize the predicted values. After that, we can use those values to calculate the Air Quality Index (AQI). The results show that the proposed method may not improve the performance of the prediction in some cases.

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

multimediaeval

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Fooling an Automatic Image Quality Estimator

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Pixel Privacy: Quality Camouflage for Social Images

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Deep Conditional Adversarial learning for polyp Segmentation

A Temporal-Spatial Attention Model for Medical Image Detection

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

Fine-tuning for Polyp Segmentation with Attention

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Recently uploaded

How to Give a Domain for a Field in Odoo 17

Celine George

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

MaritesTamaniVerdade

How to Manage Global Discount in Odoo 17 POS

Celine George

How to setup Pycharm environment for Odoo 17.pptx

Celine George

Graduate Outcomes Presentation Slides - English

neillewis46

Food safety_Challenges food safety laboratories_.pdf

Sherif Taha

Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx

Pooja Bhuva

Wellbeing inclusion and digital dystopias.pptx

Jisc

Sociology 101 Demonstration of Learning Exhibit

jbellavia9

Towards a code of practice for AI in AT.pptx

Jisc

On National Teacher Day, meet the 2024-25 Kenan Fellows

Mebane Rash

The basics of sentences session 3pptx.pptx

heathfieldcps1

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx

Esquimalt MFRC

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...

Pooja Bhuva

Application orientated numerical on hev.ppt

RamjanShidvankar

ICT Role in 21st Century Education & its Challenges.pptx

AreebaZafar22

Fostering Friendships - Enhancing Social Bonds in the Classroom

Pooky Knightsmith

Single or Multiple melodic lines structure

dhanjurrannsibayan2

REMIFENTANIL: An Ultra short acting opioid.pptx

Dr. Ravikiran H M Gowda

NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...

Amil baba

Recently uploaded (20)

How to Give a Domain for a Field in Odoo 17

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

How to Manage Global Discount in Odoo 17 POS

How to setup Pycharm environment for Odoo 17.pptx

Graduate Outcomes Presentation Slides - English

Food safety_Challenges food safety laboratories_.pdf

Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx

Wellbeing inclusion and digital dystopias.pptx

Sociology 101 Demonstration of Learning Exhibit

Towards a code of practice for AI in AT.pptx

On National Teacher Day, meet the 2024-25 Kenan Fellows

The basics of sentences session 3pptx.pptx

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...

Application orientated numerical on hev.ppt

ICT Role in 21st Century Education & its Challenges.pptx

Fostering Friendships - Enhancing Social Bonds in the Classroom

Single or Multiple melodic lines structure

REMIFENTANIL: An Ultra short acting opioid.pptx

NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...

MediaEval 2015 - IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task

1. IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task Anca Simon, Pascale Sébillot, Guillaume Gravier

2. System overview Textual representation of a video cos similarity W1 W2 … Wn V1 V2 … Vm visual query textual query search Rank all segments based on the cohesion score: anchor Goal  Segments of variable length, topically focused at various levels of details with precise jump-in points 214/09/2015 MediaEval

3. Leverage the burstiness phenomenon in word occurrences •Bursty words: characterized by long inter-arrival times followed by short inter- arrival times; • Non-bursty words: exhibit inter-arrival times with smaller variance. Starting point: Kleinberg’s algorithm (Kleinberg, 2002) Hierarchy of topically focused fragments (HTFF) 314/09/2015 MediaEval

4. Hierarchy of topically focused fragments (HTFF) Agglomerative clustering of burst intervals For each term in the textual representation of a video compute the hierarchy of burst intervals. 414/09/2015 MediaEval

5. Search sub-task 0.01 -> 10.5 (place, ballroom, color, king, royal, rock, tweed, palace, etc.) 0.01 -> 0.10 (ballroom) 0.01- > 1.50 (royal, ballroom, palace, etc.) 0.18 -> 1.50 (royal, site, etc.) 1.33 -> 1.50 (royal, palace) 2.29- > 3.21 (build, prosperous) 7.43- > 8.35 (friend, Picasso) … … … Automatic transcript: Castle in the country (bbctwo) [start time: 0.01 -> end time: 29.23] Text query: history, palace, tudor (I am looking for documentaries on British history) 514/09/2015 MediaEval

6. Search sub-task (contd.) 34.54 -> 39.52 (woman, store, train, roof, racer, mirror, route, cowboy hat, etc.) 36.38- > 36.42 (race car) 36.49 - > 37.1 (train, store, route) 39.3 -> 39.52 (girl, cowboy hat) … Visual query: train (I am looking for footage from the history of the British railways) … … relevant Visual concepts representation: Comedy map of Britain (bbctwo) [start time: 0.01 -> end time: 59.55] 614/09/2015 MediaEval

7. Search results SEARCH Binned relevance Tolerance to irrelevance P@5 P@10 P@20 P@5 P@10 P@20 P@5 P@10 P@20 Textual query 0.34 0.31 0.19 0.34 0.31 0.19 0.34 0.31 0.19 Visual query 0.12 0.11 0.06 0.12 0.11 0.06 0.12 0.11 0.06 714/09/2015 MediaEval

8. Anchoring results ANCHORING Precision Recall MRR LIMSI 0.557 0.435 0.773 MANUAL 0.469 0.38 0.735 - average number of anchors/video: 19.03 (CI at 95% [18.4,19.65] ) 814/09/2015 MediaEval

9. Conclusion and future work Leveraged a new topical structure for search and anchoring: Hierarchy of topically focused fragments -relies on the burstiness phenomenon in word reoccurrences Future work: • Study the impact of the granularity levels in the HTFF; • Combine visual and textual bursts; • Integrate semantic relations in the burst detection algorithm. 914/09/2015 MediaEval

10. Kleinberg’s algorithm 1014/09/2015 MediaEval

MediaEval 2015 - IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (15)

More from multimediaeval

More from multimediaeval (20)

Recently uploaded

Recently uploaded (20)

MediaEval 2015 - IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task