Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter

•

0 likes•65 views

Paper: http://ceur-ws.org/Vol-2882/paper16.pdf YouTube: https://youtu.be/ix_b9K7j72w Zhengyu Zhao : Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents the submission of our RU-DS team to the Pixel Privacy Task 2020. We propose to fool the blind image quality assessment model by transforming images based on optimizing a human-understandable color filter. In contrast to the common work that relies on small, $L_p$-bounded additive pixel perturbations, our approach yields large yet smooth perturbations. Experimental results demonstrate that in the specific context of this task, our approach is able to achieve strong adversarial effects, but has to sacrifice the image appeal. Presented by: Zhengyu Zhao

Science

Zhengyu Zhao
Fooling Blind Image Quality Assessment by
Optimizing a Human-Understandable Color Filter
Radboud University (Netherlands)
RU-DS @ Pixel Privacy Task 2020:

Fooling ML models with small, imperceptible perturbations
Szegedy et al. "Intriguing properties of neural networks.", ICLR 2014.
Kurakin et al. "Adversarial examples in the physical world.", ICLR 2017.
Original
CNN CNN
Perturbations Adversarial
vulnerable to image processing
(e.g. JPEG compression)

[1] semantic manipulation [3] DL-based colorization
golf-cart
[2] spatial transformation
trailer truck
domain-specific
costly and
not human-understandble
limited action space
[1] Qiu et al. "SemanticAdv: Generating Adversarial Examples via Attribute-conditioned Image
Editing." ECCV 2020.
Eykholt et al. "Robust physical-world attacks on deep learning visual classification." CVPR 2018.
[2] Engstrom et al. "Exploring the landscape of spatial robustness." ICML 2019.
[3] Bhattad et al. "Unrestricted adversarial examples via semantic manipulation." ICLR 2020.
Fooling ML models with large yet unnoticeable perturbations

[1] Zhao et al. "Adversarial Robustness Against Image Color Transformation within Parametric
Filter Space." Under review. Preliminary work at BMVC 2020.
[2] Hu et al. "Exposure: A white-box photo post-processing framework." ACM TOG 2018.
[3] Kurakin et al. "Adversarial examples in the physical world." ICLR 2017.
[4] Carlini et al. "Towards evaluating the robustness of neural networks." In IEEE S&P 2017.
Simple and human-understandable filter [2]
Generally applicable to all kinds of color images
Relatively large action space (hundreds of parameters)
C&W [4]:
ACE-Ins:
2
)
,
(
min 




 y
x
L
ε
y
x
F
L 




s.t.
),
,
)
(
(
min 2
)
(
)
),
(
(
min ins
x
x
F
y
x
F
L 
 



ε
y
x
L 
 



s.t.
),
,
(
min
PGD [3]:
ACE-PGD:
Our Adversarial Color Enhancement (ACE) [1]

Results
Half of the images to be predicted as low-quality
Robust against JPEG compression
Unacceptable image appeal

(a) PGD (b) Colorization (c) ACE-PGD
ACE-Ins
Examples of fooling an ImageNet classifier
Guessing:
BIQA model tends to rely on high-frequency features for quality assessment
→ Robust against low-frequency perturbations by ACE
ImageNet classifier learns both low- (e.g. shape) and high-frequency (e.g.
textures) features for object classification
→ Less Robust against low-frequency perturbations by ACE

Similar to Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter

This presentation is intended as a high-level introduction for to deep learning and its applications in materials science. The intended audience is materials scientists and engineers Disclaimers: the second half of this presentation is intended as a broad overview of deep learning applications in materials science; due to time limitations it is not intended to be comprehensive. As a review of the field, this necessarily includes work that is not my own. If my own name is not included explicitly in the reference at the bottom of a slide, I was not involved in that work. Any mention of commercial products in this presentation is for information only; it does not imply recommendation or endorsement by NIST.

TMS workshop on machine learning in materials science: Intro to deep learning...

BrianDeCost

Обучение нейросетей компьютерного зрения в видеоиграх

Anatol Alizar

The sudden climate change occurring in different places in the world has made disasters more unpredictable than before. In addition, responses are often late due to manual processes that have to be performed by experts. Consequently, major advances in computer vision (CV) have prompted researchers to develop smart models to help these experts. We need a strong image representation model, but at the same time, we also need to prepare for a deep learning environment at a low cost. This research attempts to develop transfer learning models using low-cost masking pre-processing in the experimental building damage (xBD) dataset, a large-scale dataset for advancing building damage assessment. The dataset includes eight types of disasters located in fifteen different countries and spans thousands of square kilometers of satellite images. The models are based on U-Net, i.e., AlexNet, visual geometry group (VGG)-16, and ResNet-34. Our experiments show that ResNet-34 is the best with an F1 score of 71.93%, and an intersection over union (IoU) of 66.72%. The models are built on a resolution of 1,024 pixels and use only first-tier images compared to the state-of-the-art baseline. For future orientations, we believe that the approach we propose could be beneficial to improve the efficiency of deep learning training.

Masking preprocessing in transfer learning for damage building detection

IAESIJAI

In recent years, frameworks that employ Generative Adversarial Networks (GANs) have achieved immense results for various applications in many fields especially those related to image generation both due to their ability to create highly realistic and sharp images as well as train on huge data sets. However, successfully training GANs are notoriously difficult task in case ifhigh resolution images are required. In this article, we discuss five applicable and fascinating areas for image synthesis based on the state-of-theart GANs techniques including Text-to-Image-Synthesis, Image-to-Image-Translation, Face Manipulation, 3D Image Synthesis and DeepMasterPrints. We provide a detailed review of current GANs-based image generation models with their advantages and disadvantages.The results of the publications in each section show the GANs based algorithmsAREgrowing fast and their constant improvement, whether in the same field or in others, will solve complicated image generation tasks in the future.

IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY

ijcsit

Image Generation with Gans-based Techniques: A Survey

AIRCC Publishing Corporation

Seminar

StephenLuo2

VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...

grssieee

Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...

Skolkovo Robotics Center

The efficiency and quality of a feature descriptor are critical to the user experience of many computer vision applications. However, the existing descriptors are either too computationally expensive to achieve real-time performance, or not sufficiently distinctive to identify correct matches from a large database with various transformations. In this paper, we propose a highly efficient and distinctive binary descriptor, called local difference binary (LDB). LDB directly computes a binary string for an image patch using simple intensity and gradient difference tests on pair wise grid cells within the patch. A multiple-gridding strategy and a salient bit-selection method are applied to capture the distinct patterns of the patch at different spatial granularities. Experimental results demonstrate that compared to the existing state-of-the-art binary descriptors, primarily designed for speed, LDB has similar construction efficiency, while achieving a greater accuracy and faster speed for mobile object recognition and tracking tasks.

OBJECT DETECTION AND RECOGNITION: A SURVEY

Journal For Research

The development of multimedia system technology in Content based Image Retrieval (CBIR) System is one in every of the outstanding area to retrieve the images from an oversized collection of database. The feature vectors of the query image are compared with feature vectors of the database images to get matching images.It is much observed that anyone algorithm isn't beneficial in extracting all differing kinds of natural images. Thus an intensive analysis of certain color, texture and shape extraction techniques are allotted to spot an efficient CBIR technique that suits for a selected sort of images. The Extraction of an image includes feature description and feature extraction. During this paper, we tend to projected Color Layout Descriptor (CLD), grey Level Co- Occurrences Matrix (GLCM), Marker-Controlled Watershed Segmentation feature extraction technique that extract the matching image based on the similarity of Color, Texture and shape within the database. For performance analysis, the image retrieval timing results of the projected technique is calculated and compared with every of the individual feature.

IJET-V2I6P17

IJET - International Journal of Engineering and Techniques

A version of watershed algorithm for color image segmentation

Habibur Rahman

The biometric is used to identify a person effectively and employ in almost all applications of day to day activities. In this paper, we propose compression based face recognition using Discrete Wavelet Transform (DWT) and Support Vector Machine (SVM). The novel concept of converting many images of single person into one image using averaging technique is introduced to reduce execution time and memory. The DWT is applied on averaged face image to obtain approximation (LL) and detailed bands. The LL band coefficients are given as input to SVM to obtain Support vectors (SV’s). The LL coefficients of DWT and SV’s are fused based on arithmetic addition to extract final features. The Euclidean Distance (ED) is used to compare test image features with database image features to compute performance parameters. It is observed that, the proposed algorithm is better in terms of performance compared to existing algorithms.

COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM

sipij

Single image haze removal

MohsinGhazi2

Paper Explained: One Pixel Attack for Fooling Deep Neural Networks

Devansh16

Nowadays, digital image compression has become a crucial factor of modern telecommunication systems. Image compression is the process of reducing total bits required to represent an image by reducing redundancies while preserving the image quality as much as possible. Various applications including internet, multimedia, satellite imaging, medical imaging uses image compression in order to store and transmit images in an efficient manner. Selection of compression technique is an application-specific process. In this paper, an improved compression technique based on Butterfly-Particle Swarm Optimization (BPSO) is proposed. BPSO is an intelligence-based iterative algorithm utilized for finding optimal solution from a set of possible values. The dominant factors of BPSO over other optimization techniques are higher convergence rate, searching ability and overall performance. The proposed technique divides the input image into 88 blocks. Discrete Cosine Transform (DCT) is applied to each block to obtain the coefficients. Then, the threshold values are obtained from BPSO. Based on this threshold, values of the coefficients are modified. Finally, quantization followed by the Huffman encoding is used to encode the image. Experimental results show the effectiveness of the proposed method over the existing method.

Image Compression based on DCT and BPSO for MRI and Standard Images

IJERA Editor

Semantic Mapping of Road Scenes

Sunando Sengupta

H018124360

IOSR Journals

Our goal of the project is to develop a complete, fully detailed 3D interactive model of the human body and systems in the human body, and allow the user to interacts in 3D with all the elements of that system, to teach students about human anatomy. Some organs, which contain a lot of details about a particular anatomy, need to be accurately and fully described in minute detail, such as the brain, lungs, liver and heart. These organs are need have all the detailed descriptions of the medical information needed to learn how to do surgery on them, and should allow the user to add careful and precise marking to indicate the operative landmarks on the surgery location. Adding so many different items of information is challenging when the area to which the information needs to be attached is very detailed and overlaps with all kinds of other medical information related to the area. Existing methods to tag areas was not allowing us sufficient locations to attach the information to. Our solution combines a variety of tagging methods, which use the marking method by selecting the RGB color area that is drawn in the texture, on the complex 3D object structure. Then, it relies on those RGB color codes to tag IDs and create relational tables that store the related information about the specific areas of the anatomy. With this method of marking, it is possible to use the entire set of color values (R, G, B) to identify a set of anatomic regions, and this also makes it possible to define multiple overlapping regions.

An effective RGB color selection for complex 3D object structure in scene gra...

IJECEIAES

In this paper, an approach is developed for segmenting an image into major surfaces and potential objects using RGBD images and 3D point cloud data retrieved from a Kinect sensor. In the proposed segmentation algorithm, depth and RGB data are mapped together. Color, texture, XYZ world coordinates, and normal-, surface-, and graph-based segmentation index features are then generated for each pixel point. These attributes are used to cluster similar points together and segment the image. The inclusion of new depth-related features provided improved segmentation performance over RGB-only algorithms by resolving illumination and occlusion problems that cannot be handled using graph-based segmentation algorithms, as well as accurately identifying pixels associated with the main structure components of rooms (walls, ceilings, floors). Since each segment is a potential object or structure, the output of this algorithm is intended to be used for object recognition. The algorithm has been tested on commercial building images and results show the usability of the algorithm in real time applications.

Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...

CSCJournals

Steganography is going to gain its importance due to the exponential growth and secret communication of potential computer users over the internet [5]. It can also be defined as the study of invisible communication that usually deals with the ways of hiding the existence of the communicated message. Generally data embedding is achieved in communication, image, text, voice or multimedia content for copyright, military communication, authentication and many other purposes [2]. In image Steganography, secret communication is achieved to embed a message into cover image (used as the carrier to embed message into) and generate a stego- image (generated image which is carrying a hidden message)[1]. In this paper we have critically analyzed various steganographic techniques and also have covered steganography overview its major types, classification, applications [3]. KEYWORDS: STEGANOGRAPHY, STEGO IMAGE, COVER IMAGE, LSB

A Survey of Image Based Steganography

editor1knowledgecuddle

Similar to Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter (20)

TMS workshop on machine learning in materials science: Intro to deep learning...

Обучение нейросетей компьютерного зрения в видеоиграх

Masking preprocessing in transfer learning for damage building detection

IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY

Image Generation with Gans-based Techniques: A Survey

Seminar

VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...

Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...

OBJECT DETECTION AND RECOGNITION: A SURVEY

IJET-V2I6P17

A version of watershed algorithm for color image segmentation

COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM

Single image haze removal

Paper Explained: One Pixel Attack for Fooling Deep Neural Networks

Image Compression based on DCT and BPSO for MRI and Standard Images

Semantic Mapping of Road Scenes

H018124360

An effective RGB color selection for complex 3D object structure in scene gra...

Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...

A Survey of Image Based Steganography

More from multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper62.pdf YouTube: https://youtu.be/gV-rvV3iFDA Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri and Julien Morlier : Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal CNN for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. This work presents a method for classifying table tennis strokes using spatio-temporal convolutional neural networks. The fine-grained classification is performed on trimmed video segments recorded at 120 fps with different players performing in natural conditions. From those segments, the frames are extracted, their optical flow is computed and the pose of the player is estimated. From the optical flow amplitude, a region of interest is inferred. A three stream spatio-temporal convolutional neural network using combination of those modalities and 3D attention mechanisms is presented in order to perform classification. Presented by: Pierre-Etienne Martin

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper50.pdf Hai Nguyen-Truong, San Cao, N. A. Khoa Nguyen, Bang-Dang Pham, Hieu Dao, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table Tennis Strokes Classification Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Sports Video Classification Tasks in the Multimedia Evaluation 2020 Challenge focuses on classifying different types of table tennis strokes in video segments. In this task, we - the HCMUS Team - perform multiple experiments, which includes a combination of models such as SlowFast, Optical Flow, DensePose, R2+1, Channel-Separated Convolutional Networks, to classify 21 types of table tennis strokes from video segments. In total, we submit eight runs corresponding to five different models with different sets of hyper-parameters in each of our models. In addition, we apply some pre-processing techniques on the dataset in order for our model to learn and classify more accurately. According to the evaluation results, one of our team's methods out-performs the other team's. In particular, our best run achieves 31.35\% global accuracy, and all of our methods show potential results in terms of local and global accuracy for action recognition tasks.

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper2.pdf YouTube: https://youtu.be/-bRL868b8ys Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri, Laurent Mascarilla, Jordan Calandre and Julien Morlier : Sports Video Classification: Classification of Strokes in Table Tennis for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. Fine-grained action classification has raised new challenges compared to classical action classification problems. Sport video analysis is a very popular research topic, due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests, up to analysis of athletes' performances. Running since 2019 as a part of MediaEval, we offer a task which consists in classifying table tennis strokes from videos recorded in natural conditions at the University of Bordeaux. The aim is to build tools for teachers, coaches and players to analyse table tennis games. Such tools could lead to an automatic profiling of the player and adaptation of his training for improving his/her sport skills more efficiently. Presented by: Pierre-Etienne Martin

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper61.pdf YouTube: https://youtu.be/brmI4g3jLS4 Ricardo Kleinlein, Cristina Luna-Jiménez, Fernando Fernández-Martínez and Zoraida Callejas : Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention and LSTM Models. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper reports on the GTH-UPM team experience in the Predicting Media Memorability task at MediaEval 2020. Teams were requested to predict memorability scores at both short-term and long-term, understanding such score as a measure of whether a video was perdurable in a viewer's memory or not. Our proposed system relies on a late fusion of the scores predicted by three sequential models, each trained over a different modality: video captions, aural embeddings and visual optical flow-based vectors. Whereas single-modality models show a low or zero Spearman correlation coefficient value, their combination considerably boosts performance over development data up to 0.2 in the short-term memorability prediction subtask and 0.19 in the long-term subtask. However, performance over test data drops to 0.016 and -0.041, respectively. Presented by: Ricardo Kleinlein

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper52.pdf Janadhip Jacutprakart, Rukiye Savran Kiziltepe, John Q. Gan, Giorgos Papanastasiou and Alba G. Seco de Herrera : Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we present the methods of approach and the main results from the Essex NLIP Team’s participation in the MediEval 2020 Predicting Media Memorability task. The task requires participants to build systems that can predict short-term and long-term memorability scores on real-world video samples provided. The focus of our approach is on the use of colour-based visual features as well as the use of the video annotation meta-data. In addition, hyper-parameter tuning was explored. Besides the simplicity of the methodology, our approach achieves competitive results. We investigated the use of different visual features. We assessed the performance of memorability scores through various regression models where Random Forest regression is our final model, to predict the memorability of videos.

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper6.pdf YouTube: https://youtu.be/ySGGu_4vaxs Alba García Seco De Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin, Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu and Alan F. Smeaton : Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable? Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes the MediaEval 2020 Predicting Media Memorability task. After first being proposed at MediaEval 2018, the Predicting Media Memorability task is in its 3rd edition this year, as the prediction of short-term and long-term video memorability (VM) remains a challenging task. In 2020, the format remained the same as in previous editions. This year the videos are a subset of the TRECVid 2019 Video to Text dataset, containing more action rich video content as compare with the 2019 task. In this paper a description of some aspects of this task is provided, including its main characteristics, a description of the collection, the ground truth dataset, evaluation metrics and the requirements for the run submission. Presented by: Rukiye Savran Kiziltepe

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper45.pdf Benoit Bonnet, Teddy Furon and Patrick Bas : Fooling an Automatic Image Quality Estimator. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper we present our work on the 2020 MediaEval task: Pixel "Privacy: Quality Camouflage for Social Images". Blind Image Quality Assessment (BIQA) is a classifier that for any given image will return a quality score. Our task is to modify an image to decrease its BIQA score while maintaining a good perceived quality. Since BIQA is a deep neural network, we worked on an adversarial attack approach of the problem.

Fooling an Automatic Image Quality Estimator

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper77.pdf YouTube: https://youtu.be/8Rr4KknGSac Zhuoran Liu, Zhengyu Zhao, Martha Larson and Laurent Amsaleg : Pixel Privacy: Quality Camouflage for Social Images. Proc. of MediaEval 2020, 14-15 December 2020, Online. High-quality social images shared online can be misappropriated for unauthorized goals, where the quality filtering step is commonly carried out by automatic Blind Image Quality Assessment (BIQA) algorithms. Pixel Privacy benchmarks privacy-protective approaches that protect privacy-sensitive images against unethical computer vision algorithms. In the 2020 task, participants are encouraged to develop camouflage methods that can effectively decrease the BIQA quality score of high-quality images and maintain image appeal. The camouflaged images need to be either imperceptible to the human eye, or it can be a visible enhancement. Presented by: Zhuoran Liu

Pixel Privacy: Quality Camouflage for Social Images

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper73.pdf YouTube: https://youtu.be/TadJ6y7xZeA Thuc Nguyen-Quang, Tuan-Duy Nguyen, Thang-Long Nguyen-Ho, Anh-Kiet Duong, Xuan-Nhat Hoang, Vinh-Thuyen Nguyen-Truong, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching. Proc. of MediaEval 2020, 14-15 December 2020, Online. Matching text and images based on their semantics has an important role in cross-media retrieval. However, text and images in articles have a complex connection. In the context of MediaEval 2020 Challenge, we propose three multi-modal methods for mapping text and images of news articles to the shared space in order to perform efficient cross-retrieval. Our methods show systemic improvement and validate our hypotheses, while the best-performed method reaches a recall@100 score of 0.2064. Presented by: Thuc Nguyen-Quang

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper72.pdf Sabarinathan D and Suganya Ramamoorthy : Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attention Unit. Proc. of MediaEval 2020, 14-15 December 2020, Online. Colorectal cancer is the third most common cause of cancer worldwide. In the era of medical Industry, identifying colorectal cancer in its early stages has been a challenging problem. Inspired by these issues, the main objective of this paper is to develop a Multi supervision net algorithm for segmenting polys on a comprehensive dataset. The risk of colorectal cancer could be reduced by early diagnosis of poly during a colonoscopy. The disease and their symptoms are highly varying and always a need for a continuous update of knowledge for the doctors and medical analyst. The diseases fall into different categories and a small variation of symptoms may lead to higher rate of risk. We have taken Medico polyp challenge dataset, which consists of 1000 segmented polyp images from gastrointestinal track. We proposed an efficient Net B4 as a pre-trained architecture in multi-supervision net. The model is trained with multiple output layers. We present quantitative results on colorectal dataset to evaluate the performance and achieved good results in all the performance metrics. The experimental results proved that the proposed model is robust and provides a good level of accuracy in segmenting polyps on a comprehensive dataset for different metrics such as Dice coefficient, Recall, Precision and F2.

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper47.pdf YouTube: https://youtu.be/vMsM4zg2-JY Tien-Phat Nguyen, Tan-Cong Nguyen, Gia-Han Diep, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico task, MediaEval 2020, explores the challenge of building accurate and high-performance algorithms to detect all types of polyps in endoscopic images. We proposed different approaches leveraging the advantages of either ResUnet++ or PraNet model to efficiently segment polyps in colonoscopy images, with modifications on the network structure, parameters, and training strategies to tackle various observed characteristics of the given dataset. Our methods outperform the other teams' methods, for both accuracy and efficiency. After the evaluation, we are at top 2 for task 1 (with Jaccard index of 0.777, best Precision and Accuracy scores) and top 1 for task 2 (with 67.52 FPS and Jaccard index of 0.658).

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper31.pdf Syed Muhammad Faraz Ali, Muhammad Taha Khan, Syed Unaiz Haider, Talha Ahmed, Zeshan Khan and Muhammad Atif Tahir : Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Intestinal Tract. Proc. of MediaEval 2020, 14-15 December 2020, Online. Identification of polyps in endoscopic images is critical for the diagnosis of colon cancer. Finding the exact shape and size of polyps requires the segmentation of endoscopic images. This research explores the advantage of using depth-wise separable convolution in the atrous convolution of the ResUNet++ architecture. Deep atrous spatial pyramid pooling was also implemented on the ResUNet++ architecture. The results show that architecture with separable convolution has a smaller size and fewer GFLOPs without degrading the performance too much.

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper22.pdf Debapriya Banik and Debotosh Bhattacharjee : Deep Conditional Adversarial learning for polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This approach has addressed the Medico automatic polyp segmentation challenge which is a part of Mediaeval 2020. We have proposed a deep conditional adversarial learning based network for the automatic polyp segmentation task. The network comprises of two interdependent models namely a generator and a discriminator. The generator network is a FCN employed for the prediction of the polyp mask while the discriminator enforces the segmentation to be as similar as the real segmented mask (ground truth). Our proposed model achieved a comparative result on the test dataset provided by the organizers of the challenge.

Deep Conditional Adversarial learning for polyp Segmentation

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper21.pdf Hwang Maxwell, Wu Cai, Hwang Kao-Shing, Xu Yong Si and Wu Chien-Hsing : A Temporal-Spatial Attention Model for Medical Image Detection. Proc. of MediaEval 2020, 14-15 December 2020, Online. A local region model with attentive temporal-spatial pathways is proposed for automatically learning various target structures. The attentive spatial pathway highlights the salient region to generate bounding boxes and ignores irrelevant regions in an input image. The proposed attention mechanism allows efficient object localization and the overall predictive performance is increased because there are fewer false positives for the object detection task for medical images with manual annotations. The experimental results show that proposed models consistently increase the base architectures' predictive performance for different datasets and training sizes without undue computational efficiency.

A Temporal-Spatial Attention Model for Medical Image Detection

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper20.pdf YouTube: https://youtu.be/CVelQl5Luf0 Quoc-Huy Trinh, Minh-Van Nguyen, Thiet-Gia Huynh and Minh-Triet Tran : HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Network and UNet for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico: Multimedia Task focuses on developing an efficient and accurate framework to computer-aided diagnosis systems for automatic polyp segmentation to detect all types of polyps in endoscopic images of the gastrointestinal (GI) tract. We are HCMUS-team approach a solution, which includes combination Residual module, Inception module, Adaptive Convolutional neural network with Unet model and PraNet to semantic segmentation all types of polyps in endoscopic images. We submit multiple runs with different architecture and parameters in our model. Our methods show potential results in accuracy and efficiency through multiple experiments.

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper15.pdf Rabindra Khadka : Transfer of Knowledge: Fine-tuning for Polyp Segmentation with Attention. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes how the transfer of prior knowledge can effectively take on segmentation tasks with the help of attention mechanisms. The UNet model pretrained on brain MRI dataset was fine-tuned with the polyp dataset. Attention mechanism was integrated to focus on relevant regions in the input images. The implemented architecture is evaluated on 200 validation images based on intersection over union and dice score between groundtruth and predicted region. The model demonstrates a promising result with computational efciency.

Fine-tuning for Polyp Segmentation with Attention

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper12.pdf Adrian Krenzer and Frank Puppe : Bigger Networks are not Always Better: Deep Convolutional Neural Networks for Automated Polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents our team's (AI-JMU) approach to the Medico automated polyp segmentation challenge. We consider deep convolutional neural networks to be well suited for this task. To determine the best architecture we test and compare state of the art backbones and two different heads. Finally we achieve a Jaccard index of 73.74\% on the challenge test set. We further demonstrate that bigger networks do not always perform better. However the growing network size always increases the computational complexity.

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper51.pdf Amel Ksibi, Amina Salhi, Ala Alluhaidan and Sahar A. El-Rahman : Insights for wellbeing: Predicting Personal Air Quality Index using Regression Approach. Proc. of MediaEval 2020, 14-15 December 2020, Online. Providing air pollution information to individuals enables them to understand the air quality of their living environments. Thus, the association between people’s wellbeing and the properties of the surrounding environment is an essential area of investigation. This paper proposes Air Quality Prediction through harvesting public/open data and leveraging them to get the Personal Air Quality index. These are usually incomplete. To cope with the problem of missing data, we applied the KNN imputation method. To predict Personal Air Quality Index, we apply a voting regression approach based on three base regressors which are Gradient Boosting regressor, Random Forest regressor, and linear regressor. Evaluating the experimental results using the RMSE metric, we got an average score of 35.39 for Walker and 51.16 for Car.

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper40.pdf YouTube: https://youtu.be/SL5Hvu1mARY Trung-Quan Nguyen, Dang-Hieu Nguyen and Loc Tai Tan Nguyen : Use Visual Features From Surrounding Scenes to Improve Personal Air Quality Data Prediction Performance. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we propose a method to predict the personal air quality index in an area by using the combination of the levels of the following pollutants: PM2.5, NO2, and O3, measured from the nearby weather stations of that area, and the photos of surrounding scenes taken at that area. Our approach uses the Inverse Distance Weighted (IDW) technique to estimate the missing air pollutant levels and then use regression to integrate visual features from taken photos to optimize the predicted values. After that, we can use those values to calculate the Air Quality Index (AQI). The results show that the proposed method may not improve the performance of the prediction in some cases.

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper39.pdf YouTube: https://youtu.be/3r_oSguFPVM Trung-Quan Nguyen, Dang-Hieu Nguyen and Loc Tai Tan Nguyen : Personal Air Quality Index Prediction Using Inverse Distance Weighting Method. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we propose a method to predict the personal air quality index in an area by only using the levels of the following pollutants: PM2.5, NO2, O3. All of them are measured from the nearby weather stations of that area. Our approach uses one of the most well-known interpolation methods in spatial analysis, the Inverse Distance Weighted (IDW) technique, to estimate the missing air pollutant levels. After that, we can use those levels to calculate the Air Quality Index (AQI). The results show that the proposed method is suitable for the prediction of those air pollutant levels.

Personal Air Quality Index Prediction Using Inverse Distance Weighting Method

multimediaeval

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Fooling an Automatic Image Quality Estimator

Pixel Privacy: Quality Camouflage for Social Images

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Deep Conditional Adversarial learning for polyp Segmentation

A Temporal-Spatial Attention Model for Medical Image Detection

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

Fine-tuning for Polyp Segmentation with Attention

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Personal Air Quality Index Prediction Using Inverse Distance Weighting Method

Recently uploaded

Role of AI in seed science Predictive modelling and Beyond.pptx

Arvind Kumar

PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE low price Call me 8617370543 100%genuine sexy VIP call girl safe service WhatsApp chat. 8617370543 Normal call kijiye 8617370543 100% genuine young college girl and housewife Full enjoy open minded girl provide 24hour full cooperative model and full satisfactions☎️*8617370543*⭐Escorts service █▬█⓿▀█▀ call girls and bhabhi available for room Sex and video call service A-1 HIGH CLASS CALL GIRLS TOP MODEL 24X7 HOME/HOTEL Call Girls Safe & Secure High Class Sm Affordable Rate 100% Satisfaction, Unlimited Enjoyment. Any Time for Model/Escort in High class luxury and premium

PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE

Goa Call Girls High Profile Escorts

TheCarringtoneventof1859hasbeenthestrongestsolarflareintheobservationalhistory.ItplaysacrucialroleinsheddinglightonthefrequencyandimpactsofthepastandfutureSolarEnergeticParticle(SEP)eventsonhumansocieties.WeaddresstheimpactoftheCarringtoneventbymeasuringtree‐ring14Cwithmultiplereplicationsfromhigh‐latitudelocationsaroundtheeventandbycomparingthemwithmid‐latitudemeasurements.Atransientoffsetin14Cfollowingtheeventisobservedwithhighstatisticalsignificance.Ourstate‐of‐the‐art14Cproductionandtransportmodeldoesnotreproducetheobservationalfinding,suggestingfeaturesbeyondpresentunderstanding.Particularly,ourobservationwouldrequirepartiallyfasttransportof14Cbetweenthestratosphereandtroposphereathighlatitudes.TheobservationisconsistentwiththepreviousfindingswiththeSEPeventsof774and993CEforwhichfasterintegrationof14Cintotreeringsisobservedathighlatitudes

TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings

Sérgio Sacani

Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...

Monika Rani

CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA

Dr. TATHAGAT KHOBRAGADE

Use of mutants in understanding seedling development.pptx

RenuJangid3

Theoretical predictions and observational data indicate a class of sub-Neptune exoplanets may have water-rich interiors covered by hydrogen-dominated atmospheres. Provided suitable climate conditions, such planets could host surface liquid oceans. Motivated by recent JWST observations of K2-18 b, we self-consistently model the photochemistry and potential detectability of biogenic sulfur gases in the atmospheres of temperate sub-Neptune waterworlds for the first time. On Earth today, organic sulfur compounds produced by marine biota are rapidly destroyed by photochemical processes before they can accumulate to significant levels. Domagal-Goldman et al. suggest that detectable biogenic sulfur signatures could emerge in Archean-like atmospheres with higher biological production or low UV flux. In this study, we explore biogenic sulfur across a wide range of biological fluxes and stellar UV environments. Critically, the main photochemical sinks are absent on the nightside of tidally locked planets. To address this, we further perform experiments with a 3D general circulation model and a 2D photochemical model (VULCAN 2D) to simulate the global distribution of biogenic gases to investigate their terminator concentrations as seen via transmission spectroscopy. Our models indicate that biogenic sulfur gases can rise to potentially detectable levels on hydrogen-rich water worlds, but only for enhanced global biosulfur flux (20 times modern Earth’s flux). We find that it is challenging to identify DMS at 3.4 μm where it strongly overlaps with CH4, whereas it is more plausible to detect DMS and companion byproducts, ethylene (C2H4) and ethane (C2H6), in the mid-infrared between 9 and 13 μm. Unified Astronomy Thesaurus concepts: Exoplanet atmospheres (487); Exoplanet

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds

Sérgio Sacani

GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry

Areesha Ahmad

Atp synthase , Atp synthase complex 1 to 4.

Silpa

Molecular markers are identifiable DNA sequences used to locate genes associated with specific traits or genetic conditions. A molecular marker is a specific gene fragment present at a specific position called ‘locus’ (pleural loci) in the genome of a cell. In the pool of unknown DNA or in a whole chromosome, these molecular markers help in identification of particular sequence of DNA at particular location.

Molecular markers- RFLP, RAPD, AFLP, SNP etc.

Silpa

PSYCHOSOCIAL NEEDS. in nursing II sem pptx

Suji236384

Embark on a captivating exploration of the transformative power of biotechnology on economic prosperity and societal advancement in this illuminating presentation. Uncover the multifaceted impacts of biotech breakthroughs across diverse sectors, from bolstering agricultural yields to revolutionizing personalized medicine in healthcare. Delve deep into case studies and real-world examples illustrating how biotechnological innovations serve as catalysts for economic growth and sustainable development. This presentation offers a comprehensive analysis of the strategic imperatives driving the integration of biotechnology into global economies. Discover the pivotal role of public-private partnerships, regulatory frameworks, and investment incentives in fostering an ecosystem conducive to biotech innovation and entrepreneurship. Gain invaluable insights into emerging trends and market dynamics shaping the biotech landscape, from CRISPR gene editing to synthetic biology and beyond. Moreover, explore the ethical, social, and environmental implications inherent in the widespread adoption of biotechnological solutions. Learn how stakeholders navigate complex challenges such as intellectual property rights, biosecurity concerns, and equitable access to biotech advancements while maximizing their positive impact on communities worldwide. Whether you're a seasoned industry professional, an academic researcher, or a policymaker, this presentation promises to deepen your understanding of the pivotal role biotechnology plays in shaping our economic future. Join us on this enlightening journey at the intersection of science, economics, and innovation, and unlock the boundless potential of biotechnology to uplift societies and enhance quality of life on a global scale.

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx

ANSARKHAN96

Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx

DiariAli

FAIRSpectra - Enabling the FAIRification of Analytical Science

Alex Henderson

Module for Grade 9 for Asynchronous/Distance learning

levieagacer

Cyanide resistant respiration pathway.pptx

Silpa

Selaginella: features, morphology ,anatomy and reproduction.

Silpa

Factory Acceptance Test( FAT).pptx .

Poonam Aher Patil

LUNULARIA -features, morphology, anatomy ,reproduction etc.

Silpa

Clean In Place(CIP).pptx .

Poonam Aher Patil

Recently uploaded (20)

Role of AI in seed science Predictive modelling and Beyond.pptx

PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE

TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings

Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...

CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA

Use of mutants in understanding seedling development.pptx

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds

GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry

Atp synthase , Atp synthase complex 1 to 4.

Molecular markers- RFLP, RAPD, AFLP, SNP etc.

PSYCHOSOCIAL NEEDS. in nursing II sem pptx

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx

Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx

FAIRSpectra - Enabling the FAIRification of Analytical Science

Module for Grade 9 for Asynchronous/Distance learning

Cyanide resistant respiration pathway.pptx

Selaginella: features, morphology ,anatomy and reproduction.

Factory Acceptance Test( FAT).pptx .

LUNULARIA -features, morphology, anatomy ,reproduction etc.

Clean In Place(CIP).pptx .

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter

1. Zhengyu Zhao Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter Radboud University (Netherlands) RU-DS @ Pixel Privacy Task 2020:

2. Fooling ML models with small, imperceptible perturbations Szegedy et al. "Intriguing properties of neural networks.", ICLR 2014. Kurakin et al. "Adversarial examples in the physical world.", ICLR 2017. Original CNN CNN Perturbations Adversarial vulnerable to image processing (e.g. JPEG compression)

3. [1] semantic manipulation [3] DL-based colorization golf-cart [2] spatial transformation trailer truck domain-specific costly and not human-understandble limited action space [1] Qiu et al. "SemanticAdv: Generating Adversarial Examples via Attribute-conditioned Image Editing." ECCV 2020. Eykholt et al. "Robust physical-world attacks on deep learning visual classification." CVPR 2018. [2] Engstrom et al. "Exploring the landscape of spatial robustness." ICML 2019. [3] Bhattad et al. "Unrestricted adversarial examples via semantic manipulation." ICLR 2020. Fooling ML models with large yet unnoticeable perturbations

4. [1] Zhao et al. "Adversarial Robustness Against Image Color Transformation within Parametric Filter Space." Under review. Preliminary work at BMVC 2020. [2] Hu et al. "Exposure: A white-box photo post-processing framework." ACM TOG 2018. [3] Kurakin et al. "Adversarial examples in the physical world." ICLR 2017. [4] Carlini et al. "Towards evaluating the robustness of neural networks." In IEEE S&P 2017. Simple and human-understandable filter [2] Generally applicable to all kinds of color images Relatively large action space (hundreds of parameters) C&W [4]: ACE-Ins: 2 ) , ( min       y x L ε y x F L      s.t. ), , ) ( ( min 2 ) ( ) ), ( ( min ins x x F y x F L       ε y x L       s.t. ), , ( min PGD [3]: ACE-PGD: Our Adversarial Color Enhancement (ACE) [1]

5. Results Half of the images to be predicted as low-quality Robust against JPEG compression Unacceptable image appeal

6. Examples

7. (a) PGD (b) Colorization (c) ACE-PGD ACE-Ins Examples of fooling an ImageNet classifier Guessing: BIQA model tends to rely on high-frequency features for quality assessment → Robust against low-frequency perturbations by ACE ImageNet classifier learns both low- (e.g. shape) and high-frequency (e.g. textures) features for object classification → Less Robust against low-frequency perturbations by ACE

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter

Recommended

Recommended

More Related Content

Similar to Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter

Similar to Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter (20)

More from multimediaeval

More from multimediaeval (20)

Recently uploaded

Recently uploaded (20)

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter