Center loss for Face Recognition

•

4 likes•5,063 views

This paper proposes a discriminative feature learning approach for deep face recognition using a center loss function in addition to softmax loss. The center loss aims to learn discriminative features that reduce intra-class variations. It works by minimizing distances between feature vectors and their corresponding class centers, which are updated during training. Experimental results on benchmarks like LFW, YTF, and MegaFace demonstrate state-of-the-art performance for face verification and identification tasks when using the proposed softmax loss combined with center loss. While performance improvements are achieved, the paper also acknowledges there is still room for enhancing results to meet practical demands involving large-scale datasets with millions of distractors.

Technology

A Discriminative Feature Learning Approach
for Deep Face Recognition
Yandong Wen, Kaipeng Zhang, Zhifeng Li and Yu Qiao
Paper Seminar @ SK Telecom: Jisung Kim

A toy example : What’s wrong with Softmax

A toy example : What’s wrong with Softmax
1. m = mini-batch size
2. n = the number of class
3. x_i = feature vector in R^d (d is the feature dimension)
4. W = R^(dxn), b = R^n (bias)

Is it good for clustering?
Separable,
the deep features are not discriminative enough.
by intra-class variation

Is it good for clustering?
Training Set (50K) Test Set (10K)

Let’s be discriminative by Center Loss
1. m = mini-batch size
2. c_yi = yth class center in d dimension
3. x_i = feature vector in R^d (d is the feature dimension)
4. But! c_yi should be updated as the deep features changed.
5. Average the features of every class in each iteration.

Let’s be discriminative by Center Loss
1. Two modification
a. Updating the centers based on mini-batch
b. Updating center with learning rate alpha (for mislabelled samples)
2. Total = Softmax Loss + lambda * Center Loss

The discriminative feature learning algorithm

CNN Architecture → https://github.com/ydwen/caffe-face

Compared to Siamese and Triplet
1. Dramatic data expansion
a. Make (x_i, x_j) pair or (x_positive, x_negative, x_anchor) pair
2. Hard to make proper pair → Hard to make decreasing loss → Hard to train

The Devil is in the details.
1. Preprocessing
2. Training Data
3. Detailed Settings in CNNs
4. Detailed Settings in Testings

Implementation Details : Preprocessing
1. Cropped Face Size = (112x96x3)
a. Subtraction 127.5
b. Dividing by 128
c. Every Pixel Values in -1 <= val <= 1
2. Face Detect by MTCNN
3. Use 5 landmarks
a. Two Eyes
b. One Nose
c. Two Mouth Corners
4. Alignment : Similarity Transformation.
a. Rotation, Translation, Scaling

Implementation Details : Training data
1. 17,189 unique persons
2. 0.7 M images
a. CASIA-WebFace : 0.49 M, 10,575
b. CACD2000 : 0.16 M, 2,000
c. Celebrity+ : 0.20 M, 10,177
3. Removing Same Identities !!!
4. Data Augmentation : Only horizontally Flipped

Implementation Details : Detailed Settings in CNNs
1. Model A : Softmax Loss
2. Model B : Softmax Loss & Contrastive Loss
3. Model C : Softmax Loss & Center Loss
4. Batch Size : 256
5. GPUs : 2 x Titan X
6. Learning rate : Start from 0.1
7. Model A, C
a. Divided by 10 at the 16K, 24K
b. Complete at 28K (Roughly costs 14 hours)
8. Model B
a. Divided by 10 at the 24K, 36K
b. Complete at 42K (Roughly costs 22 hours)

Implementation Details : Detailed Settings in Testing
1. Deep Features : first FC layer
2. Extract 2 feature
a. Original Image
b. Horizontally Flipped Image
3. Just Concatenate
4. Do PCA
5. Score by cosine distance
6. Identification
a. Nearest Neighbor
7. Verification
a. Threshold Comparison

Verification Performance (λ=0.003 & α=0.5)

MegaFace
1. Gallery Set : 690,000 persons / 1 milion Images ( Distractors )
2. Probe set
a. Facescrub : 530 persons / 100K images
b. FGNet : 82 persons / 1,002 images (ages varying from 0 to 69)
3. Small / Large Protocol
a. When training db size < 0.5M, < 20K persons
b. When training db size > 0.5M, > 20K persons
4. Face Identification
a. From 1 vs 10
b. To 1 vs 1,000,000
5. Face Verification
a. 4 bilion = 4,000,000,000 negative pairs

MegaFace : Identification (1M distractors)

MegaFace : Verification (10^-6 FAR, 1M distractors)

FaceScrup
1. 530 celebrities
2. 100k images

Conclusions
1. USE!!! Center Loss for getting discriminative feature
2. Much room for performance improvement.
a. To meet practical demand
b. Identification : Rank-1 with 1M distractor
c. Verification : 10^-6 FAR with 1M distractor

This document presents a literature review and proposed work plan for face recognition using a back propagation neural network. It summarizes the Viola-Jones face detection algorithm which uses Haar features and an integral image for real-time detection. The algorithm has high detection rates with low false positives. Future work will apply back propagation neural networks to extract features and recognize faces from a database of facial images in order to build a facial recognition system.

Object detection and Instance Segmentation

Hichem Felouat

The document discusses object detection and instance segmentation models like YOLOv5, Faster R-CNN, EfficientDet, Mask R-CNN, and TensorFlow's object detection API. It provides information on labeling images with bounding boxes for training these models, including open-source and commercial annotation tools. The document also covers evaluating object detection models using metrics like mean average precision (mAP) and intersection over union (IoU). It includes an example of training YOLOv5 on a custom dataset.

Introduction to Deep face detection and recognition

Apache MXNet

Modern face recognition with deep learning

marada0033

Object tracking

Sri vidhya k

The document discusses object tracking in computer vision. It begins with an introduction and overview of applications of object tracking. It then discusses object representation, detection, tracking algorithms and methodologies. It compares different tracking methods and provides an example of object tracking in MATLAB. Key steps in object tracking include object detection, tracking the detected objects across frames using algorithms like point tracking, kernel tracking and silhouette tracking. Common challenges with object tracking are also summarized.

Object detection

ROUSHAN RAJ KUMAR

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos. Well-researched domains of object detection include face detection and pedestrian detection. Object detection has applications in many areas of computer vision, including image retrieval and video surveillance.

Real Time Object Tracking

Vanya Valindria

This document summarizes several methods for real-time object detection and tracking in video sequences. Traditional methods like absolute differences and census transforms are compared to modern methods like KLT (Lucas-Kanade Technique) and Meanshift. Hardware requirements for real-time tracking like memory, frame rate, and processors are also discussed. The document provides examples of applications for object detection and tracking in traffic monitoring, surveillance, and mobile robotics.

The document describes the Histogram of Oriented Gradients (HOG) feature descriptor technique. HOG counts occurrences of gradient orientation in localized portions of an image to represent a distribution of intensity fluctuations along different orientations. It works by first calculating gradient images, then calculating histograms of gradients in 8x8 cells, followed by block normalization to account for lighting variations before forming the final HOG feature vector.

Machine Learning - Object Detection and Classification

Vikas Jain

從圖像辨識到物件偵測，進階的圖影像人工智慧 (From Image Classification to Object Detection, Advance...

Jian-Kai Wang

複習及補充機器學習與深度學習，說明物件偵測要解決的問題。探討策略1: One-Shot Solution，舉 YOLO 為例及其 Hands-On 操作，並探討其他相關演算法與其發展；其次探討策略2: Divide-and-Conquer，以 Faster RCNN 為例與利用 Tensorflow Object Detection API 進行練習，探討其他相關演算法與其發展。最後探討增進訓練結果與演算法發展，並介紹機器學習的推論與應用與應用機器學習導入產業。 We first reviewed the Machine Learning basis, introduced what object detection is, and then described what the problems it is going to solve. (both the localization and the category issues) Second, we introduced two types of algorithms that represent two different ideas. One is a One-Shot solution and the other is a divide-and-conquer way. The representative algorithm for the one-shot solution is "YOLO" and the other one is "Faster R-CNN". We also implemented the whole YOLO training and inference processes from scratch via Tensorflow 2.0. On the other hand, we introduced how to use Tensorflow Object Detection APIs to implement the whole Faster R-CNN training and inference processes. Third, we quickly introduced the evolution of several famous object detection algorithms and how to improve training performance and results. In the final, we introduced the gap between the AI industrial in research and in practice.

Image segmentation with deep learning

Antonio Rueda-Toicen

Template Matching

SatyamShukla277769

This document discusses template matching techniques in computer vision. Template matching allows identifying parts of an image that match a predefined template. Naive template matching works by comparing template images to overlapping regions of input images. More advanced methods use normalized cross-correlation to measure similarity in a way that is robust to brightness changes. Key points identified through template matching must exceed a threshold and be locally maximal correlations. Edge-based and multi-angle matching techniques improve template matching for rotated objects. Template matching has applications in fields like face recognition and medical imaging.

Object Detection & Tracking

Akshay Gujarathi

TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...

Edureka!

** AI & Deep Learning Using TensorFlow - https://www.edureka.co/ai-deep-learning-with-tensorflow ** This Edureka tutorial will provide you with a detailed and comprehensive knowledge of TensorFlow Object detection and how it works. It will also provide you with the details on how to use Tensorflow to detect objects in deep learning method. Below are the topics covered in this tutorial: 1. What is Object Detection? 2. Industrial use of Object Detection 3. Object Detection Workflow 4. What is Tensorflow? 5. Object Detection using Tensorflow - Demo 6. Live Object Detection using Tensorflow- Demo

04 image enhancement edge detection

Rumah Belajar

Image enhancement techniques can be used to improve image visual appearance and analysis by accentuating features like edges and boundaries. There are several techniques including: 1. Point operations like contrast stretching and thresholding that modify pixel values. 2. Spatial operations like noise smoothing and sharpening that apply neighborhood pixel averaging or differencing. 3. Transform domain techniques like filtering in the frequency domain to accelerate operations like noise removal. 4. Edge enhancement methods like the pyramid approach that detects edges across multiple image scales to isolate significant edges.

Deep Learning for Computer Vision: Image Classification (UPC 2016)

Universitat Politècnica de Catalunya

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep Learning for Computer Vision Barcelona Summer seminar UPC TelecomBCN (July 4-8, 2016) Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Image Formation

ssuser24ddad

The document discusses various topics related to image formation and light properties, including: - The different "life choices" a photon can take as it interacts with objects, such as absorption, reflection, refraction, etc. - How digital cameras form images using sensors that convert photons to electrons, and concepts like interlaced vs progressive scan. - Representations of color in different color spaces like RGB, HSV, YCbCr, and CIE L*a*b*. - The importance of luminance vs chrominance for image quality and the effect of white balance on image appearance.

Mask R-CNN

Chanuk Lim

Mask R-CNN extends Faster R-CNN by adding a branch for predicting segmentation masks in parallel with bounding box recognition and classification. It introduces a new layer called RoIAlign to address misalignment issues in the RoIPool layer of Faster R-CNN. RoIAlign improves mask accuracy by 10-50% by removing quantization and properly aligning extracted features. Mask R-CNN runs at 5fps with only a small overhead compared to Faster R-CNN.

Object tracking survey

Rich Nguyen

This document summarizes object tracking methods, including representations of objects, features for tracking, detection approaches, tracking algorithms, and future directions. It discusses representing objects as points, patches, or contours, using features like color, edges, texture, and optical flow for detection and tracking. Detection can be done through point detection, background subtraction, segmentation, and supervised learning. Tracking algorithms include point tracking, kernel tracking, and silhouette tracking. The document outlines challenges like occlusion, camera motion, and non-rigid objects that remain for future work in object tracking.

3D Vision Technology

basuabhishek92

The document outlines topics to discuss regarding 3D vision technology, including a brief history. It covers early patents from 1880 and the first 3D movie from 1922. Methods of capturing 3D images are discussed as well as techniques for projection, such as anaglyph, polarization, interference filters, and Dolby 3D. The document also touches on classifying 3D formats and modern technologies that enable 3D without glasses, like autostereoscopic screens and holograms. References are provided at the end.

Computer Vision for Beginners

Sanghamitra Deb

This document provides an overview of computer vision techniques including classification and object detection. It discusses popular deep learning models such as AlexNet, VGGNet, and ResNet that advanced the state-of-the-art in image classification. It also covers applications of computer vision in areas like healthcare, self-driving cars, and education. Additionally, the document reviews concepts like the classification pipeline in PyTorch, data augmentation, and performance metrics for classification and object detection like precision, recall, and mAP.

SIFT

Nitin Ramchandani

The document describes the Scale-invariant feature transform (SIFT) algorithm. It outlines the key steps: 1) constructing scale space by generating blurred images at different scales, 2) calculating difference of Gaussian images to find keypoints, 3) assigning orientations to keypoints, and 4) generating 128-element feature vectors for each keypoint to uniquely describe local image features in a way that is invariant to scale, rotation, and illumination changes. The SIFT algorithm allows for reliable object recognition and image stitching.

You only look once (YOLO) : unified real time object detection

Entrepreneur / Startup

YOLO (You Only Look Once) is a real-time object detection system that frames object detection as a regression problem. It uses a single neural network that predicts bounding boxes and class probabilities directly from full images in one evaluation. This approach allows YOLO to process images and perform object detection over 45 frames per second while maintaining high accuracy compared to previous systems. YOLO was trained on natural images from PASCAL VOC and can generalize to new domains like artwork without significant degradation in performance, unlike other methods that struggle with domain shift.

Moving Object Detection And Tracking Using CNN

NITISHKUMAR1401

Generative adversarial networks

남주 김

Generative Adversarial Networks (GANs) are a class of machine learning frameworks where two neural networks contest with each other in a game. A generator network generates new data instances, while a discriminator network evaluates them for authenticity, classifying them as real or generated. This adversarial process allows the generator to improve over time and generate highly realistic samples that can pass for real data. The document provides an overview of GANs and their variants, including DCGAN, InfoGAN, EBGAN, and ACGAN models. It also discusses techniques for training more stable GANs and escaping issues like mode collapse.

YOLO

geothomas18

This document discusses the YOLO object detection algorithm and its applications in real-time object detection. YOLO frames object detection as a regression problem to predict bounding boxes and class probabilities in one pass. It can process images at 30 FPS. The document compares YOLO versions 1-3 and their improvements in small object detection, resolution, and generalization. It describes implementing YOLO with OpenCV and its use in self-driving cars due to its speed and contextual awareness.

Optimization in deep learning

Jeremy Nixon

This document discusses various optimization techniques for training neural networks, including gradient descent, stochastic gradient descent, momentum, Nesterov momentum, RMSProp, and Adam. The key challenges in neural network optimization are long training times, hyperparameter tuning such as learning rate, and getting stuck in local minima. Momentum helps accelerate learning by amplifying consistent gradients while canceling noise. Adaptive learning rate algorithms like RMSProp, Adagrad, and Adam automatically tune the learning rate over time to improve performance and reduce sensitivity to hyperparameters.

Muzammil Abdulrahman PPT On Gabor Wavelet Transform (GWT) Based Facial Expres...

Petroleum Training Institute

This paper proposed a facial expression recognition approach based on Gabor wavelet transform. Gabor wavelet filter is first used as pre-processing stage for extraction of the feature vector representation. Dimensionality of the feature vector is reduced using Principal Component Analysis and Local binary pattern (LBP) Algorithms. Experiments were carried out of The Japanese female facial expression (JAFFE) database. In all experiments conducted on JAFFE database, results obtained reveal that GW+LBP has outperformed other approaches in this paper with Average recognition rate of 90% under the same experimental setting.

What's hot

Python in Computer Vision

Brian Thorne

Hog

Anirudh Kanneganti

Machine Learning - Object Detection and Classification

Vikas Jain

從圖像辨識到物件偵測，進階的圖影像人工智慧 (From Image Classification to Object Detection, Advance...

Jian-Kai Wang

Image segmentation with deep learning

Antonio Rueda-Toicen

Template Matching

SatyamShukla277769

Object Detection & Tracking

Akshay Gujarathi

TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...

Edureka!

04 image enhancement edge detection

Rumah Belajar

Deep Learning for Computer Vision: Image Classification (UPC 2016)

Universitat Politècnica de Catalunya

Image Formation

ssuser24ddad

Mask R-CNN

Chanuk Lim

Object tracking survey

Rich Nguyen

3D Vision Technology

basuabhishek92

Computer Vision for Beginners

Sanghamitra Deb

SIFT

Nitin Ramchandani

You only look once (YOLO) : unified real time object detection

Entrepreneur / Startup

Moving Object Detection And Tracking Using CNN

NITISHKUMAR1401

Generative adversarial networks

남주 김

YOLO

geothomas18

What's hot (20)

Python in Computer Vision

Hog

Machine Learning - Object Detection and Classification

從圖像辨識到物件偵測，進階的圖影像人工智慧 (From Image Classification to Object Detection, Advance...

Image segmentation with deep learning

Template Matching

Object Detection & Tracking

TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...

04 image enhancement edge detection

Deep Learning for Computer Vision: Image Classification (UPC 2016)

Image Formation

Mask R-CNN

Object tracking survey

3D Vision Technology

Computer Vision for Beginners

SIFT

You only look once (YOLO) : unified real time object detection

Moving Object Detection And Tracking Using CNN

Generative adversarial networks

YOLO

Viewers also liked

Optimization in deep learning

Jeremy Nixon

Muzammil Abdulrahman PPT On Gabor Wavelet Transform (GWT) Based Facial Expres...

Petroleum Training Institute

Pattern Recognition and Machine Learning : Graphical Models

butest

- Bayesian networks are directed acyclic graphs that represent conditional independence relationships between variables. They allow compact representation of high-dimensional joint distributions. - Graphical models like Bayesian networks and Markov random fields use graphs to represent conditional independence relationships between random variables. Inference can be performed exactly using algorithms like sum-product on trees or approximately using loopy belief propagation on general graphs. - Sum-product and max-sum algorithms allow efficient exact inference in trees by passing messages along edges until beliefs at all nodes converge. Loopy belief propagation extends this approach to general graphs but convergence is not guaranteed.

Face Recognition Based on Deep Learning (Yurii Pashchenko Technology Stream)

IT Arena

Lviv IT Arena is a conference specially designed for programmers, designers, developers, top managers, inverstors, entrepreneur and startuppers. Annually it takes place on 2-4 of October in Lviv at the Arena Lviv stadium. In 2015 conference gathered more than 1400 participants and over 100 speakers from companies like Facebook. FitBit, Mail.ru, HP, Epson and IBM. More details about conference at itarene.lviv.ua.

DIY Deep Learning with Caffe Workshop

odsc

Caffe (Convolutional Architecture for Fast Feature Embedding) is a deep learning framework made with expression, speed, and modularity in mind. It is developed by the Berkeley Vision and Learning Center (BVLC) and by community contributors. Caffe’s expressive architecture encourages application and innovation. Models and optimization are defined by configuration without hard-coding. Switch between CPU and GPU by setting a single flag to train on a GPU machine then deploy to commodity clusters or mobile devices.Caffe’s extensible code fosters active development. In Caffe’s first year, it has been forked by over 1,000 developers and had many significant changes contributed back. Thanks to these contributors the framework tracks the state-of-the-art in both code and models.Speed makes Caffe perfect for research experiments and industry deployment. Caffe can processover 60M images per day with a single NVIDIA K40 GPU*. That’s 1 ms/image for inference and 4 ms/image for learning. We believe that Caffe is the fastest convnet implementation available.Caffe already powers academic research projects, startup prototypes, and even large-scale industrial applications in vision, speech, and multimedia. Join our community of brewers on the caffe-users group and Github. This tutorial is designed to equip researchers and developers with the tools and know-how needed to incorporate deep learning into their work. Both the ideas and implementation of state-of-the-art deep learning models will be presented. While deep learning and deep features have recently achieved strong results in many tasks, a common framework and shared models are needed to advance further research and applications and reduce the barrier to entry. To this end we present the Caffe framework, public reference models, and working examples for deep learning. Join our tour from the 1989 LeNet for digit recognition to today’s top ILSVRC14 vision models. Follow along with do-it-yourself code notebooks. While focusing on vision, general techniques are covered.

Using Gradient Descent for Optimization and Learning

Dr. Volkan OBAN

This document discusses optimization techniques for gradient descent, including the basics of gradient descent, Newton's method, and quasi-Newton methods. It covers limitations of gradient descent and Newton's method, and approximations like Gauss-Newton, Levenberg-Marquardt, BFGS, and L-BFGS. It also discusses stochastic optimization techniques for handling large datasets with minibatch or online updates rather than full batch updates.

Semi fragile watermarking

Yash Diwakar

This document proposes a semi-fragile watermarking method for image authentication using local binary patterns (LBP). It first describes how LBP works by comparing pixel values in a neighborhood to a central pixel and encoding the results as a binary number. It then explains how the proposed method embeds a watermark by modifying this binary number based on a watermark bit and extracting the watermark by recalculating the binary number. Specifically, it selects the pixel with the minimum magnitude difference to slightly modify in order to embed the watermark with minimal image quality impact. The watermark can then be extracted to detect any tampering of pixels in the local neighborhood. This semi-fragile watermarking using LBP has applications in image

Facebook Deep face

Emanuele Santellani

DeepFace is a facial recognition system developed by Facebook that can identify human faces in digital images with 97% accuracy, which is considered human-level performance. It uses a deep learning neural network trained on 4 million Facebook user photos. The system works by detecting faces, aligning them, using convolutional neural networks to extract features, and classifying images by comparing feature vectors between images. It achieved 97.35% accuracy on the Labeled Faces in the Wild benchmark dataset.

Computer vision, machine, and deep learning

Igi Ardiyanto

This document provides an overview of computer vision, machine learning, and deep learning with Python. It introduces computer vision and some example applications like optical character recognition and face detection. It then discusses machine learning and how it can be applied to computer vision problems. Deep learning is introduced as a type of machine learning using artificial neural networks. Examples of successful deep learning applications are presented, including speech recognition and the AlphaGo program that mastered the game of Go. Finally, Python is discussed as a programming language well-suited for scientific and deep learning applications due to supporting libraries like NumPy, Scipy, and Matplotlib.

Caffe framework tutorial

Park Chunduck

This document provides an overview of key concepts in Caffe including blobs, layers, nets, forward and backward passes, loss functions, and solvers. Blobs wrap data and define dimensions. Layers are the basic computation units, performing operations like filtering and nonlinearities. Nets define the overall model architecture by connecting layers. Forward and backward passes are used for inference and backpropagation. Loss functions drive learning, and solvers optimize models by adjusting parameters to reduce loss over iterations using techniques like stepwise learning rate decay. Data inputs and outputs are also configured through layers.

[AI07] Revolutionizing Image Processing with Cognitive Toolkit

de:code 2017

Deep Learning has revolutionized the field of image processing. I'll show real-world examples using CNTK, from anomaly classification using CNNs to generation using Generative Adversarial Networks. 製品/テクノロジ: AI (人工知能)/Deep Learning (深層学習)/Microsoft Azure/Machine Learning (機械学習) Michael Lanzetta Microsoft Corporation Developer Experience and Evangelism Principal Software Development Engineer

портфоліо Бабич О.А.

Сергей Жулавник

Caffe framework tutorial2

Park Chunduck

The document provides information on Caffe layers and networks for image classification tasks. It describes common layers used in convolutional neural networks (CNNs) like Convolution, Pooling, ReLU and InnerProduct. It also discusses popular CNN architectures for datasets such as MNIST, CIFAR-10 and ImageNet and the steps to prepare the data and train these networks in Caffe. Experiments comparing different CNN configurations on a 4-class image dataset show that removal of layers degrades performance, indicating their importance.

Processor, Compiler and Python Programming Language

arumdapta98

Caffe - A deep learning framework (Ramin Fahimi)

irpycon

Caffe is a deep learning framework. It is used for tasks like visual recognition using neural networks and deep learning techniques. Caffe uses plain text configuration files called prototxt to define neural network architectures and hyperparameters. It also supports distributed training on GPUs for large datasets. Caffe provides pre-trained models and tools to load, fine-tune, and publish new models for tasks like image classification and object detection.

Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...

Joe Suzuki

This document summarizes Joe Suzuki's presentation on structure learning of Bayesian networks from a small number of samples when the number of samples is much less than the number of nodes. The presentation discusses using a branch and bound algorithm to efficiently learn Bayesian network structures in this setting. It presents Suzuki's previous work on using branch and bound with minimum description length and maximum a posteriori scoring. Experimental results show the proposed tighter upper bound cuts computation time by about a third compared to previous work. The document also briefly summarizes a bonus discussion on using Hilbert-Schmidt independence criterion for independence testing.

Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC

BAINIDA

Rattani - Ph.D. Defense Slides

Pluribus One

This document discusses adaptive biometric systems based on template update paradigms. It provides background on biometric systems and the problems of intra-class variations affecting template representativeness over time. Standard solutions like using multiple templates or modalities are noted. The goal of the PhD study is to formulate the taxonomy of current template update methods, analyze their pros and cons, and propose novel solutions. Specifically, it will experimentally analyze and compare the performance of self-update and co-update methods in controlled and uncontrolled environments. Initial results show co-update more effectively lowers equal error rates than self-update when capturing variations from unlabeled samples in uncontrolled conditions.

Pattern Recognition and Machine Learning: Section 3.3

Yusuke Oda

The document discusses Bayesian linear regression. It introduces the parameter distribution by assuming a Gaussian prior distribution for the model parameters. This leads to a Gaussian posterior distribution. It then discusses the predictive distribution for new data points by marginalizing over the posterior distribution of the parameters. Finally, it introduces the concept of an equivalent kernel, which allows predictions to be written as a linear combination of the training targets using a kernel matrix rather than by calculating the model parameters.

怖くない誤差逆伝播法 Chainerを添えて

marujirou

This document discusses neural networks and the Chainer deep learning framework. It covers neural network concepts like forward propagation, loss calculation, and backpropagation. It then explains how Chainer can be used to define neural network models using define-by-run and implement forward and backward propagation to train models. Specific Chainer concepts discussed include loss functions, linear layers, activation functions, and LSTM networks.

Viewers also liked (20)

Optimization in deep learning

Muzammil Abdulrahman PPT On Gabor Wavelet Transform (GWT) Based Facial Expres...

Pattern Recognition and Machine Learning : Graphical Models

Face Recognition Based on Deep Learning (Yurii Pashchenko Technology Stream)

DIY Deep Learning with Caffe Workshop

Using Gradient Descent for Optimization and Learning

Semi fragile watermarking

Facebook Deep face

Computer vision, machine, and deep learning

Caffe framework tutorial

[AI07] Revolutionizing Image Processing with Cognitive Toolkit

портфоліо Бабич О.А.

Caffe framework tutorial2

Processor, Compiler and Python Programming Language

Caffe - A deep learning framework (Ramin Fahimi)

Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...

Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC

Rattani - Ph.D. Defense Slides

Pattern Recognition and Machine Learning: Section 3.3

怖くない誤差逆伝播法 Chainerを添えて

Similar to Center loss for Face Recognition

Face Recognition: From Scratch To Hatch

Eduard Tyantov

Face Recognition: From Scratch To Hatch / Эдуард Тянтов (Mail.ru Group)

Ontico

HighLoad++ 2017 Зал «Найроби+Касабланка», 7 ноября, 15:00 Тезисы: http://www.highload.ru/2017/abstracts/3044.html Мы разработали технологию по детекту и распознаванию лиц для продуктов компании Mail.ru, которая показывает высокие результаты на известных тестах. Технология на данный момент используется в Мобильном Облаке@Mail.ru для кластеризации фотографий по людям, а также во внутренних сервисах компании. ...

Deep learning in Computer Vision

David Dao

The document discusses deep learning in computer vision. It provides an overview of research areas in computer vision including 3D reconstruction, shape analysis, and optical flow. It then discusses how deep learning approaches can learn representations from raw data through methods like convolutional neural networks and restricted Boltzmann machines. Deep learning has achieved state-of-the-art results in applications such as handwritten digit recognition, ImageNet classification, learning optical flow, and generating image captions. Convolutional neural networks have been particularly successful due to properties of shared local weights and pooling layers.

Deep Learning - Overview of my work II

Mohamed Loey

Recent Progress on Object Detection_20170331

Jihong Kang

机器学习Adaboost

Shocky1

This document discusses object detection using Adaboost and various techniques. It begins with an overview of the Adaboost algorithm and provides a toy example to illustrate how it works. Next, it describes how Viola and Jones used Adaboost with Haar-like features and an integral image representation for rapid face detection in images. It achieved high detection rates with very low false positives. The document also discusses how Schneiderman and Kanade used a parts-based representation with localized wavelet coefficients as features for object detection and used statistical independence of parts to obtain likelihoods for classification.

Deep learning for molecules, introduction to chainer chemistry

Kenta Oono

1) The document introduces machine learning and deep learning techniques for predicting chemical properties, including rule-based approaches versus learning-based approaches using neural message passing algorithms. 2) It discusses several graph neural network models like NFP, GGNN, WeaveNet and SchNet that can be applied to molecular graphs to predict characteristics. These models update atom representations through message passing and graph convolution operations. 3) Chainer Chemistry is introduced as a deep learning framework that can be used with these graph neural network models for chemical property prediction tasks. Examples of tasks include drug discovery and molecular generation.

Detection focal loss 딥러닝 논문읽기 모임 발표자료

taeseon ryu

Focal Loss for Dense Object Detection proposes a novel focal loss function to address the extreme foreground-background class imbalance encountered in training dense object detectors. The focal loss focuses training on hard examples and prevents easy negatives from overwhelming the detector. RetinaNet, a simple dense detector designed with a ResNet-FPN backbone and focal loss, achieves state-of-the-art accuracy while running faster than existing two-stage detectors. Extensive experiments demonstrate the focal loss enables training highly accurate dense detectors on datasets with vast numbers of background examples like COCO.

Deep cv 101

Xiaohu ZHU

The document provides an overview of deep computer vision (CV) classical models and applications. It discusses popular CV models including LeNet, AlexNet, GoogLeNet, VGG, and Deep Residual Networks. It also covers CV applications such as image classification, neural style transfer, and text-to-image/image-to-text generation. Finally, it briefly introduces distributed deep learning in UCloud and multi-node weight updating.

[Paper] learning video representations from correspondence proposals

Susang Kim

ImageNet classification with deep convolutional neural networks(2012)

WoochulShin10

1) The document describes a study that trained one of the largest convolutional neural networks on the ImageNet dataset. 2) It implemented highly optimized GPU training of large CNNs on high resolution images and introduced features like ReLU, local response normalization, and overlapping pooling to improve performance and reduce overfitting. 3) The network architecture consisted of 5 convolutional layers and 3 fully-connected layers and was trained on two GPUs with techniques like dropout and data augmentation to reduce overfitting.

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

IRJET Journal

This document presents an approach for image deblurring based on sparse representation and a regularized filter. The approach involves splitting the blurred input image into patches, estimating sparse coefficients for each patch, learning dictionaries from the coefficients, and merging the patches. The merged patches are subtracted from the blurred image to obtain the deblur kernel. Wiener deconvolution with the kernel is then applied and followed by a regularized filter to recover the original image without blurring. The approach was tested on MATLAB and evaluation metrics like RMSE, PSNR, and SSIM showed it performed better than existing methods, recovering images with more details and contrast.

one shot15729752 Deep Learning for AI and DS

ManiMaran230751

This document discusses one-shot learning techniques for object recognition from few examples. It introduces the concepts of embedding spaces and similarity metrics for measuring distances between objects. Specific deep learning models are described, including Siamese networks, triplet networks, DeepFace, and FaceNet. Siamese networks aim to learn a similarity function using a contrastive loss over input pairs, while triplet networks employ a triplet loss to optimize relative distances between anchor, positive, and negative examples. DeepFace and FaceNet are state-of-the-art face recognition systems that use deep convolutional networks trained with triplet losses to learn embeddings that achieve human-level accuracy on benchmark face datasets.

GroupFace (Face Recognition)

Susang Kim

The document summarizes the GroupFace face recognition method. GroupFace learns latent groups within training data and constructs group-based representations to enrich face embeddings. This helps narrow the search space for face matching compared to instance-only methods. GroupFace achieves state-of-the-art results on face verification and identification benchmarks by learning a joint group-aware representation through self-distributed grouping and an arcface-based loss. Ablation studies validate the importance of uniform group distributions and combining instance and group representations.

Avihu Efrat's Viola and Jones face detection slides

wolf

The document summarizes the Viola-Jones object detection framework. It uses a cascade of classifiers with increasingly more complex features trained with AdaBoost to rapidly detect objects. Integral images allow for very fast feature evaluations. The framework was applied to face detection, achieving very fast average detection speeds of 270 microseconds per sub-window while maintaining low false positive rates.

PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild

jaewon lee

This paper proposes RetinaFace, a single-stage face detection method that uses multi-task learning to simultaneously predict face boxes, facial landmarks, and dense face masks. It achieves state-of-the-art accuracy on the WIDER Face dataset while running in real-time on a CPU. RetinaFace uses a feature pyramid network with a context module and dense regression branch to perform face localization, landmark detection, and dense alignment. Experiments show it outperforms two-stage detectors on face detection and improves face verification accuracy on the IJB-C dataset compared to previous methods.

MLIP - Chapter 5 - Detection, Segmentation, Captioning

Charles Deledalle

Localization and classification. Overfeat: class agnostic versu class specific localization, fully convolutional neural networks, greedy merge strategy. Multiobject detection. Region proposal and selective search. R-CNN, Fast R-CNN, Faster R-CNN and YOLO. Image segmentation. Semantic segmentation and transposed convolutions. Instance segmentation and Mask R-CNN. Image captioning. Recurrent Neural Networks (RNNs). Language generation. Long Short Term Memory (LSTMs). DeepImageSent, Show and Tell, and Show, Attend and Tell algorithms.

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

IRJET Journal

This document proposes an approach for image deblurring based on sparse representation and a regularized filter. The approach splits the blurred input image into patches, estimates sparse coefficients for each patch using dictionary learning, updates the dictionary, and estimates the deblur kernel. The deblur kernel is applied using Wiener deconvolution and further processed with a regularized filter to recover the original image. The approach was tested on MATLAB and evaluation metrics like RMSE, PSNR, and SSIM along with visual analysis showed it performed better deblurring compared to existing methods.

Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...

PyData

This document discusses gradient boosted regression trees (GBRT) and their implementation in scikit-learn. It begins with an introduction to machine learning concepts like classification, regression, and decision trees. It then covers the basics of boosting and gradient boosting, describing how GBRT works by sequentially fitting trees to residuals. The rest of the document demonstrates scikit-learn's GBRT implementation, provides tips on regularization and hyperparameters, and presents a case study on house price prediction.

A proposed accelerated image copy-move forgery detection-vcip2014

SondosFadl

Similar to Center loss for Face Recognition (20)

Face Recognition: From Scratch To Hatch

Face Recognition: From Scratch To Hatch / Эдуард Тянтов (Mail.ru Group)

Deep learning in Computer Vision

Deep Learning - Overview of my work II

Recent Progress on Object Detection_20170331

机器学习Adaboost

Deep learning for molecules, introduction to chainer chemistry

Detection focal loss 딥러닝 논문읽기 모임 발표자료

Deep cv 101

[Paper] learning video representations from correspondence proposals

ImageNet classification with deep convolutional neural networks(2012)

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

one shot15729752 Deep Learning for AI and DS

GroupFace (Face Recognition)

Avihu Efrat's Viola and Jones face detection slides

PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild

MLIP - Chapter 5 - Detection, Segmentation, Captioning

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...

A proposed accelerated image copy-move forgery detection-vcip2014

Recently uploaded

Data structures and Algorithms in Python.pdf

TIPNGVN2

TrustArc Webinar - 2024 Global Privacy Survey

TrustArc

How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024? In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores. See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe. This webinar will review: - The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey - The top challenges for privacy leaders, practitioners, and organizations in 2024 - Key themes to consider in developing and maintaining your privacy program

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

UiPath Test Automation using UiPath Test Suite series, part 6

DianaGray10

Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI. UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities. Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes. What will you get from this session? 1. Insights into integrating generative AI. 2. Understanding how this integration enhances test automation within the UiPath platform 3. Practical demonstrations 4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath Topics covered: What is generative AI Test Automation with generative AI and Open AI. UiPath integration with generative AI Speaker: Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

Removing Uninteresting Bytes in Software Fuzzing

Aftab Hussain

Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process. In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds. - These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.

Presentation of the OECD Artificial Intelligence Review of Germany

innovationoecd

Monitoring Java Application Security with JDK Tools and JFR Events

Ana-Maria Mihalceanu

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

Zilliz

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...

Edge AI and Vision Alliance

For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/ Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit. In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing. van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.

How to Get CNIC Information System with Paksim Ga.pptx

danishmna97

Communications Mining Series - Zero to Hero - Session 1

DianaGray10

This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered: • Communication Mining Overview • Why is it important? • How can it help today’s business and the benefits • Phases in Communication Mining • Demo on Platform overview • Q/A

20240607 QFM018 Elixir Reading List May 2024

Matthew Sinclair

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Malak Abu Hammad

Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers: * What is Vector Search? * Importance and benefits of vector search * Practical use cases across various industries * Step-by-step implementation guide * Live demos with code snippets * Enhancing LLM capabilities with vector search * Best practices and optimization strategies Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications. #MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology

Building RAG with self-deployed Milvus vector database and Snowpark Container...

Zilliz

Pushing the limits of ePRTC: 100ns holdover for 100 days

Adtran

Mind map of terminologies used in context of Generative AI

Kumud Singh

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...

Neo4j

Dr. Sean Tan, Head of Data Science, Changi Airport Group Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.

“I’m still / I’m still / Chaining from the Block”

Claudio Di Ciccio

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

Neo4j

Leonard Jayamohan, Partner & Generative AI Lead, Deloitte This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.

Securing your Kubernetes cluster_ a step-by-step guide to success !

KatiaHIMEUR1

Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster. However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks. In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.

Recently uploaded (20)

Data structures and Algorithms in Python.pdf

TrustArc Webinar - 2024 Global Privacy Survey

Generative AI Deep Dive: Advancing from Proof of Concept to Production

UiPath Test Automation using UiPath Test Suite series, part 6

Removing Uninteresting Bytes in Software Fuzzing

Presentation of the OECD Artificial Intelligence Review of Germany

Monitoring Java Application Security with JDK Tools and JFR Events

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...

How to Get CNIC Information System with Paksim Ga.pptx

Communications Mining Series - Zero to Hero - Session 1

20240607 QFM018 Elixir Reading List May 2024

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Building RAG with self-deployed Milvus vector database and Snowpark Container...

Pushing the limits of ePRTC: 100ns holdover for 100 days

Mind map of terminologies used in context of Generative AI

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...

“I’m still / I’m still / Chaining from the Block”

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

Securing your Kubernetes cluster_ a step-by-step guide to success !

Center loss for Face Recognition

1. A Discriminative Feature Learning Approach for Deep Face Recognition Yandong Wen, Kaipeng Zhang, Zhifeng Li and Yu Qiao Paper Seminar @ SK Telecom: Jisung Kim

2. Separable vs Discriminative

3. A toy example : What’s wrong with Softmax

4. A toy example : What’s wrong with Softmax 1. m = mini-batch size 2. n = the number of class 3. x_i = feature vector in R^d (d is the feature dimension) 4. W = R^(dxn), b = R^n (bias)

5. Is it good for clustering? Separable, the deep features are not discriminative enough. by intra-class variation

6. Is it good for clustering? Training Set (50K) Test Set (10K)

7. Let’s be discriminative by Center Loss 1. m = mini-batch size 2. c_yi = yth class center in d dimension 3. x_i = feature vector in R^d (d is the feature dimension) 4. But! c_yi should be updated as the deep features changed. 5. Average the features of every class in each iteration.

8. Let’s be discriminative by Center Loss 1. Two modification a. Updating the centers based on mini-batch b. Updating center with learning rate alpha (for mislabelled samples) 2. Total = Softmax Loss + lambda * Center Loss

9. Varying λ with Loss = Soft + λ*Center

10. The discriminative feature learning algorithm

11. CNN Architecture → https://github.com/ydwen/caffe-face

12. Compared to Siamese and Triplet 1. Dramatic data expansion a. Make (x_i, x_j) pair or (x_positive, x_negative, x_anchor) pair 2. Hard to make proper pair → Hard to make decreasing loss → Hard to train

13. The Devil is in the details. 1. Preprocessing 2. Training Data 3. Detailed Settings in CNNs 4. Detailed Settings in Testings

14. Implementation Details : Preprocessing 1. Cropped Face Size = (112x96x3) a. Subtraction 127.5 b. Dividing by 128 c. Every Pixel Values in -1 <= val <= 1 2. Face Detect by MTCNN 3. Use 5 landmarks a. Two Eyes b. One Nose c. Two Mouth Corners 4. Alignment : Similarity Transformation. a. Rotation, Translation, Scaling

15. Implementation Details : Training data 1. 17,189 unique persons 2. 0.7 M images a. CASIA-WebFace : 0.49 M, 10,575 b. CACD2000 : 0.16 M, 2,000 c. Celebrity+ : 0.20 M, 10,177 3. Removing Same Identities !!! 4. Data Augmentation : Only horizontally Flipped

16. Implementation Details : Detailed Settings in CNNs 1. Model A : Softmax Loss 2. Model B : Softmax Loss & Contrastive Loss 3. Model C : Softmax Loss & Center Loss 4. Batch Size : 256 5. GPUs : 2 x Titan X 6. Learning rate : Start from 0.1 7. Model A, C a. Divided by 10 at the 16K, 24K b. Complete at 28K (Roughly costs 14 hours) 8. Model B a. Divided by 10 at the 24K, 36K b. Complete at 42K (Roughly costs 22 hours)

17. Implementation Details : Detailed Settings in Testing 1. Deep Features : first FC layer 2. Extract 2 feature a. Original Image b. Horizontally Flipped Image 3. Just Concatenate 4. Do PCA 5. Score by cosine distance 6. Identification a. Nearest Neighbor 7. Verification a. Threshold Comparison

18. λ & α on LFW

19. LFW (images) & YTF (videos)

20. Verification Performance (λ=0.003 & α=0.5)

21. MegaFace

22. MegaFace 1. Gallery Set : 690,000 persons / 1 milion Images ( Distractors ) 2. Probe set a. Facescrub : 530 persons / 100K images b. FGNet : 82 persons / 1,002 images (ages varying from 0 to 69) 3. Small / Large Protocol a. When training db size < 0.5M, < 20K persons b. When training db size > 0.5M, > 20K persons 4. Face Identification a. From 1 vs 10 b. To 1 vs 1,000,000 5. Face Verification a. 4 bilion = 4,000,000,000 negative pairs

23. MegaFace : Identification (1M distractors)

24. MegaFace : Verification (10^-6 FAR, 1M distractors)

25. FaceScrup 1. 530 celebrities 2. 100k images

26. FGNet 1. 82 persons 2. 1002 images

27. MegaFace : Identification (FaceScrub)

28. MegaFace : Verification (FaceScrub)

29. MegaFace : Identification (FGNet)

30. MegaFace : Verification (FGNet)

31. Conclusions 1. USE!!! Center Loss for getting discriminative feature 2. Much room for performance improvement. a. To meet practical demand b. Identification : Rank-1 with 1M distractor c. Verification : 10^-6 FAR with 1M distractor

Center loss for Face Recognition

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Center loss for Face Recognition

Similar to Center loss for Face Recognition (20)

Recently uploaded

Recently uploaded (20)

Center loss for Face Recognition