SlideShare a Scribd company logo
1 of 38
Download to read offline
Caricature Recognition and
Generation
Introduction
● Caricatures are facial drawings by artists with
exaggeration on certain facial parts. A caricature
can be rendered in two ways: it can be either
exaggerated or oversimplified.
Our Project In a Nutshell:
• Cartoon gender identification: It is binary classification problem
where given a cartoon face the system has to answer whether it is
a male or a female
• Cartoon face recognition: The problem of cartoon face
recognition is to recognize the given cartoon face as one of the C
classes.
• Cartoon face generation: Given a real face of public figure,
generate a caricature in various moods, that has exaggerated
identifiable facial features.
To further improve the accuracy of above tasks, 3 experiments for
landmark extraction of caricature images:
● Dlib facial landmark extractor
● Flood-fill algorithm
● Manual annotation assisted Keypoints extraction of Images
Applications
● Can help visually impaired people to
understand cartoon images or movies.
● Can be used to automatically censor
communal or politically incorrect
cartoons in the social media.
Caricature Generation
Experiments with CNN
Facial Landmark Extraction
Pre-processing
Summary of roles/responsibilities
● Pratik Parwal (20148093): Problem Analysis, Landmark
Detection using manually annotated dataset, report
making.
● Nikhil Agarwal (20144092): Problem Analysis, Face
averaging, Training and Transformation for caricature
generation, report making.
● Richa Pandey (20144126): Problem Analysis,Dataset
parsing, detecting face boundary using flood-fill algorithm,
report making.
● Saurav Jha (20144007): Problem Analysis,Experiments
with CNN modules, Data set augmentation, report making.
● Pramee Chowdhury (20144136):Problem Analysis, Face
detection, alignment, normalization, Java app for manual
annotation of caricature landmarks, report making.
Task 1: Caricature Recognition
Dataset Used
The recently released Cartoon
faces in the Wild (IIIT-CFW) [11] database
contains a total of 8,928 annotated images of
cartoon faces of 100 public figures. The real
human faces of these public figures have also
been provided.
A number of attributes for each cartoon
image provided.
Attributes in XML form
● Type of cartoon: cartoon, cartoon
sketch, caricature
● Pose: frontal, non frontal
● Expression: happy, sad, thoughtful,
seductive, sorrow, angry, serious, fright-
ened, crying, shocked
● Age group: young, old
● Gender: male, female
● Glass: yes, no
Data Pre-processing
● We perform XML parsing for each caricature image
to extract the information about their name and
gender.
● Each image was first converted into grayscale to
convert the pixel values in the range (0,255)
followed by the extraction of the face bounding box
and then re-sized to the 96x96 size.
● Finally, we get a list of pixel values along with their
corresponding classes ((1-100 ) for the face
recognition task and (0, 1) for the gender
identification task).
Facial Landmark Extraction
L1: Face Region Candidate Extraction
using Flood-Fill.
We make the following assumptions :
• The skin color of the image would be nearly even.
• There is proper distinction between a caricature face and
its surrounding.
The algorithm [14] starts from a skin color pixel (generally,
center of the image), and gradually moves to all its
neighboring pixels if the difference between the two pixels is
less than a certain threshold.
L2 : Dlib Face Landmark Detector
Detects a total of 68 facial landmark points
L3: Manual Key-points annotations
of caricature images.
We chose a subset of 20 personalities out of 100 in the
data-set and manually annotated 15 images of each. In order
to do this, a Java swing based application was built to
manually get the (x, y) coordinates of the 15 selected
landmark points.
Then the system was trained to get the
landmark points of an image given its pixel
intensities.
Architecture of the CNN model. Input and output are normalised. Loss function used was
‘mse’ with SGD optimizer.
System Description
● For evaluating the accuracy of the sub-tasks for
gender identification and face verification, we
experimented with different model configurations of
Convolutional Neural Networks(CNNs).
● CNNs [3] have fully-connectedness replaced with
convolutions using filters, they perform much better
for applications that has structured inputs which can
be filtered using such convolutions.
1. System1: 1-D Convolutional Neural Nets
2. System2: 2-D Convolutional Neural Nets
3. System 3: 2-D CNN with Data Augmentation
SYSTEM 1: 1-D CNNs
● Filters moving in just one direction and
are generally used for problems involving
text classification and signal smoothing.
SYSTEM 2: 2-D CNNs
● Believed to be more suited for tasks
requiring inherent assumption of the
two dimensional spatial patterns of
images, such as ours.
2-D CNN based model configuration used by us for the tasks of gender
identification and face verification.
SYSTEM 3: 2D CNN + Data
Augmentation
● We carried out horizontal flips of the
training images followed by a 5 degree
anti-clockwise rotation of the images.
● Horizontal flip preserves the symmetry
of human-like faces along with helping
in doubling the train data size.
RESULTS AND ANALYSIS
● Gender Identification and Face verification
1. Using Only List of Pixel Values
- We report our accuracy on all 100 classes using the entire
list of pixel values.
- All the three systems described above were first trained
on a total of 7138 caricature images (with a validation split
of 0.2) and then tested on 1785 images.
- Number of training instances for male faces being 5242
while that for female faces being 1896, dataset suffers
from class imbalance problem.
- Because of gender class imbalance problem, we show the
results of our systems individually on male and female
classes.
Using Landmark points of L3 combined with
pixel values
● Effect of the landmark points on the accuracy
obtained by four machine learning classifiers, namely
LinearSVC, Random Forest, Gradient Boosting
Classifier and a Voting of the three classifiers.
● Trained and tested on total of 2,339 caricature
images belonging to 20 classes of the dataset.
● First Run: (x,y) coordinates of 15 landmark points
along with 96*96 dimensional list of pixel values fed
to the classifiers, resulting in 2946 features.
● Second Run: Only list of pixel values fed to the
classifiers, resulting in 2916 features.
Accuracy Report
● Face verification on 100 classes:
30.96%
● Gender Identification task on 100
classes:
● Male : 79.33%
● Female : 66.17%
● Face verification using Landmark
points on 20 classes: 33.8%
Task 2: Caricature Generation
Face Alignment and Normalization
For better result, all the images must confine to
certain standards. So, all the original
image-caricature pairs were subsequently aligned
and normalized.
Mesh Warping
Mesh Warping is the module we created to warp image
according to the final landmark points. We used Dlib
landmark detector for obtaining landmark points. We
used these 68 points and 8 points on the boundary of
the original face to calculate a Delaunay Triangulation.
Face Averaging
To calculate the mean face, we first need to calculate
the average of all landmarks coordinates in the real
faces. This is done by simply averaging the x and y
values of the landmarks coordinates. After that we used
our Mesh Warping module to calculate the average face.
Relationships among original image,
corresponding caricature and mean
face
When a caricaturist sees a face, he/she has the ability to
identify the distinctive facial features by comparing it
with the mean face hidden in his/her mind [7].
Artist style capture algorithm
Neural Network Model
The network is capable
to learn from the
training set by
constructing an
input-output
mapping for the
problem automatically.
Result for Task 2:
Initial approach result( not much
exaggeration )
Second approach result by
decreasing training data.
Training Data given as
input to get sad caricature.
Sad face of Smriti Irani obtained after
training on Mulayam’s image.
CONCLUSION
In our project 2-D Convolutional Neural Network based
model fed with augmented data performed the best
pushing the accuracy to 30.96% on the face
verification task, and to 66.17% and 79.33%
respectively, on the identification of female and male
faces. In order to further improve the accuracy, we
conducted three independent experiments to capture
the facial keypoints of the caricature faces and
analyzed the detailed results which were not
satisfactory enough to be used further as feature set of
the tasks and so, we showed them only for the task of
face verification on 20 classes leading to a best
accuracy of 33.8% using
Gradient Boosting Classier.
FUTURE WORKS
The task of landmark detection of cartoon faces has a
substantial scope of improvement based on the
results we achieved. Improvement in face alignment
would also significantly affect the accuracy of the
results. Recent advancements of CNNs such as the
Multi-task Cascaded Convolutional Neural Networks
(MTCNN) can be trained and used for the task of joint
face detection and alignment of such cartoon faces.
References :
● [1] Bharadwaj, S., Bhatt, H. S., Vatsa, M., and Singh, R. Domain specific learning for
newborn face recognition. IEEE Transactions on Information Forensics and Security 11
(2016), 1630{1641.
● [2] Hsu, R.-L., and Jain, A. K. Generating discriminating cartoon faces using interacting
snakes. IEEE Trans. Pattern Anal. Mach. Intell. 25 (2003), 1388{1398.
● [3] J urgen Schmidhuber, journal=Neural networks : the official journal of the
International Neural Network Society, y. v. p. Deep learning in neural networks: An
overview.
● [4] Kingma, D. P., and Ba, J. Adam: A method for stochastic optimization. CoRR
abs/1412.6980 (2014).
● [5] Klare, B., Burge, M., Klontz, J. C., Bruegge, R. W. V., and Jain, A. K. Face recognition
performance: Role of demographic information. IEEE Transactions on Information
Forensics and Security 7 (2012), 1789{1801.
● [6] Kumar, N., Berg, A. C., Belhumeur, P. N., and Nayar, S. K. Attribute and simile
classifiers for face verification. 2009 IEEE 12th International Conference on Computer
Vision (2009), 365{372.[7] Lai, K., Chung, P., and Edirisinghe, E. Novel approach to
neural network based caricature generation.
● [7] Lai, K., Chung, P., and Edirisinghe, E. Novel approach to neural
● network based caricature generation.
● [8] LeCun, Y. Gradient-based learning applied to document recognition.
● [9] Liang, L., Chen, H., Xu, Y.-Q., and Shum, H.-Y. Example-based caricature
generation with exaggeration. In Computer Graphics and Applications,2002.
Proceedings. 10th Pacic Conference on (2002), IEEE, pp. 386{393.
● [10] Liao, S., Jain, A. K., and Li, S. Z. Partial face recognition: Alignment-free approach.
IEEE Transactions on Pattern Analysis and Machine Intelligence 35 (2013), 1193{1205.
● [11] Mishra, A., Rai, S. N., Mishra, A., and Jawahar, C. V. Iiit-cfw: A benchmark database
of cartoon faces in the wild. In ECCV Workshops (2016).
● [12] Shan, C., Gong, S., and McOwan, P. W. Facial expression recognition based on
local binary patterns: A comprehensive study. Image Vision Comput. 27 (2009),
803{816.
● [13] Sun, Y., Liang, D., Wang, X., and Tang, X. Deepid3: Face recognition with very
deep neural networks. CoRR abs/1502.00873 (2015).
● [14] Takayama, K., Johan, H., and Nishita, T. Face detection and face recognition of
cartoon characters using feature extraction. In Image, Electronics and Visual
Computing Workshop (2012), p. 48.

More Related Content

What's hot

fuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptfuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptAbdullah Gubbi
 
Face Recognition
Face RecognitionFace Recognition
Face Recognitionlaknatha
 
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUESA STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUEScscpconf
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalcsandit
 
Face recogntion using PCA algorithm
Face recogntion using PCA algorithmFace recogntion using PCA algorithm
Face recogntion using PCA algorithmAshwini Awatare
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern RecognitionMustafa Salam
 
Facial Expression Recognition via Python
Facial Expression Recognition via PythonFacial Expression Recognition via Python
Facial Expression Recognition via PythonSaurav Gupta
 
Scene Description From Images To Sentences
Scene Description From Images To SentencesScene Description From Images To Sentences
Scene Description From Images To SentencesIRJET Journal
 
Cartoon Based Image Retrieval : An Indexing Approach
Cartoon Based Image Retrieval : An Indexing ApproachCartoon Based Image Retrieval : An Indexing Approach
Cartoon Based Image Retrieval : An Indexing Approachmlaij
 
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationA New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationAIRCC Publishing Corporation
 
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis &  Viola-Jones AlgorithmGesture Recognition using Principle Component Analysis &  Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis & Viola-Jones AlgorithmIJMER
 
Minghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkMinghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkWei Wang
 
Appearance based face recognition by pca and lda
Appearance based face recognition by pca and ldaAppearance based face recognition by pca and lda
Appearance based face recognition by pca and ldaIAEME Publication
 
Machine Learning Explanations: LIME framework
Machine Learning Explanations: LIME framework Machine Learning Explanations: LIME framework
Machine Learning Explanations: LIME framework Deep Learning Italia
 
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHcsandit
 

What's hot (19)

fuzzy LBP for face recognition ppt
fuzzy LBP for face recognition pptfuzzy LBP for face recognition ppt
fuzzy LBP for face recognition ppt
 
Face Recognition
Face RecognitionFace Recognition
Face Recognition
 
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUESA STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrieval
 
Ijcatr04041016
Ijcatr04041016Ijcatr04041016
Ijcatr04041016
 
Face recognition vaishali
Face recognition vaishaliFace recognition vaishali
Face recognition vaishali
 
Face recogntion using PCA algorithm
Face recogntion using PCA algorithmFace recogntion using PCA algorithm
Face recogntion using PCA algorithm
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern Recognition
 
Facial Expression Recognition via Python
Facial Expression Recognition via PythonFacial Expression Recognition via Python
Facial Expression Recognition via Python
 
Scene Description From Images To Sentences
Scene Description From Images To SentencesScene Description From Images To Sentences
Scene Description From Images To Sentences
 
Cartoon Based Image Retrieval : An Indexing Approach
Cartoon Based Image Retrieval : An Indexing ApproachCartoon Based Image Retrieval : An Indexing Approach
Cartoon Based Image Retrieval : An Indexing Approach
 
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationA New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
 
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis &  Viola-Jones AlgorithmGesture Recognition using Principle Component Analysis &  Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
 
Minghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkMinghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation Talk
 
Human Emotion Recognition
Human Emotion RecognitionHuman Emotion Recognition
Human Emotion Recognition
 
Appearance based face recognition by pca and lda
Appearance based face recognition by pca and ldaAppearance based face recognition by pca and lda
Appearance based face recognition by pca and lda
 
Machine Learning Explanations: LIME framework
Machine Learning Explanations: LIME framework Machine Learning Explanations: LIME framework
Machine Learning Explanations: LIME framework
 
Term11566
Term11566Term11566
Term11566
 
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
 

Similar to Caricature Recognition and Generation

AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxssuserb4a9ba
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxdhaliwalharsh055
 
Face Recognition Using Gabor features And PCA
Face Recognition Using Gabor features And PCAFace Recognition Using Gabor features And PCA
Face Recognition Using Gabor features And PCAIOSR Journals
 
Identifying Gender from Facial Parts Using Support Vector Machine Classifier
Identifying Gender from Facial Parts Using Support Vector Machine ClassifierIdentifying Gender from Facial Parts Using Support Vector Machine Classifier
Identifying Gender from Facial Parts Using Support Vector Machine ClassifierEditor IJCATR
 
CariGANs : Unpaired Photo-to-Caricature Translation
CariGANs : Unpaired Photo-to-Caricature TranslationCariGANs : Unpaired Photo-to-Caricature Translation
CariGANs : Unpaired Photo-to-Caricature TranslationRazorthink
 
Image–based face-detection-and-recognition-using-matlab
Image–based face-detection-and-recognition-using-matlabImage–based face-detection-and-recognition-using-matlab
Image–based face-detection-and-recognition-using-matlabIjcem Journal
 
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...ijesajournal
 
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...CSCJournals
 
Face Recognition System using Self Organizing Feature Map and Appearance Base...
Face Recognition System using Self Organizing Feature Map and Appearance Base...Face Recognition System using Self Organizing Feature Map and Appearance Base...
Face Recognition System using Self Organizing Feature Map and Appearance Base...ijtsrd
 
Model Based Emotion Detection using Point Clouds
Model Based Emotion Detection using Point CloudsModel Based Emotion Detection using Point Clouds
Model Based Emotion Detection using Point CloudsLakshmi Sarvani Videla
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...ijfcstjournal
 
cvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptxcvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptxPyariMohanJena
 
ppt 20BET1024.pptx
ppt 20BET1024.pptxppt 20BET1024.pptx
ppt 20BET1024.pptxManeetBali
 
Face Identification Project Abstract 2017
Face Identification Project Abstract 2017Face Identification Project Abstract 2017
Face Identification Project Abstract 2017ioshean
 
A Robust & Fast Face Detection System
A Robust & Fast Face Detection SystemA Robust & Fast Face Detection System
A Robust & Fast Face Detection SystemIDES Editor
 
Image Redundancy and Its Elimination
Image Redundancy and Its EliminationImage Redundancy and Its Elimination
Image Redundancy and Its EliminationIJMERJOURNAL
 
Computer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and PythonComputer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and PythonAkash Satamkar
 

Similar to Caricature Recognition and Generation (20)

AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptx
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptx
 
Face Recognition Using Gabor features And PCA
Face Recognition Using Gabor features And PCAFace Recognition Using Gabor features And PCA
Face Recognition Using Gabor features And PCA
 
Identifying Gender from Facial Parts Using Support Vector Machine Classifier
Identifying Gender from Facial Parts Using Support Vector Machine ClassifierIdentifying Gender from Facial Parts Using Support Vector Machine Classifier
Identifying Gender from Facial Parts Using Support Vector Machine Classifier
 
G1802033543
G1802033543G1802033543
G1802033543
 
CariGANs : Unpaired Photo-to-Caricature Translation
CariGANs : Unpaired Photo-to-Caricature TranslationCariGANs : Unpaired Photo-to-Caricature Translation
CariGANs : Unpaired Photo-to-Caricature Translation
 
Image–based face-detection-and-recognition-using-matlab
Image–based face-detection-and-recognition-using-matlabImage–based face-detection-and-recognition-using-matlab
Image–based face-detection-and-recognition-using-matlab
 
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...
PERFORMANCE EVALUATION OF FUZZY LOGIC AND BACK PROPAGATION NEURAL NETWORK FOR...
 
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
 
Face Recognition System using Self Organizing Feature Map and Appearance Base...
Face Recognition System using Self Organizing Feature Map and Appearance Base...Face Recognition System using Self Organizing Feature Map and Appearance Base...
Face Recognition System using Self Organizing Feature Map and Appearance Base...
 
Model Based Emotion Detection using Point Clouds
Model Based Emotion Detection using Point CloudsModel Based Emotion Detection using Point Clouds
Model Based Emotion Detection using Point Clouds
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...
 
cvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptxcvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptx
 
ppt 20BET1024.pptx
ppt 20BET1024.pptxppt 20BET1024.pptx
ppt 20BET1024.pptx
 
Seminar5
Seminar5Seminar5
Seminar5
 
Face Identification Project Abstract 2017
Face Identification Project Abstract 2017Face Identification Project Abstract 2017
Face Identification Project Abstract 2017
 
A Robust & Fast Face Detection System
A Robust & Fast Face Detection SystemA Robust & Fast Face Detection System
A Robust & Fast Face Detection System
 
Image Redundancy and Its Elimination
Image Redundancy and Its EliminationImage Redundancy and Its Elimination
Image Redundancy and Its Elimination
 
H0334749
H0334749H0334749
H0334749
 
Computer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and PythonComputer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and Python
 

Recently uploaded

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 

Caricature Recognition and Generation

  • 2. Introduction ● Caricatures are facial drawings by artists with exaggeration on certain facial parts. A caricature can be rendered in two ways: it can be either exaggerated or oversimplified.
  • 3. Our Project In a Nutshell: • Cartoon gender identification: It is binary classification problem where given a cartoon face the system has to answer whether it is a male or a female • Cartoon face recognition: The problem of cartoon face recognition is to recognize the given cartoon face as one of the C classes. • Cartoon face generation: Given a real face of public figure, generate a caricature in various moods, that has exaggerated identifiable facial features. To further improve the accuracy of above tasks, 3 experiments for landmark extraction of caricature images: ● Dlib facial landmark extractor ● Flood-fill algorithm ● Manual annotation assisted Keypoints extraction of Images
  • 4. Applications ● Can help visually impaired people to understand cartoon images or movies. ● Can be used to automatically censor communal or politically incorrect cartoons in the social media.
  • 5. Caricature Generation Experiments with CNN Facial Landmark Extraction Pre-processing
  • 6. Summary of roles/responsibilities ● Pratik Parwal (20148093): Problem Analysis, Landmark Detection using manually annotated dataset, report making. ● Nikhil Agarwal (20144092): Problem Analysis, Face averaging, Training and Transformation for caricature generation, report making. ● Richa Pandey (20144126): Problem Analysis,Dataset parsing, detecting face boundary using flood-fill algorithm, report making. ● Saurav Jha (20144007): Problem Analysis,Experiments with CNN modules, Data set augmentation, report making. ● Pramee Chowdhury (20144136):Problem Analysis, Face detection, alignment, normalization, Java app for manual annotation of caricature landmarks, report making.
  • 7. Task 1: Caricature Recognition
  • 8. Dataset Used The recently released Cartoon faces in the Wild (IIIT-CFW) [11] database contains a total of 8,928 annotated images of cartoon faces of 100 public figures. The real human faces of these public figures have also been provided. A number of attributes for each cartoon image provided.
  • 9. Attributes in XML form ● Type of cartoon: cartoon, cartoon sketch, caricature ● Pose: frontal, non frontal ● Expression: happy, sad, thoughtful, seductive, sorrow, angry, serious, fright- ened, crying, shocked ● Age group: young, old ● Gender: male, female ● Glass: yes, no
  • 10. Data Pre-processing ● We perform XML parsing for each caricature image to extract the information about their name and gender. ● Each image was first converted into grayscale to convert the pixel values in the range (0,255) followed by the extraction of the face bounding box and then re-sized to the 96x96 size. ● Finally, we get a list of pixel values along with their corresponding classes ((1-100 ) for the face recognition task and (0, 1) for the gender identification task).
  • 11. Facial Landmark Extraction L1: Face Region Candidate Extraction using Flood-Fill. We make the following assumptions : • The skin color of the image would be nearly even. • There is proper distinction between a caricature face and its surrounding. The algorithm [14] starts from a skin color pixel (generally, center of the image), and gradually moves to all its neighboring pixels if the difference between the two pixels is less than a certain threshold.
  • 12. L2 : Dlib Face Landmark Detector Detects a total of 68 facial landmark points
  • 13. L3: Manual Key-points annotations of caricature images. We chose a subset of 20 personalities out of 100 in the data-set and manually annotated 15 images of each. In order to do this, a Java swing based application was built to manually get the (x, y) coordinates of the 15 selected landmark points. Then the system was trained to get the landmark points of an image given its pixel intensities.
  • 14. Architecture of the CNN model. Input and output are normalised. Loss function used was ‘mse’ with SGD optimizer.
  • 15.
  • 16. System Description ● For evaluating the accuracy of the sub-tasks for gender identification and face verification, we experimented with different model configurations of Convolutional Neural Networks(CNNs). ● CNNs [3] have fully-connectedness replaced with convolutions using filters, they perform much better for applications that has structured inputs which can be filtered using such convolutions. 1. System1: 1-D Convolutional Neural Nets 2. System2: 2-D Convolutional Neural Nets 3. System 3: 2-D CNN with Data Augmentation
  • 17. SYSTEM 1: 1-D CNNs ● Filters moving in just one direction and are generally used for problems involving text classification and signal smoothing.
  • 18. SYSTEM 2: 2-D CNNs ● Believed to be more suited for tasks requiring inherent assumption of the two dimensional spatial patterns of images, such as ours.
  • 19. 2-D CNN based model configuration used by us for the tasks of gender identification and face verification.
  • 20. SYSTEM 3: 2D CNN + Data Augmentation ● We carried out horizontal flips of the training images followed by a 5 degree anti-clockwise rotation of the images. ● Horizontal flip preserves the symmetry of human-like faces along with helping in doubling the train data size.
  • 21. RESULTS AND ANALYSIS ● Gender Identification and Face verification 1. Using Only List of Pixel Values - We report our accuracy on all 100 classes using the entire list of pixel values. - All the three systems described above were first trained on a total of 7138 caricature images (with a validation split of 0.2) and then tested on 1785 images. - Number of training instances for male faces being 5242 while that for female faces being 1896, dataset suffers from class imbalance problem. - Because of gender class imbalance problem, we show the results of our systems individually on male and female classes.
  • 22.
  • 23. Using Landmark points of L3 combined with pixel values ● Effect of the landmark points on the accuracy obtained by four machine learning classifiers, namely LinearSVC, Random Forest, Gradient Boosting Classifier and a Voting of the three classifiers. ● Trained and tested on total of 2,339 caricature images belonging to 20 classes of the dataset. ● First Run: (x,y) coordinates of 15 landmark points along with 96*96 dimensional list of pixel values fed to the classifiers, resulting in 2946 features. ● Second Run: Only list of pixel values fed to the classifiers, resulting in 2916 features.
  • 24.
  • 25. Accuracy Report ● Face verification on 100 classes: 30.96% ● Gender Identification task on 100 classes: ● Male : 79.33% ● Female : 66.17% ● Face verification using Landmark points on 20 classes: 33.8%
  • 26. Task 2: Caricature Generation
  • 27. Face Alignment and Normalization For better result, all the images must confine to certain standards. So, all the original image-caricature pairs were subsequently aligned and normalized.
  • 28. Mesh Warping Mesh Warping is the module we created to warp image according to the final landmark points. We used Dlib landmark detector for obtaining landmark points. We used these 68 points and 8 points on the boundary of the original face to calculate a Delaunay Triangulation.
  • 29. Face Averaging To calculate the mean face, we first need to calculate the average of all landmarks coordinates in the real faces. This is done by simply averaging the x and y values of the landmarks coordinates. After that we used our Mesh Warping module to calculate the average face.
  • 30. Relationships among original image, corresponding caricature and mean face When a caricaturist sees a face, he/she has the ability to identify the distinctive facial features by comparing it with the mean face hidden in his/her mind [7].
  • 31. Artist style capture algorithm
  • 32. Neural Network Model The network is capable to learn from the training set by constructing an input-output mapping for the problem automatically.
  • 33. Result for Task 2: Initial approach result( not much exaggeration ) Second approach result by decreasing training data.
  • 34. Training Data given as input to get sad caricature. Sad face of Smriti Irani obtained after training on Mulayam’s image.
  • 35. CONCLUSION In our project 2-D Convolutional Neural Network based model fed with augmented data performed the best pushing the accuracy to 30.96% on the face verification task, and to 66.17% and 79.33% respectively, on the identification of female and male faces. In order to further improve the accuracy, we conducted three independent experiments to capture the facial keypoints of the caricature faces and analyzed the detailed results which were not satisfactory enough to be used further as feature set of the tasks and so, we showed them only for the task of face verification on 20 classes leading to a best accuracy of 33.8% using Gradient Boosting Classier.
  • 36. FUTURE WORKS The task of landmark detection of cartoon faces has a substantial scope of improvement based on the results we achieved. Improvement in face alignment would also significantly affect the accuracy of the results. Recent advancements of CNNs such as the Multi-task Cascaded Convolutional Neural Networks (MTCNN) can be trained and used for the task of joint face detection and alignment of such cartoon faces.
  • 37. References : ● [1] Bharadwaj, S., Bhatt, H. S., Vatsa, M., and Singh, R. Domain specific learning for newborn face recognition. IEEE Transactions on Information Forensics and Security 11 (2016), 1630{1641. ● [2] Hsu, R.-L., and Jain, A. K. Generating discriminating cartoon faces using interacting snakes. IEEE Trans. Pattern Anal. Mach. Intell. 25 (2003), 1388{1398. ● [3] J urgen Schmidhuber, journal=Neural networks : the official journal of the International Neural Network Society, y. v. p. Deep learning in neural networks: An overview. ● [4] Kingma, D. P., and Ba, J. Adam: A method for stochastic optimization. CoRR abs/1412.6980 (2014). ● [5] Klare, B., Burge, M., Klontz, J. C., Bruegge, R. W. V., and Jain, A. K. Face recognition performance: Role of demographic information. IEEE Transactions on Information Forensics and Security 7 (2012), 1789{1801. ● [6] Kumar, N., Berg, A. C., Belhumeur, P. N., and Nayar, S. K. Attribute and simile classifiers for face verification. 2009 IEEE 12th International Conference on Computer Vision (2009), 365{372.[7] Lai, K., Chung, P., and Edirisinghe, E. Novel approach to neural network based caricature generation.
  • 38. ● [7] Lai, K., Chung, P., and Edirisinghe, E. Novel approach to neural ● network based caricature generation. ● [8] LeCun, Y. Gradient-based learning applied to document recognition. ● [9] Liang, L., Chen, H., Xu, Y.-Q., and Shum, H.-Y. Example-based caricature generation with exaggeration. In Computer Graphics and Applications,2002. Proceedings. 10th Pacic Conference on (2002), IEEE, pp. 386{393. ● [10] Liao, S., Jain, A. K., and Li, S. Z. Partial face recognition: Alignment-free approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 35 (2013), 1193{1205. ● [11] Mishra, A., Rai, S. N., Mishra, A., and Jawahar, C. V. Iiit-cfw: A benchmark database of cartoon faces in the wild. In ECCV Workshops (2016). ● [12] Shan, C., Gong, S., and McOwan, P. W. Facial expression recognition based on local binary patterns: A comprehensive study. Image Vision Comput. 27 (2009), 803{816. ● [13] Sun, Y., Liang, D., Wang, X., and Tang, X. Deepid3: Face recognition with very deep neural networks. CoRR abs/1502.00873 (2015). ● [14] Takayama, K., Johan, H., and Nishita, T. Face detection and face recognition of cartoon characters using feature extraction. In Image, Electronics and Visual Computing Workshop (2012), p. 48.