Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)

•

1 like•197 views

This document proposes using a Deep Convolutional Embedding method to cluster Pablo Picasso's artworks in an unsupervised manner based on visual features. It trains an autoencoder to learn a nonlinear mapping of the artwork images to a latent space, then performs k-means clustering in that space. Testing on a database of 439 Picasso paintings, it achieved good clustering performance according to evaluation metrics. Future work will label clusters based on art literature and analyze the encoder to identify distinguishing objects that drove the clustering.

Data & Analytics

Deep Convolutional Embedding for Painting
Clustering: Case Study on Picasso’s Artworks
Giovanna Castellano, Gennaro Vessio
CILAB, Computer Science Department, University of Bari, Italy
gennaro.vessio@uniba.it

Context
Our cultural heritage is of inestimable
importance for the cultural, historical
and economic growth of our society
It is important for:
● art historians
● economists
● museum curators
● …
● computer scientists!
2

The role of
Computer Science
Research by computer scientists in this context is
mainly focused on applying machine learning/pattern
recognition for:
● classiﬁcation and categorization
● link prediction
● information retrieval
● knowledge discovery
● …
This growing interest has been motivated by the
increasing availability of large-scale digitized art
collections (e.g., WikiArt)
3

Motivations
Human beings ﬁnd similarity relationships
among paintings based on their aesthetic
perception
This perception (which can also be inﬂuenced by
subjective experience) is:
● extremely hard to conceptualize
● diﬃcult to translate into features and labels
Our goal is to develop an automatic tool to group
digital paintings based on:
● “visual” features
● an unsupervised approach
4

Limitations of traditional methods
1. Applying traditional algorithms like k-means on the high-dimensional raw pixel
space can be ineﬀective
2. The application of reduction techniques, such as PCA, can ignore nonlinear
relationships between the original input and the latent feature space
3. Some variants of k-means (e.g., spectral clustering) are computationally
expensive as the data grows
4. Engineering meaningful features based on domain knowledge is extremely
diﬃcult
5

We propose to use a reﬁnement of the Deep Convolutional Embedding Clustering
(DCEC) method recently proposed by Guo et al.
Main beneﬁts:
1. deep learning algorithms are good at mapping input to output data due to their
exceptional ability to express nonlinear representations
2. conv layers are even better when the input is complex image data
6
Key points of the proposed method

Model training works in two phases:
1. parameter initialization
a. use a convolutional autoencoder to learn a nonlinear mapping between the original space X
and a latent spaze Z
2. parameter optimization
a. initialize k cluster centroids with k-means
b. compute a soft assignment between the embedded points and cluster centroids
c. compute an auxiliary target distribution
d. minimize the KL divergence with respect to the computed target distribution
Input reconstruction and cluster assignment are jointly optimized
8
Key points

Case study
We used a database that collects 439 artworks
by a very popular artist: Pablo Picasso
This was done to evaluate the eﬀectiveness of
the method in ﬁnding meaningful clusters within
the artist’s production
9

Experimental setting
● Input resized to 128×128 pixels and normalized to [0, 1]
● Autoencoder pre-training:
○ epochs: 200
○ mini-batch size: 128
○ optimizer: AdaMax
● Overall training:
○ delta: 0.001
○ optimizer: AdaMax
● Evaluation metrics:
○ silhouette score
○ Calinski-Harabasz index
10

Clustering performance
11
# clusters silhouette score Calinski-Harabasz index
2 0.933 0.737
3 0.936 0.771
4 0.951 0.768
5 0.965 1.000
6 0.962 0.812

Conclusion
Encouraging preliminary results were obtained, which conﬁrm the eﬀectiveness of
the deep clustering approach to address highly complex image domains, such as
the artistic one
Future work will use much of the existing literature on Picasso to try to label
paintings to perform a much more systematic evaluation, even according to
external clustering criteria
Finally, the convolutional layers of the encoder will be analyzed to ﬁnd out which
are the distinctive objects in the paintings that led to their clustering
13

Similar to Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)

Symbolic Background Knowledge for Machine LearningSteffen Staab

Cahall Final Intern PresentationDaniel Cahall

Exploring Machine Learning for Libraries and Archives: Present and FutureBohyun Kim

Object Classification in Images of Neoclassical Artifacts using Deep LearningBernhard Bermeitinger

2019 cvpr paper_overviewLEE HOSEONG

2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain

Mat189: Cluster Analysis with NBA Sports DataKathleneNgo

Do Better ImageNet Models Transfer Better... for Image Recommendation?Denis Parra Santander

MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Postermultimediaeval

Image Object Detection PipelineAbhinav Dadhich

物件偵測與辨識技術CHENHuiMei

Lecture_16_Self-supervised_Learning.pptxKarimdabbabi

IRJET- Finding Dominant Color in the Artistic Painting using Data Mining ...IRJET Journal

Deep Learning behind Prismalostleaves

Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Universitat Politècnica de Catalunya

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Deep Neural Networks PresentationBohdan Klimenko

CP923.docbutest

CAM-1.pptxhimarusti

Image Segmentation Using Deep Learning : A surveyNUPUR YADAV

Similar to Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020) (20)

Symbolic Background Knowledge for Machine Learning

Cahall Final Intern Presentation

Exploring Machine Learning for Libraries and Archives: Present and Future

Object Classification in Images of Neoclassical Artifacts using Deep Learning

2019 cvpr paper_overview

2019 cvpr paper overview by Ho Seong Lee

Mat189: Cluster Analysis with NBA Sports Data

Do Better ImageNet Models Transfer Better... for Image Recommendation?

MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster

Image Object Detection Pipeline

物件偵測與辨識技術

Lecture_16_Self-supervised_Learning.pptx

IRJET- Finding Dominant Color in the Artistic Painting using Data Mining ...

Deep Learning behind Prisma

Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020

Deep Neural Networks Presentation

CP923.doc

CAM-1.pptx

Image Segmentation Using Deep Learning : A survey

Recently uploaded

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Midocean dropshipping via API with DroFxolyaivanovalion

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Week-01-2.ppt BBB human Computer interactionfulawalesam

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

April 2024 - Crypto Market Report's Analysismanisha194592

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Introduction-to-Machine-Learning (1).pptxfirstjob4

Invezz.com - Grow your wealth with trading signalsInvezz1

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

Halmar dropshipping via API with DroFxolyaivanovalion

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Midocean dropshipping via API with DroFx

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Week-01-2.ppt BBB human Computer interaction

ALSO dropshipping via API with DroFx.pptx

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...

April 2024 - Crypto Market Report's Analysis

BabyOno dropshipping via API with DroFx.pptx

Introduction-to-Machine-Learning (1).pptx

Invezz.com - Grow your wealth with trading signals

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779

Halmar dropshipping via API with DroFx

Generative AI on Enterprise Cloud with NiFi and Milvus

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx

BigBuy dropshipping via API with DroFx.pptx

Ravak dropshipping via API with DroFx.pptx

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

Edukaciniai dropshipping via API with DroFx

Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)

1. Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso’s Artworks Giovanna Castellano, Gennaro Vessio CILAB, Computer Science Department, University of Bari, Italy gennaro.vessio@uniba.it

2. Context Our cultural heritage is of inestimable importance for the cultural, historical and economic growth of our society It is important for: ● art historians ● economists ● museum curators ● … ● computer scientists! 2

3. The role of Computer Science Research by computer scientists in this context is mainly focused on applying machine learning/pattern recognition for: ● classiﬁcation and categorization ● link prediction ● information retrieval ● knowledge discovery ● … This growing interest has been motivated by the increasing availability of large-scale digitized art collections (e.g., WikiArt) 3

4. Motivations Human beings find similarity relationships among paintings based on their aesthetic perception This perception (which can also be influenced by subjective experience) is: ● extremely hard to conceptualize ● difficult to translate into features and labels Our goal is to develop an automatic tool to group digital paintings based on: ● “visual” features ● an unsupervised approach 4

5. Limitations of traditional methods 1. Applying traditional algorithms like k-means on the high-dimensional raw pixel space can be ineﬀective 2. The application of reduction techniques, such as PCA, can ignore nonlinear relationships between the original input and the latent feature space 3. Some variants of k-means (e.g., spectral clustering) are computationally expensive as the data grows 4. Engineering meaningful features based on domain knowledge is extremely diﬃcult 5

6. We propose to use a reﬁnement of the Deep Convolutional Embedding Clustering (DCEC) method recently proposed by Guo et al. Main beneﬁts: 1. deep learning algorithms are good at mapping input to output data due to their exceptional ability to express nonlinear representations 2. conv layers are even better when the input is complex image data 6 Key points of the proposed method

7. Overall network architecture 7

8. Model training works in two phases: 1. parameter initialization a. use a convolutional autoencoder to learn a nonlinear mapping between the original space X and a latent spaze Z 2. parameter optimization a. initialize k cluster centroids with k-means b. compute a soft assignment between the embedded points and cluster centroids c. compute an auxiliary target distribution d. minimize the KL divergence with respect to the computed target distribution Input reconstruction and cluster assignment are jointly optimized 8 Key points

9. Case study We used a database that collects 439 artworks by a very popular artist: Pablo Picasso This was done to evaluate the eﬀectiveness of the method in ﬁnding meaningful clusters within the artist’s production 9

10. Experimental setting ● Input resized to 128×128 pixels and normalized to [0, 1] ● Autoencoder pre-training: ○ epochs: 200 ○ mini-batch size: 128 ○ optimizer: AdaMax ● Overall training: ○ delta: 0.001 ○ optimizer: AdaMax ● Evaluation metrics: ○ silhouette score ○ Calinski-Harabasz index 10

11. Clustering performance 11 # clusters silhouette score Calinski-Harabasz index 2 0.933 0.737 3 0.936 0.771 4 0.951 0.768 5 0.965 1.000 6 0.962 0.812

12. Qualitative evaluation 12

13. Conclusion Encouraging preliminary results were obtained, which confirm the effectiveness of the deep clustering approach to address highly complex image domains, such as the artistic one Future work will use much of the existing literature on Picasso to try to label paintings to perform a much more systematic evaluation, even according to external clustering criteria Finally, the convolutional layers of the encoder will be analyzed to find out which are the distinctive objects in the paintings that led to their clustering 13

Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)

Recommended

Recommended

More Related Content

Similar to Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)

Similar to Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020) (20)

More from Gennaro Vessio

More from Gennaro Vessio (9)

Recently uploaded

Recently uploaded (20)

Deep Convolutional Embedding for Painting Clustering: Case Study on Picasso's Artworks (DS2020)