SlideShare a Scribd company logo
Saliency-based Models of Image
Content and their Application to
Auto-Annotation by Semantic
Propagation
Jonathon S. Hare and Paul H. Lewis
Intelligence, Agents, Multimedia Group
Department of Electronics and Computer Science
University of Southampton
{jsh02r, phl}@ecs.soton.ac.uk
Introduction
• Image search is much easier if all of the images in a collection
have adequate annotations
• Annotating images manually is a time-consuming and
laborious process
• We propose a simple method for automatic image
annotation based on the idea that visually similar images
should have similar annotations
• Visual similarity is assessed using local descriptors of salient
regions in an information model
• We also address how the quality of the automatic
annotations can be assessed
Image Auto-Annotation
Propagating the Semantics (I)
• Simple idea based on propagating semantics between
visually similar images
• Image content is modelled using concepts from
information theory
• Modelling the images in this way allows us to assess
image similarity in a well-founded framework
• We tried two models:
• Vector-Space
• Latent Semantic Indexing
Image Auto-Annotation
Propagating the Semantics (II)
• A training corpus of pre-annotated images is created
• An unannotated image is compared to images in the
training corpus using the information models to find
visually similar images
• The annotations from a number of the closest visually
similar images in the training corpus are then applied, or
propagated, to the unannotated image
Information Theory
Modelling Textual Information::vector-spaces
• A common approach to modelling documents containing
textual information is to use a vector-space
• Each document is represented by a vector of term-
occurrences
• Each element of the vector is the count of the
number of times the corresponding term occurred in
the document
• Similar documents should have similar vectors
• i.e. the angle between vectors is small
Information Theory
Modelling Textual Information::latent semantic indexing
• LSI takes the vector-space model a step further
• LSI attempts to deal with issues of synonymy and
polysemy between terms by using linear algebra
• Term-occurrence vectors are arranged in a term-
document matrix and factored using SVD
• Using the resulting matrices from the SVD it is
possible to create a rank-k estimate of the original
term-document matrix
• Queries can be performed in the k-subspace instead
of the original space
Modelling Images
Representing Images using Textual Information Models
• Salient regions are found in
the image from peaks in a
difference-of-Gaussian
pyramid
• Local descriptors (SIFT) are
calculated for each region
• The feature descriptors are
quantised by assigning them to
the closest visual terms in a
predefined vocabulary
• Term-occurrence vectors are
calculated
Experimentation
Dataset
• Test dataset consisted of 697
annotated photographic
images
• Original annotations were
processed to remove plurals
and correct mistakes
• 170 processed annotation
terms in total
• Training and test sets were
created by randomly cutting
the dataset into halves
Experimentation
Evaluation Technique::considerations (I)
• For any auto-annotation system to be worthwhile it must
perform better than if the annotations were guessed
based on the empirical distribution of keywords in the
training set
Experimentation
Evaluation Technique::considerations (II)
• Images in the training set may have been incorrectly
annotated
• For comparative purposes this is not a problem as all
algorithms have to deal with the same data
• However, the reported overall performance is likely to
be less than it would be with correct annotations
Experimentation
Evaluation Technique::performance measure
• A reasonable assumption to make of
an annotation algorithm should be
that it will return approximately the
correct number of annotations
• Previous auto-annotation work has
used the normalised score measure
developed by Barnard et al.
• However, this measure does not
sufficiently weight incorrect
guesses, resulting in many more
guessed annotations than correct
annotations when the score is
maximised
r = Number of correctly predicted
annotations
n = Number of actual true annotations
w = Number of wrongly predicted
annotations
N = Number of terms in annotation
vocabulary
Experimentation
Evaluation Technique::performance measure
• We have instead adopted the
use of precision and recall to
measure performance
• Unlike in retrieval, we want
both high precision and high
recall
r = Number of correctly predicted
annotations
n = Number of actual true annotations
w = Number of wrongly predicted
annotations
Experimentation
Results (I)
• Two models, using
vocabularies of 3000 visual
terms
• Vector-Space
• LSI (K=40)
Experimentation
Results (II)
True Annotations Tree, Bush, Sky Temple, Sky
Flower, Bush, Tree,
Building, Sidewalk
Empirical
Annotations
Tree, Building, People,
Bush, Grass
Tree, Building, People,
Bush, Grass
Tree, Building, People,
Bush, Grass
Vector-Space
Annotations
Tree, Bush
Tree, Building, Grass,
Sidewalk, Pole, People,
Clear Sky
Flower, Bush, Tree,
Building, Partially Cloudy
Sky
LSI Annotations
Tree, Bush, Grass,
Sidewalk
Steps, Wall
Flower, Bush, Tree,
Ground
Future Work
(I)
• Our current annotation approach is slightly deficient in
that it doesn’t allow us to select individual terms
• This needs to be addressed
• This work used a fixed number of images from which to
draw the annotations
• It is possible that this number could be chosen
dynamically for each unannotated image, based on the
similarity between its vector representation and the
vectors of the training images
Future Work
(II)
• The SIFT local descriptor is generated from grey-level
information only
• The addition of other local descriptors may improve
performance
• A local colour descriptor is currently planned
Conclusions
• The results show promise for our relatively simple auto-
annotation technique
• The LSI based method marginally outperforms the vector-
space approach
• This result confirms the findings of our work on image
retrieval using the two approaches

More Related Content

What's hot

Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
rubaiyat11
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval
Swati Chauhan
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
Rayan Dasoriya
 
CBIR in the Era of Deep Learning
CBIR in the Era of Deep LearningCBIR in the Era of Deep Learning
CBIR in the Era of Deep Learning
Xiaohu ZHU
 
CONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEMCONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEM
Vamsi IV
 
Cbir ‐ features
Cbir ‐ featuresCbir ‐ features
Cbir ‐ features
tanweimin666
 
CBIR by deep learning
CBIR by deep learningCBIR by deep learning
CBIR by deep learning
Vigen Sahakyan
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
rinki nag
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)paddu123
 
IEEE Projects 2014-2015
IEEE Projects 2014-2015IEEE Projects 2014-2015
IEEE Projects 2014-2015
Finalyear Projects
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)
Raja Sekar
 
Image processing using labview
Image processing using labviewImage processing using labview
Image processing using labview
vikrammutneja1
 
Artist Assistant AI(AAA)
Artist Assistant AI(AAA)Artist Assistant AI(AAA)
Artist Assistant AI(AAA)
Gunhee Lee
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmAkshit Bum
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
Yu Huang
 
Comparative between global threshold and adaptative threshold concepts in ima...
Comparative between global threshold and adaptative threshold concepts in ima...Comparative between global threshold and adaptative threshold concepts in ima...
Comparative between global threshold and adaptative threshold concepts in ima...
AssiaHAMZA
 
Image formation
Image formationImage formation
Image formation
Syed Zaid Irshad
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
Sarasi Sarangi
 

What's hot (20)

Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
CBIR
CBIRCBIR
CBIR
 
CBIR in the Era of Deep Learning
CBIR in the Era of Deep LearningCBIR in the Era of Deep Learning
CBIR in the Era of Deep Learning
 
CONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEMCONTENT BASED IMAGE RETRIEVAL SYSTEM
CONTENT BASED IMAGE RETRIEVAL SYSTEM
 
Cbir ‐ features
Cbir ‐ featuresCbir ‐ features
Cbir ‐ features
 
CBIR by deep learning
CBIR by deep learningCBIR by deep learning
CBIR by deep learning
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 
IEEE Projects 2014-2015
IEEE Projects 2014-2015IEEE Projects 2014-2015
IEEE Projects 2014-2015
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)
 
Image processing using labview
Image processing using labviewImage processing using labview
Image processing using labview
 
Artist Assistant AI(AAA)
Artist Assistant AI(AAA)Artist Assistant AI(AAA)
Artist Assistant AI(AAA)
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
 
Comparative between global threshold and adaptative threshold concepts in ima...
Comparative between global threshold and adaptative threshold concepts in ima...Comparative between global threshold and adaptative threshold concepts in ima...
Comparative between global threshold and adaptative threshold concepts in ima...
 
CBIR MIni project2
CBIR MIni project2CBIR MIni project2
CBIR MIni project2
 
Image formation
Image formationImage formation
Image formation
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
 

Viewers also liked

BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORYBUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
Jonathon Hare
 
The Art and Science of Image Retrieval
The Art and Science of Image RetrievalThe Art and Science of Image Retrieval
The Art and Science of Image Retrieval
Jonathon Hare
 
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image RetrievalA Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
Jonathon Hare
 
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIASIMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
Jonathon Hare
 
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
Jonathon Hare
 
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gapSharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Jonathon Hare
 
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrievalMind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Jonathon Hare
 
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Jonathon Hare
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
Jonathon Hare
 
SEWM'14 keynote: Mining Events from Multimedia Streams
SEWM'14 keynote: Mining Events from Multimedia StreamsSEWM'14 keynote: Mining Events from Multimedia Streams
SEWM'14 keynote: Mining Events from Multimedia Streams
Jonathon Hare
 
WAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton GogglesWAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton Goggles
Jonathon Hare
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Jonathon Hare
 
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Jonathon Hare
 
Multimedia Information Retrieval
Multimedia Information RetrievalMultimedia Information Retrieval
Multimedia Information Retrieval
Stephane Marchand-Maillet
 

Viewers also liked (14)

BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORYBUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
 
The Art and Science of Image Retrieval
The Art and Science of Image RetrievalThe Art and Science of Image Retrieval
The Art and Science of Image Retrieval
 
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image RetrievalA Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
 
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIASIMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
 
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
 
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gapSharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
 
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrievalMind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
 
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
 
SEWM'14 keynote: Mining Events from Multimedia Streams
SEWM'14 keynote: Mining Events from Multimedia StreamsSEWM'14 keynote: Mining Events from Multimedia Streams
SEWM'14 keynote: Mining Events from Multimedia Streams
 
WAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton GogglesWAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton Goggles
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
 
Multimedia Information Retrieval
Multimedia Information RetrievalMultimedia Information Retrieval
Multimedia Information Retrieval
 

Similar to Saliency-based Models of Image Content and their Application to Auto-Annotation by Semantic Propagation

Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
Nguyen Giang
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity Search
Sujit Pal
 
Knowing when to look
Knowing when to lookKnowing when to look
Knowing when to look
JaeHo Jang
 
Neural Networks for Machine Learning and Deep Learning
Neural Networks for Machine Learning and Deep LearningNeural Networks for Machine Learning and Deep Learning
Neural Networks for Machine Learning and Deep Learning
comifa7406
 
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVALA COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
cscpconf
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrieval
csandit
 
REVIEW PPT.pptx
REVIEW PPT.pptxREVIEW PPT.pptx
REVIEW PPT.pptx
SaravanaD2
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
aashnareddy1
 
PPT s12-machine vision-s2
PPT s12-machine vision-s2PPT s12-machine vision-s2
PPT s12-machine vision-s2
Binus Online Learning
 
Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...
Sujit Pal
 
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHESSECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
ranjit banshpal
 
GAN Evaluation
GAN EvaluationGAN Evaluation
GAN Evaluation
Dongheon Lee
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
IJERA Editor
 
Image quality improvement of Low-resolution camera using Data fusion technique
Image quality improvement of Low-resolution camera using Data fusion techniqueImage quality improvement of Low-resolution camera using Data fusion technique
Image quality improvement of Low-resolution camera using Data fusion technique
Sayed Abulhasan Quadri
 
A Hybrid Approach for Content Based Image Retrieval System
A Hybrid Approach for Content Based Image Retrieval SystemA Hybrid Approach for Content Based Image Retrieval System
A Hybrid Approach for Content Based Image Retrieval System
IOSR Journals
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...
CSCJournals
 

Similar to Saliency-based Models of Image Content and their Application to Auto-Annotation by Semantic Propagation (20)

Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity Search
 
Knowing when to look
Knowing when to lookKnowing when to look
Knowing when to look
 
Neural Networks for Machine Learning and Deep Learning
Neural Networks for Machine Learning and Deep LearningNeural Networks for Machine Learning and Deep Learning
Neural Networks for Machine Learning and Deep Learning
 
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVALA COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrieval
 
REVIEW PPT.pptx
REVIEW PPT.pptxREVIEW PPT.pptx
REVIEW PPT.pptx
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
sa.ppt
sa.pptsa.ppt
sa.ppt
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
PPT s12-machine vision-s2
PPT s12-machine vision-s2PPT s12-machine vision-s2
PPT s12-machine vision-s2
 
poster
posterposter
poster
 
Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...
 
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHESSECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
 
GAN Evaluation
GAN EvaluationGAN Evaluation
GAN Evaluation
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
 
Image quality improvement of Low-resolution camera using Data fusion technique
Image quality improvement of Low-resolution camera using Data fusion techniqueImage quality improvement of Low-resolution camera using Data fusion technique
Image quality improvement of Low-resolution camera using Data fusion technique
 
A Hybrid Approach for Content Based Image Retrieval System
A Hybrid Approach for Content Based Image Retrieval SystemA Hybrid Approach for Content Based Image Retrieval System
A Hybrid Approach for Content Based Image Retrieval System
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...A novel Image Retrieval System using an effective region based shape represen...
A novel Image Retrieval System using an effective region based shape represen...
 

Recently uploaded

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 

Saliency-based Models of Image Content and their Application to Auto-Annotation by Semantic Propagation

  • 1. Saliency-based Models of Image Content and their Application to Auto-Annotation by Semantic Propagation Jonathon S. Hare and Paul H. Lewis Intelligence, Agents, Multimedia Group Department of Electronics and Computer Science University of Southampton {jsh02r, phl}@ecs.soton.ac.uk
  • 2. Introduction • Image search is much easier if all of the images in a collection have adequate annotations • Annotating images manually is a time-consuming and laborious process • We propose a simple method for automatic image annotation based on the idea that visually similar images should have similar annotations • Visual similarity is assessed using local descriptors of salient regions in an information model • We also address how the quality of the automatic annotations can be assessed
  • 3. Image Auto-Annotation Propagating the Semantics (I) • Simple idea based on propagating semantics between visually similar images • Image content is modelled using concepts from information theory • Modelling the images in this way allows us to assess image similarity in a well-founded framework • We tried two models: • Vector-Space • Latent Semantic Indexing
  • 4. Image Auto-Annotation Propagating the Semantics (II) • A training corpus of pre-annotated images is created • An unannotated image is compared to images in the training corpus using the information models to find visually similar images • The annotations from a number of the closest visually similar images in the training corpus are then applied, or propagated, to the unannotated image
  • 5. Information Theory Modelling Textual Information::vector-spaces • A common approach to modelling documents containing textual information is to use a vector-space • Each document is represented by a vector of term- occurrences • Each element of the vector is the count of the number of times the corresponding term occurred in the document • Similar documents should have similar vectors • i.e. the angle between vectors is small
  • 6. Information Theory Modelling Textual Information::latent semantic indexing • LSI takes the vector-space model a step further • LSI attempts to deal with issues of synonymy and polysemy between terms by using linear algebra • Term-occurrence vectors are arranged in a term- document matrix and factored using SVD • Using the resulting matrices from the SVD it is possible to create a rank-k estimate of the original term-document matrix • Queries can be performed in the k-subspace instead of the original space
  • 7. Modelling Images Representing Images using Textual Information Models • Salient regions are found in the image from peaks in a difference-of-Gaussian pyramid • Local descriptors (SIFT) are calculated for each region • The feature descriptors are quantised by assigning them to the closest visual terms in a predefined vocabulary • Term-occurrence vectors are calculated
  • 8. Experimentation Dataset • Test dataset consisted of 697 annotated photographic images • Original annotations were processed to remove plurals and correct mistakes • 170 processed annotation terms in total • Training and test sets were created by randomly cutting the dataset into halves
  • 9. Experimentation Evaluation Technique::considerations (I) • For any auto-annotation system to be worthwhile it must perform better than if the annotations were guessed based on the empirical distribution of keywords in the training set
  • 10. Experimentation Evaluation Technique::considerations (II) • Images in the training set may have been incorrectly annotated • For comparative purposes this is not a problem as all algorithms have to deal with the same data • However, the reported overall performance is likely to be less than it would be with correct annotations
  • 11. Experimentation Evaluation Technique::performance measure • A reasonable assumption to make of an annotation algorithm should be that it will return approximately the correct number of annotations • Previous auto-annotation work has used the normalised score measure developed by Barnard et al. • However, this measure does not sufficiently weight incorrect guesses, resulting in many more guessed annotations than correct annotations when the score is maximised r = Number of correctly predicted annotations n = Number of actual true annotations w = Number of wrongly predicted annotations N = Number of terms in annotation vocabulary
  • 12. Experimentation Evaluation Technique::performance measure • We have instead adopted the use of precision and recall to measure performance • Unlike in retrieval, we want both high precision and high recall r = Number of correctly predicted annotations n = Number of actual true annotations w = Number of wrongly predicted annotations
  • 13. Experimentation Results (I) • Two models, using vocabularies of 3000 visual terms • Vector-Space • LSI (K=40)
  • 14. Experimentation Results (II) True Annotations Tree, Bush, Sky Temple, Sky Flower, Bush, Tree, Building, Sidewalk Empirical Annotations Tree, Building, People, Bush, Grass Tree, Building, People, Bush, Grass Tree, Building, People, Bush, Grass Vector-Space Annotations Tree, Bush Tree, Building, Grass, Sidewalk, Pole, People, Clear Sky Flower, Bush, Tree, Building, Partially Cloudy Sky LSI Annotations Tree, Bush, Grass, Sidewalk Steps, Wall Flower, Bush, Tree, Ground
  • 15. Future Work (I) • Our current annotation approach is slightly deficient in that it doesn’t allow us to select individual terms • This needs to be addressed • This work used a fixed number of images from which to draw the annotations • It is possible that this number could be chosen dynamically for each unannotated image, based on the similarity between its vector representation and the vectors of the training images
  • 16. Future Work (II) • The SIFT local descriptor is generated from grey-level information only • The addition of other local descriptors may improve performance • A local colour descriptor is currently planned
  • 17. Conclusions • The results show promise for our relatively simple auto- annotation technique • The LSI based method marginally outperforms the vector- space approach • This result confirms the findings of our work on image retrieval using the two approaches