SlideShare a Scribd company logo
1 of 18
IMAGE RETRIEVAL:
  	

CONTENT 
VERSUS 
  	

CONTEXT	

Thijs Westerveld	

	

Teoria e Tecnologia della Comunicazione	

Sistemi Informativi Multimediali AA’11-’12
	

Angelo Oldani 	

744818
INTRODUCTION	



         Through these slides will be presented the paper “Image
         retrieval: content versus context” of Thijs Westerveld.
         
         	

         This paper presents a “new” approach to image retrieval
         that takes the best from two worlds.	

         	

         It combine image features – content
                   collateral text – context
INDEX	




TRADITIONAL IMAGE RETRIEVAL	


LATENT SEMANTIC INDEXING	


FEATURE EXTRACTION PROCESS	


EXPERIMENTS	

DISCUSSION
CONTEXT
BASE
IMAGE
RETRIEVAL	


               May be based on two modes:	

               •  Annotations that are manually added.	

               •  Collateral text available with an image.	

               	

               The similarity between images is then based on the
               similarity between the associated texts.
CONTEXT
BASE
IMAGE
RETRIEVAL	

PROBLEMS	


               •  Synonymy	

                  Use different words to describe the same subject in
                  different documents.	

                  	

               •  Ambiguity	

                  Same words describe different subjects.
CONTENT
BASE
IMAGE
RETRIEVAL	


               Return images that are visually most similar.	

               Similarity is based on a set of low-level image
               features like a:	

               •  colour	

               •  shape	

               •  texture	

               •  …..
CONTENT
BASE
IMAGE
RETRIEVAL	

PROBLEMS	


               Semantic gap
LATENT
SEMANTIC
INDEXING
(LSI)	


              LSI is a method that uses co-occurrence statistics of
              terms to find the semantics behind a document’s terms. 	

              Documents using similar terms are probably related. 	

                      	

	





                     RESERVATION	


                    DOUBLE ROOM	

                               SHOWER	


                         BREAKFAST
LATENT
SEMANTIC
INDEXING
(LSI)	

              No one has combined text and image into the same
              semantic space using LSI.	

              List of terms from both modalities in one term document
              matrix and then apply the SVD resulting in a semantic space
              that contains both visual and textual items.
LATENT
SEMANTIC
INDEXING
(LSI)	

CALCULATING
IMAGE TERMS	





                 To use LSI on image content is necessary to
                 define a set of discrete image features that
                 has the same distribuiton as the set of textual
                 terms.	


                                  Set terms that is sparse as the set
                                  of the textual terms.	

                 CALCULATING
                 IMAGE TERMS	


                                  Set of therms that is the same size
                                  of the textual terms.
FEATURE
EXTRACTION	




          Should extract the indexing terms from documents.	

                     TEXTUAL                 IMAGE
                      TERMS	

                FEATURES	



                 Image captions	

           Colours	

                                             Textures
SPARSE
SET OF
IMAGE TERMS	

               COLOUR FEATURES	


           Has been used HSV colour space divided into 18 Hues, 3
           Saturations and 3 Values and were extracted two sets of
           features:	

           •  Histogram for the whole image.	

           •  Binary value of the most frequent color for each
              block.	


               TEXTURE FEATURES	


           Has been used gabor filters at 3 different wavelengths and
           four orientation and was extracted the average energy for
           each combination of wavelengths and orientation. Avg
           energy values are quantified into 128 bands and disregarding
           the values	

that fall within the lower 16 bands.
SPARSE
SET OF
IMAGE TERMS	

                    TERM FREQUENCIES	





                         Tot. #terms	

   Avg. #terms/doc	

   ratio	

          Text	

              4283	

           27	

         158:1	

         Image	

             37752	

           625	

         63:1	

      Combination	

          42035	

           598	

         70:1
SMALL 
SET OF
IMAGE TERMS	

               COLOUR FEATURES	


           Has been used HSV colour space divided into 18 Hues, 3
           Saturations and 3 Values and were extracted two sets of
           features:	

           •  Histogram for each block.	

           •  Histogram for whole image. 	




               TEXTURE FEATURES	


           Has been used gabor filters at 3 different wavelengths and
           four orientation and was extracted the average energy for
           each combination of wavelengths and orientation. Avg
           energy values are quantified into 10 bands and
           disregarding the values	

that fall within the lower 2 bands.
SMALL
SET OF
IMAGE TERMS	

                    TERM FREQUENCIES	





                         Tot. #terms	

   Avg. #terms/doc	

   ratio	

          Text	

              4283	

           27	

         158:1	

         Image	

              4442	

          1131	

         4:1	

      Combination	

           8725	

          1158	

         8:1
EXPERIMENT	

                         3379 images from Reformatorisch Dagblad
                         online archive together with their
                         captions.	


           Set of 20 documents as query	

           
           3 indexes (LSI indexing):	

           •  Visual terms	

           •  Textual term	

           •  Visual  Textual terms	

           	

           Top 100 returned documents
EXPERIMENT	

RESULTS	





             	

             •  The small set of image features seems to perform
                somewhat better than the sparse set 	

             	

             •  The combined approach for this set of features
                outperforms both the image and the text approach for
                queries with many relevant documents in the data set.
DISCUSSION
	



            Latent Semantic Indexing can help bridge the semantic
            gap	



                         LIMITS	


            •  Research based on very small set of images 	

            •  Text is not available with every image

More Related Content

What's hot

Text Extraction from Image using Python
Text Extraction from Image using PythonText Extraction from Image using Python
Text Extraction from Image using Pythonijtsrd
 
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...IJERA Editor
 
Implementation of content based image retrieval using the cfsd algorithm
Implementation of content based image retrieval using the cfsd algorithmImplementation of content based image retrieval using the cfsd algorithm
Implementation of content based image retrieval using the cfsd algorithmeSAT Publishing House
 

What's hot (6)

Text Extraction from Image using Python
Text Extraction from Image using PythonText Extraction from Image using Python
Text Extraction from Image using Python
 
Multimedia searching
Multimedia searchingMultimedia searching
Multimedia searching
 
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...
Review of Use of Nonlocal Spectral – Spatial Structured Sparse Representation...
 
Ct31628631
Ct31628631Ct31628631
Ct31628631
 
OOPSLA04.ppt
OOPSLA04.pptOOPSLA04.ppt
OOPSLA04.ppt
 
Implementation of content based image retrieval using the cfsd algorithm
Implementation of content based image retrieval using the cfsd algorithmImplementation of content based image retrieval using the cfsd algorithm
Implementation of content based image retrieval using the cfsd algorithm
 

Viewers also liked

Selling electronic devices!?
Selling electronic devices!?Selling electronic devices!?
Selling electronic devices!?fraillunatic2504
 
xcel energy EEI_Pres_November_2007SEC
xcel energy  EEI_Pres_November_2007SECxcel energy  EEI_Pres_November_2007SEC
xcel energy EEI_Pres_November_2007SECfinance26
 
pnsr_the_power_of_people_report
pnsr_the_power_of_people_reportpnsr_the_power_of_people_report
pnsr_the_power_of_people_reportSharon Czarnek
 
педагогіка123
педагогіка123педагогіка123
педагогіка123Igor Shevtsov
 
A study on existing and required facilities or amenities for
A study on existing and required facilities or amenities forA study on existing and required facilities or amenities for
A study on existing and required facilities or amenities forAlexander Decker
 
Stacy Davis ETEC Timeline
Stacy Davis ETEC TimelineStacy Davis ETEC Timeline
Stacy Davis ETEC TimelineStacy Davis
 
Back to school scholarships
Back to school scholarshipsBack to school scholarships
Back to school scholarshipsjerry guhyem
 
Programma "Horizon 2020"
Programma "Horizon 2020"Programma "Horizon 2020"
Programma "Horizon 2020"EnricoPanini
 
Tutorials: The Range
Tutorials: The RangeTutorials: The Range
Tutorials: The RangeMedia4math
 
Amnistia30urte
Amnistia30urteAmnistia30urte
Amnistia30urteetengabe
 
мовознавець
мовознавецьмовознавець
мовознавецьgalina90210
 

Viewers also liked (18)

Semantics
Semantics Semantics
Semantics
 
Selling electronic devices!?
Selling electronic devices!?Selling electronic devices!?
Selling electronic devices!?
 
xcel energy EEI_Pres_November_2007SEC
xcel energy  EEI_Pres_November_2007SECxcel energy  EEI_Pres_November_2007SEC
xcel energy EEI_Pres_November_2007SEC
 
pnsr_the_power_of_people_report
pnsr_the_power_of_people_reportpnsr_the_power_of_people_report
pnsr_the_power_of_people_report
 
педагогіка123
педагогіка123педагогіка123
педагогіка123
 
A study on existing and required facilities or amenities for
A study on existing and required facilities or amenities forA study on existing and required facilities or amenities for
A study on existing and required facilities or amenities for
 
Stacy Davis ETEC Timeline
Stacy Davis ETEC TimelineStacy Davis ETEC Timeline
Stacy Davis ETEC Timeline
 
La ley de_ohm[1]
La ley de_ohm[1]La ley de_ohm[1]
La ley de_ohm[1]
 
Road Safety Products
Road Safety ProductsRoad Safety Products
Road Safety Products
 
Future of india
Future of indiaFuture of india
Future of india
 
Back to school scholarships
Back to school scholarshipsBack to school scholarships
Back to school scholarships
 
Programma "Horizon 2020"
Programma "Horizon 2020"Programma "Horizon 2020"
Programma "Horizon 2020"
 
Tutorials: The Range
Tutorials: The RangeTutorials: The Range
Tutorials: The Range
 
Resume Only
Resume OnlyResume Only
Resume Only
 
CV Aditya - Oct 2016
CV Aditya - Oct 2016CV Aditya - Oct 2016
CV Aditya - Oct 2016
 
Amnistia30urte
Amnistia30urteAmnistia30urte
Amnistia30urte
 
мовознавець
мовознавецьмовознавець
мовознавець
 
Ijrdt11 140005
Ijrdt11 140005Ijrdt11 140005
Ijrdt11 140005
 

Similar to Content vs Context

A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...Matthias Trapp
 
Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Sujit Pal
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceinventy
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Zahra Mansoori
 
Reference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural NetworkReference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural NetworkSaurav Jha
 
Parsimonious topic models with salient word discovery
Parsimonious topic models with salient word discoveryParsimonious topic models with salient word discovery
Parsimonious topic models with salient word discoveryieeepondy
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Jonathon Hare
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Jonathon Hare
 
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHESSECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHESranjit banshpal
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Jonathon Hare
 
Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Konstantinos Zagoris
 
Probabilistic data structures. Part 4. Similarity
Probabilistic data structures. Part 4. SimilarityProbabilistic data structures. Part 4. Similarity
Probabilistic data structures. Part 4. SimilarityAndrii Gakhov
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchSujit Pal
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyenNguyen Giang
 

Similar to Content vs Context (20)

A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
 
Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...
 
Das09112008
Das09112008Das09112008
Das09112008
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
 
Reference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural NetworkReference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural Network
 
Ac03401600163.
Ac03401600163.Ac03401600163.
Ac03401600163.
 
Parsimonious topic models with salient word discovery
Parsimonious topic models with salient word discoveryParsimonious topic models with salient word discovery
Parsimonious topic models with salient word discovery
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
 
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHESSECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
SECURE IMAGE RETRIEVAL BASED ON HYBRID FEATURES AND HASHES
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
Visual Search
Visual SearchVisual Search
Visual Search
 
Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...
 
B0310408
B0310408B0310408
B0310408
 
LSDI 2.pptx
LSDI 2.pptxLSDI 2.pptx
LSDI 2.pptx
 
Probabilistic data structures. Part 4. Similarity
Probabilistic data structures. Part 4. SimilarityProbabilistic data structures. Part 4. Similarity
Probabilistic data structures. Part 4. Similarity
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity Search
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
 

Recently uploaded

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Content vs Context

  • 1. IMAGE RETRIEVAL: CONTENT VERSUS CONTEXT Thijs Westerveld Teoria e Tecnologia della Comunicazione Sistemi Informativi Multimediali AA’11-’12 Angelo Oldani 744818
  • 2. INTRODUCTION Through these slides will be presented the paper “Image retrieval: content versus context” of Thijs Westerveld. This paper presents a “new” approach to image retrieval that takes the best from two worlds. It combine image features – content collateral text – context
  • 3. INDEX TRADITIONAL IMAGE RETRIEVAL LATENT SEMANTIC INDEXING FEATURE EXTRACTION PROCESS EXPERIMENTS DISCUSSION
  • 4. CONTEXT BASE IMAGE RETRIEVAL May be based on two modes: •  Annotations that are manually added. •  Collateral text available with an image. The similarity between images is then based on the similarity between the associated texts.
  • 5. CONTEXT BASE IMAGE RETRIEVAL PROBLEMS •  Synonymy Use different words to describe the same subject in different documents. •  Ambiguity Same words describe different subjects.
  • 6. CONTENT BASE IMAGE RETRIEVAL Return images that are visually most similar. Similarity is based on a set of low-level image features like a: •  colour •  shape •  texture •  …..
  • 8. LATENT SEMANTIC INDEXING (LSI) LSI is a method that uses co-occurrence statistics of terms to find the semantics behind a document’s terms. Documents using similar terms are probably related. RESERVATION DOUBLE ROOM SHOWER BREAKFAST
  • 9. LATENT SEMANTIC INDEXING (LSI) No one has combined text and image into the same semantic space using LSI. List of terms from both modalities in one term document matrix and then apply the SVD resulting in a semantic space that contains both visual and textual items.
  • 10. LATENT SEMANTIC INDEXING (LSI) CALCULATING IMAGE TERMS To use LSI on image content is necessary to define a set of discrete image features that has the same distribuiton as the set of textual terms. Set terms that is sparse as the set of the textual terms. CALCULATING IMAGE TERMS Set of therms that is the same size of the textual terms.
  • 11. FEATURE EXTRACTION Should extract the indexing terms from documents. TEXTUAL IMAGE TERMS FEATURES Image captions Colours Textures
  • 12. SPARSE SET OF IMAGE TERMS COLOUR FEATURES Has been used HSV colour space divided into 18 Hues, 3 Saturations and 3 Values and were extracted two sets of features: •  Histogram for the whole image. •  Binary value of the most frequent color for each block. TEXTURE FEATURES Has been used gabor filters at 3 different wavelengths and four orientation and was extracted the average energy for each combination of wavelengths and orientation. Avg energy values are quantified into 128 bands and disregarding the values that fall within the lower 16 bands.
  • 13. SPARSE SET OF IMAGE TERMS TERM FREQUENCIES Tot. #terms Avg. #terms/doc ratio Text 4283 27 158:1 Image 37752 625 63:1 Combination 42035 598 70:1
  • 14. SMALL SET OF IMAGE TERMS COLOUR FEATURES Has been used HSV colour space divided into 18 Hues, 3 Saturations and 3 Values and were extracted two sets of features: •  Histogram for each block. •  Histogram for whole image. TEXTURE FEATURES Has been used gabor filters at 3 different wavelengths and four orientation and was extracted the average energy for each combination of wavelengths and orientation. Avg energy values are quantified into 10 bands and disregarding the values that fall within the lower 2 bands.
  • 15. SMALL SET OF IMAGE TERMS TERM FREQUENCIES Tot. #terms Avg. #terms/doc ratio Text 4283 27 158:1 Image 4442 1131 4:1 Combination 8725 1158 8:1
  • 16. EXPERIMENT 3379 images from Reformatorisch Dagblad online archive together with their captions. Set of 20 documents as query 3 indexes (LSI indexing): •  Visual terms •  Textual term •  Visual Textual terms Top 100 returned documents
  • 17. EXPERIMENT RESULTS •  The small set of image features seems to perform somewhat better than the sparse set •  The combined approach for this set of features outperforms both the image and the text approach for queries with many relevant documents in the data set.
  • 18. DISCUSSION Latent Semantic Indexing can help bridge the semantic gap LIMITS •  Research based on very small set of images •  Text is not available with every image