SlideShare a Scribd company logo
1 of 17
Visual-Textual Joint Relevance Learning
For Tag-Based Social Image Search
Sushil Kumar(16EC65R16)
M.Tech, VIPES
IIT Kharagpur
1
Content
• Introduction
• Related Works
A) Social Image Search
B) Hypergraph Learning
• Hypergraph Analysis
• Visual-Textual Relevance Learning
• Experimental Results
• Conclusion
• References
2
Introduction
Conventional tag-based social image search methods cannot achieve satisfactory results for
two reasons
1) Too much noise in user-provided tags
2) It lacks an optimal ranking strategy
Ranking options
1) Time based ranking
2) Interestingness based ranking
Most existing Algorithms usually explore visual content and tags separately or sequentially.
3
Fig. 1[1]. Schematic illustration of the proposed visual-textual joint relevance learning approach.
4
Related works
A) Social Image Search
1) Separated Methods: In separated methods, only the textual content or the
visual content is employed for tag analysis.
2) Sequential Methods: In sequential methods, the visual content and the tags are
sequentially employed.
B) Hypergraph Learning
Hypergraph has been employed for image retrieval and object recognition.
5
Fig. 2[1]. Illustration of different social image search methods.
6
Hypergraph Analysis
In a simple graph, samples are represented by vertices and an edge links the
two related vertices.
Hyperedge in a hypergraph is able to link more than two vertices.
Vertices represents images and hyperedges are representing tag/visual terms.
A hypergraph G = (V,E,w) is composed by a vertex set V, an edge set E, and the weights
of the edges w.
The hypergraph G can be denoted by a |V| × |E| incidence matrix H with entries defined
as
h 𝑣, 𝑒 =
1, 𝑖𝑓 𝑣 ∈ 𝑒
0, 𝑖𝑓 𝑣 ∉ 𝑒
7
(1)
For a vertex v ∈ V, its vertex degree can be estimated by
d(v) =
𝑒∈ 𝐸
𝑤 𝑒 ℎ(𝑣, 𝑒)
For a hyperedge e ∈ E, its hyperedge degree can be estimated by
δ 𝑒 =
v∈V
ℎ(𝑣, 𝑒)
W denote the diagonal matrix of the hyperedge weights
W 𝑖, 𝑗 =
𝑤 𝑖 , 𝑖𝑓 𝑖 = 𝑗
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
8
(4)
(3)
(2)
Learning
Regularization framework
Arg minf {λRemp( f ) +Ω ( f )}
where f is the to-be-learned classification function, Ω( f ) is a regularizer on the
hypergraph, Remp( f ) is empirical loss, and λ > 0 is a weighting parameter.
9
(5)
(6)
In matrix form
Ω( f ) = f T Δ f ; Δ=I-Θ
Θ=Dv
-1/2HWDe
-1HTDv
-1/2a
Dv and De the diagonal matrices of the vertex degree and the
hyperedge degree.
The loss term is defined as:
Remp(f)= 𝑓−𝑦 2 = u∈V f (u) − y (u)) 2
y is an n × 1 initial label vector.
10
(7)
(9)
Step 1. Hypergraph Construction
1. For social image set X = (x1, x2, . . . , xn) as a vertex in the hypergraph G = (V,E,w).
2. Generate a bag-of-visual-words description fi
bow for each image xi, where fi
bow (k, 1) = 1 indicates
that xi contains the kth visual word. Construct hyperedges by using fi
bow.
3. For each image, the tags are ranked and only top min (nl , ni ) tags are left for further processing.
4. Generate a bag-of-textual-words description fi
tag for each image xi, where fi
tag (k, 1) = 1 indicates that
xi contains the kth selected tag.
5. Construct hyperedges by using fi
tag.
6. Generate the incidence matrix Hi , the diagonal matrices of the vertex degrees and the hyperedge
degrees Dv and De, the initial weights of all hyperedges w respectively.
Visual-Textual Relevance Learning
Algorithm
11
Step 2. Pseudo-Relevant Sample Selection
The Flickr Distance is employed to estimate the semantic relevance of an image xi to
the query tag tq , and the top K results are selected as the pseudo-relevant images.
Step 3. Relevance Learning on Hypergraph
Conduct semi-supervised learning on the hypergraph structure. Iteratively learn the
to-be-learned relevance score vector f and the weights for hyperedge w.
12
Fig. 3[1]. Examples of hyperedge construction. (a) Example of textual hyperedge
construction, where three hyperedges are generated by tags “people,” “gun,”
and “tank.” (b) Example of visual hyperedge construction, where three hyperedges are
generated by three visual words.
Fig. 4[1]. Example of the connection
between two images.
13
Query Seq. HG HG+WE HG+WE(VIS) HG+WE(TAG)
Airshow 0.4193 0.5759 0.7183 0.5847 0.6869
Apple 0.2433 0.6975 0.8128 0.8100 0.7875
Aquarium 0.5640 0.8163 0.9346 0.9189 0.9134
Basin 0.2981 0.4911 0.6115 0.6178 0.5946
Beach 0.5986 0.8270 1.0000 0.9949 0.9869
Bird 0.8931 0.9576 0.9653 0.9375 0.9618
Bmw 0.5910 0.6244 0.7265 0.7048 0.6826
Table [1][1]:The NDCG @7 results of different methods. the best result in each row is marked in blue
Experimental Results
14
NDCG=Normalised Discounted Cumulative Gain
(a)
(b)
(c)
(d)
(e)
Fig. 5. Top results obtained by
different methods for the query apple.
(a) Sequential social image relevance
learning.
(b) Hypergraph-based relevance
learning.
(c) Hypergraph based relevance
learning with hyperedge weight
estimation, i.e., the proposed method.
(d) Proposed learning method with
merely visual information.
(e) Proposed learning method with
merely tag information.
15
Conclusion
• In the proposed method, both visual content and tags are used to generate the
hyperedges of a hypergraph, and a relevance learning procedure is performed on
the hypergraph structure
• Experimental results demonstrate that the proposed method achieved better results
compared with many base line methods including sequential social image ranking,
Hypergraph-based relevance learning, HG-WE (Visual) and HG-WE (Tag).
16
References
[1]. Y. Gao, M. Wang, Z. J. Zha, J. Shen, X. Li and X. Wu, "Visual-Textual
Joint Relevance Learning for Tag-Based Social Image Search," in IEEE
Transactions on Image Processing, vol. 22, no. 1, pp. 363-376, Jan. 2013.
[2]. M. Wang, K. Yang, X. S. Hua, and H.-J. Zhang, “Toward a relevant and
diverse search of social images,” IEEE Trans. Multimedia, vol. 12, no. 8, pp.
829–842, Dec. 2010.
[3]. Q. Liu, Y. Huang, and D. Metaxas, “Hypergraph with sampling for image
retrieval,” Pattern Recognit., vol. 44, nos. 10–11, pp. 2255–2262, 2011.
17

More Related Content

What's hot

Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH A MODEL TO E...
Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH  A MODEL TO E...Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH  A MODEL TO E...
Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH A MODEL TO E...Giacomo Veneri
 
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...Kazi Mostafa
 
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...CSCJournals
 
Text Extraction from Image using Python
Text Extraction from Image using PythonText Extraction from Image using Python
Text Extraction from Image using Pythonijtsrd
 
Region filling and object removal by exemplar based image inpainting
Region filling and object removal by exemplar based image inpaintingRegion filling and object removal by exemplar based image inpainting
Region filling and object removal by exemplar based image inpaintingWoonghee Lee
 
optimal subsampling
optimal subsamplingoptimal subsampling
optimal subsamplingTian Tian
 
A Review on Image Inpainting to Restore Image
A Review on Image Inpainting to Restore ImageA Review on Image Inpainting to Restore Image
A Review on Image Inpainting to Restore ImageIOSR Journals
 
Interpolation Technique using Non Linear Partial Differential Equation with E...
Interpolation Technique using Non Linear Partial Differential Equation with E...Interpolation Technique using Non Linear Partial Differential Equation with E...
Interpolation Technique using Non Linear Partial Differential Equation with E...CSCJournals
 
Digital image processing lab 1
Digital image processing lab 1Digital image processing lab 1
Digital image processing lab 1Moe Moe Myint
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from imagesGarby Baby
 
Image colorization
Image colorizationImage colorization
Image colorizationPankti Fadia
 
11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrievalAlexander Decker
 
3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networksAlexander Decker
 
3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrievalAlexander Decker
 
11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networksAlexander Decker
 
Digital Image Processing (Lab 08)
Digital Image Processing (Lab 08)Digital Image Processing (Lab 08)
Digital Image Processing (Lab 08)Moe Moe Myint
 

What's hot (20)

Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH A MODEL TO E...
Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH  A MODEL TO E...Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH  A MODEL TO E...
Raw 2009 -THE ROLE OF LATEST FIXATIONS ON ONGOING VISUAL SEARCH A MODEL TO E...
 
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
 
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...
A Framework for Soccer Video Processing and AnalysisBased on Enhanced Algorit...
 
Text Extraction from Image using Python
Text Extraction from Image using PythonText Extraction from Image using Python
Text Extraction from Image using Python
 
Sharath copy
Sharath   copySharath   copy
Sharath copy
 
Image Inpainting
Image InpaintingImage Inpainting
Image Inpainting
 
Region filling and object removal by exemplar based image inpainting
Region filling and object removal by exemplar based image inpaintingRegion filling and object removal by exemplar based image inpainting
Region filling and object removal by exemplar based image inpainting
 
optimal subsampling
optimal subsamplingoptimal subsampling
optimal subsampling
 
A Review on Image Inpainting to Restore Image
A Review on Image Inpainting to Restore ImageA Review on Image Inpainting to Restore Image
A Review on Image Inpainting to Restore Image
 
point processing
point processingpoint processing
point processing
 
Interpolation Technique using Non Linear Partial Differential Equation with E...
Interpolation Technique using Non Linear Partial Differential Equation with E...Interpolation Technique using Non Linear Partial Differential Equation with E...
Interpolation Technique using Non Linear Partial Differential Equation with E...
 
Digital image processing lab 1
Digital image processing lab 1Digital image processing lab 1
Digital image processing lab 1
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
 
Image colorization
Image colorizationImage colorization
Image colorization
 
11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval
 
3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks
 
3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval
 
11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks
 
Digital Image Processing (Lab 08)
Digital Image Processing (Lab 08)Digital Image Processing (Lab 08)
Digital Image Processing (Lab 08)
 
Test
TestTest
Test
 

Similar to Visual textual joint relevance learning for tag based social image search

An Hypergraph Object Oriented Model For Image Segmentation And Annotation
An Hypergraph Object Oriented Model For Image Segmentation And AnnotationAn Hypergraph Object Oriented Model For Image Segmentation And Annotation
An Hypergraph Object Oriented Model For Image Segmentation And AnnotationCrystal Sanchez
 
自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用Ryo Iwaki
 
ShawnQuinnCSS581FinalProjectReport
ShawnQuinnCSS581FinalProjectReportShawnQuinnCSS581FinalProjectReport
ShawnQuinnCSS581FinalProjectReportShawn Quinn
 
Classification of Fine Art Oil Paintings by Semantic Category
Classification of Fine Art Oil Paintings by Semantic CategoryClassification of Fine Art Oil Paintings by Semantic Category
Classification of Fine Art Oil Paintings by Semantic CategoryBill Kromydas
 
Learning to Rank Image Tags With Limited Training Examples
Learning to Rank Image Tags With Limited Training ExamplesLearning to Rank Image Tags With Limited Training Examples
Learning to Rank Image Tags With Limited Training Examples1crore projects
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative AttributesVikas Jain
 
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchVisual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchSOYEON KIM
 
Colour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired ModelColour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired Modelijsrd.com
 
$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problemmhmt82
 
$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problemmhmt82
 
A Hough Transform Based On a Map-Reduce Algorithm
A Hough Transform Based On a Map-Reduce AlgorithmA Hough Transform Based On a Map-Reduce Algorithm
A Hough Transform Based On a Map-Reduce AlgorithmIJERA Editor
 
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...IJECEIAES
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptAnshika865276
 
An adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachAn adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachCemal Ardil
 
Vehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVMVehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVMCSEIJJournal
 
3ways to improve semantic segmentation
3ways to improve semantic segmentation3ways to improve semantic segmentation
3ways to improve semantic segmentationFrozen Paradise
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMcseij
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM cseij
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMcseij
 
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...Editor IJCATR
 

Similar to Visual textual joint relevance learning for tag based social image search (20)

An Hypergraph Object Oriented Model For Image Segmentation And Annotation
An Hypergraph Object Oriented Model For Image Segmentation And AnnotationAn Hypergraph Object Oriented Model For Image Segmentation And Annotation
An Hypergraph Object Oriented Model For Image Segmentation And Annotation
 
自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用
 
ShawnQuinnCSS581FinalProjectReport
ShawnQuinnCSS581FinalProjectReportShawnQuinnCSS581FinalProjectReport
ShawnQuinnCSS581FinalProjectReport
 
Classification of Fine Art Oil Paintings by Semantic Category
Classification of Fine Art Oil Paintings by Semantic CategoryClassification of Fine Art Oil Paintings by Semantic Category
Classification of Fine Art Oil Paintings by Semantic Category
 
Learning to Rank Image Tags With Limited Training Examples
Learning to Rank Image Tags With Limited Training ExamplesLearning to Rank Image Tags With Limited Training Examples
Learning to Rank Image Tags With Limited Training Examples
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative Attributes
 
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchVisual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
 
Colour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired ModelColour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired Model
 
$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem
 
$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem$$ Formulating semantic image annotation as a supervised learning problem
$$ Formulating semantic image annotation as a supervised learning problem
 
A Hough Transform Based On a Map-Reduce Algorithm
A Hough Transform Based On a Map-Reduce AlgorithmA Hough Transform Based On a Map-Reduce Algorithm
A Hough Transform Based On a Map-Reduce Algorithm
 
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...
Real-time Multi-object Face Recognition Using Content Based Image Retrieval (...
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.ppt
 
An adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachAn adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approach
 
Vehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVMVehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVM
 
3ways to improve semantic segmentation
3ways to improve semantic segmentation3ways to improve semantic segmentation
3ways to improve semantic segmentation
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...
An Improved Image Fusion Scheme Based on Markov Random Fields with Image Enha...
 

Recently uploaded

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 

Recently uploaded (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Visual textual joint relevance learning for tag based social image search

  • 1. Visual-Textual Joint Relevance Learning For Tag-Based Social Image Search Sushil Kumar(16EC65R16) M.Tech, VIPES IIT Kharagpur 1
  • 2. Content • Introduction • Related Works A) Social Image Search B) Hypergraph Learning • Hypergraph Analysis • Visual-Textual Relevance Learning • Experimental Results • Conclusion • References 2
  • 3. Introduction Conventional tag-based social image search methods cannot achieve satisfactory results for two reasons 1) Too much noise in user-provided tags 2) It lacks an optimal ranking strategy Ranking options 1) Time based ranking 2) Interestingness based ranking Most existing Algorithms usually explore visual content and tags separately or sequentially. 3
  • 4. Fig. 1[1]. Schematic illustration of the proposed visual-textual joint relevance learning approach. 4
  • 5. Related works A) Social Image Search 1) Separated Methods: In separated methods, only the textual content or the visual content is employed for tag analysis. 2) Sequential Methods: In sequential methods, the visual content and the tags are sequentially employed. B) Hypergraph Learning Hypergraph has been employed for image retrieval and object recognition. 5
  • 6. Fig. 2[1]. Illustration of different social image search methods. 6
  • 7. Hypergraph Analysis In a simple graph, samples are represented by vertices and an edge links the two related vertices. Hyperedge in a hypergraph is able to link more than two vertices. Vertices represents images and hyperedges are representing tag/visual terms. A hypergraph G = (V,E,w) is composed by a vertex set V, an edge set E, and the weights of the edges w. The hypergraph G can be denoted by a |V| × |E| incidence matrix H with entries defined as h 𝑣, 𝑒 = 1, 𝑖𝑓 𝑣 ∈ 𝑒 0, 𝑖𝑓 𝑣 ∉ 𝑒 7 (1)
  • 8. For a vertex v ∈ V, its vertex degree can be estimated by d(v) = 𝑒∈ 𝐸 𝑤 𝑒 ℎ(𝑣, 𝑒) For a hyperedge e ∈ E, its hyperedge degree can be estimated by δ 𝑒 = v∈V ℎ(𝑣, 𝑒) W denote the diagonal matrix of the hyperedge weights W 𝑖, 𝑗 = 𝑤 𝑖 , 𝑖𝑓 𝑖 = 𝑗 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 8 (4) (3) (2)
  • 9. Learning Regularization framework Arg minf {λRemp( f ) +Ω ( f )} where f is the to-be-learned classification function, Ω( f ) is a regularizer on the hypergraph, Remp( f ) is empirical loss, and λ > 0 is a weighting parameter. 9 (5) (6)
  • 10. In matrix form Ω( f ) = f T Δ f ; Δ=I-Θ Θ=Dv -1/2HWDe -1HTDv -1/2a Dv and De the diagonal matrices of the vertex degree and the hyperedge degree. The loss term is defined as: Remp(f)= 𝑓−𝑦 2 = u∈V f (u) − y (u)) 2 y is an n × 1 initial label vector. 10 (7) (9)
  • 11. Step 1. Hypergraph Construction 1. For social image set X = (x1, x2, . . . , xn) as a vertex in the hypergraph G = (V,E,w). 2. Generate a bag-of-visual-words description fi bow for each image xi, where fi bow (k, 1) = 1 indicates that xi contains the kth visual word. Construct hyperedges by using fi bow. 3. For each image, the tags are ranked and only top min (nl , ni ) tags are left for further processing. 4. Generate a bag-of-textual-words description fi tag for each image xi, where fi tag (k, 1) = 1 indicates that xi contains the kth selected tag. 5. Construct hyperedges by using fi tag. 6. Generate the incidence matrix Hi , the diagonal matrices of the vertex degrees and the hyperedge degrees Dv and De, the initial weights of all hyperedges w respectively. Visual-Textual Relevance Learning Algorithm 11
  • 12. Step 2. Pseudo-Relevant Sample Selection The Flickr Distance is employed to estimate the semantic relevance of an image xi to the query tag tq , and the top K results are selected as the pseudo-relevant images. Step 3. Relevance Learning on Hypergraph Conduct semi-supervised learning on the hypergraph structure. Iteratively learn the to-be-learned relevance score vector f and the weights for hyperedge w. 12
  • 13. Fig. 3[1]. Examples of hyperedge construction. (a) Example of textual hyperedge construction, where three hyperedges are generated by tags “people,” “gun,” and “tank.” (b) Example of visual hyperedge construction, where three hyperedges are generated by three visual words. Fig. 4[1]. Example of the connection between two images. 13
  • 14. Query Seq. HG HG+WE HG+WE(VIS) HG+WE(TAG) Airshow 0.4193 0.5759 0.7183 0.5847 0.6869 Apple 0.2433 0.6975 0.8128 0.8100 0.7875 Aquarium 0.5640 0.8163 0.9346 0.9189 0.9134 Basin 0.2981 0.4911 0.6115 0.6178 0.5946 Beach 0.5986 0.8270 1.0000 0.9949 0.9869 Bird 0.8931 0.9576 0.9653 0.9375 0.9618 Bmw 0.5910 0.6244 0.7265 0.7048 0.6826 Table [1][1]:The NDCG @7 results of different methods. the best result in each row is marked in blue Experimental Results 14 NDCG=Normalised Discounted Cumulative Gain
  • 15. (a) (b) (c) (d) (e) Fig. 5. Top results obtained by different methods for the query apple. (a) Sequential social image relevance learning. (b) Hypergraph-based relevance learning. (c) Hypergraph based relevance learning with hyperedge weight estimation, i.e., the proposed method. (d) Proposed learning method with merely visual information. (e) Proposed learning method with merely tag information. 15
  • 16. Conclusion • In the proposed method, both visual content and tags are used to generate the hyperedges of a hypergraph, and a relevance learning procedure is performed on the hypergraph structure • Experimental results demonstrate that the proposed method achieved better results compared with many base line methods including sequential social image ranking, Hypergraph-based relevance learning, HG-WE (Visual) and HG-WE (Tag). 16
  • 17. References [1]. Y. Gao, M. Wang, Z. J. Zha, J. Shen, X. Li and X. Wu, "Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search," in IEEE Transactions on Image Processing, vol. 22, no. 1, pp. 363-376, Jan. 2013. [2]. M. Wang, K. Yang, X. S. Hua, and H.-J. Zhang, “Toward a relevant and diverse search of social images,” IEEE Trans. Multimedia, vol. 12, no. 8, pp. 829–842, Dec. 2010. [3]. Q. Liu, Y. Huang, and D. Metaxas, “Hypergraph with sampling for image retrieval,” Pattern Recognit., vol. 44, nos. 10–11, pp. 2255–2262, 2011. 17