SlideShare a Scribd company logo
1 of 21
Download to read offline
Mathias Lux
This work is licensed under a Creative
Commons Attribution 3.0 Unported License.
What is LIRE?
• Library for CBIR
• Easy access & instant “success”
• Few loc to index & search
It’s based on Lucene
• Java text retrieval framework
– based on inverted lists

• Top level Apache project

• Extends to Solr
Modular Feature
Architecture
LireFeature as the basic Interface
• Extraction,
• Distance function,

• Serialization (byte[] based)
• toString(), field name, …
Fast Access & Linear
Search
• Efficient coding of serialization
– transformation to byte[]
– run length coding for sparse vectors

• Custom Lucene codec
– Lucene field compression
– update to DocValues in v1.0
Search with sub Linear
T ime Complexity
• Hashing based approach for global features
– Locality sensitive hashing
• bit sampling

– Proximity based hashing

• nearest neighbors as “buckets”,
• cp. work of G. Amato

• Local features supported

– SIFT, SURF, k-means, VLAD
Tools
• Parallel Indexing
– consumer-producer based
– up to the capabilities of the VM / HDD

• Intermediate byte based data format
– small footprint, efficient, relative paths
Extending LIRE
• Implement a global feature
– extraction, distance function, serialization

• Lire takes care of the rest
– Parallel indexing, hashing, search
Using Parts of LIRE
Take what you need …

• Feature implementations

– cp. work of Xinchao Li et al. at Mediaeval 2013

• Image processing

– Canny Edge Detector, SWT (coming soon),

• Tools & code base

– FastMap, Suffix Tree Clustering, …
UCID Data Set

MAP

precision 10

ER

CEDD

0,431

0,420

0,553

CEDD

Color Correlogram

0,586

0,480

0,370

Color Correlogram

Color Layout

0,277

0,285

0,679

Color Layout

Edge Histogram

0,180

0,202

0,813

Edge Histogram

FCTH

0,447

0,415

0,531

FCTH

JCD

0,470

0,435

0,508

JCD

Joint Histogram

0,348

0,313

0,603

Joint Histogram

LBP Opponent Joined

0,266

0,267

0,729

LBP Opponent Joined

Local Binary Patterns (LBP)

0,228

0,221

0,714

Local Binary Patterns (LBP)

Opponent Histogram

0,319

0,309

0,649

Opponent Histogram

PHOG

0,232

0,235

0,725

PHOG

RGB Color Histogram

0,403

0,358

0,550

RGB Color Histogram

Rotation Invariant LBP

0,165

0,174

0,813

Rotation Invariant LBP

Scalable Color

0,172

0,183

0,840

Scalable Color

SPCEDD

0,575

0,487

0,366

SPCEDD

SPLBP

0,264

0,251

0,683

SPLBP

Surf BoVW

0,348

0,313

0,634

Surf BoVW

VLAD-SURF

0,370

0,356

0,603

VLAD-SURF
SIMPLICity Data Set

MAP

precision 10

ER

CEDD

0,513

0,706

0,193

Color Correlogram

0,498

0,740

0,159

Color Layout

0,439

0,612

0,303

Edge Histogram

0,333

0,500

0,401

FCTH

0,499

0,703

0,207

JCD

0,520

0,730

0,183

JCD

Joint Histogram

0,449

0,689

0,197

Joint Histogram

LBP Opponent Joined

0,418

0,569

0,347

LBP Opponent Joined

Local Binary Patterns (LBP)

0,358

0,587

0,295

Local Binary Patterns (LBP)

OpponentHistogram

0,450

0,635

0,270

OpponentHistogram

PHOG

0,365

0,547

0,355

PHOG

RGB Color Histogram

0,450

0,704

0,191

RGB Color Histogram

Rotation Invariant LBP

0,338

0,520

0,375

Rotation Invariant LBP

Scalable Color

0,305

0,470

0,464

Scalable Color

SPCEDD

0,599

0,772

0,144

SPCEDD

SPLBP

0,395

0,556

0,348

SPLBP

SURF BoVW

0,338

0,464

0,475

SURF BoVW

VLAD-SURF

0,365

0,518

0,407

VLAD-SURF

CEDD

Color Correlogram
Color Layout
Edge Histogram

FCTH
Hashing - BitSampling
1,000
0,900

JCD

0,800

CEDD

0,700

FCTH

0,600

ACC

0,500

PHOG

0,400

OPH

0,300

ColHist

0,200

ColLay

0,100

EH
SPCEDD

0,000

0

500

1000

1500

2000

2500

100k images from flickr, 50 results cp. to linear search

3000
Hashing - Proximity
1,000

JCD

0,900

CEDD

0,800
0,700

FCTH

0,600

ACC

0,500

PHOG

0,400

OPHIST

0,300

ColHist

0,200

Collay

0,100

EH

0,000

SPCEDD

0

500

1000

1500

2000

2500

100k images from flickr, 50 results cp. to linear search

3000
Apache Solr Integration
• Motivation:
– Use a search and retrieval server with all its tools

• Objectives:
– indexing & management

– efficient content based image search
– content based ranking of results
Solr Plugin
• Custom Request Handler
– Uses Solr’s request and response framework

– Allows for content based retrieval

• Custom ValueSourceFunction
– Added to text based search queries
– Allows for ranking based on the distance function
Solr Plugin
• Custom type of index field
– DocValue based binary field
– transmission base64 encoded

• Custom Indexer
– XML documents to be uploaded to Solr
SOLR Plugin
• http://demo-itec.uni-

klu.ac.at/liredemo/wipo.html

• Local demo
Future Work
• DocValues based indexing

– make linear search faster

• Proximity hashing

– metric spaces approach
– more accurate

• Release version 1.0

– adding docs & features freeze
Acknowledgements
I’d like to thank Anna-Maria Pasterk, Arthur Li, Arthur Pitman,
Bastian Hösch, Benjamin Sznajder, Christian Penz, Christine

Keim, Christoph Kofler, Dan Hanley, Daniel Pötzinger, Fabrizio

Falchi, Franz Graf, Giuseppe Amato, Glenn Macstravic, James
Charters, Janine Lachner, Katharina Tomanec, Lukas Esterle,

Manuel Oraze, Marian Kogler, Marko Keuschnig, Michael Riegler,
Rodrigo Carvalho Rezende, Roman Divotkey, Roman Kern,
Savvas Chatzichristofis and Sandeep Gupta.
Lecture Book
T hanks for listening …
• Mathias Lux
• mlux@itec.uni-klu.ac.at

More Related Content

Similar to LIRE presentation at the ACM Multimedia Open Source Software Competition 2013

OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)NetProtocol Xpert
 
Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons
 
Conceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLConceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLMongoDB
 
[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming OverviewStratio
 
OrientDB - The 2nd generation of (multi-model) NoSQL
OrientDB - The 2nd generation of  (multi-model) NoSQLOrientDB - The 2nd generation of  (multi-model) NoSQL
OrientDB - The 2nd generation of (multi-model) NoSQLRoberto Franchini
 
Bgpcep odl summit 2015
Bgpcep odl summit 2015Bgpcep odl summit 2015
Bgpcep odl summit 2015Giles Heron
 

Similar to LIRE presentation at the ACM Multimedia Open Source Software Competition 2013 (11)

OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)
 
Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011
 
Mpls technology
Mpls technologyMpls technology
Mpls technology
 
SenseiDB
SenseiDBSenseiDB
SenseiDB
 
Mpls te
Mpls teMpls te
Mpls te
 
Conceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLConceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQL
 
Osp fv3 cs
Osp fv3 csOsp fv3 cs
Osp fv3 cs
 
[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview
 
OrientDB - The 2nd generation of (multi-model) NoSQL
OrientDB - The 2nd generation of  (multi-model) NoSQLOrientDB - The 2nd generation of  (multi-model) NoSQL
OrientDB - The 2nd generation of (multi-model) NoSQL
 
Bgpcep odl summit 2015
Bgpcep odl summit 2015Bgpcep odl summit 2015
Bgpcep odl summit 2015
 
Cisco ospf
Cisco ospf Cisco ospf
Cisco ospf
 

More from dermotte

Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015dermotte
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimediadermotte
 
Why did you record this video?
Why did you record this video?Why did you record this video?
Why did you record this video?dermotte
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRedermotte
 
Ohne LIRe keine Bildsuche
Ohne LIRe keine BildsucheOhne LIRe keine Bildsuche
Ohne LIRe keine Bildsuchedermotte
 
Callisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for ImagesCallisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for Imagesdermotte
 
User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."dermotte
 
Visual Information Retrieval
Visual Information RetrievalVisual Information Retrieval
Visual Information Retrievaldermotte
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...dermotte
 
Power Laws Popularity And Interestingness
Power Laws Popularity And InterestingnessPower Laws Popularity And Interestingness
Power Laws Popularity And Interestingnessdermotte
 
Aspects of broad folksonomies
Aspects of broad folksonomiesAspects of broad folksonomies
Aspects of broad folksonomiesdermotte
 

More from dermotte (11)

Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimedia
 
Why did you record this video?
Why did you record this video?Why did you record this video?
Why did you record this video?
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRe
 
Ohne LIRe keine Bildsuche
Ohne LIRe keine BildsucheOhne LIRe keine Bildsuche
Ohne LIRe keine Bildsuche
 
Callisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for ImagesCallisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for Images
 
User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."
 
Visual Information Retrieval
Visual Information RetrievalVisual Information Retrieval
Visual Information Retrieval
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
 
Power Laws Popularity And Interestingness
Power Laws Popularity And InterestingnessPower Laws Popularity And Interestingness
Power Laws Popularity And Interestingness
 
Aspects of broad folksonomies
Aspects of broad folksonomiesAspects of broad folksonomies
Aspects of broad folksonomies
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

LIRE presentation at the ACM Multimedia Open Source Software Competition 2013

  • 1. Mathias Lux This work is licensed under a Creative Commons Attribution 3.0 Unported License.
  • 2. What is LIRE? • Library for CBIR • Easy access & instant “success” • Few loc to index & search
  • 3. It’s based on Lucene • Java text retrieval framework – based on inverted lists • Top level Apache project • Extends to Solr
  • 4. Modular Feature Architecture LireFeature as the basic Interface • Extraction, • Distance function, • Serialization (byte[] based) • toString(), field name, …
  • 5. Fast Access & Linear Search • Efficient coding of serialization – transformation to byte[] – run length coding for sparse vectors • Custom Lucene codec – Lucene field compression – update to DocValues in v1.0
  • 6. Search with sub Linear T ime Complexity • Hashing based approach for global features – Locality sensitive hashing • bit sampling – Proximity based hashing • nearest neighbors as “buckets”, • cp. work of G. Amato • Local features supported – SIFT, SURF, k-means, VLAD
  • 7. Tools • Parallel Indexing – consumer-producer based – up to the capabilities of the VM / HDD • Intermediate byte based data format – small footprint, efficient, relative paths
  • 8. Extending LIRE • Implement a global feature – extraction, distance function, serialization • Lire takes care of the rest – Parallel indexing, hashing, search
  • 9. Using Parts of LIRE Take what you need … • Feature implementations – cp. work of Xinchao Li et al. at Mediaeval 2013 • Image processing – Canny Edge Detector, SWT (coming soon), • Tools & code base – FastMap, Suffix Tree Clustering, …
  • 10. UCID Data Set MAP precision 10 ER CEDD 0,431 0,420 0,553 CEDD Color Correlogram 0,586 0,480 0,370 Color Correlogram Color Layout 0,277 0,285 0,679 Color Layout Edge Histogram 0,180 0,202 0,813 Edge Histogram FCTH 0,447 0,415 0,531 FCTH JCD 0,470 0,435 0,508 JCD Joint Histogram 0,348 0,313 0,603 Joint Histogram LBP Opponent Joined 0,266 0,267 0,729 LBP Opponent Joined Local Binary Patterns (LBP) 0,228 0,221 0,714 Local Binary Patterns (LBP) Opponent Histogram 0,319 0,309 0,649 Opponent Histogram PHOG 0,232 0,235 0,725 PHOG RGB Color Histogram 0,403 0,358 0,550 RGB Color Histogram Rotation Invariant LBP 0,165 0,174 0,813 Rotation Invariant LBP Scalable Color 0,172 0,183 0,840 Scalable Color SPCEDD 0,575 0,487 0,366 SPCEDD SPLBP 0,264 0,251 0,683 SPLBP Surf BoVW 0,348 0,313 0,634 Surf BoVW VLAD-SURF 0,370 0,356 0,603 VLAD-SURF
  • 11. SIMPLICity Data Set MAP precision 10 ER CEDD 0,513 0,706 0,193 Color Correlogram 0,498 0,740 0,159 Color Layout 0,439 0,612 0,303 Edge Histogram 0,333 0,500 0,401 FCTH 0,499 0,703 0,207 JCD 0,520 0,730 0,183 JCD Joint Histogram 0,449 0,689 0,197 Joint Histogram LBP Opponent Joined 0,418 0,569 0,347 LBP Opponent Joined Local Binary Patterns (LBP) 0,358 0,587 0,295 Local Binary Patterns (LBP) OpponentHistogram 0,450 0,635 0,270 OpponentHistogram PHOG 0,365 0,547 0,355 PHOG RGB Color Histogram 0,450 0,704 0,191 RGB Color Histogram Rotation Invariant LBP 0,338 0,520 0,375 Rotation Invariant LBP Scalable Color 0,305 0,470 0,464 Scalable Color SPCEDD 0,599 0,772 0,144 SPCEDD SPLBP 0,395 0,556 0,348 SPLBP SURF BoVW 0,338 0,464 0,475 SURF BoVW VLAD-SURF 0,365 0,518 0,407 VLAD-SURF CEDD Color Correlogram Color Layout Edge Histogram FCTH
  • 14. Apache Solr Integration • Motivation: – Use a search and retrieval server with all its tools • Objectives: – indexing & management – efficient content based image search – content based ranking of results
  • 15. Solr Plugin • Custom Request Handler – Uses Solr’s request and response framework – Allows for content based retrieval • Custom ValueSourceFunction – Added to text based search queries – Allows for ranking based on the distance function
  • 16. Solr Plugin • Custom type of index field – DocValue based binary field – transmission base64 encoded • Custom Indexer – XML documents to be uploaded to Solr
  • 18. Future Work • DocValues based indexing – make linear search faster • Proximity hashing – metric spaces approach – more accurate • Release version 1.0 – adding docs & features freeze
  • 19. Acknowledgements I’d like to thank Anna-Maria Pasterk, Arthur Li, Arthur Pitman, Bastian Hösch, Benjamin Sznajder, Christian Penz, Christine Keim, Christoph Kofler, Dan Hanley, Daniel Pötzinger, Fabrizio Falchi, Franz Graf, Giuseppe Amato, Glenn Macstravic, James Charters, Janine Lachner, Katharina Tomanec, Lukas Esterle, Manuel Oraze, Marian Kogler, Marko Keuschnig, Michael Riegler, Rodrigo Carvalho Rezende, Roman Divotkey, Roman Kern, Savvas Chatzichristofis and Sandeep Gupta.
  • 21. T hanks for listening … • Mathias Lux • mlux@itec.uni-klu.ac.at