SlideShare a Scribd company logo
1 of 67
Oge Marques
Florida Atlantic University
     Boca Raton, FL - USA
    VIR is a highly interdisciplinary field, but …

                                 Image and       (Multimedia)
                                                                  Information
                                   Video          Database
                                                                    Retrieval
                                 Processing        Systems




                                                   Visual
                              Machine                                 Computer
                              Learning          Information            Vision
                                                  Retrieval



                                                  Visual data
                                                                 Human Visual
                                  Data Mining    modeling and
                                                                  Perception
                                                representation




Klagenfurt - June 2010
    There are many things that I believe…




             … but cannot prove




Klagenfurt - June 2010
The “big mismatch”




Klagenfurt - June 2010
    Part I
              ◦  10 years after the “end of the early years”
                    Where are we now?
             Part II
              ◦  Medical image retrieval
                    Challenges and opportunities
             Part III
              ◦  Where is VIR headed?
                    Advice for young researchers




Klagenfurt - June 2010
    It’s been 10 years since the “end of the early
              years” [Smeulders et al., 2000]




              ◦  Are the challenges from 2000 still relevant?
              ◦  Are the directions and guidelines from 2000 still
                 appropriate?

Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Driving forces
                    “[…] content-based image retrieval (CBIR) will continue
                     to grow in every direction: new audiences, new
                     purposes, new styles of use, new modes of interaction,
                     larger data sets, and new methods to solve the
                     problems.”




Klagenfurt - June 2010
    Yes, we have seen many new audiences, new
              purposes, new styles of use, and new modes
              of interaction emerge.

             Each of these usually requires new methods
              to solve the problems that they bring.

             However, not too many researchers see them
              as a driving force (as they should).



Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Heritage of computer vision
                    “An important obstacle to overcome […] is to realize
                     that image retrieval does not entail solving the general
                     image understanding problem.”




Klagenfurt - June 2010
    I’m afraid I have bad news…
              ◦  Computer vision hasn’t made so much progress
                 during the past 10 years.

              ◦  Some classical problems 

                 (including image 

                 understanding)

                 remain unresolved.

              ◦  Similarly, CBIR from a 

                 pure computer vision

                 perspective didn’t work 

                 too well either.


Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Influence on computer vision
                    “[…] CBIR offers a different look at traditional computer
                     vision problems: large data sets, no reliance on strong
                     segmentation, and revitalized interest in color image
                     processing and invariance.”




Klagenfurt - June 2010
    The adoption of large data sets became standard
              practice in computer vision (see Torralba’s work).
             No reliance on strong segmentation (still
              unresolved)  new areas of research, e.g.,
              automatic ROI extraction and RBIR.
             Color image processing and color descriptors
              became incredibly popular, useful, and (to some
              degree) effective.
              Invariance still a huge problem
              ◦  But it’s cheaper than ever to have multiple views.



Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Similarity and learning
                    “We make a pledge for the importance of human-
                     based similarity rather than general similarity. Also,
                     the connection between image semantics, image data,
                     and query context will have to be made clearer in the
                     future.”
                    “[…] in order to bring semantics to the user, learning is
                     inevitable.”



Klagenfurt - June 2010
    The authors were pointing in the right
              direction (human in the loop, role of context,
              benefits from learning,…)

             However:
              ◦  Similarity is a tough problem to crack and model.
                    Even the understanding of how humans judge image
                     similarity is very limited.
              ◦  Machine learning is almost inevitable…
                    … but sometimes it can be abused.



Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Interaction
                    Better visualization options, more control to the user,
                     ability to provide feedback […]




Klagenfurt - June 2010
    Significant progress on visualization
              interfaces and devices.

             Relevance Feedback: still a very tricky
              tradeoff (effort vs. perceived benefit), but
              more popular than ever (rating, thumbs up/
              down, etc.)




Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Need for databases
                    “The connection between CBIR and database research
                     is likely to increase in the future. […] problems like the
                     definition of suitable query languages, efficient search
                     in high dimensional feature space, search in the
                     presence of changing similarity measures are largely
                     unsolved […]”




Klagenfurt - June 2010
    Very little progress

              ◦  Image search and retrieval has benefited much
                 more from document information retrieval than
                 from database research.




Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  The problem of evaluation
                    CBIR could use a reference standard against which new
                     algorithms could be evaluated (similar to TREC in the
                     field of text recognition).
                    “A comprehensive and publicly available collection of
                     images, sorted by class and retrieval purposes,
                     together with a protocol to standardize experimental
                     practices, will be instrumental in the next phase of
                     CBIR.”


Klagenfurt - June 2010
    Significant progress on benchmarks,
              standardized datasets, etc.

              ◦  ImageCLEF
              ◦  Pascal VOC Challenge
              ◦  MSRA dataset
              ◦  Simplicity dataset
              ◦  UCID dataset and ground truth (GT)
              ◦  Accio / SIVAL dataset and GT
              ◦  Caltech 101, Caltech 256
              ◦  LabelMe


Klagenfurt - June 2010
    Revisiting the ‘Concluding Remarks’ from
              [Smeulders et al., 2000]:

              ◦  Semantic gap and other sources
                    “A critical point in the advancement of CBIR is the
                     semantic gap, where the meaning of an image is rarely
                     self-evident. […] One way to resolve the semantic gap
                     comes from sources outside the image by integrating
                     other sources of information about the image in the
                     query.”




Klagenfurt - June 2010
    The semantic gap problem has not been
              solved (and maybe will never be…)

             What are the alternatives?
              1.  Treat visual similarity and semantic relatedness
                  differently
                    Examples: Alipr, Google similarity search, etc.
              2.  Improve both (text-based and visual) search
                  methods independently
              3.  Trust the user
                    CFIR, collaborative filtering, crowdsourcing, games.


Klagenfurt - June 2010
    Challenges
              ◦  We’re entering a new country…
                        How much can we bring?
                        Do we speak the language?
                        Do we know their culture?
                        Do they understand us and where we come from?
             Opportunities
              ◦  They use images (extensively)
              ◦  They have expert knowledge
              ◦  Domains are narrow (almost by definition)
              ◦  Fewer clients, but potentially more $$


Klagenfurt - June 2010
    An overview of the challenges

              ◦  Different terminology
              ◦  Standards (e.g., DICOM)
              ◦  Modality dependencies
              ◦  Equipment dependencies
              ◦  Privacy issues
              ◦  Proprietary data
              ◦  A tough sell?




Klagenfurt - June 2010
    Be prepared for:
              ◦  New acronyms
                    CBMIR (Content-Based Medical Image Retrieval)
                    PACS (Picture Archiving and Communication System)
                    DICOM (Digital Imaging and COmmunication in
                     Medicine)
                    Hospital Information Systems (HIS)
                    Radiological Information Systems (RIS)
              ◦  New phrases
                    Imaging informatics
              ◦  Lots of technical medical terms



Klagenfurt - June 2010
    DICOM (http://medical.nema.org/)
              ◦  Global IT standard, created in 1993, used in
                 virtually all hospitals worldwide.
              ◦  Designed to ensure the interoperability of different
                 systems and manage related workflow.
              ◦  Will be required by all EHR systems that include
                 imaging information as an integral part of the
                 patient record.
              ◦  750+ technical and medical experts participate in
                 20+ active DICOM working groups.
              ◦  Standard is updated 4-5 times per year.
              ◦  Many available tools! (see http://www.idoimaging.com/)

Klagenfurt - June 2010
    The IRMA code [Lehmann et al., 2003]
              ◦  4 axes with 3 to 4 positions, each in
                 {0,...9,a,...,z}, where "0" denotes "unspecified" to
                 determine the end of a path along an axis.
                   Technical code (T) describes the imaging
                    modality
                   Directional code (D) models body orientations
                   Anatomical code (A) refers to the body region
                    examined
                   Biological code (B) describes the biological
                    system examined.


Klagenfurt - June 2010
    The IRMA code [Lehmann et al., 2003]
              ◦  The entire code results in a character string of <14
                 characters (IRMA: TTTT – DDD – AAA – BBB).




           Example: “x-ray, projection radiography,
           analog, high energy – sagittal, left lateral
           decubitus, inspiration – chest, lung –
           respiratory system, lung”




                                   Source: [Lehmann et al., 2003]
Klagenfurt - June 2010
    The IRMA code
             [Lehmann et
             al., 2003]

             ◦  The companion
                tool…




                         Source: [Lehmann et al., 2004]


Klagenfurt - June 2010
    Most current retrieval systems in clinical use rely
              on text keywords such as DICOM header
              information to perform retrieval.
             CBIR has been widely researched in a variety of
              domains and provides an intuitive and expressive
              method for querying visual data using features,
              e.g. color, shape, and texture.
             Current CBIR systems:
              ◦  are not easily integrated into the healthcare
                 environment;
              ◦  have not been widely evaluated using a large dataset;
                 and
              ◦  lack the ability to perform relevance feedback to refine
                 retrieval results.


                               Source: [Hsu et al., 2009]
Klagenfurt - June 2010
    CBMIR is still a relatively
              small dot on the map of
              the medical imaging
              community.




                            Source: Program of SPIE Medical Imaging 2010 Multiconference
Klagenfurt - June 2010
    New gaps!

              ◦  Just when you
                 thought the
                 semantic gap
                 was your only
                 problem…




              Source: [Deserno, Antani, and Long, 2009]


Klagenfurt - June 2010
    USA
              ◦  NIH (National Institutes of Health)
                    NIBIB - National Institute of Biomedical Imaging and
                     Bioengineering
                    NCI - National Cancer Institute
                    NLM – National Libraries of Medicine
              ◦  Several universities and hospitals
             Europe
              ◦  Aachen University (Germany)
              ◦  Geneva University (Switzerland)
             Big companies (Siemens, GE, etc.)

Klagenfurt - June 2010
    IRMA (Image Retrieval in Medical Applications)

              ◦  Aachen University (Germany)
                    http://ganymed.imib.rwth-aachen.de/irma/

              ◦  3 online demos:
                    IRMA Query demo: allows the evaluation of CBIR on several
                     databases.
                    IRMA Extended Query Refinement demo: CBIR from the IRMA
                     database (a subset of 10,000 images).
                    Spine Pathology and Image Retrieval Systems (SPIRS)
                     designed by the NLM/NIH (USA): holds information of
                     ~17,000 spine x-rays.



Klagenfurt - June 2010
    MedGIFT (GNU Image Finding Tool)

              ◦  Geneva University (Switzerland)
                    http://www.sim.hcuge.ch/medgift/

              ◦  Large effort, including projects such as:
                        Talisman (lung image retrieval)
                        Case-based fracture image retrieval system
                        Onco-Media: medical image retrieval + grid computing
                        ImageCLEF: evaluation and validation
                        medSearch



Klagenfurt - June 2010
    WebMIRS

              ◦  NIH / NLM (USA)
                    http://archive.nlm.nih.gov/proj/webmirs/index.php

              ◦  Query by text + navigation by categories

              ◦  Uses datasets and related x-ray images from the
                 National Health and Nutrition Examination Survey
                 (NHANES)




Klagenfurt - June 2010
    SPIRS (Spine Pathology & Image Retrieval
              System): Web-based image retrieval system
              for large biomedical databases
              ◦  NIH / UCLA (USA)
              ◦  Great case study on highly specialized CBMIR




Klagenfurt - June 2010           Source: [Hsu et al., 2009]
    National Biomedical Imaging Archive (NBIA)

              ◦  NCI / NIH (USA)
                    https://imaging.nci.nih.gov/

              ◦  Search based on metadata (DICOM fields)
              ◦  3 search options:
                    Simple
                    Advanced
                    Dynamic




Klagenfurt - June 2010
    ARSS Goldminer

              ◦  American Roentgen Ray Society (USA)
                    http://goldminer.arrs.org/

              ◦  Query by text
              ◦  Results can be filtered by:
                    Modality
                    Age
                    Sex




Klagenfurt - June 2010
    Yottalook Images

              ◦  iVirtuoso (USA)
                    http://www.yottalook.com/

              ◦  Developed and maintained by four radiologists
              ◦  Query by text
              ◦  Claims to use 4 “core technologies”:
                        "natural query analysis”
                        "semantic ontology”
                        “relevance algorithm”
                        a specialized content delivery system that provides
                         high yield content based on the search term.


Klagenfurt - June 2010
    ImageCLEF Medical Image Retrieval 2010
                    http://www.imageclef.org/2010/medical
              ◦  Data set: 77,000 images from articles published in
                 Radiology and Radiographics including text of the
                 captions and link to the html of the full text articles.
              ◦  3 types of tasks:
                    Modality Classification: given an image, return its
                     modality (MR, CT, XR, etc.)
                    Ad-hoc retrieval: classic medical retrieval task, with 3
                     “flavors”: textual, mixed and semantic queries
                    Case-based retrieval: retrieve cases including images
                     that might best suit the provided case description.


Klagenfurt - June 2010
    Better user interfaces, which are responsive,
              highly interactive, and capable of supporting
              relevance feedback.
              ◦  In other words, address the “Performance Gap
                 Category” and the “Usability Gap Category”.




Klagenfurt - June 2010
    New applications of CBMIR, including:
              ◦  Teaching
              ◦  Research
              ◦  Diagnosis
              ◦  PACS and Electronic Patient Records


             CBMIR evaluation using medical experts

             Integration of local and global features



Klagenfurt - June 2010
    New descriptors
              ◦  Example: the Fuzzy Rule Based Compact Composite
                 Descriptor (CCD), which includes global image
                 features capturing both brightness and texture
                 characteristics in a 1D Histogram [Chatzichristofis &
                 Boutalis, 2009]




Klagenfurt - June 2010
    Partial match schemes (see [Hsu et al., 2009])




                              Source: [Hsu et al., 2009]
Klagenfurt - June 2010
    New devices (e.g., iPad)




Klagenfurt - June 2010
    Advice for [young] researchers

              ◦  In this last part, I’ve compiled pieces and bits of
                 advice that I believe might help researchers who are
                 entering the field.

              ◦  They focus on research avenues that I personally
                 consider to be the most promising.




Klagenfurt - June 2010
    LOOK…

              ◦  at yourself (how do you search for images and
                 videos?)

              ◦  around (related areas and how they have grown)

              ◦  at Google (and other major players)




Klagenfurt - June 2010
    Which sites do you use?
              ◦  Why?
             Which search options do you use?
              ◦  What do you do when the returned results aren’t
                 good?
             What is the single most useful feature that
              you wish those sites had?

             What are your intentions and how do you
              express them?


Klagenfurt - June 2010
  Semi-automatic image annotation
           Tag recommendation systems

           Story annotation engines
           Content-based image filtering

           Copyright detection
           Watermark detection
              ◦  and many more




Klagenfurt - June 2010
    Google Similarity Search (VisualRank) [Jing &
              Baluja, 2008]



             Google Goggles (mobile visual search)




Klagenfurt - June 2010
    THINK…

              ◦  mobile devices

              ◦  new devices and services

              ◦  social networks

              ◦  games




Klagenfurt - June 2010
    Google Goggles understands narrow-domain
              search and retrieval




             Several other apps for iPhone, iPad, and
              Android (e.g., kooaba and Fetch!)


Klagenfurt - June 2010
  Flickr (b. 2004)
           YouTube (b. 2005)

           Flip video cameras (b. 2006)
           iPhone (b. 2007)

           iPad (b. 2010)




Klagenfurt - June 2010
    The Web 2.0 has brought about:
              ◦  New data sources
              ◦  New usage patterns
              ◦  New understanding about the users, their needs,
                 habits, preferences
              ◦  New opportunities
              ◦  Lots of metadata!

              ◦  A chance to experience a true paradigm shift
                    Before: image annotation is tedious, labor-intensive,
                     expensive
                    After: image annotation is fun!

Klagenfurt - June 2010
◦    Google Image Labeler

         ◦    Games with a purpose (GWAP):
                The ESP Game
                Squigl
                Matchin




Klagenfurt - June 2010
    UNDERSTAND…

              ◦  human intentions

              ◦  human emotions

              ◦  user’s preferences and needs




Klagenfurt - June 2010
    CREATE…

              ◦  better interfaces

              ◦  better user experience

              ◦  new business opportunities (added value)




Klagenfurt - June 2010
    Image Genius (sponsored by FAU / will
     become startup)
    Fully functional online prototype of a medical image
     retrieval system (MEDIX) with DICOM capabilities
    Unsupervised ROI extraction from an image
     (by Gustavo B. Borba, UTFPR, Brazil)
–    I believe (but cannot prove…) that successful
            VIR solutions will:
            •  combine content-based image retrieval (CBIR) with
               metadata (high-level semantic-based image
               retrieval)
            •  only be truly successful in narrow domains
            •  include the user in the loop
                –  Relevance Feedback (RF)
                –  Collaborative efforts (tagging, rating, annotating)
            •  provide friendly, intuitive interfaces
            •  incorporate results and insights from cognitive
               science, particularly human visual attention,
               perception, and memory


Klagenfurt - June 2010
    “Image search and retrieval” is not a problem,
            but rather a collection of related problems that
            look like one.

           There is a great need for good solutions to
            specific problems.

           10 years after “the end of the early years”,
            research in visual information retrieval still has
            many open problems, challenges, and
            opportunities.

Klagenfurt - June 2010
Questions?




                         omarques@fau.edu

Klagenfurt - June 2010

More Related Content

What's hot

Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introductioncairo university
 
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)SSII
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLAlbert Y. C. Chen
 
Elegant Resume
Elegant ResumeElegant Resume
Elegant Resumebutest
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...webhostingguy
 
Information Architecture Course Part 2 - Spring 2013 - Class 1
Information Architecture Course Part 2 - Spring 2013 - Class 1Information Architecture Course Part 2 - Spring 2013 - Class 1
Information Architecture Course Part 2 - Spring 2013 - Class 1Keith Schengili-Roberts
 
Information Architecture - Part 1 - Spring 2013 - Class 1
Information Architecture - Part 1 - Spring 2013 - Class 1Information Architecture - Part 1 - Spring 2013 - Class 1
Information Architecture - Part 1 - Spring 2013 - Class 1Keith Schengili-Roberts
 

What's hot (8)

Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introduction
 
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
16 ijcse-01237
16 ijcse-0123716 ijcse-01237
16 ijcse-01237
 
Elegant Resume
Elegant ResumeElegant Resume
Elegant Resume
 
Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...Multimedia Information Retrieval: What is it, and why isn't ...
Multimedia Information Retrieval: What is it, and why isn't ...
 
Information Architecture Course Part 2 - Spring 2013 - Class 1
Information Architecture Course Part 2 - Spring 2013 - Class 1Information Architecture Course Part 2 - Spring 2013 - Class 1
Information Architecture Course Part 2 - Spring 2013 - Class 1
 
Information Architecture - Part 1 - Spring 2013 - Class 1
Information Architecture - Part 1 - Spring 2013 - Class 1Information Architecture - Part 1 - Spring 2013 - Class 1
Information Architecture - Part 1 - Spring 2013 - Class 1
 

Viewers also liked

Image retrieval: challenges and opportunities
Image retrieval: challenges and opportunitiesImage retrieval: challenges and opportunities
Image retrieval: challenges and opportunitiesOge Marques
 
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...techkrish
 
A Comparative Study of Content Based Image Retrieval Trends and Approaches
A Comparative Study of Content Based Image Retrieval Trends and ApproachesA Comparative Study of Content Based Image Retrieval Trends and Approaches
A Comparative Study of Content Based Image Retrieval Trends and ApproachesCSCJournals
 
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...Nexgen Technology
 
The Real Problem of Bridging the Multimedia “Semantic Gap”
The Real Problem of Bridging the Multimedia “Semantic Gap” The Real Problem of Bridging the Multimedia “Semantic Gap”
The Real Problem of Bridging the Multimedia “Semantic Gap” jrs21
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final pptrinki nag
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrievalrubaiyat11
 
Literature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalLiterature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalUpekha Vandebona
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmAkshit Bum
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval Swati Chauhan
 
Presentatie Worldcafe 16 November 2010 Bo
Presentatie Worldcafe 16 November 2010 BoPresentatie Worldcafe 16 November 2010 Bo
Presentatie Worldcafe 16 November 2010 BoSietze Jan Kamstra
 
Coordination And Integration Of Federal
Coordination And Integration Of FederalCoordination And Integration Of Federal
Coordination And Integration Of FederalKathyRees
 
Why Do Bad Things Happen To Gods People
Why Do Bad Things Happen To Gods PeopleWhy Do Bad Things Happen To Gods People
Why Do Bad Things Happen To Gods PeopleEric Olson
 

Viewers also liked (20)

Image retrieval: challenges and opportunities
Image retrieval: challenges and opportunitiesImage retrieval: challenges and opportunities
Image retrieval: challenges and opportunities
 
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...
Multimodal Analysis for Bridging Semantic Gap with Biologically Inspired Algo...
 
A Comparative Study of Content Based Image Retrieval Trends and Approaches
A Comparative Study of Content Based Image Retrieval Trends and ApproachesA Comparative Study of Content Based Image Retrieval Trends and Approaches
A Comparative Study of Content Based Image Retrieval Trends and Approaches
 
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...
UNSUPERVISED VISUAL HASHING WITH SEMANTIC ASSISTANT FOR CONTENT-BASED IMAGE R...
 
The Real Problem of Bridging the Multimedia “Semantic Gap”
The Real Problem of Bridging the Multimedia “Semantic Gap” The Real Problem of Bridging the Multimedia “Semantic Gap”
The Real Problem of Bridging the Multimedia “Semantic Gap”
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
 
Literature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalLiterature Review on Content Based Image Retrieval
Literature Review on Content Based Image Retrieval
 
Visual Search
Visual SearchVisual Search
Visual Search
 
CBIR
CBIRCBIR
CBIR
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
 
Content-based Image Retrieval
Content-based Image RetrievalContent-based Image Retrieval
Content-based Image Retrieval
 
CBIR
CBIRCBIR
CBIR
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval
 
Presentatie Worldcafe 16 November 2010 Bo
Presentatie Worldcafe 16 November 2010 BoPresentatie Worldcafe 16 November 2010 Bo
Presentatie Worldcafe 16 November 2010 Bo
 
Coordination And Integration Of Federal
Coordination And Integration Of FederalCoordination And Integration Of Federal
Coordination And Integration Of Federal
 
Why Do Bad Things Happen To Gods People
Why Do Bad Things Happen To Gods PeopleWhy Do Bad Things Happen To Gods People
Why Do Bad Things Happen To Gods People
 
EPA DROE Email 7.11.03
EPA DROE Email 7.11.03EPA DROE Email 7.11.03
EPA DROE Email 7.11.03
 
IBSC Ci
IBSC CiIBSC Ci
IBSC Ci
 
Linkedin
LinkedinLinkedin
Linkedin
 

Similar to Recent advances in visual information retrieval marques klu june 2010

Ipres 2011 The Costs and Economics of Preservation
Ipres 2011 The Costs and Economics of PreservationIpres 2011 The Costs and Economics of Preservation
Ipres 2011 The Costs and Economics of Preservationneilgrindley
 
A Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringA Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringIRJET Journal
 
Interactive Video Search: Where is the User in the Age of Deep Learning?
Interactive Video Search: Where is the User in the Age of Deep Learning?Interactive Video Search: Where is the User in the Age of Deep Learning?
Interactive Video Search: Where is the User in the Age of Deep Learning?klschoef
 
Qo E E2 E5 User Centric Approach Katrien De Moor
Qo E E2 E5   User Centric Approach   Katrien De MoorQo E E2 E5   User Centric Approach   Katrien De Moor
Qo E E2 E5 User Centric Approach Katrien De Moorimec.archive
 
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...IRJET Journal
 
Towards a System Support of Collaborative Knowledge Work
Towards a System Support of Collaborative Knowledge WorkTowards a System Support of Collaborative Knowledge Work
Towards a System Support of Collaborative Knowledge Worknmundbrod
 
Btp report linu&rupam
Btp report linu&rupamBtp report linu&rupam
Btp report linu&rupamLinu George
 
JISC Observatory: Horizon Scanning for Higher & Further Education
JISC Observatory: Horizon Scanning for Higher & Further EducationJISC Observatory: Horizon Scanning for Higher & Further Education
JISC Observatory: Horizon Scanning for Higher & Further EducationThom Bunting
 
Brief History of Visual Representation Learning
Brief History of Visual Representation LearningBrief History of Visual Representation Learning
Brief History of Visual Representation LearningSangwoo Mo
 
Delineating Cancer Genomics through Data Visualization
Delineating Cancer Genomics through  Data VisualizationDelineating Cancer Genomics through  Data Visualization
Delineating Cancer Genomics through Data VisualizationRupam Das
 
The Opportunities and Challenges of Putting the Latest Computer Vision and De...
The Opportunities and Challenges of Putting the Latest Computer Vision and De...The Opportunities and Challenges of Putting the Latest Computer Vision and De...
The Opportunities and Challenges of Putting the Latest Computer Vision and De...Albert Y. C. Chen
 
IRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually ImpairedIRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually ImpairedIRJET Journal
 
Factors Affecting How We Work
Factors Affecting How We WorkFactors Affecting How We Work
Factors Affecting How We WorkAlan Lepofsky
 
JISC Observatory: Horizon Scanning for Higher and Further Education
JISC Observatory: Horizon Scanning for Higher and Further EducationJISC Observatory: Horizon Scanning for Higher and Further Education
JISC Observatory: Horizon Scanning for Higher and Further EducationThom Bunting
 

Similar to Recent advances in visual information retrieval marques klu june 2010 (20)

Scientific visualization
Scientific visualizationScientific visualization
Scientific visualization
 
[Nux]05 ux
[Nux]05 ux[Nux]05 ux
[Nux]05 ux
 
Ipres 2011 The Costs and Economics of Preservation
Ipres 2011 The Costs and Economics of PreservationIpres 2011 The Costs and Economics of Preservation
Ipres 2011 The Costs and Economics of Preservation
 
A Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question AnsweringA Literature Survey on Image Linguistic Visual Question Answering
A Literature Survey on Image Linguistic Visual Question Answering
 
Interactive Video Search: Where is the User in the Age of Deep Learning?
Interactive Video Search: Where is the User in the Age of Deep Learning?Interactive Video Search: Where is the User in the Age of Deep Learning?
Interactive Video Search: Where is the User in the Age of Deep Learning?
 
STI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked dataSTI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked data
 
Qo E E2 E5 User Centric Approach Katrien De Moor
Qo E E2 E5   User Centric Approach   Katrien De MoorQo E E2 E5   User Centric Approach   Katrien De Moor
Qo E E2 E5 User Centric Approach Katrien De Moor
 
40120140501006
4012014050100640120140501006
40120140501006
 
Parents
ParentsParents
Parents
 
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...
ANALYSIS OF LUNG NODULE DETECTION AND STAGE CLASSIFICATION USING FASTER RCNN ...
 
Towards a System Support of Collaborative Knowledge Work
Towards a System Support of Collaborative Knowledge WorkTowards a System Support of Collaborative Knowledge Work
Towards a System Support of Collaborative Knowledge Work
 
Btp report linu&rupam
Btp report linu&rupamBtp report linu&rupam
Btp report linu&rupam
 
JISC Observatory: Horizon Scanning for Higher & Further Education
JISC Observatory: Horizon Scanning for Higher & Further EducationJISC Observatory: Horizon Scanning for Higher & Further Education
JISC Observatory: Horizon Scanning for Higher & Further Education
 
Brief History of Visual Representation Learning
Brief History of Visual Representation LearningBrief History of Visual Representation Learning
Brief History of Visual Representation Learning
 
Delineating Cancer Genomics through Data Visualization
Delineating Cancer Genomics through  Data VisualizationDelineating Cancer Genomics through  Data Visualization
Delineating Cancer Genomics through Data Visualization
 
The Opportunities and Challenges of Putting the Latest Computer Vision and De...
The Opportunities and Challenges of Putting the Latest Computer Vision and De...The Opportunities and Challenges of Putting the Latest Computer Vision and De...
The Opportunities and Challenges of Putting the Latest Computer Vision and De...
 
Promise notes
Promise notesPromise notes
Promise notes
 
IRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually ImpairedIRJET- Navigation and Camera Reading System for Visually Impaired
IRJET- Navigation and Camera Reading System for Visually Impaired
 
Factors Affecting How We Work
Factors Affecting How We WorkFactors Affecting How We Work
Factors Affecting How We Work
 
JISC Observatory: Horizon Scanning for Higher and Further Education
JISC Observatory: Horizon Scanning for Higher and Further EducationJISC Observatory: Horizon Scanning for Higher and Further Education
JISC Observatory: Horizon Scanning for Higher and Further Education
 

Recently uploaded

Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Recently uploaded (20)

Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

Recent advances in visual information retrieval marques klu june 2010

  • 1. Oge Marques Florida Atlantic University Boca Raton, FL - USA
  • 2.   VIR is a highly interdisciplinary field, but … Image and (Multimedia) Information Video Database Retrieval Processing Systems Visual Machine Computer Learning Information Vision Retrieval Visual data Human Visual Data Mining modeling and Perception representation Klagenfurt - June 2010
  • 3.   There are many things that I believe…   … but cannot prove Klagenfurt - June 2010
  • 5.   Part I ◦  10 years after the “end of the early years”   Where are we now?   Part II ◦  Medical image retrieval   Challenges and opportunities   Part III ◦  Where is VIR headed?   Advice for young researchers Klagenfurt - June 2010
  • 6.
  • 7.   It’s been 10 years since the “end of the early years” [Smeulders et al., 2000] ◦  Are the challenges from 2000 still relevant? ◦  Are the directions and guidelines from 2000 still appropriate? Klagenfurt - June 2010
  • 8.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Driving forces   “[…] content-based image retrieval (CBIR) will continue to grow in every direction: new audiences, new purposes, new styles of use, new modes of interaction, larger data sets, and new methods to solve the problems.” Klagenfurt - June 2010
  • 9.   Yes, we have seen many new audiences, new purposes, new styles of use, and new modes of interaction emerge.   Each of these usually requires new methods to solve the problems that they bring.   However, not too many researchers see them as a driving force (as they should). Klagenfurt - June 2010
  • 10.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Heritage of computer vision   “An important obstacle to overcome […] is to realize that image retrieval does not entail solving the general image understanding problem.” Klagenfurt - June 2010
  • 11.   I’m afraid I have bad news… ◦  Computer vision hasn’t made so much progress during the past 10 years. ◦  Some classical problems 
 (including image 
 understanding)
 remain unresolved. ◦  Similarly, CBIR from a 
 pure computer vision
 perspective didn’t work 
 too well either. Klagenfurt - June 2010
  • 12.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Influence on computer vision   “[…] CBIR offers a different look at traditional computer vision problems: large data sets, no reliance on strong segmentation, and revitalized interest in color image processing and invariance.” Klagenfurt - June 2010
  • 13.   The adoption of large data sets became standard practice in computer vision (see Torralba’s work).   No reliance on strong segmentation (still unresolved)  new areas of research, e.g., automatic ROI extraction and RBIR.   Color image processing and color descriptors became incredibly popular, useful, and (to some degree) effective.   Invariance still a huge problem ◦  But it’s cheaper than ever to have multiple views. Klagenfurt - June 2010
  • 14.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Similarity and learning   “We make a pledge for the importance of human- based similarity rather than general similarity. Also, the connection between image semantics, image data, and query context will have to be made clearer in the future.”   “[…] in order to bring semantics to the user, learning is inevitable.” Klagenfurt - June 2010
  • 15.   The authors were pointing in the right direction (human in the loop, role of context, benefits from learning,…)   However: ◦  Similarity is a tough problem to crack and model.   Even the understanding of how humans judge image similarity is very limited. ◦  Machine learning is almost inevitable…   … but sometimes it can be abused. Klagenfurt - June 2010
  • 16.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Interaction   Better visualization options, more control to the user, ability to provide feedback […] Klagenfurt - June 2010
  • 17.   Significant progress on visualization interfaces and devices.   Relevance Feedback: still a very tricky tradeoff (effort vs. perceived benefit), but more popular than ever (rating, thumbs up/ down, etc.) Klagenfurt - June 2010
  • 18.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Need for databases   “The connection between CBIR and database research is likely to increase in the future. […] problems like the definition of suitable query languages, efficient search in high dimensional feature space, search in the presence of changing similarity measures are largely unsolved […]” Klagenfurt - June 2010
  • 19.   Very little progress ◦  Image search and retrieval has benefited much more from document information retrieval than from database research. Klagenfurt - June 2010
  • 20.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  The problem of evaluation   CBIR could use a reference standard against which new algorithms could be evaluated (similar to TREC in the field of text recognition).   “A comprehensive and publicly available collection of images, sorted by class and retrieval purposes, together with a protocol to standardize experimental practices, will be instrumental in the next phase of CBIR.” Klagenfurt - June 2010
  • 21.   Significant progress on benchmarks, standardized datasets, etc. ◦  ImageCLEF ◦  Pascal VOC Challenge ◦  MSRA dataset ◦  Simplicity dataset ◦  UCID dataset and ground truth (GT) ◦  Accio / SIVAL dataset and GT ◦  Caltech 101, Caltech 256 ◦  LabelMe Klagenfurt - June 2010
  • 22.   Revisiting the ‘Concluding Remarks’ from [Smeulders et al., 2000]: ◦  Semantic gap and other sources   “A critical point in the advancement of CBIR is the semantic gap, where the meaning of an image is rarely self-evident. […] One way to resolve the semantic gap comes from sources outside the image by integrating other sources of information about the image in the query.” Klagenfurt - June 2010
  • 23.   The semantic gap problem has not been solved (and maybe will never be…)   What are the alternatives? 1.  Treat visual similarity and semantic relatedness differently   Examples: Alipr, Google similarity search, etc. 2.  Improve both (text-based and visual) search methods independently 3.  Trust the user   CFIR, collaborative filtering, crowdsourcing, games. Klagenfurt - June 2010
  • 24.
  • 25.   Challenges ◦  We’re entering a new country…   How much can we bring?   Do we speak the language?   Do we know their culture?   Do they understand us and where we come from?   Opportunities ◦  They use images (extensively) ◦  They have expert knowledge ◦  Domains are narrow (almost by definition) ◦  Fewer clients, but potentially more $$ Klagenfurt - June 2010
  • 26.   An overview of the challenges ◦  Different terminology ◦  Standards (e.g., DICOM) ◦  Modality dependencies ◦  Equipment dependencies ◦  Privacy issues ◦  Proprietary data ◦  A tough sell? Klagenfurt - June 2010
  • 27.   Be prepared for: ◦  New acronyms   CBMIR (Content-Based Medical Image Retrieval)   PACS (Picture Archiving and Communication System)   DICOM (Digital Imaging and COmmunication in Medicine)   Hospital Information Systems (HIS)   Radiological Information Systems (RIS) ◦  New phrases   Imaging informatics ◦  Lots of technical medical terms Klagenfurt - June 2010
  • 28.   DICOM (http://medical.nema.org/) ◦  Global IT standard, created in 1993, used in virtually all hospitals worldwide. ◦  Designed to ensure the interoperability of different systems and manage related workflow. ◦  Will be required by all EHR systems that include imaging information as an integral part of the patient record. ◦  750+ technical and medical experts participate in 20+ active DICOM working groups. ◦  Standard is updated 4-5 times per year. ◦  Many available tools! (see http://www.idoimaging.com/) Klagenfurt - June 2010
  • 29.   The IRMA code [Lehmann et al., 2003] ◦  4 axes with 3 to 4 positions, each in {0,...9,a,...,z}, where "0" denotes "unspecified" to determine the end of a path along an axis.   Technical code (T) describes the imaging modality   Directional code (D) models body orientations   Anatomical code (A) refers to the body region examined   Biological code (B) describes the biological system examined. Klagenfurt - June 2010
  • 30.   The IRMA code [Lehmann et al., 2003] ◦  The entire code results in a character string of <14 characters (IRMA: TTTT – DDD – AAA – BBB). Example: “x-ray, projection radiography, analog, high energy – sagittal, left lateral decubitus, inspiration – chest, lung – respiratory system, lung” Source: [Lehmann et al., 2003] Klagenfurt - June 2010
  • 31.   The IRMA code [Lehmann et al., 2003] ◦  The companion tool… Source: [Lehmann et al., 2004] Klagenfurt - June 2010
  • 32.   Most current retrieval systems in clinical use rely on text keywords such as DICOM header information to perform retrieval.   CBIR has been widely researched in a variety of domains and provides an intuitive and expressive method for querying visual data using features, e.g. color, shape, and texture.   Current CBIR systems: ◦  are not easily integrated into the healthcare environment; ◦  have not been widely evaluated using a large dataset; and ◦  lack the ability to perform relevance feedback to refine retrieval results. Source: [Hsu et al., 2009] Klagenfurt - June 2010
  • 33.   CBMIR is still a relatively small dot on the map of the medical imaging community. Source: Program of SPIE Medical Imaging 2010 Multiconference Klagenfurt - June 2010
  • 34.   New gaps! ◦  Just when you thought the semantic gap was your only problem… Source: [Deserno, Antani, and Long, 2009] Klagenfurt - June 2010
  • 35.   USA ◦  NIH (National Institutes of Health)   NIBIB - National Institute of Biomedical Imaging and Bioengineering   NCI - National Cancer Institute   NLM – National Libraries of Medicine ◦  Several universities and hospitals   Europe ◦  Aachen University (Germany) ◦  Geneva University (Switzerland)   Big companies (Siemens, GE, etc.) Klagenfurt - June 2010
  • 36.   IRMA (Image Retrieval in Medical Applications) ◦  Aachen University (Germany)   http://ganymed.imib.rwth-aachen.de/irma/ ◦  3 online demos:   IRMA Query demo: allows the evaluation of CBIR on several databases.   IRMA Extended Query Refinement demo: CBIR from the IRMA database (a subset of 10,000 images).   Spine Pathology and Image Retrieval Systems (SPIRS) designed by the NLM/NIH (USA): holds information of ~17,000 spine x-rays. Klagenfurt - June 2010
  • 37.   MedGIFT (GNU Image Finding Tool) ◦  Geneva University (Switzerland)   http://www.sim.hcuge.ch/medgift/ ◦  Large effort, including projects such as:   Talisman (lung image retrieval)   Case-based fracture image retrieval system   Onco-Media: medical image retrieval + grid computing   ImageCLEF: evaluation and validation   medSearch Klagenfurt - June 2010
  • 38.   WebMIRS ◦  NIH / NLM (USA)   http://archive.nlm.nih.gov/proj/webmirs/index.php ◦  Query by text + navigation by categories ◦  Uses datasets and related x-ray images from the National Health and Nutrition Examination Survey (NHANES) Klagenfurt - June 2010
  • 39.   SPIRS (Spine Pathology & Image Retrieval System): Web-based image retrieval system for large biomedical databases ◦  NIH / UCLA (USA) ◦  Great case study on highly specialized CBMIR Klagenfurt - June 2010 Source: [Hsu et al., 2009]
  • 40.   National Biomedical Imaging Archive (NBIA) ◦  NCI / NIH (USA)   https://imaging.nci.nih.gov/ ◦  Search based on metadata (DICOM fields) ◦  3 search options:   Simple   Advanced   Dynamic Klagenfurt - June 2010
  • 41.   ARSS Goldminer ◦  American Roentgen Ray Society (USA)   http://goldminer.arrs.org/ ◦  Query by text ◦  Results can be filtered by:   Modality   Age   Sex Klagenfurt - June 2010
  • 42.   Yottalook Images ◦  iVirtuoso (USA)   http://www.yottalook.com/ ◦  Developed and maintained by four radiologists ◦  Query by text ◦  Claims to use 4 “core technologies”:   "natural query analysis”   "semantic ontology”   “relevance algorithm”   a specialized content delivery system that provides high yield content based on the search term. Klagenfurt - June 2010
  • 43.   ImageCLEF Medical Image Retrieval 2010   http://www.imageclef.org/2010/medical ◦  Data set: 77,000 images from articles published in Radiology and Radiographics including text of the captions and link to the html of the full text articles. ◦  3 types of tasks:   Modality Classification: given an image, return its modality (MR, CT, XR, etc.)   Ad-hoc retrieval: classic medical retrieval task, with 3 “flavors”: textual, mixed and semantic queries   Case-based retrieval: retrieve cases including images that might best suit the provided case description. Klagenfurt - June 2010
  • 44.   Better user interfaces, which are responsive, highly interactive, and capable of supporting relevance feedback. ◦  In other words, address the “Performance Gap Category” and the “Usability Gap Category”. Klagenfurt - June 2010
  • 45.   New applications of CBMIR, including: ◦  Teaching ◦  Research ◦  Diagnosis ◦  PACS and Electronic Patient Records   CBMIR evaluation using medical experts   Integration of local and global features Klagenfurt - June 2010
  • 46.   New descriptors ◦  Example: the Fuzzy Rule Based Compact Composite Descriptor (CCD), which includes global image features capturing both brightness and texture characteristics in a 1D Histogram [Chatzichristofis & Boutalis, 2009] Klagenfurt - June 2010
  • 47.   Partial match schemes (see [Hsu et al., 2009]) Source: [Hsu et al., 2009] Klagenfurt - June 2010
  • 48.   New devices (e.g., iPad) Klagenfurt - June 2010
  • 49.
  • 50.   Advice for [young] researchers ◦  In this last part, I’ve compiled pieces and bits of advice that I believe might help researchers who are entering the field. ◦  They focus on research avenues that I personally consider to be the most promising. Klagenfurt - June 2010
  • 51.   LOOK… ◦  at yourself (how do you search for images and videos?) ◦  around (related areas and how they have grown) ◦  at Google (and other major players) Klagenfurt - June 2010
  • 52.   Which sites do you use? ◦  Why?   Which search options do you use? ◦  What do you do when the returned results aren’t good?   What is the single most useful feature that you wish those sites had?   What are your intentions and how do you express them? Klagenfurt - June 2010
  • 53.   Semi-automatic image annotation   Tag recommendation systems   Story annotation engines   Content-based image filtering   Copyright detection   Watermark detection ◦  and many more Klagenfurt - June 2010
  • 54.   Google Similarity Search (VisualRank) [Jing & Baluja, 2008]   Google Goggles (mobile visual search) Klagenfurt - June 2010
  • 55.   THINK… ◦  mobile devices ◦  new devices and services ◦  social networks ◦  games Klagenfurt - June 2010
  • 56.   Google Goggles understands narrow-domain search and retrieval   Several other apps for iPhone, iPad, and Android (e.g., kooaba and Fetch!) Klagenfurt - June 2010
  • 57.   Flickr (b. 2004)   YouTube (b. 2005)   Flip video cameras (b. 2006)   iPhone (b. 2007)   iPad (b. 2010) Klagenfurt - June 2010
  • 58.   The Web 2.0 has brought about: ◦  New data sources ◦  New usage patterns ◦  New understanding about the users, their needs, habits, preferences ◦  New opportunities ◦  Lots of metadata! ◦  A chance to experience a true paradigm shift   Before: image annotation is tedious, labor-intensive, expensive   After: image annotation is fun! Klagenfurt - June 2010
  • 59. ◦  Google Image Labeler ◦  Games with a purpose (GWAP):   The ESP Game   Squigl   Matchin Klagenfurt - June 2010
  • 60.   UNDERSTAND… ◦  human intentions ◦  human emotions ◦  user’s preferences and needs Klagenfurt - June 2010
  • 61.   CREATE… ◦  better interfaces ◦  better user experience ◦  new business opportunities (added value) Klagenfurt - June 2010
  • 62.   Image Genius (sponsored by FAU / will become startup)
  • 63.   Fully functional online prototype of a medical image retrieval system (MEDIX) with DICOM capabilities
  • 64.   Unsupervised ROI extraction from an image (by Gustavo B. Borba, UTFPR, Brazil)
  • 65. –  I believe (but cannot prove…) that successful VIR solutions will: •  combine content-based image retrieval (CBIR) with metadata (high-level semantic-based image retrieval) •  only be truly successful in narrow domains •  include the user in the loop –  Relevance Feedback (RF) –  Collaborative efforts (tagging, rating, annotating) •  provide friendly, intuitive interfaces •  incorporate results and insights from cognitive science, particularly human visual attention, perception, and memory Klagenfurt - June 2010
  • 66.   “Image search and retrieval” is not a problem, but rather a collection of related problems that look like one.   There is a great need for good solutions to specific problems.   10 years after “the end of the early years”, research in visual information retrieval still has many open problems, challenges, and opportunities. Klagenfurt - June 2010
  • 67. Questions? omarques@fau.edu Klagenfurt - June 2010