Multimedia Semantics: Metadata, Analysis and Interaction

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    Diagram is messy. Try to show largest part of MPEG-7 in one slide. From Alia and Michiel: MPEG-7 so far???

    Who? experts, lay persons Why? information searching, annotation tasks, How? entering query, finding items of interest, displaying results What? fact finding, information gathering, sensemaking, location-based mobile search (Pub Canary Wharf)

    3 Favorites

    Multimedia Semantics: Metadata, Analysis and Interaction - Presentation Transcript

    1. Multimedia Semantics: Metadata, Analysis and Interaction Raphael Troncy < [email_address] > Multimedia Semantics, EURECOM
    2. Some BIG numbers
      • User Generated Content (Jul'09)
        • 3.7+ billion photos
        • 10+ billion photos
        • 110+ million videos
          • 20 hours uploaded / min ≈ 75 000 full length movies / week
      • Archived TV content
        • 1.5 million hours ≈ 120 km of shelves
        • 300000 hours | 1 petabyte / year
      • News content
      • Content difficult to search and reuse
        • Barely invisible for the search engines
    3. Image/Video indexing
      • Techniques used by mainstream search engines
        • search term occurs in the filename or in the caption or in user tags
        • no semantics
      • Image indexing: main problem
        • an image is not alphabetic: there is no countable discrete units, that, in combination will provide the meaning of the image
        • image descriptors are not given with the image: one needs to extract or interpret them
      • Video indexing: additional problem
        • a video has additionally a temporal dimension to take into account
        • a video has a priori no discrete units neither (i.e. frames, shots, sequences cannot be absolutely defined)
    4. Why is it so difficult to find appropriate multimedia content, to reuse and repurpose content previously published and to present this content in interfaces that vary with user needs?
    5. Sounds Familiar?
      • [Arnold Smeulders, PAMI, 2000 ] The semantic gap is the lack of coincidence between the information that one can extract from the sensory data and the interpretation that the same data has for a user in a given situation
    6. long way
      • a little drop of semantics goes a
      Jim Hendler [ 1997 ]
    7. Agenda
      • Semantics in multimedia analysis
        • Detecting concepts for video indexing
        • Evaluating interactive search tasks
      • Semantics in metadata
        • Multimedia metadata interoperability
        • Expose your data following 4 basic principles
        • Re-use a growing amount of publicly open datasets
      • Semantics in user interfaces
        • Provide meaningful presentation of underlying data
        • Explore large knowledge bases powered by linked data
    8. The science of labeling
      • Automatically detecting the presence of a concept in a video stream
      • Naming visual information
      airplane
    9. The Computer Vision Approach
      • Building detectors one-at-the-time
        • a face detector for frontal faces
        • a face detector for non -frontal faces
      3 years later One (or more) PhD for every new concept
    10. So how about these?
    11. A Simple Concept Detector
    12. K-nearest neighbor
    13. Linear Classification
    14. Support Vector Machine
    15. Supervised Learner
    16. NIST TRECVID Evaluation
      • Until 2001, everybody defined his own concepts
        • Using specific and small data sets
        • Hard to compare methodologies
      • Since 2001, worldwide evaluation by NIST
        • Promote progress in video retrieval search
        • Provide common datasets (shots, ASR, key frames)
        • Use open, metrics-based evaluation
      Large-Scale Concept Ontology for Multimedia
    17. Success and Criticism
      • More and more concept detectors available:
        • TRECVID 2005: 101 concept lexicon
        • TRECVID 2006: 491 concept lexicon
        • MediaMill Challenge 2007: 572 concept lexicon
      • ... but focus is on the final result
        • relative merit of indexing methods: ignore intermediary steps while systems become more complex (several features and learning methods)
      • ... but concept detectors developed mismatch user information needs
    18. TRECVID Interactive Video Search Task
      • Query selection:
        • by keyword,
        • by concept,
        • by example
      • Topics unknown
      • Test set
        • English (2004)
        • Chinese (2005-6)
        • Dutch (2007-8-9)
    19. VideOlympics
      • Benchmark performance cannot be sole criterion
        • Experience of searcher counts
        • Usability of systems matters
      • VideoOlympics: live interactive search task
        • Simultaneous exposure of video retrieval systems
        • Showcase that goes beyond a regular demo session
        • Fun to do (participants) & Fun to watch (audience)
    20. VideOlympics Setup
      • One display
        • TRECVID like queries
        • Results pushed by searchers
    21. Agenda
      • Semantics in multimedia analysis
        • Detecting concepts for video indexing
        • Evaluating interactive search tasks
      • Semantics in metadata
        • Multimedia metadata interoperability
        • Expose your data following 4 basic principles
        • Re-use a growing amount of publicly open datasets
      • Semantics in user interfaces
        • Provide meaningful presentation of underlying data
        • Explore large knowledge bases powered by linked data
    22. Multimedia: Description methods MPEG-1 MPEG-2 MPEG-4 MPEG-7 MPEG-21 ISO W3C
    23. MPEG-7: a multimedia description language?
      • ISO standard since December of 2001
      • Main components :
        • Descriptors (Ds) and Description Schemes (DSs)
        • DDL (XML Schema + extensions)
      • Concern all types of media
      Part 5 – MDS Multimedia Description Schemes
    24. MPEG-7 and the Semantic Web
      • MDS Upper Layer represented in RDFS
        • 2001: Hunter
        • Later on: link to the ABC upper ontology
      • MDS fully represented in OWL-DL
        • 2004: Tsinaraki et al., DS-MIRF model
      • MPEG-7 fully represented in OWL-DL
        • 2005: Garcia and Celma, Rhizomik model
        • Fully automatic translation of the whole standard
      • MDS and Visual parts represented in OWL-DL
        • 2007: Arndt et al., COMM model
        • Re-engineering MPEG-7 using DOLCE design patterns
    25.  
    26. Example 1: Region Annotation
    27. Example 2: Sequence Annotation
    28.  
    29. Image Annotation with Linked Data The &quot; Big Three &quot; at the Yalta Conference (Wikipedia)
      • Localize a region (bounding box)
      • Annotate the content (interpretation)
        • Tag: Winston Churchill, UK Prime Minister, Allied Forces, WWII
        • Link to knowledge on the Web
      Reg1 :Reg1 foaf:depicts dbpedia:Winston_Churchill ---------------------------------------------- dbpedia:Winston_Churchill dbpedia:spouse dbpedia:Clementine_Churchill dbpedia:Winston_Churchill owl:sameAs fbase:Winston_Churchill
    30. Video Annotation with Linked Data A history of G8 violence ( video ) (© Reuters)
      • Localize a region
      • Annotate the content
        • Tag: G8 Summit, Heiligendamn, 2007
        • Link to knowledge on the Web
      EU Summit, Gothenburg, 2001 Seq1 Seq4 :Seq1 foaf:depicts dbpedia:34th_G8_Summit ---------------------------------------------- dbpedia:33rd_G8_Summit foaf:based_near geo:Heilegendamn geo:Heilegendamn skos:broader geo:Germany
    31. What is linked data?
      • URIs, possibly identifying media fragments
      • + annotations (tags)
      • + links among fragments & annotations
      dbpedia:Zidane foaf:depicts nar:location geonames:2950159 nar:subject nc:15054000 events:id wp:2006_FIFA_World_Cup#Final
    32. Linked Data Principles
      • Tim Berners Lee [2006] ( Design Issues )
        • Use URIs to identify things (anything, not just documents);
        • Use HTTP URIs – globally unique names, distributed ownership – so that people can look up those names;
        • Provide useful information in RDF – when someone looks up a URI;
        • Include RDF links to other URIs – to enable discovery of related information
    33. An Example: DBpedia
      • DBpedia is a community effort to:
        • extract structured &quot;infobox&quot; information from Wikipedia
        • interlink DBpedia with other datasets on the Web
    34. Scrapping infobox data http:// dbpedia.org /resource/Bogotá
    35. Automatic Links Among Open Datasets <http://dbpedia.org/resource/Bogotá> owl:sameAs <http://sws.geonames.org/3688689/> owl:sameAs <http://rdf.freebase.com/ns/guid.9202a8c04000641f8000000000167bab> dbpedia:population &quot;6776009&quot; ... <http://sws.geonames.org/3688689/> owl:sameAs <http://dbpedia.org/resource/Bogotá> wgs84_pos:lat &quot;4.6&quot; wgs84_pos:long &quot;-74.0833333&quot; geo:population &quot;7102602&quot; ... Geonames DBpedia
    36. sameAs.org
    37. Bogotá on Freebase
    38. Bogotá on Geonames
    39. How Much Linked Data is there ?
    40. Linked Data Cloud – August 2007
    41. Linked Data Cloud – March 2008
    42. Linked Data Cloud – September 2008
    43. Linked Data Cloud – March 2009
    44. The Web of Data
      • Expose open datasets in RDF
      • Set RDF links among the data items for different datasets
      • Over 4.5 billion triples, 5 millions links (March 2009)
      • ... still counting
      • Who are the users?
      • Why would they use the cloud?
      • What tasks can be supported?
      • How will the semantics help?
    45. Agenda
      • Semantics in multimedia analysis
        • Detecting concepts for video indexing
        • Evaluating interactive search tasks
      • Semantics in metadata
        • Multimedia metadata interoperability
        • Expose your data following 4 basic principles
        • Re-use a growing amount of publicly open datasets
      • Semantics in user interfaces
        • Provide meaningful presentation of underlying data
        • Explore large knowledge bases powered by linked data
    46. Provide meaningful presentation of data
    47. ... and behind the scene
    48. ... link an artist to more data
    49. ... myspace
    50. ... last.fm
    51. ... IMDb
    52. Going through the Walled Gardens David Simonds: Everywhere and nowhere. 19 May 2008, The Economist .
    53. How can semantics help?
      • Query construction
        • disambiguate input (auto-completion)
        • selection of available terms (grouping and ranking algorithms)
      • (Semantic) search algorithm
        • graph traversal
        • query expansion
        • RDFS/OWL reasoning
      • Presentation of search results
        • grouping by property
        • visualization on timeline, map, etc.
    54. News Workflow Interoperability
      • No integration of media (stories, photo, animation, video)
      • Little (or no) context in the news presentation
      • Lack of interoperability in the current workflow
      NAR Schema Controlled Vocabularies Broadcaster Schema NewsCodes User Vocabulary
    55. Exploratory Search
      • (Ultimate) Goal:
        • Provide an environment for searching and browsing contextualized multimedia news information
      • Required integration:
        • Data: various media, different forms, various sources
        • Metadata: schema integration, semantic models
      • Influence and implications of UI:
        • How to represent semantic multimedia metadata to facilitate presenting information?
        • in other words ... What constraints do end-user interfaces put on the modeling of the metadata?
    56. News and Multimedia Formats (NAR) News Architecture NewsML G2 EventsML G2 SportsML G2
    57. Modeling the News + Media Ontology foaf:Person ≈ nar:Person dc:Subject ≈ nar:Subject sioc:Item ≈ nar:Item geo:lat geo:long +
    58. Enriching the News Metadata
      • Concepts/Entities that are subject of news
        • Thematic categories
        • People
        • Organizations
        • Geopolitical Areas
        • Points of Interest
        • Events
        • Products or artefacts
    59. Enriching the News Metadata NAR Ontology NewsCodes Thesaurus Named Entity Recognition Domain Ontologies
    60. Enriching the News Metadata NAR Ontology NewsCodes Thesaurus Domain Ontologies Concept Detectors
    61. Presenting News Information
      • Dimensions used for searching news items
        • When time 10/07/2006
        • Where location Paris
        • What is depicted J. Chirac, Z. Zidane
        • Why event WC 2006
        • Who photographer Bertrand Guay, AFP
      Metadata
    62. Semantic Search of Multimedia News Powered by ClioPatria 1.0 alpha 3 1,067,754 Total 1,932 INA Broadcast Video (June/July 2006) 61,311 AFP Photos (June/July 2006) 804,446 AFP News Feed (June/July 2006) 53,468 DBpedia, Geonames 34,903 Thesauri: newscodes 104,358 Domain Specific Ontologies: football 7,336 General Ontologies: NAR, DC, FOAF Number of RDF Triples Description
    63.  
    64.  
    65.  
    66. Provide New Dimensions for Exploring
    67. Take Home Message
      • Concept detection challenges: machine learning and IR
        • Features can be extracted and used to describe multimedia content
        • Show generality of approach, dynamic nature of video ( event )
        • Show that an ontology can help
      • Semantic metadata representation challenges: KR
        • Media and metadata can be passed around and among systems
        • Reuse what is there
        • Expose what you make
      • Interaction challenges: CHI
        • Users can be given much richer and more flexible access to (semantically annotated) content
        • ... but we are still figuring out how to do this!
    68. Credits
      • Many people
        • Cees Snoek, Alex Hauptmann, Alan Smeaton, Ivan Herman, Krishna Chandramouli, David Simonds, Laurent Le Meur
        • Colleagues from the Interactive Information Access Group, CWI Amsterdam
      • Datasets
      http:// www.slideshare.net/troncy

    + Raphael TroncyRaphael Troncy, 3 months ago

    custom

    412 views, 3 favs, 0 embeds more stats

    Multimedia Semantics: Metadata, Analysis and Intera more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 412
      • 412 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 3
    • Downloads 0
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories