Video Indexing And Retrieval - Presentation Transcript
Video Indexing and Retrieval SLIS 5206
Text indexing Began in earnest after the printing press was invented in the 1400s. Scholarly journals began to be published with rapidity and indexing methods Were desperately needed. Pre-coordinate indexing Post-coordinate indexing Computerized indexing: KWIC—keyword in context String searches
Still image indexing
James Turner, 1997: “Indexing Images, Some Considerations”
Images are subject to more than one interpretation; text is not
Text can stand alone; images rarely do
Video indexing
Video indexing applications : news video, film archives, surveillance, user-generated content, distance learning, video conferencing, medical applications, sports.
Video indexing involves “segmentation, analysis and abstraction” of video content (Zhong).
Problems/Goals
Growing amounts of video data
Video data is difficult to index ; dynamic not static. Ex: TV video has 25-30 frames per second. Bibliographic schemes: different manifestations of videos—languages, content added or edited, etc.
Copyright issues: not many videos are in public domain.
Goal = automated semantic indexing; not quite there yet.
Indexing breakdown
Sequence->scene->shot->frame->object
Frame =still image
Key frame =representative still image
Types of indexing: low-level
Based on color histograms, motion & object detection and tracking.
Focuses on appearance
Sequence->shot-> scene
Sequence =group of scenes
Scene =group of shots, similar to a paragraph in a text document.
Shot = “single series of actions with one camera.”
The basic unit of indexing, similar to a word in a text document. Scenes are similar to paragraphs in a text document while sequences are similar to pages or chapters.
Types of indexing: high-level
High level indexing focuses on the content of the video, rather than its appearance
Semantic gap: the difference between description of content and how the user perceives the content.
Ex: content described by indexer as “boat”; user perceives it as “cruise”
Entities mentioned but not seen must still be indexed. Ex: newsreel of Bob Hope making joke about Marilyn Monroe (she may not actually appear in footage)
Scene cut detection algorithms (Zhong)
1. Divide video streams into units (such as shots)
2. Select representative or KEY frame
3. Describe colors and shapes for indexing
Note: Temporal information not included
(must be separately annotated along with metadata)
Metadata
Information that describes a resource; aids in classification
Metadata standards used for video indexing:
FIAF EAD SMIL MPEG-7
Dublin Core XML RDF MPEG-21
Metadata standards, cont’d
FIAF —International Film Archive Federation cataloging rules
0 comments
Post a comment