SlideShare a Scribd company logo
1 of 25
Download to read offline
Workshop on Interactive Information Access
Untangling Tasks and Technologies




Untangling the semantic structure
in a broadcast video archive


                                            Ichiro IDE
                                  Nagoya University, Japan
                   University of Amsterdam, The Netherlands
December 7, 2010
2
                                                                    2

Introduction

• Online digital video archive is becoming a reality
  • Efficient retrieval and browsing
  • Effective reuse
• Our aims:
  • Extract the semantic structure between video data
  • Rearrange video segments and generate new contents
     • Provide a browsing / editing interface based on the
       extracted semantic structure
     • (Semi-) automatic rearrangement of retrieved results
       for answering queries

                            Workshop on Interactive Information Access
                            Workshop on Interactive Information Access
3
                                                                                 3

NII news video archive                                            RAID
                                                                  disk Servers
                                                       PCs for
                                                      capturing

                                           Client
     MPEG-1/2    Closed-caption          Client
                                             PC
      decoder       decoder               PC


     Video archiving server                    DBMS server

         Video archive                              Metadata

      MPEG-1       MPEG-2               Closed-caption       Story
        Video       Video                    text          boundary
      [970 GB]     [5.9 TB]                [79 MB]       [46 k stories]



                                         Data processing server
• Program: NHK News7
• Period: March 16, 2001– (1,700– hours)
                                  Workshop on Interactive Information Access
                                  Workshop on Interactive Information Access
4
                                                               4

Overview of the talk

              Exploring news stories
          along the topic thread structure
                        §
  Cross-language detection of related news stories
      by text and near-duplicate video segments

                        §
       Structuring a broadcast video archive
      based on near-duplicate video segments
                       Workshop on Interactive Information Access
                       Workshop on Interactive Information Access
5
                                                                                                                     5




                 Exploring news stories
             along the topic thread structure
                                         I. Ide, H. Mo, N. Katayama, S. Satoh:
“Exploiting topic thread structures in a news video archive for the semi-automatic generation of video summaries”,
                       2006 IEEE Int. Conf. on Multimedia and Expo (ICME2006), July 2006

                           I. Ide, T. Kinoshita, T. Takahashi, S. Satoh, H. Murase:
                “mediaWalker: A video archive explorer based on time-series semantic structure”,
                         15th ACM Int. Multimedia Conf. Demo Session, Sept. 2007

                   I.Ide , T. Kinoshita, T. Takahashi, H. Mo, N. Katayama, S. Satoh, H. Murase:
         “Exploiting the chronological semantic structure in a large-scale broadcast news video archive
                                              for its efficient exploration”,
                      APSIPA Annual Summit and Conf. (ASC) 2010, to appear in Dec. 2010
                                                   Workshop on Interactive Information Access
                                                   Workshop on Interactive Information Access
6
                                                                      6
Semantic structures in news video
Intra- & Inter-video structure
• Story tracking / Topic threading
                  Intra-video structured videos
       Video-1               Story-3

       Video-2 Story-1           Story-3
                                                   Inter-video
       Video-3 Story-1 Story-2
                                                   structure
       Video-4           Story-2 Story-3

       Video-5 Story-1 Story-2                  Story-5
                  Thread-2                     Thread-1
        …




   Reveals the semantic structure throughout the archive
                              Workshop on Interactive Information Access
                              Workshop on Interactive Information Access
7
                                                             7

Example of a topic thread structure

  Period: 100 days
    Origin
  May 1, 2003
   Story #1




 [Cluster-view]




                     Workshop on Interactive Information Access
                     Workshop on Interactive Information Access
8
                                                                                                 8

Contents of a topic thread structure

    SARS outbreak         Chinese gov. worries                        Chinese gov. watches
      In Beijing        the spread in rural areas                    the spread in rural areas

                      Spreads in                                                  Calms down
                     mainland China                                                in Taiwan




 WHO sends a                                                              WHO declares
mission to Beijing                                                         the cease

Slows downs in mainland                                                     Calms down in
China, spreads in Taiwan                                                    mainland China,
                                                                          reports fromToronto
   Taiwanese doctor found
infected after traveling Japan                                                Search for
                                                    Anti-SARS conference  Infection in Japan
                                                        held in Beijing
                                              Workshop on Interactive Information Access
                                              Workshop on Interactive Information Access
9
                                                          9
Browsing news video by the thread
structure: mediaWalker
                                                 Demo




                  Workshop on Interactive Information Access
                  Workshop on Interactive Information Access
10
                                                                10

Towards Video Story-Telling

  From here




                                           To here
                                            I want to know
                                            how it developed


• Generate a summarized video that explains how
  the story developed between two news stories
  • Select a path (semi-)automatically
  • Summarize the video streams along the path
  Currently under work with Frank
                         Workshop on Interactive Information Access
                         Workshop on Interactive Information Access
14
                                                                                                    14




         Cross-language detection
          of related news stories
by text and near-duplicate video segments



                              A. Ogawa, T. Takahashi, I. Ide, H. Murase:
    “Cross-lingual retrieval of identical news events by near-duplicate video segment detection”,
                14th Intl. Multimedia Modeling Conference (MMM2008), Jan. 2008




                                            Workshop on Interactive Information Access
                                            Workshop on Interactive Information Access
15
                                                                  15

Cross-language news story detection

• Definition
  ― Detect news stories in different
    channels (especially in different
    languages) discussing the same
    event
• Problem
                                                       Near
  ― Text-based approach                              duplicate
     • Low MT * ASR quality
             (Though, recently improving…)
             (Though, recently improving…)
     • Different view-point, culture
• Proposed method
  • Detect near-duplicate video segments
    to complement text information on Interactive Information Access
                           Workshop on Interactive Information Access
                            Workshop
16
                                                                16

Comparison of news video streams
• Identical event should be broadcast in a close timing
  • Compare news programs broadcast within +/- 24 hours




              Compare only the center part
            to avoid super-imposed captions

              Cope with color differences
               by histogram averaging
                         Workshop on Interactive Information Access
                         Workshop on Interactive Information Access
17
                                                                                 17

Example of news stories on a same event
                        <<Keywords>>
                        operation [25], US army [20], Fallujah [18], military
Nov 9, 2004 Story # 1   force [12], troops [7], military strategy [7], attack [5],
19:01 (GMT+9) --        Iraqi army [5], general citizens [5], Iraq [4], …




                        <<Keywords>>
                        city [9], Jean [6], Aaron [6], Iraqi [4], phone, call [3],
Nov 8, 2004 Story # 1   army forces [3], casualties [3], …
22:03 (GMT-5) --
                              Workshop on Interactive Information Access
                              Workshop on Interactive Information Access
18
                                                         18
Cross-language news browsing interface:
topicTraveller

                                                 Demo




                  Workshop on Interactive Information Access
                  Workshop on Interactive Information Access
20
                                                                 20

Result

• Dataset
  – 18 pairs of (JP: 1  US: 2)
  – Ground truth: manually given

                               Sum of
             Text only                           Image only
                           text and Image
 Recall     83% (38/46)     96% (20/46)          43% (20/46)
Precision   72% (38/53)     90% (44/49)          77% (20/26)

 Advantage of using image information

                          Workshop on Interactive Information Access
                          Workshop on Interactive Information Access
21
                                                                                            21




 Structuring a broadcast video archive
based on near-duplicate video segments




                   I. Ide, Y. Shamoto, D. Deguchi, T. Takahashi, H. Murase:
    “Classification of near-duplicate video segments based on their appearance patterns”,
                 20th Int. Conf. on Pattern Recognition (ICPR2010), Aug. 2010.




                                        Workshop on Interactive Information Access
                                        Workshop on Interactive Information Access
22
                                                                 22

Structuring a broadcast video archive

• Structure?
  – For browsing / retrieval
  – Differs among programs / genres
• Applications
  – Advertisement database
  – Related contents detection
     • Related news, …
  – Periodic contents detection
     • Sub-program structure

   Handle in a unified framework

                          Workshop on Interactive Information Access
                          Workshop on Interactive Information Access
23
                                                                 23

Example of appearance patterns


Advertisement



Related news




 Sub-program


• Different distributions for different types            Demo
                          Workshop on Interactive Information Access
                          Workshop on Interactive Information Access
24
                                                                24

Classes of near-duplicate segment types

 1) Advertisement    2) Related news       3) Sub-program




 4) Rebroadcast     5) Similar framing 6) Extracted segment




                         Workshop on Interactive Information Access
                         Workshop on Interactive Information Access
25
                                                                 25

Near-duplicate detection experiment

• Data set
  – 1 week of broadcast from 6 channels in Tokyo area
      Total:     1,008 hours
• Computer environment
  – Cluster computer
     • 40 CPU (Intel Xeon 3.4Ghz, Main Memory: 1.0 GB)
• Computation cost
  – CPU time:      133 days
  – Actual time:     4 days
• Result
  – 3,597,943 pairs     (40,928 unique segments)
                          Workshop on Interactive Information Access
                          Workshop on Interactive Information Access
26
                                                                               26

Automatic classification of classes

• Classification rules             Unique ND segment set

   – Features of near-duplicate
                                   Rebroadcast
     video segments within a
     unique segment set
                                       Advertisement
      • Appearance period
      • Appeared channels                          Sub-
                                                 program
      • Appearance interval
      • Length of the segment                           Similar
                                                       framing
      • Periodic or not
                                                             Extracted   Related
      • Extracted segment or not                             segment      news




        Extracted segment      Original segment
                            Workshop on Interactive Information Access
                            Workshop on Interactive Information Access
27
                                                                                                     27

Evaluation

• 100 unique segment sets per class
  (61 sets for rebroadcast)
                                                               Manual classification
                                                    1)    2)     3)     4)     5)      6)    Misc.
  Automatic classification




                             1) Advertisement       92%    1%     2%      0%    0%      0%     5%
                             2) Related news         1%   51%     7%      0%    5%     17%    19%
                             3) Sub-program          0%    0%    65%      0%    2%      0%    33%
                             4) Rebroadcast          0%    0%     0%    36%     0%      0%    64%
                             5) Similar framing      0%    0%     0%      0%   63%      6%    31%
                             6) Extracted segment    1%   49%     2%      0%    0%     35%    13%

                              Accuracy: 57%                          Cover rate: 77%
                                                     Workshop on Interactive Information Access
                                                     Workshop on Interactive Information Access
28
                                                               28

Future directions
• Now we have structured the archives in various
  ways
   Consider how to exploit the structure

• Reorganize the video data based on an external
  “scenario”
  – News video archive         Wikipedia description
     (Semi-)automatic Documentary generation
  – Cooking video archive      Plain recipe text
     Multimedia supplementation to a text recipe
  …
                        Workshop on Interactive Information Access
                        Workshop on Interactive Information Access
29
                                                                            29

Summary

• Introduced works on analyzing the semantic
  structures in large-scale news video archives and
  interfaces for efficient understanding of its contents.
Thanks to:
• Nagoya Univ:         Profs. Hiroshi Murase, Daisuke Deguchi
                       Akira Ogawa, Yuji Shamoto, Tomoki Okuoka
• NII:                 Profs. Shin’ichi Satoh, Norio Katayama, Hiroshi Mo
• Gifu Shotoku Gakuen Univ.: Prof. Tomokazu Takahashi
• NetCompass Ltd.:     Tomoyoshi Kinoshita, Takeharu Haraigawa

Funded by:
• JSPS, MEXT, MRI Inc., Kayamori Information Science Fund, Hoso Bunka
  Foundation                   Workshop on Interactive Information Access
                                Workshop on Interactive Information Access

More Related Content

Viewers also liked

Structure of your music video
Structure of your music videoStructure of your music video
Structure of your music videomeganbrant
 
Music Videos (Structure)
Music Videos (Structure)Music Videos (Structure)
Music Videos (Structure)Adil AbbAs
 
H.264 nal and RTP
H.264 nal and RTPH.264 nal and RTP
H.264 nal and RTPYoss Cohen
 
Structure Data 2014: IS VIDEO BIG DATA?, Steve Russell
Structure Data 2014: IS VIDEO BIG DATA?, Steve RussellStructure Data 2014: IS VIDEO BIG DATA?, Steve Russell
Structure Data 2014: IS VIDEO BIG DATA?, Steve RussellGigaom
 
Data structure and its types
Data structure and its typesData structure and its types
Data structure and its typesNavtar Sidhu Brar
 
Introduction of data structure
Introduction of data structureIntroduction of data structure
Introduction of data structureeShikshak
 
Data Structures and Algorithms
Data Structures and AlgorithmsData Structures and Algorithms
Data Structures and AlgorithmsPierre Vigneras
 
Data structure and its types
Data structure and its typesData structure and its types
Data structure and its typesNavtar Sidhu Brar
 
Lecture 1 data structures and algorithms
Lecture 1 data structures and algorithmsLecture 1 data structures and algorithms
Lecture 1 data structures and algorithmsAakash deep Singhal
 
DATA STRUCTURES
DATA STRUCTURESDATA STRUCTURES
DATA STRUCTURESbca2010
 
Fundamentals of data structures
Fundamentals of data structuresFundamentals of data structures
Fundamentals of data structuresNiraj Agarwal
 

Viewers also liked (12)

Structure of your music video
Structure of your music videoStructure of your music video
Structure of your music video
 
Music Videos (Structure)
Music Videos (Structure)Music Videos (Structure)
Music Videos (Structure)
 
H.264 nal and RTP
H.264 nal and RTPH.264 nal and RTP
H.264 nal and RTP
 
Structure Data 2014: IS VIDEO BIG DATA?, Steve Russell
Structure Data 2014: IS VIDEO BIG DATA?, Steve RussellStructure Data 2014: IS VIDEO BIG DATA?, Steve Russell
Structure Data 2014: IS VIDEO BIG DATA?, Steve Russell
 
Data structure and its types
Data structure and its typesData structure and its types
Data structure and its types
 
Introduction of data structure
Introduction of data structureIntroduction of data structure
Introduction of data structure
 
Data Structures and Algorithms
Data Structures and AlgorithmsData Structures and Algorithms
Data Structures and Algorithms
 
Data structure and its types
Data structure and its typesData structure and its types
Data structure and its types
 
Data Structure
Data StructureData Structure
Data Structure
 
Lecture 1 data structures and algorithms
Lecture 1 data structures and algorithmsLecture 1 data structures and algorithms
Lecture 1 data structures and algorithms
 
DATA STRUCTURES
DATA STRUCTURESDATA STRUCTURES
DATA STRUCTURES
 
Fundamentals of data structures
Fundamentals of data structuresFundamentals of data structures
Fundamentals of data structures
 

Similar to Untangling the semantic structure in a broadcast video archive

Mpeg 7 video signature tools for content recognition
Mpeg 7 video signature tools for content recognitionMpeg 7 video signature tools for content recognition
Mpeg 7 video signature tools for content recognitionParag Tamhane
 
A brief introduction to learning cell
A brief introduction to learning cellA brief introduction to learning cell
A brief introduction to learning cellWei Cheng
 
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"Johan Oomen
 
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...Behrouz Jedari
 
Social Media Trumps
Social Media TrumpsSocial Media Trumps
Social Media TrumpsMichael Norton
 
Video Description using Deep Learning
Video Description using Deep LearningVideo Description using Deep Learning
Video Description using Deep LearningPranjalMahajan9
 
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Project
 
Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachCSCJournals
 
Enrichment of News Show Videos with Multimodal Semi-Automatic Analysis
Enrichment of News Show Videos with Multimodal Semi-Automatic AnalysisEnrichment of News Show Videos with Multimodal Semi-Automatic Analysis
Enrichment of News Show Videos with Multimodal Semi-Automatic AnalysisLinkedTV
 
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009Dennis Busch
 
Unit 62 scheme of work
Unit 62 scheme of workUnit 62 scheme of work
Unit 62 scheme of workinghamjamie
 
to introduce SNOW
to introduce SNOWto introduce SNOW
to introduce SNOWSNOW.or.kr
 
Information Architecture in the Workplace
Information Architecture in the WorkplaceInformation Architecture in the Workplace
Information Architecture in the WorkplaceCourtney Lawton, M.S.
 
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...IJERD Editor
 
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardt
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardtFĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardt
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardtDigHumFritjof
 
Building Blocks Where's My Content Joyce Schwarz
Building Blocks Where's My Content Joyce SchwarzBuilding Blocks Where's My Content Joyce Schwarz
Building Blocks Where's My Content Joyce SchwarzJoyce Schwarz
 
How Search is Accelerating the Growth of Video in the Enterprise
How Search is Accelerating  the Growth of Video in the EnterpriseHow Search is Accelerating  the Growth of Video in the Enterprise
How Search is Accelerating the Growth of Video in the Enterprisesatishgannu
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 

Similar to Untangling the semantic structure in a broadcast video archive (20)

Mpeg 7 video signature tools for content recognition
Mpeg 7 video signature tools for content recognitionMpeg 7 video signature tools for content recognition
Mpeg 7 video signature tools for content recognition
 
Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011
 
A brief introduction to learning cell
A brief introduction to learning cellA brief introduction to learning cell
A brief introduction to learning cell
 
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"
JTS 2010 Presentation 'Audiovisual Heritage and Participatory Culture"
 
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
 
Social Media Trumps
Social Media TrumpsSocial Media Trumps
Social Media Trumps
 
Video Description using Deep Learning
Video Description using Deep LearningVideo Description using Deep Learning
Video Description using Deep Learning
 
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
 
Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional Approach
 
Enrichment of News Show Videos with Multimodal Semi-Automatic Analysis
Enrichment of News Show Videos with Multimodal Semi-Automatic AnalysisEnrichment of News Show Videos with Multimodal Semi-Automatic Analysis
Enrichment of News Show Videos with Multimodal Semi-Automatic Analysis
 
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
 
Embedded service oriented monitoring, diagnostics and control: towards the as...
Embedded service oriented monitoring, diagnostics and control: towards the as...Embedded service oriented monitoring, diagnostics and control: towards the as...
Embedded service oriented monitoring, diagnostics and control: towards the as...
 
Unit 62 scheme of work
Unit 62 scheme of workUnit 62 scheme of work
Unit 62 scheme of work
 
to introduce SNOW
to introduce SNOWto introduce SNOW
to introduce SNOW
 
Information Architecture in the Workplace
Information Architecture in the WorkplaceInformation Architecture in the Workplace
Information Architecture in the Workplace
 
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
 
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardt
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardtFĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardt
FĂŚlles prĂŚsentation-tema 3-thomas-ryborg-max-eckardt
 
Building Blocks Where's My Content Joyce Schwarz
Building Blocks Where's My Content Joyce SchwarzBuilding Blocks Where's My Content Joyce Schwarz
Building Blocks Where's My Content Joyce Schwarz
 
How Search is Accelerating the Growth of Video in the Enterprise
How Search is Accelerating  the Growth of Video in the EnterpriseHow Search is Accelerating  the Growth of Video in the Enterprise
How Search is Accelerating the Growth of Video in the Enterprise
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Untangling the semantic structure in a broadcast video archive

  • 1. Workshop on Interactive Information Access Untangling Tasks and Technologies Untangling the semantic structure in a broadcast video archive Ichiro IDE Nagoya University, Japan University of Amsterdam, The Netherlands December 7, 2010
  • 2. 2 2 Introduction • Online digital video archive is becoming a reality • Efficient retrieval and browsing • Effective reuse • Our aims: • Extract the semantic structure between video data • Rearrange video segments and generate new contents • Provide a browsing / editing interface based on the extracted semantic structure • (Semi-) automatic rearrangement of retrieved results for answering queries Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 3. 3 3 NII news video archive RAID disk Servers PCs for capturing Client MPEG-1/2 Closed-caption Client PC decoder decoder PC Video archiving server DBMS server Video archive Metadata MPEG-1 MPEG-2 Closed-caption Story Video Video text boundary [970 GB] [5.9 TB] [79 MB] [46 k stories] Data processing server • Program: NHK News7 • Period: March 16, 2001– (1,700– hours) Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 4. 4 4 Overview of the talk Exploring news stories along the topic thread structure § Cross-language detection of related news stories by text and near-duplicate video segments § Structuring a broadcast video archive based on near-duplicate video segments Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 5. 5 5 Exploring news stories along the topic thread structure I. Ide, H. Mo, N. Katayama, S. Satoh: “Exploiting topic thread structures in a news video archive for the semi-automatic generation of video summaries”, 2006 IEEE Int. Conf. on Multimedia and Expo (ICME2006), July 2006 I. Ide, T. Kinoshita, T. Takahashi, S. Satoh, H. Murase: “mediaWalker: A video archive explorer based on time-series semantic structure”, 15th ACM Int. Multimedia Conf. Demo Session, Sept. 2007 I.Ide , T. Kinoshita, T. Takahashi, H. Mo, N. Katayama, S. Satoh, H. Murase: “Exploiting the chronological semantic structure in a large-scale broadcast news video archive for its efficient exploration”, APSIPA Annual Summit and Conf. (ASC) 2010, to appear in Dec. 2010 Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 6. 6 6 Semantic structures in news video Intra- & Inter-video structure • Story tracking / Topic threading Intra-video structured videos Video-1 Story-3 Video-2 Story-1 Story-3  Inter-video Video-3 Story-1 Story-2 structure Video-4 Story-2 Story-3 Video-5 Story-1 Story-2 Story-5 Thread-2 Thread-1 …  Reveals the semantic structure throughout the archive Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 7. 7 7 Example of a topic thread structure Period: 100 days Origin May 1, 2003 Story #1 [Cluster-view] Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 8. 8 8 Contents of a topic thread structure SARS outbreak Chinese gov. worries Chinese gov. watches In Beijing the spread in rural areas the spread in rural areas Spreads in Calms down mainland China in Taiwan WHO sends a WHO declares mission to Beijing the cease Slows downs in mainland Calms down in China, spreads in Taiwan mainland China, reports fromToronto Taiwanese doctor found infected after traveling Japan Search for Anti-SARS conference Infection in Japan held in Beijing Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 9. 9 9 Browsing news video by the thread structure: mediaWalker Demo Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 10. 10 10 Towards Video Story-Telling From here To here I want to know how it developed • Generate a summarized video that explains how the story developed between two news stories • Select a path (semi-)automatically • Summarize the video streams along the path Currently under work with Frank Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 11. 14 14 Cross-language detection of related news stories by text and near-duplicate video segments A. Ogawa, T. Takahashi, I. Ide, H. Murase: “Cross-lingual retrieval of identical news events by near-duplicate video segment detection”, 14th Intl. Multimedia Modeling Conference (MMM2008), Jan. 2008 Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 12. 15 15 Cross-language news story detection • Definition ― Detect news stories in different channels (especially in different languages) discussing the same event • Problem Near ― Text-based approach duplicate • Low MT * ASR quality (Though, recently improving…) (Though, recently improving…) • Different view-point, culture • Proposed method • Detect near-duplicate video segments to complement text information on Interactive Information Access Workshop on Interactive Information Access Workshop
  • 13. 16 16 Comparison of news video streams • Identical event should be broadcast in a close timing • Compare news programs broadcast within +/- 24 hours Compare only the center part to avoid super-imposed captions Cope with color differences by histogram averaging Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 14. 17 17 Example of news stories on a same event <<Keywords>> operation [25], US army [20], Fallujah [18], military Nov 9, 2004 Story # 1 force [12], troops [7], military strategy [7], attack [5], 19:01 (GMT+9) -- Iraqi army [5], general citizens [5], Iraq [4], … <<Keywords>> city [9], Jean [6], Aaron [6], Iraqi [4], phone, call [3], Nov 8, 2004 Story # 1 army forces [3], casualties [3], … 22:03 (GMT-5) -- Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 15. 18 18 Cross-language news browsing interface: topicTraveller Demo Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 16. 20 20 Result • Dataset – 18 pairs of (JP: 1  US: 2) – Ground truth: manually given Sum of Text only Image only text and Image Recall 83% (38/46) 96% (20/46) 43% (20/46) Precision 72% (38/53) 90% (44/49) 77% (20/26)  Advantage of using image information Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 17. 21 21 Structuring a broadcast video archive based on near-duplicate video segments I. Ide, Y. Shamoto, D. Deguchi, T. Takahashi, H. Murase: “Classification of near-duplicate video segments based on their appearance patterns”, 20th Int. Conf. on Pattern Recognition (ICPR2010), Aug. 2010. Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 18. 22 22 Structuring a broadcast video archive • Structure? – For browsing / retrieval – Differs among programs / genres • Applications – Advertisement database – Related contents detection • Related news, … – Periodic contents detection • Sub-program structure  Handle in a unified framework Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 19. 23 23 Example of appearance patterns Advertisement Related news Sub-program • Different distributions for different types Demo Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 20. 24 24 Classes of near-duplicate segment types 1) Advertisement 2) Related news 3) Sub-program 4) Rebroadcast 5) Similar framing 6) Extracted segment Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 21. 25 25 Near-duplicate detection experiment • Data set – 1 week of broadcast from 6 channels in Tokyo area  Total: 1,008 hours • Computer environment – Cluster computer • 40 CPU (Intel Xeon 3.4Ghz, Main Memory: 1.0 GB) • Computation cost – CPU time: 133 days – Actual time: 4 days • Result – 3,597,943 pairs (40,928 unique segments) Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 22. 26 26 Automatic classification of classes • Classification rules Unique ND segment set – Features of near-duplicate Rebroadcast video segments within a unique segment set Advertisement • Appearance period • Appeared channels Sub- program • Appearance interval • Length of the segment Similar framing • Periodic or not Extracted Related • Extracted segment or not segment news Extracted segment Original segment Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 23. 27 27 Evaluation • 100 unique segment sets per class (61 sets for rebroadcast) Manual classification 1) 2) 3) 4) 5) 6) Misc. Automatic classification 1) Advertisement 92% 1% 2% 0% 0% 0% 5% 2) Related news 1% 51% 7% 0% 5% 17% 19% 3) Sub-program 0% 0% 65% 0% 2% 0% 33% 4) Rebroadcast 0% 0% 0% 36% 0% 0% 64% 5) Similar framing 0% 0% 0% 0% 63% 6% 31% 6) Extracted segment 1% 49% 2% 0% 0% 35% 13%  Accuracy: 57% Cover rate: 77% Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 24. 28 28 Future directions • Now we have structured the archives in various ways  Consider how to exploit the structure • Reorganize the video data based on an external “scenario” – News video archive  Wikipedia description  (Semi-)automatic Documentary generation – Cooking video archive  Plain recipe text  Multimedia supplementation to a text recipe … Workshop on Interactive Information Access Workshop on Interactive Information Access
  • 25. 29 29 Summary • Introduced works on analyzing the semantic structures in large-scale news video archives and interfaces for efficient understanding of its contents. Thanks to: • Nagoya Univ: Profs. Hiroshi Murase, Daisuke Deguchi Akira Ogawa, Yuji Shamoto, Tomoki Okuoka • NII: Profs. Shin’ichi Satoh, Norio Katayama, Hiroshi Mo • Gifu Shotoku Gakuen Univ.: Prof. Tomokazu Takahashi • NetCompass Ltd.: Tomoyoshi Kinoshita, Takeharu Haraigawa Funded by: • JSPS, MEXT, MRI Inc., Kayamori Information Science Fund, Hoso Bunka Foundation Workshop on Interactive Information Access Workshop on Interactive Information Access