SlideShare a Scribd company logo
Adaptive Video and Metadata Display using Multimedia Documents  Cyril Concolato ACM MM 2010 / SAPMIA Workshop 29/10/2010
Personalized Video Viewing with ROIRelated works Previous works “The big picture on small screens delivering acceptable video quality in mobile TV”, Knoche et al., TOMCCAP 2009 Discusses best zooming factor depending on the content “Adding dynamic visual manipulations to declarative multimedia documents”, Kuikjet al., DocEng 2009 Zooming onto pictures and creating animated camera motions “Animated Picture Presentation Steered by Natural Language”, Reiterer et al., UCMedia 2009 Virtual camera motion driven by ROI and textual description More recent works @ ACM MM 2010 “Crowd-sourced Automatic Zoom and Scroll for Video Retargeting”, Carlier et al. Learning the ROI based on user interaction, and creating a retargeted video based on ROI “Impact of Zooming and Enhancing Region of Interests for Optimizing User Experience on Mobile Sports Video”, Song et al. User study on the usefulness of ROI for improving the user experience “Video Retargeting for Aesthetic Enhancement”, Xiang et al. Automatic ROI detection and video creation page 1
Our approach vs. related works Automatic ROI detection (RWTH Aachen) Similar to existing works with specific detection Differentiated H.264|AVC encoding (IBBT-MMLAB) Balanced encoding between background and ROIs Use of a rich media document To display video  To let the user select a ROI and zoom or not To show additional metadata with adaptation features page 2 “Annotation based personalized adaptation and presentation of videos for mobile applications”, S. De Bruyne, P. Hosten, C. Concolato, M. Asbach, J. De Cock, M. Unger, J. Le Feuvre and R.Vande Walle, Multimedia Tools and Applications, 2011, DOI: 10.1007/s11042-010-0575-2.
Our System Principles Generate rich media documents from video annotations Based on semi-automatic annotations Based on templates Hierarchical Rich Media Documents MPEG-4 BIFS for synchronized & interactive ROI W3C SVG & JavaScript for adaptive metadata layout & interaction page 3
Adaptive Rich Media Documents Part of a global problem of media adaptation (e.g. MPEG-21 DIA) Specificities of documents Structured information (e.g. XML) The use of media  The spatial organization (2D/3D, …) The temporal aspects (animations, synchronization …) The interactive behavior (events, modifications) Existing methods for document adaptation Alternatives/Switch between document branches Constraints solving problem Interpolation between key scenes (e.g. automatic layout, “artistic resizing”) Scalable documents page 4
Example of spatial adaptation of Rich Media Documents page 5
Our choices in thiswork Adaptation based on constraints solving Screen size, video size, quantity/type of metadata to display  Author directives E.g. priority of text over images, relative positioning of elements, … Compiled into a JavaScript algorithm  Included in the rich media document Executed at runtime Results Size and positions of metadata, font size, split of metadata over several pages … page 6
Video and Metadata Display Results page 7 Le Feuvre, J., Concolato, C., and Moissinac, J. 2007. GPAC: open source multimedia framework. In Proceedings of the 15th international Conference on Multimedia (Augsburg, Germany, September 25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 1009-1012. DOI= http://doi.acm.org/10.1145/1291233.1291452
Demonstrations page 8
Conclusions and Future Work Functionnal proof of concept  How media annotations can leverage document adaptation  How different rich media languages can be mixed How user preferences expressed by interactions can drive the adaptation Many aspects can be improved Add more constraints Pixel density, screen orientation, … Improve algorithm for constraint solving Better use of screen space Work on the User Interface When ROI don’t last long enough to be clicked  When many ROIs are present on the screen at the same time When the font size is too small User Studies Future work Authoring of adaptive documents page 9
Thank you for your attention!Questions ? Suggestions ? cyril.concolato@telecom-paristech.fr page 10

More Related Content

Viewers also liked

Viewers also liked (7)

Electronic Program Guides using SVG
Electronic Program Guides using SVGElectronic Program Guides using SVG
Electronic Program Guides using SVG
 
MPEG-4 BIFS Overview
MPEG-4 BIFS OverviewMPEG-4 BIFS Overview
MPEG-4 BIFS Overview
 
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio servicesMPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
MPEG-4 BIFS and MPEG-2 TS: Latest developments for digital radio services
 
Streaming of SVG animations on the Web
Streaming of SVG animations on the WebStreaming of SVG animations on the Web
Streaming of SVG animations on the Web
 
Extensions for Hybrid Delivery using MPEG-2 TS and DASH
Extensions for Hybrid Delivery using MPEG-2 TS and DASHExtensions for Hybrid Delivery using MPEG-2 TS and DASH
Extensions for Hybrid Delivery using MPEG-2 TS and DASH
 
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASHTutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
Tutoriel sur le streaming vidéo sur HTTP et sur MPEG-DASH
 
Live streaming of video and subtitles with MPEG-DASH
Live streaming of video and subtitles with MPEG-DASHLive streaming of video and subtitles with MPEG-DASH
Live streaming of video and subtitles with MPEG-DASH
 

Similar to Adaptive Video and Metadata Display using Multimedia Documents

Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional Approach
CSCJournals
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
Benoit HUET
 
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced CostMobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Eswar Publications
 

Similar to Adaptive Video and Metadata Display using Multimedia Documents (20)

Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization  Video Data Visualization System : Semantic Classification and Personalization
Video Data Visualization System : Semantic Classification and Personalization
 
Content Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional ApproachContent Modelling for Human Action Detection via Multidimensional Approach
Content Modelling for Human Action Detection via Multidimensional Approach
 
Summ11 useinterx
Summ11 useinterxSumm11 useinterx
Summ11 useinterx
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch session
 
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standardsA Personalized Audio Server using MPEG-7 and MPEG-21 standards
A Personalized Audio Server using MPEG-7 and MPEG-21 standards
 
Semantic browsing
Semantic browsingSemantic browsing
Semantic browsing
 
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your MediaA Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
 
Emerging database technology multimedia database
Emerging database technology   multimedia databaseEmerging database technology   multimedia database
Emerging database technology multimedia database
 
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization CapabilitiesA Mobile Audio Server enhanced with Semantic Personalization Capabilities
A Mobile Audio Server enhanced with Semantic Personalization Capabilities
 
A Framework for Adaptive Delivery of Omnidirectional Video
A Framework for Adaptive Delivery of Omnidirectional VideoA Framework for Adaptive Delivery of Omnidirectional Video
A Framework for Adaptive Delivery of Omnidirectional Video
 
Research Group Multimedia Communication (MMC)
Research Group Multimedia Communication (MMC)Research Group Multimedia Communication (MMC)
Research Group Multimedia Communication (MMC)
 
A Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
A Multimedia Visualization Tool For Solving Mechanics Dynamics ProblemA Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
A Multimedia Visualization Tool For Solving Mechanics Dynamics Problem
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
 
PEUDOM: a Mashup Platform for End User Development of Common Information Spaces
PEUDOM: a Mashup Platform for End User Development of Common Information SpacesPEUDOM: a Mashup Platform for End User Development of Common Information Spaces
PEUDOM: a Mashup Platform for End User Development of Common Information Spaces
 
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
 
Mobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced CostMobile Web Browsing Based On Content Preserving With Reduced Cost
Mobile Web Browsing Based On Content Preserving With Reduced Cost
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
2010 sigdoc keynote
2010 sigdoc keynote2010 sigdoc keynote
2010 sigdoc keynote
 
Image Security Case Study
Image Security Case StudyImage Security Case Study
Image Security Case Study
 

Recently uploaded

Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

Benefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational ResourcesBenefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational Resources
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdfDanh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptx
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & EngineeringBasic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 

Adaptive Video and Metadata Display using Multimedia Documents

  • 1. Adaptive Video and Metadata Display using Multimedia Documents Cyril Concolato ACM MM 2010 / SAPMIA Workshop 29/10/2010
  • 2. Personalized Video Viewing with ROIRelated works Previous works “The big picture on small screens delivering acceptable video quality in mobile TV”, Knoche et al., TOMCCAP 2009 Discusses best zooming factor depending on the content “Adding dynamic visual manipulations to declarative multimedia documents”, Kuikjet al., DocEng 2009 Zooming onto pictures and creating animated camera motions “Animated Picture Presentation Steered by Natural Language”, Reiterer et al., UCMedia 2009 Virtual camera motion driven by ROI and textual description More recent works @ ACM MM 2010 “Crowd-sourced Automatic Zoom and Scroll for Video Retargeting”, Carlier et al. Learning the ROI based on user interaction, and creating a retargeted video based on ROI “Impact of Zooming and Enhancing Region of Interests for Optimizing User Experience on Mobile Sports Video”, Song et al. User study on the usefulness of ROI for improving the user experience “Video Retargeting for Aesthetic Enhancement”, Xiang et al. Automatic ROI detection and video creation page 1
  • 3. Our approach vs. related works Automatic ROI detection (RWTH Aachen) Similar to existing works with specific detection Differentiated H.264|AVC encoding (IBBT-MMLAB) Balanced encoding between background and ROIs Use of a rich media document To display video To let the user select a ROI and zoom or not To show additional metadata with adaptation features page 2 “Annotation based personalized adaptation and presentation of videos for mobile applications”, S. De Bruyne, P. Hosten, C. Concolato, M. Asbach, J. De Cock, M. Unger, J. Le Feuvre and R.Vande Walle, Multimedia Tools and Applications, 2011, DOI: 10.1007/s11042-010-0575-2.
  • 4. Our System Principles Generate rich media documents from video annotations Based on semi-automatic annotations Based on templates Hierarchical Rich Media Documents MPEG-4 BIFS for synchronized & interactive ROI W3C SVG & JavaScript for adaptive metadata layout & interaction page 3
  • 5. Adaptive Rich Media Documents Part of a global problem of media adaptation (e.g. MPEG-21 DIA) Specificities of documents Structured information (e.g. XML) The use of media The spatial organization (2D/3D, …) The temporal aspects (animations, synchronization …) The interactive behavior (events, modifications) Existing methods for document adaptation Alternatives/Switch between document branches Constraints solving problem Interpolation between key scenes (e.g. automatic layout, “artistic resizing”) Scalable documents page 4
  • 6. Example of spatial adaptation of Rich Media Documents page 5
  • 7. Our choices in thiswork Adaptation based on constraints solving Screen size, video size, quantity/type of metadata to display Author directives E.g. priority of text over images, relative positioning of elements, … Compiled into a JavaScript algorithm Included in the rich media document Executed at runtime Results Size and positions of metadata, font size, split of metadata over several pages … page 6
  • 8. Video and Metadata Display Results page 7 Le Feuvre, J., Concolato, C., and Moissinac, J. 2007. GPAC: open source multimedia framework. In Proceedings of the 15th international Conference on Multimedia (Augsburg, Germany, September 25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 1009-1012. DOI= http://doi.acm.org/10.1145/1291233.1291452
  • 10. Conclusions and Future Work Functionnal proof of concept How media annotations can leverage document adaptation How different rich media languages can be mixed How user preferences expressed by interactions can drive the adaptation Many aspects can be improved Add more constraints Pixel density, screen orientation, … Improve algorithm for constraint solving Better use of screen space Work on the User Interface When ROI don’t last long enough to be clicked When many ROIs are present on the screen at the same time When the font size is too small User Studies Future work Authoring of adaptive documents page 9
  • 11. Thank you for your attention!Questions ? Suggestions ? cyril.concolato@telecom-paristech.fr page 10