SlideShare a Scribd company logo
RELIVING ON DEMAND: A TOTAL VIEWER
EXPERIENCE

           Vivek K. Singh1*, Jiebo Luo2, Dhiraj Joshi2,
          Phoury Lei2, Madirakshi Das2, Peter Stubler2


                              1 University
                              of California, Irvine,
            2 Kodak Research Laboratories, Rochester, NY,




            ACM International Conference on Multimedia – ACMM 2011

1             * Work was done when the author was interning at Kodak Research Laboratories, Eastman Kodak Company, Rochester, NY, USA.
Why do people take pictures?

 1. Digital re-living




 2. Sharing it with
   family and friends
What’s available today?

• Commercial Slideshows (Picasa, iPhoto, ACDsee):
  • Focus on visual appearance only.
  • Don’t understand/utilize semantics (except “FaceMovie”)
• Research efforts: Semantic analysis
  • No interaction
  • Interaction on demand
• Allow different users to dynamically re-direct the flow of
  media reliving experience
Platforms
   Desktop
   Digital frame
   HDTV
   Kodak Gallery
   Mobile
   Kiosk
Preview
 • Re-living of events in user’s life, based on WHO,
   WHERE, and WHEN .
Outline
 • Preview
 • Design principles
 • System design
 • Under the hood (sneak peek)
 • Evaluations
Design principles
 1. User controllable:
    • Responsive to user demand (overcoming intent gap)
 2. Semantically drivable:
    • Events as organizing units
    • Who, when, where; what
 3. Aesthetically pleasing:
    • Dynamic presentation
    • Multimodal (songs, images, videos)
Retrieval vs. Browsing vs. Reliving

• Media by itself is uninteresting unless it performs a
  function (e.g. reliving, sharing) for the human user
• Retrieval
  • Fetching data. Strong intent (e.g. search)
• Browsing
  • Piecemeal reliving. Weak intent (e.g. youtube)
• Reliving
  • Valuable middle ground.
  • Semantically re-direct the flow if desired.
System overview
System overview: Approach
Media data structure

                                                          Media
                               URL                      properties
                   Type              Height, width

                                                        Aesthetic
                                        Aesthetic IVI   properties
        location

                    subjects         dateTime           Semantic
                                                        properties


                                          Score         Suitability
                                                        properties
Pre-processing
                                       Media
                                      Collection




       Date and Time          Aesthetics Value           Face Detection    Location Information
        Extraction              Extraction                                      Extraction



                                                         Face Clustering


      Event Clustering                             Face Labeling
                                                                               Geographic
                                                                               Clustering



                         Metadata
                         Repository
Reordering of event list

• Basic idea


• Time


• People


• Location
Choosing layout
 • Default:




i=     2          3   4   5
Choose transitions

• If (criteria=time || criteria=loc)
   • Slide In/Out
• If (criteria=personi)
   • Face2Face transition



  Transform(θ1, trans.X                Transform(θ2, trans.X
  1,                                   2,
  trans.Y 1, scale 1)                  trans.Y2, scale 2)
Choose song

• If (criteria=time)
   • Select seasonal songs (easily extensible to finer grain)
• If (criteria=loc)
   • Select regional songs
• If (criteria=personi)
   • Select age-based songs (easily extensible to gender)
• Taken from a library of available songs
Show images
 • In time order
 • Higher score => more display time
 • Auto-zoom-crop
   • Find center to focus on
   • Match the aspect ratio required
 • Multiple Holes in transitions
   • Token passing amongst holes
   • Representative image as background
Logging user sessions
     <Interaction>
                      <Click>
                                    <GlobalEventID>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</GlobalEventID>
                                    <SortedEventID>0</SortedEventID
                                    <TimeStamp>10:17:47 AM</TimeStamp>
                                    <Criteria_type>gps</Criteria_type>
                                    <Criteria_value>61.2175937710438 , -149.898739309764</Criteria_value>
                                    <HotSpotClick>False</HotSpotClick>
                      </Click>
                      <Snapshot>
                                    <Locations>
                                                      <loc>-149.898739309764,61.2175937710438</loc>
                                                      <loc>-73.508556462585,40.5956603174603</loc>
                                                      <loc>102.757525301205,25.1018832329317</loc>
                                                      <loc>104.195397,35.86166</loc>
                                                      <loc>6.09306585111111,52.7236709366667</loc>
                                    </Locations>
                                    <People>
                                                      <peo>Jiebo</peo>
                                                      <peo>Joyce</peo>
                                                      <peo>Xinping</peo>
                                                      <peo></peo>
                                                      <peo></peo>
                                    </People>
                                    <SortedEvents>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve>
                                                      <eve></eve>
                                    </SortedEvents>
                                    <PicsShown>
                                                      <pic>c:datajiebocvpr2008103_5972.jpg</pic>
                                                      <pic>c:datajiebocvpr2008103_5973.jpg</pic>
                                                      <pic>c:datajiebolijiang-shangrila-day2108_0043.jpg</pic>
                                                      <pic>c:datajiebolijiang-shangrila-day2108_0044.jpg</pic>
                                    </PicsShown>
                      </Snapshot>
     </Interaction>
Evaluations
• Experiments with 11 families
• 35 user interaction sessions logged
          Age of contributing photographers         23 to 56

          No. of images/ videos in the collection   2,091 to 10,522

          No. of calendar years in time span        3 to 10

          No. of tagged people in the collection    26 to 137

          No. of places in the collection           19 to 45

• Roles
  • 1st person (owner)
  • 2nd person (immediate family)
  • 3rd person (friends, cousins )
Experiment 1: Comparison with commercially available
options
6.2 Experiment 2: Use of different features across
different user demographics
         Females   1.14      1.49            1.13         1.01
         Males     1.41      1.25            2.08         1.43
          Both     1.30      1.27            1.28         1.35
                   All     1st party       2nd party    3rd party
                                                       Active Vs Passive?




         Clicks per axis               Stickiness :Time spent after clicks
Future work
 • Choosing songs more generically/smartly
 • Choosing optimal spatio-temporal placement of
   images in the slide show
   • Choosing layout
   • Choosing transition time?
 • Supporting multiple axes simultaneously
 • Previews

More Related Content

Similar to Reliving on demand a total viewer experience

A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
Mostafa Saad Ibrahim
 
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research ProgramFast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
Michael Powers
 
Geospatial for Java
Geospatial for JavaGeospatial for Java
Geospatial for Java
Jody Garnett
 
The Seven Wastes of Software Development
The Seven Wastes of Software DevelopmentThe Seven Wastes of Software Development
The Seven Wastes of Software Development
Matt Stine
 
3D打印:從想像到現實
3D打印:從想像到現實3D打印:從想像到現實
3D打印:從想像到現實
Tonny Kuo
 
Understand Your Festival
Understand Your FestivalUnderstand Your Festival
Understand Your Festival
festivalslab
 
Class 5
Class 5Class 5
Class 5
Ana Jofre
 
Android Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applicationsAndroid Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applications
Infinum
 
UMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group PresentationUMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group Presentation
The Integral Worm
 
Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...
KU Leuven
 
Presentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes LabPresentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes Lab
selan_rds
 
Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )
parul6792
 
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Fwdays
 
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
DACON AI 데이콘
 
Human Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesHuman Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD Features
Barış Üstündağ
 
7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility
Tim Gifford
 
Geo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extractionGeo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extraction
Conor Mc Elhinney
 
EBImage - Short Overview
EBImage - Short OverviewEBImage - Short Overview
EBImage - Short Overview
Charles Howard
 
Capital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial RecognitionCapital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial Recognition
Janice Gluck
 

Similar to Reliving on demand a total viewer experience (19)

A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
A Hierarchical Deep Temporal Model for Group Activity Recognition (CVPR16)
 
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research ProgramFast, Cheap, and Actionable: Creating an Affordable User Research Program
Fast, Cheap, and Actionable: Creating an Affordable User Research Program
 
Geospatial for Java
Geospatial for JavaGeospatial for Java
Geospatial for Java
 
The Seven Wastes of Software Development
The Seven Wastes of Software DevelopmentThe Seven Wastes of Software Development
The Seven Wastes of Software Development
 
3D打印:從想像到現實
3D打印:從想像到現實3D打印:從想像到現實
3D打印:從想像到現實
 
Understand Your Festival
Understand Your FestivalUnderstand Your Festival
Understand Your Festival
 
Class 5
Class 5Class 5
Class 5
 
Android Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applicationsAndroid Talks #05 - UI optimization of Android applications
Android Talks #05 - UI optimization of Android applications
 
UMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group PresentationUMBC IFSM438 Project Management Group Presentation
UMBC IFSM438 Project Management Group Presentation
 
Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...Improve the communication between an expert and a layman through interactive ...
Improve the communication between an expert and a layman through interactive ...
 
Presentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes LabPresentation Selan dos Santos 4Eyes Lab
Presentation Selan dos Santos 4Eyes Lab
 
Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )Final Year Major Project Report ( Year 2010-2014 Batch )
Final Year Major Project Report ( Year 2010-2014 Batch )
 
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
Алексей Ященко и Ярослав Волощук "False simplicity of front-end applications"
 
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션
 
Human Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD FeaturesHuman Action Recognition Using 3D Joint Information and HOOFD Features
Human Action Recognition Using 3D Joint Information and HOOFD Features
 
7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility7 (+/- 2) Steps to Agility
7 (+/- 2) Steps to Agility
 
Geo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extractionGeo-referenced human-activity-data; access, processing and knowledge extraction
Geo-referenced human-activity-data; access, processing and knowledge extraction
 
EBImage - Short Overview
EBImage - Short OverviewEBImage - Short Overview
EBImage - Short Overview
 
Capital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial RecognitionCapital Go 2018 - College Advising with Facial Recognition
Capital Go 2018 - College Advising with Facial Recognition
 

Recently uploaded

AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 

Recently uploaded (20)

AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 

Reliving on demand a total viewer experience

  • 1. RELIVING ON DEMAND: A TOTAL VIEWER EXPERIENCE Vivek K. Singh1*, Jiebo Luo2, Dhiraj Joshi2, Phoury Lei2, Madirakshi Das2, Peter Stubler2 1 University of California, Irvine, 2 Kodak Research Laboratories, Rochester, NY, ACM International Conference on Multimedia – ACMM 2011 1 * Work was done when the author was interning at Kodak Research Laboratories, Eastman Kodak Company, Rochester, NY, USA.
  • 2. Why do people take pictures? 1. Digital re-living 2. Sharing it with family and friends
  • 3. What’s available today? • Commercial Slideshows (Picasa, iPhoto, ACDsee): • Focus on visual appearance only. • Don’t understand/utilize semantics (except “FaceMovie”) • Research efforts: Semantic analysis • No interaction • Interaction on demand • Allow different users to dynamically re-direct the flow of media reliving experience
  • 4. Platforms  Desktop  Digital frame  HDTV  Kodak Gallery  Mobile  Kiosk
  • 5. Preview • Re-living of events in user’s life, based on WHO, WHERE, and WHEN .
  • 6. Outline • Preview • Design principles • System design • Under the hood (sneak peek) • Evaluations
  • 7. Design principles 1. User controllable: • Responsive to user demand (overcoming intent gap) 2. Semantically drivable: • Events as organizing units • Who, when, where; what 3. Aesthetically pleasing: • Dynamic presentation • Multimodal (songs, images, videos)
  • 8. Retrieval vs. Browsing vs. Reliving • Media by itself is uninteresting unless it performs a function (e.g. reliving, sharing) for the human user • Retrieval • Fetching data. Strong intent (e.g. search) • Browsing • Piecemeal reliving. Weak intent (e.g. youtube) • Reliving • Valuable middle ground. • Semantically re-direct the flow if desired.
  • 11. Media data structure Media URL properties Type Height, width Aesthetic Aesthetic IVI properties location subjects dateTime Semantic properties Score Suitability properties
  • 12. Pre-processing Media Collection Date and Time Aesthetics Value Face Detection Location Information Extraction Extraction Extraction Face Clustering Event Clustering Face Labeling Geographic Clustering Metadata Repository
  • 13. Reordering of event list • Basic idea • Time • People • Location
  • 14. Choosing layout • Default: i= 2 3 4 5
  • 15. Choose transitions • If (criteria=time || criteria=loc) • Slide In/Out • If (criteria=personi) • Face2Face transition Transform(θ1, trans.X Transform(θ2, trans.X 1, 2, trans.Y 1, scale 1) trans.Y2, scale 2)
  • 16. Choose song • If (criteria=time) • Select seasonal songs (easily extensible to finer grain) • If (criteria=loc) • Select regional songs • If (criteria=personi) • Select age-based songs (easily extensible to gender) • Taken from a library of available songs
  • 17. Show images • In time order • Higher score => more display time • Auto-zoom-crop • Find center to focus on • Match the aspect ratio required • Multiple Holes in transitions • Token passing amongst holes • Representative image as background
  • 18. Logging user sessions <Interaction> <Click> <GlobalEventID>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</GlobalEventID> <SortedEventID>0</SortedEventID <TimeStamp>10:17:47 AM</TimeStamp> <Criteria_type>gps</Criteria_type> <Criteria_value>61.2175937710438 , -149.898739309764</Criteria_value> <HotSpotClick>False</HotSpotClick> </Click> <Snapshot> <Locations> <loc>-149.898739309764,61.2175937710438</loc> <loc>-73.508556462585,40.5956603174603</loc> <loc>102.757525301205,25.1018832329317</loc> <loc>104.195397,35.86166</loc> <loc>6.09306585111111,52.7236709366667</loc> </Locations> <People> <peo>Jiebo</peo> <peo>Joyce</peo> <peo>Xinping</peo> <peo></peo> <peo></peo> </People> <SortedEvents> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve>urn:guid:f1337996-3c28-4345-b4fb-c4f1b788fc05</eve> <eve></eve> </SortedEvents> <PicsShown> <pic>c:datajiebocvpr2008103_5972.jpg</pic> <pic>c:datajiebocvpr2008103_5973.jpg</pic> <pic>c:datajiebolijiang-shangrila-day2108_0043.jpg</pic> <pic>c:datajiebolijiang-shangrila-day2108_0044.jpg</pic> </PicsShown> </Snapshot> </Interaction>
  • 19. Evaluations • Experiments with 11 families • 35 user interaction sessions logged Age of contributing photographers 23 to 56 No. of images/ videos in the collection 2,091 to 10,522 No. of calendar years in time span 3 to 10 No. of tagged people in the collection 26 to 137 No. of places in the collection 19 to 45 • Roles • 1st person (owner) • 2nd person (immediate family) • 3rd person (friends, cousins )
  • 20. Experiment 1: Comparison with commercially available options
  • 21. 6.2 Experiment 2: Use of different features across different user demographics Females 1.14 1.49 1.13 1.01 Males 1.41 1.25 2.08 1.43 Both 1.30 1.27 1.28 1.35 All 1st party 2nd party 3rd party Active Vs Passive? Clicks per axis Stickiness :Time spent after clicks
  • 22. Future work • Choosing songs more generically/smartly • Choosing optimal spatio-temporal placement of images in the slide show • Choosing layout • Choosing transition time? • Supporting multiple axes simultaneously • Previews

Editor's Notes

  1. People don’t want to see images, they want to re-live and share the experiences