SlideShare a Scribd company logo
1 of 43
Download to read offline
Advances in Semantic Analysis
       of Multimedia




Dr. Gerald Friedland
International Computer Science Institute
Berkeley, CA
friedland@icsi.berkeley.edu
The Internet Today




                     2
Internet Use Today




Raphaël Troncy: Linked Media: Weaving non-textual content into the Semantic Web, MozCamp, 03/2009.
                                                                                                3
Types of Videos




                  4
Addressable Market for
                          Enterprise Video Applications




          Security               Asset Tracking        QA/Operational Efficiency         Intelligent
        $1.2 Billion             $480m by 2010                 $700m                     Marketing
(Total Market $7.8B, 2005)
                              $4.0 Billion Commercially
                               (RFID in 2006 2.4B)        (source: Envysion,
   (Source: JP Freeman) (Total Asset protection $14.7B) Arrowsight, corporate
                                                                                            $200m
                                                                                   (source: T3CI corporate
($7B in 06. Source Lehman)(Source: Lehman report 2006)        analysis)                   analysis)




                                       BI                       Training                Government
       Compliance                    $400m
         $450m                                                   $600m
                          (Reporting and Analysis 4B)
   (source: JP Freeman)                                    (source: Forrester      (Intelligence, Defense,
                            (Total BI market $13.3B)
                                                          Enterprise Software        Homeland Security) 5
                          (source: IDC BI tools 03-08)
                                                              report 2005)
Multimedia Capabilities:
       1985


• Record
• Store
• Play
• Random Seek
• Annotate Manually


                                  6
Multimedia Capabilities:
       2009

• Record
• Store
• Stream
• Play
• Random Seek
• Annotate Manually

                                  7
Multimedia Capabilities:
      Wanted
       • Semantic Navigation
       • Search
       • Content Compare
       • Object Cut & Paste
       • Annotate Automatically
       • Infer over Content

=> Make multimedia “understandable”
for computers.
                                      8
Problems


•Multimedia data very dense manual
 annotation not feasable
•Multimedia content analysis is
 difficult and rarely good enough to
 create reliable products.


                                       9
My Research...
         Network                     Knowledge

     Semantic Web



         Context                    Understanding

  Semantic Computing



   Machine Learning                  Recognition

   Artificial Intelligence



         Filtering                    Features

  Signal/Text Processing



           Images           Audio     Video         Text
My Research...


Hypotheses:
• Multimedia content analysis works
  better when every cue is taken into
  account (eg. video AND audio).
• Semantic is enabled through
  context. Converts AI research into
  products.
Context
Sources of Context:
• Inclusion of prior knowledge
• Combination of algorithms
• Multimodality:
  – audio+video+...
  – extra hardware
• Human interaction
• ...

                                 12
Context as Key:
 Example 1



      →   Cut          Horse    →
          Paste   ^V   Meadow




Visual Object Extraction

                                    13
Simple Interactive
        Object Extraction (SIOX)


           →                   →




Image          User Input           Output


  Context delivered by human interaction
                                             14
SIOX: Algorithm Idea
                   Color Signatures from image retrieval:




Y. Rubner, C. Tomasi, and L. J. Guibas: The Earth Mover’s Distance as a Metric for Image
Retrieval. Int. Journal of Computer Vision, 40(2):99–121, 2000.


Idea: Instead of searching and image database, use Color
          Signatures to search inside an image.



                                                                                           15
SIOX in GIMP
             SIOX
            Button




G. Friedland, K. Jantz, T. Lenz, F. Wiesel, R. Rojas: “Object Cut and Paste in
Images and Videos”, International Journal of Semantic Computing Vol 1,
             No 2, pp. 221-247, World Scientific, USA, June 2007.            16
SIOX in Inkscape




                   17
SIOX in Blender




                  18
Extensions
Extracting multiple similar
objects at once:




          →




                              19
Sub-Pixel Refinement
      Problem: Spill colors and foreground
      disappearance



           →



Original          SIOX         GraphCut


           →



                                          20
Sub-Pixel Refinement
Detail Refinement Brush:
Coarse Interaction



                    →




                    →




                          21
VideoSIOX

1st Frame:




Subsequent
Frames:


             22
More Information



 http://www.siox.org




                       23
Shoesurfer




             24
Shoesurfer




             25
Shoesurfer




             26
Shoesurfer




             27
Shoesurfer




             28
Context as Key:
Example 2




                  29
Speaker Diarization: Who
            Spoke When?
            Audiotrack:


             Segmentation:




             Clustering:



G. Friedland, O. Vinyals, Y. Huang, C. Müller: “Prosodic and other Long-Term
Features for Speaker Diarization”, IEEE Transactions on Audio, Speech, and
Language Processing, Vol 17, No 5, pp 985--993, July 2009.
                                                                       30
Analyzing Meetings




                     31
Dominance Estimation
I Know You...



http://www.icsi.berkeley.edu/
~fractor/ioda_demo.avi




                                33
Narrative Theme Navigation




G. Friedland, L. Gottlieb, A. Janin: “Joke-o-mat: Browsing Sitcoms Punchline by
Punchline”, Proceedings of ACM Multimedia, Beijing, China, October 2009.
                                                                          34
Joke-O-Mat: Demo




http://www.youtube.com/watch?v=1qfa84Ulm5s




                                         35
Connecting Multimedia
and Semantic Technologies
   GStreamer

     Appscio
                   User
       Device   Component 1
       Driver
                   User
                Component 2
       Source                 Recorder
                    .
                    .
                    .
        File       User
                Component n




                                         36
Semantic Media
Framework
   Pipeline Framework
                                    Integrated
      C/C++/Java                   Development
       Interface                   Environment

                        Events             Code
     Custom Event
       Source 1
                             Video Application Server
                                  Web Technology
     Custom Event
                                    Interface
       Source 2
          .                  Scripting & Logic Engine
          .
          .
     Custom Event                Services Connector
       Source n


  http://www.appscio.com
                                                        37
Semantic Analysis of
Multimedia Data
• enables automatic logical
  inference on perceptually
  encoded data
• enables more “natural”
  interaction with the computer:
  “do what the user means”
• Interfaces nicely with Semantic
  Web technologies

                                    38
A note...




            James A. Hendler


                          39
MySTT



 Open-Source, open-model,
 state-of-the-art speech
 recognizer for multiparty
 conversations.

 Release Date: February 2010
                               40
4th IEEE International
  Conference on Semantic
  Computing 2010




Paper Deadline: May 3rd, 2010
                                41
Upcoming...




              42
Thank You!
Questions?
Contact:
Dr. Gerald Friedland
International Computer Science Institute
Berkeley, CA
http://www.gerald-friedland.org
friedland@icsi.berkeley.edu                43

More Related Content

Viewers also liked

Paper-Based Piezoelectric Touch Pads
Paper-Based Piezoelectric Touch PadsPaper-Based Piezoelectric Touch Pads
Paper-Based Piezoelectric Touch Pads
Vicky Wang
 
Touch paper presentation-tarek
Touch paper presentation-tarekTouch paper presentation-tarek
Touch paper presentation-tarek
Tarek Gaber
 
2009 01 0521 Sae Paper Smart Touch
2009 01 0521 Sae Paper Smart Touch2009 01 0521 Sae Paper Smart Touch
2009 01 0521 Sae Paper Smart Touch
zztdn3
 
The chemistry of meals, ready to-eat
The chemistry of meals, ready to-eatThe chemistry of meals, ready to-eat
The chemistry of meals, ready to-eat
Logan Van Eldik
 
Plasma deposited thermocouple
Plasma deposited thermocouplePlasma deposited thermocouple
Plasma deposited thermocouple
I'am Ajas
 
SMARCOS HIG Paper on Designing Touch Screen Interfaces
SMARCOS HIG Paper on Designing Touch Screen InterfacesSMARCOS HIG Paper on Designing Touch Screen Interfaces
SMARCOS HIG Paper on Designing Touch Screen Interfaces
Smarcos Eu
 

Viewers also liked (15)

PACER: Fine-grained Interactive Paper via Hybrid Camera and Touch Gestures on...
PACER: Fine-grained Interactive Paper via Hybrid Camera and Touch Gestures on...PACER: Fine-grained Interactive Paper via Hybrid Camera and Touch Gestures on...
PACER: Fine-grained Interactive Paper via Hybrid Camera and Touch Gestures on...
 
Paper-Based Piezoelectric Touch Pads
Paper-Based Piezoelectric Touch PadsPaper-Based Piezoelectric Touch Pads
Paper-Based Piezoelectric Touch Pads
 
SiOx Nanoparticals.PDF
SiOx Nanoparticals.PDFSiOx Nanoparticals.PDF
SiOx Nanoparticals.PDF
 
Touch paper presentation-tarek
Touch paper presentation-tarekTouch paper presentation-tarek
Touch paper presentation-tarek
 
2009 01 0521 Sae Paper Smart Touch
2009 01 0521 Sae Paper Smart Touch2009 01 0521 Sae Paper Smart Touch
2009 01 0521 Sae Paper Smart Touch
 
Biobased Biodegradable Food Packaging
Biobased Biodegradable Food PackagingBiobased Biodegradable Food Packaging
Biobased Biodegradable Food Packaging
 
107 yun-yu wang - 7538029 - method of room temperature growth of si ox on s...
107   yun-yu wang - 7538029 - method of room temperature growth of si ox on s...107   yun-yu wang - 7538029 - method of room temperature growth of si ox on s...
107 yun-yu wang - 7538029 - method of room temperature growth of si ox on s...
 
Films and paper tear strength tester
Films and paper tear strength testerFilms and paper tear strength tester
Films and paper tear strength tester
 
The chemistry of meals, ready to-eat
The chemistry of meals, ready to-eatThe chemistry of meals, ready to-eat
The chemistry of meals, ready to-eat
 
Plasma deposited thermocouple
Plasma deposited thermocouplePlasma deposited thermocouple
Plasma deposited thermocouple
 
Touch Channel Management White Paper
Touch Channel Management White PaperTouch Channel Management White Paper
Touch Channel Management White Paper
 
Thin Film Silicon Nanowire - Prof.Rusli
Thin Film Silicon Nanowire - Prof.RusliThin Film Silicon Nanowire - Prof.Rusli
Thin Film Silicon Nanowire - Prof.Rusli
 
SMARCOS HIG Paper on Designing Touch Screen Interfaces
SMARCOS HIG Paper on Designing Touch Screen InterfacesSMARCOS HIG Paper on Designing Touch Screen Interfaces
SMARCOS HIG Paper on Designing Touch Screen Interfaces
 
Nano technology based bio degradable plastics
Nano technology based bio degradable plasticsNano technology based bio degradable plastics
Nano technology based bio degradable plastics
 
Transparent Retort Pouches by Parikh Packaging Private Ltd., Gujarat, Ahmedabad
Transparent Retort Pouches by Parikh Packaging Private Ltd., Gujarat, AhmedabadTransparent Retort Pouches by Parikh Packaging Private Ltd., Gujarat, Ahmedabad
Transparent Retort Pouches by Parikh Packaging Private Ltd., Gujarat, Ahmedabad
 

Similar to Semantics And Multimedia

Future of technical innovation 3 trends that impact enterprise users
Future of technical innovation   3 trends that impact enterprise usersFuture of technical innovation   3 trends that impact enterprise users
Future of technical innovation 3 trends that impact enterprise users
John Gibbon
 
ITCamp 2012 - Tim Huckaby - Keynote
ITCamp 2012 - Tim Huckaby - KeynoteITCamp 2012 - Tim Huckaby - Keynote
ITCamp 2012 - Tim Huckaby - Keynote
ITCamp
 
I Minds2009 Future Media Prof Rik Van De Walle (Ibbt Mm Lab U Gent)
I Minds2009 Future Media  Prof  Rik Van De Walle (Ibbt Mm Lab U Gent)I Minds2009 Future Media  Prof  Rik Van De Walle (Ibbt Mm Lab U Gent)
I Minds2009 Future Media Prof Rik Van De Walle (Ibbt Mm Lab U Gent)
imec.archive
 
Luiz eduardo. introduction to mobile snitch
Luiz eduardo. introduction to mobile snitchLuiz eduardo. introduction to mobile snitch
Luiz eduardo. introduction to mobile snitch
Yury Chemerkin
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
IntelAPAC
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
Vaticle
 

Similar to Semantics And Multimedia (20)

Future of technical innovation 3 trends that impact enterprise users
Future of technical innovation   3 trends that impact enterprise usersFuture of technical innovation   3 trends that impact enterprise users
Future of technical innovation 3 trends that impact enterprise users
 
Into the twilight zone innovations for education
Into the twilight zone innovations for educationInto the twilight zone innovations for education
Into the twilight zone innovations for education
 
OIT Technology, Communications, Japan
OIT Technology, Communications, JapanOIT Technology, Communications, Japan
OIT Technology, Communications, Japan
 
Vom PC zum Roboter
Vom PC zum RoboterVom PC zum Roboter
Vom PC zum Roboter
 
OW2 Community and more!
OW2 Community and more!OW2 Community and more!
OW2 Community and more!
 
DeMarle-MFAEmergent Media at Champlain College
DeMarle-MFAEmergent Media at Champlain CollegeDeMarle-MFAEmergent Media at Champlain College
DeMarle-MFAEmergent Media at Champlain College
 
ITCamp 2012 - Tim Huckaby - Keynote
ITCamp 2012 - Tim Huckaby - KeynoteITCamp 2012 - Tim Huckaby - Keynote
ITCamp 2012 - Tim Huckaby - Keynote
 
I Minds2009 Future Media Prof Rik Van De Walle (Ibbt Mm Lab U Gent)
I Minds2009 Future Media  Prof  Rik Van De Walle (Ibbt Mm Lab U Gent)I Minds2009 Future Media  Prof  Rik Van De Walle (Ibbt Mm Lab U Gent)
I Minds2009 Future Media Prof Rik Van De Walle (Ibbt Mm Lab U Gent)
 
I3master
I3masterI3master
I3master
 
AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)
 
Luiz eduardo. introduction to mobile snitch
Luiz eduardo. introduction to mobile snitchLuiz eduardo. introduction to mobile snitch
Luiz eduardo. introduction to mobile snitch
 
Creating compelling user interfaces
Creating compelling user interfacesCreating compelling user interfaces
Creating compelling user interfaces
 
Frameworks2 go business insights delivered socially exponentiality & noiseles...
Frameworks2 go business insights delivered socially exponentiality & noiseles...Frameworks2 go business insights delivered socially exponentiality & noiseles...
Frameworks2 go business insights delivered socially exponentiality & noiseles...
 
Cognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz SaračevićCognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz Saračević
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
 
Power of Social Collaboration and Business Technology Adoption
Power of Social Collaboration and Business Technology AdoptionPower of Social Collaboration and Business Technology Adoption
Power of Social Collaboration and Business Technology Adoption
 
Big Data Big Media the new paradigm of multimedia content management with Per...
Big Data Big Media the new paradigm of multimedia content management with Per...Big Data Big Media the new paradigm of multimedia content management with Per...
Big Data Big Media the new paradigm of multimedia content management with Per...
 
Tsunami of Technologies. Are we prepared?
Tsunami of Technologies. Are we prepared?Tsunami of Technologies. Are we prepared?
Tsunami of Technologies. Are we prepared?
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
 
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Semantics And Multimedia