SlideShare a Scribd company logo
1 of 45
Download to read offline
Humans & Machines
                     collaborating on vision
                                     Pietro Perona
                           California Institute of Technology

                          NSF Workshop - Frontiers in Vision
                              Cambridge, 23 Aug 2011




Friday, August 26, 2011
“Collaborative vision’’ ?
                                        Pietro Perona
                              California Institute of Technology

                             NSF Workshop - Frontiers in Vision
                                 Cambridge, 23 Aug 2011




Friday, August 26, 2011
Objectives

                     • Sketch new area of research
                     • Sampler of initial work
                     • Drawing lessons
                     • Brainstorm: potential, way forward

Friday, August 26, 2011
Plan

                     • Define area (10’)
                     • Presentations (50’): Perona, Geman,
                          Grauman, Berg, Belongie
                     • Discussion (15’)


Friday, August 26, 2011
Definition



Friday, August 26, 2011
6
Friday, August 26, 2011
?


                          6
Friday, August 26, 2011
7


Friday, August 26, 2011
Friday, August 26, 2011
Friday, August 26, 2011
Friday, August 26, 2011
9
Friday, August 26, 2011
Lessons:
                     • Visual queries
                          •   Easy for humans

                          •   Difficult for machines

                     • Much information is available on line
                     • Pictures are digital dark matter
                     • Experts not providing visual knowledge
                                                 10
Friday, August 26, 2011
Unsupervised learning?




                                                          11
                          [Fergus et al., CVPR03]
Friday, August 26, 2011
Unsupervised learning?




                                                          11
                          [Fergus et al., CVPR03]
Friday, August 26, 2011
12
Friday, August 26, 2011
Friday, August 26, 2011
Throat




Friday, August 26, 2011
Throat




Friday, August 26, 2011
Visual knowledge




          Categorical (experts)   14   Task-oriented (practitioners)
Friday, August 26, 2011
World


                                                                         Ob




                                   n
                                                                           ser




                                    io
                                                                                 va




                                 at
                                                                                   tio




                               rv
                                                                                      n




                             se
                           Ob    Science,
                                                  Shared           Education                            Users
                                 expertise      knowledge
                 Experts




                                                                                             ers
                                                                                          sw

                                                                                                   es
                                                                                         An

                                                                                                 eri
                                             Models          Image




                                                                                              Qu
                                                           annotations




                                                                                                    Machine vision
                                  Annotators                 Automata                                 scientists
                                                      15
Friday, August 26, 2011
World


                                                                         Ob




                                   n
                                                                           ser




                                    io
                                                                                 va




                                 at
                                                                                   tio




                               rv
                                                                                      n




                             se
                           Ob    Science,
                                                  Shared           Education                            Users
                                 expertise      knowledge
                 Experts




                                                                                             ers
                                                                                          sw

                                                                                                   es
                                                                                         An

                                                                                                 eri
                                             Models          Image




                                                                                              Qu
                                                           annotations




                                                                                                    Machine vision
                                  Annotators                 Automata                                 scientists
                                                      15
Friday, August 26, 2011
World


                                                                         Ob




                                   n
                                                                           ser




                                    io
                                                                                 va




                                 at
                                                                                   tio




                               rv
                                                                                      n




                             se
                           Ob    Science,
                                                  Shared           Education                            Users
                                 expertise      knowledge
                 Experts




                                                                                             ers
                                                                                          sw

                                                                                                   es
                                                                                         An

                                                                                                 eri
                                             Models          Image




                                                                                              Qu
                                                           annotations




                                                                                                    Machine vision
                                  Annotators                 Automata                                 scientists
                                                      15
Friday, August 26, 2011
World


                                                                         Ob




                                   n
                                                                           ser




                                    io
                                                                                 va




                                 at
                                                                                   tio




                               rv
                                                                                      n




                             se
                           Ob    Science,
                                                  Shared           Education                            Users
                                 expertise      knowledge
                 Experts




                                                                                             ers
                                                                                          sw

                                                                                                   es
                                                                                         An

                                                                                                 eri
                                             Models          Image




                                                                                              Qu
                                                           annotations




                                                                                                    Machine vision
                                  Annotators                 Automata                                 scientists
                                                      15
Friday, August 26, 2011
Some progress...



Friday, August 26, 2011
DUCKS Waterbirds
                              Mallard         American Black Duck




               Canada Goose         Red Necked Grebe          Clutter




Friday, August 26, 2011
Multidimensional signals and annotators
                                                   p(xi | zi = 1)
                                               2
                                              xi


                                   1    2
                          xi =   (xi , xi )                  1
                                                            xi



                   p(xi | zi = 0)
Friday, August 26, 2011
Multidimensional signals and annotators
                                                   p(xi | zi = 1)
                                               2
                                              xi


                                   1    2
                          xi =   (xi , xi )                  1
                                                            xi



                   p(xi | zi = 0)
Friday, August 26, 2011
Multidimensional signals and annotators
                                                        p(xi | zi = 1)
                                               2
                                              xi


                                   1    2
                          xi =   (xi , xi )                          1
                                                                    xi
                                                   τj            1    2
                                                        wj =   (wj , wj )



                   p(xi | zi = 0)
Friday, August 26, 2011
α                      γ
                     Full model                       annotators


                                              σj           wj         τj
                          β   θz
                                                                           M



                      zi      xi              yij                     lij


                                   Ji          ij
              images                    N   labels                     |Lij |

                                                     [Welinder et al., NIPS2010]
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Is there a duck in the image?
   2
  xi




                                                           1
                                                          xi
Friday, August 26, 2011
Concluding...



Friday, August 26, 2011
Collaborative vision
   100%
            Automation




                0%             Performance   100%

Friday, August 26, 2011
Collaborative vision
   100%
            Automation




                0%             Performance   100%

Friday, August 26, 2011
Collaborative vision
   100%
            Automation




                0%             Performance   100%

Friday, August 26, 2011
Collaborative vision
   100%
            Automation




                0%             Performance   100%

Friday, August 26, 2011
Collaborative vision
   100%
                                      +
            Automation




                                 Applications
                                 Training data
                                      -
                                 Complexity
                                 Cost

                0%             Performance       100%

Friday, August 26, 2011
World


                                                                         Ob




                                   n
                                                                           ser




                                    io
                                                                                 va




                                 at
                                                                                   tio




                               rv
                                                                                      n




                             se
                           Ob    Science,
                                                  Shared           Education                            Users
                                 expertise      knowledge
                 Experts




                                                                                             ers
                                                                                          sw

                                                                                                   es
                                                                                         An

                                                                                                 eri
                                             Models          Image




                                                                                              Qu
                                                           annotations




                                                                                                    Machine vision
                                  Annotators                 Automata                                 scientists
                                                      24
Friday, August 26, 2011
New research directions
             • Incremental learning
             • Models of human vision, decision, attention
             • Systems composed of machines and humans
             • Performance bounds (humans, machines)
             • Representations (human-machine-friendly)
             • Extracting visual knowledge from experts

Friday, August 26, 2011

More Related Content

More from zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 

More from zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 

Fcv hum mach_perona

  • 1. Humans & Machines collaborating on vision Pietro Perona California Institute of Technology NSF Workshop - Frontiers in Vision Cambridge, 23 Aug 2011 Friday, August 26, 2011
  • 2. “Collaborative vision’’ ? Pietro Perona California Institute of Technology NSF Workshop - Frontiers in Vision Cambridge, 23 Aug 2011 Friday, August 26, 2011
  • 3. Objectives • Sketch new area of research • Sampler of initial work • Drawing lessons • Brainstorm: potential, way forward Friday, August 26, 2011
  • 4. Plan • Define area (10’) • Presentations (50’): Perona, Geman, Grauman, Berg, Belongie • Discussion (15’) Friday, August 26, 2011
  • 7. ? 6 Friday, August 26, 2011
  • 13. Lessons: • Visual queries • Easy for humans • Difficult for machines • Much information is available on line • Pictures are digital dark matter • Experts not providing visual knowledge 10 Friday, August 26, 2011
  • 14. Unsupervised learning? 11 [Fergus et al., CVPR03] Friday, August 26, 2011
  • 15. Unsupervised learning? 11 [Fergus et al., CVPR03] Friday, August 26, 2011
  • 20. Visual knowledge Categorical (experts) 14 Task-oriented (practitioners) Friday, August 26, 2011
  • 21. World Ob n ser io va at tio rv n se Ob Science, Shared Education Users expertise knowledge Experts ers sw es An eri Models Image Qu annotations Machine vision Annotators Automata scientists 15 Friday, August 26, 2011
  • 22. World Ob n ser io va at tio rv n se Ob Science, Shared Education Users expertise knowledge Experts ers sw es An eri Models Image Qu annotations Machine vision Annotators Automata scientists 15 Friday, August 26, 2011
  • 23. World Ob n ser io va at tio rv n se Ob Science, Shared Education Users expertise knowledge Experts ers sw es An eri Models Image Qu annotations Machine vision Annotators Automata scientists 15 Friday, August 26, 2011
  • 24. World Ob n ser io va at tio rv n se Ob Science, Shared Education Users expertise knowledge Experts ers sw es An eri Models Image Qu annotations Machine vision Annotators Automata scientists 15 Friday, August 26, 2011
  • 26. DUCKS Waterbirds Mallard American Black Duck Canada Goose Red Necked Grebe Clutter Friday, August 26, 2011
  • 27. Multidimensional signals and annotators p(xi | zi = 1) 2 xi 1 2 xi = (xi , xi ) 1 xi p(xi | zi = 0) Friday, August 26, 2011
  • 28. Multidimensional signals and annotators p(xi | zi = 1) 2 xi 1 2 xi = (xi , xi ) 1 xi p(xi | zi = 0) Friday, August 26, 2011
  • 29. Multidimensional signals and annotators p(xi | zi = 1) 2 xi 1 2 xi = (xi , xi ) 1 xi τj 1 2 wj = (wj , wj ) p(xi | zi = 0) Friday, August 26, 2011
  • 30. α γ Full model annotators σj wj τj β θz M zi xi yij lij Ji ij images N labels |Lij | [Welinder et al., NIPS2010] Friday, August 26, 2011
  • 31. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 32. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 33. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 34. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 35. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 36. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 37. Is there a duck in the image? 2 xi 1 xi Friday, August 26, 2011
  • 39. Collaborative vision 100% Automation 0% Performance 100% Friday, August 26, 2011
  • 40. Collaborative vision 100% Automation 0% Performance 100% Friday, August 26, 2011
  • 41. Collaborative vision 100% Automation 0% Performance 100% Friday, August 26, 2011
  • 42. Collaborative vision 100% Automation 0% Performance 100% Friday, August 26, 2011
  • 43. Collaborative vision 100% + Automation Applications Training data - Complexity Cost 0% Performance 100% Friday, August 26, 2011
  • 44. World Ob n ser io va at tio rv n se Ob Science, Shared Education Users expertise knowledge Experts ers sw es An eri Models Image Qu annotations Machine vision Annotators Automata scientists 24 Friday, August 26, 2011
  • 45. New research directions • Incremental learning • Models of human vision, decision, attention • Systems composed of machines and humans • Performance bounds (humans, machines) • Representations (human-machine-friendly) • Extracting visual knowledge from experts Friday, August 26, 2011