SlideShare a Scribd company logo
1 of 11
Lecture 12 What happens if we solve object recognition? 6.870 Object Recognition and Scene Understanding  http://people.csail.mit.edu/torralba/courses/6.870/6.870.recognition.htm
[object Object],[object Object],[object Object]
Polygons world Blocks world Imagine that object recognition and segmentation is solved, so, now what?
Here word recognition is solved, we can access the meaning of the words. But yet, we are far from having solved language understanding. Detecting objects is just one small piece of understanding scenes (it might not even be the hardest). Images and sequence tell stories, and the structure of those stories are as complex as sentences, paragraphs and books . “ I went to the airport by car, but it took me  a very long time because of the traffic.”
Images, with our current set of features, look more like strange sentences: I performed the action of going from one place to another in order to reach the point from where there are devices with wings in which people can get inside and perform the action of going from one place to another even when the other place is really far away. To perform the first action, I used another device that lacks wings and that, instead, has four round things attached to the sides. It took me a long time to complete the first action as there was many other people using the same device-with-four-round-things attached to them performing similar actions to me and trying to occupy the same space as me.
How to give a talk ,[object Object],[object Object]
How to give a talk ,[object Object],[object Object],[object Object]
How to give a talk ,[object Object],[object Object],[object Object],[object Object]
How to give a talk ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The best advice I got came from Yair Weiss while preparing my job talk:  “ just give a good talk”
How to give the project class talk ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Next week Alec Rivers Scene Understanding Based on Object Relationships Gokberk Cinbis Category Level 3D Object Detection Using View-Invariant Representations Hueihan Jhuang and Sharat Chikkerur Video shot boundary detection using GIST representation Jenny Yuen Semiautomatic alignment of text and images Nathaniel R Twarog A Filtering Approach to Image Segmentation: Perceptual Grouping in Feature Space Nicolas Pinto Evaluating dense feature descriptor and multi-kernel learning for face detection/recognition  Tilke Judd and Vladimir Bychkovsky Identify the same people in different photographs from the same event Tom Kollar Context-based object priors for scene understanding Tom Ouyang Hand-Drawn Sketch Recognition, A Vision-Based Approach Papers due this Friday: send PDF by email

More Related Content

Similar to Mit6870 orsu lecture12

Monday Night, March 3rd, Visual Rhetoric
Monday Night, March 3rd, Visual RhetoricMonday Night, March 3rd, Visual Rhetoric
Monday Night, March 3rd, Visual Rhetoric
Miami University
 
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
Ian Varley
 

Similar to Mit6870 orsu lecture12 (20)

A talk about talking!
A talk about talking!A talk about talking!
A talk about talking!
 
Concept design-second class
Concept design-second classConcept design-second class
Concept design-second class
 
The Role of Empathy in Design
The Role of Empathy in DesignThe Role of Empathy in Design
The Role of Empathy in Design
 
Seth Godin - BAD PowerPoint
Seth Godin - BAD PowerPointSeth Godin - BAD PowerPoint
Seth Godin - BAD PowerPoint
 
Concept design
Concept design Concept design
Concept design
 
Monday Night, March 3rd, Visual Rhetoric
Monday Night, March 3rd, Visual RhetoricMonday Night, March 3rd, Visual Rhetoric
Monday Night, March 3rd, Visual Rhetoric
 
PiL vct file
PiL vct filePiL vct file
PiL vct file
 
What Is Thesis In Essay.pdf
What Is Thesis In Essay.pdfWhat Is Thesis In Essay.pdf
What Is Thesis In Essay.pdf
 
Unpicking how (some) participatory design methods work (with older people)
Unpicking how (some) participatory design methods work (with older people)Unpicking how (some) participatory design methods work (with older people)
Unpicking how (some) participatory design methods work (with older people)
 
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
Unstructure: Smashing the Boundaries of Data (SxSWi 2014)
 
FMP Proposal Y2.docx
FMP Proposal Y2.docxFMP Proposal Y2.docx
FMP Proposal Y2.docx
 
Would Plato love Lego, inspirED seminar, University of Dundee 17 April 2013,
Would Plato love Lego, inspirED seminar, University of Dundee 17 April 2013, Would Plato love Lego, inspirED seminar, University of Dundee 17 April 2013,
Would Plato love Lego, inspirED seminar, University of Dundee 17 April 2013,
 
3. proposal sf 2017
3. proposal sf 20173. proposal sf 2017
3. proposal sf 2017
 
15 research debrief
15 research debrief15 research debrief
15 research debrief
 
Why use video by david mann
Why use video by david mannWhy use video by david mann
Why use video by david mann
 
Design thinking, critical artefacts
Design thinking, critical artefactsDesign thinking, critical artefacts
Design thinking, critical artefacts
 
Presenting: structure story and support
Presenting: structure story and supportPresenting: structure story and support
Presenting: structure story and support
 
Planning booklet will wheatley
Planning booklet will wheatleyPlanning booklet will wheatley
Planning booklet will wheatley
 
Leaving Classrooms Behind - Interaction 18 - Education Summit (annotated)
Leaving Classrooms Behind - Interaction 18 - Education Summit (annotated)Leaving Classrooms Behind - Interaction 18 - Education Summit (annotated)
Leaving Classrooms Behind - Interaction 18 - Education Summit (annotated)
 
Assignment 03 video monologues
Assignment 03 video monologuesAssignment 03 video monologues
Assignment 03 video monologues
 

More from zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
zukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
zukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
zukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
zukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
zukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
zukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
zukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
zukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
zukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
zukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
zukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
zukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
zukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
zukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
zukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
zukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
zukun
 

More from zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Mit6870 orsu lecture12

  • 1. Lecture 12 What happens if we solve object recognition? 6.870 Object Recognition and Scene Understanding http://people.csail.mit.edu/torralba/courses/6.870/6.870.recognition.htm
  • 2.
  • 3. Polygons world Blocks world Imagine that object recognition and segmentation is solved, so, now what?
  • 4. Here word recognition is solved, we can access the meaning of the words. But yet, we are far from having solved language understanding. Detecting objects is just one small piece of understanding scenes (it might not even be the hardest). Images and sequence tell stories, and the structure of those stories are as complex as sentences, paragraphs and books . “ I went to the airport by car, but it took me a very long time because of the traffic.”
  • 5. Images, with our current set of features, look more like strange sentences: I performed the action of going from one place to another in order to reach the point from where there are devices with wings in which people can get inside and perform the action of going from one place to another even when the other place is really far away. To perform the first action, I used another device that lacks wings and that, instead, has four round things attached to the sides. It took me a long time to complete the first action as there was many other people using the same device-with-four-round-things attached to them performing similar actions to me and trying to occupy the same space as me.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11. Next week Alec Rivers Scene Understanding Based on Object Relationships Gokberk Cinbis Category Level 3D Object Detection Using View-Invariant Representations Hueihan Jhuang and Sharat Chikkerur Video shot boundary detection using GIST representation Jenny Yuen Semiautomatic alignment of text and images Nathaniel R Twarog A Filtering Approach to Image Segmentation: Perceptual Grouping in Feature Space Nicolas Pinto Evaluating dense feature descriptor and multi-kernel learning for face detection/recognition Tilke Judd and Vladimir Bychkovsky Identify the same people in different photographs from the same event Tom Kollar Context-based object priors for scene understanding Tom Ouyang Hand-Drawn Sketch Recognition, A Vision-Based Approach Papers due this Friday: send PDF by email