SlideShare a Scribd company logo
CS-498
Computer Vision
 Week 1, Day 1
 Computer Vision Examples
 Overview of the Course
 Introduction to Images
1
 Computer Graphics: Models to Images
Computer Vision and Nearby
Fields
© Pixar
http://pichost.me/1578472/
Computer Vision and Nearby
Fields (2)
 Computational Photography: Images to
images
http://docs.opencv.org/trunk/doc/tutorials/photo/hdr_imaging
ng
Computer Vision: Images to
models
Make computers “understand” images and
video.
What kind of scene?
Where are the cars?
How far is the
building?
…
images.google.com/
https://www.clarifai.com
http://cs.brown.edu/courses/cs1
3D from thousands of images
Building Rome in a Day: Agarwal et al. 2009
http://cs.brown.edu/courses/cs143/
Vision is really hard
 Vision is an amazing feat of natural
intelligence
 Visual cortex occupies about 50% of Macaque brain
 More human brain devoted to vision than anything
else
Is that a
queen or a
bishop?
http://cs.brown.edu/courses/cs143/
Who is investing in vision?
 Microsoft
 Google
 NVIDIA
 Direct Supply
 …
7
Appendix
 What are examples of computer vision being
used in the world?
http://cs.brown.edu/courses/cs143/
Ridiculously brief history of computer
vision
 1966: Minsky assigns computer vision
as an undergrad summer project
 1960’s: interpretation of synthetic
worlds
 1970’s: some progress on interpreting
selected images
 1980’s: ANNs come and go; shift
toward geometry and increased
mathematical rigor
 1990’s: face recognition; statistical
analysis in vogue
 2000’s: broader recognition; large
annotated datasets available; video
processing starts
 2030’s: robot uprising? 
Guzman ‘68
Ohta Kanade ‘78
Turk and Pentland ‘91
http://cs.brown.edu/courses/cs143/
How vision is used now
 Examples of state-of-the-art
Some of the following slides by Steve Seitz
Optical character recognition
(OCR)
Digit recognition, AT&T labs
http://www.research.att.com/~yann/
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
http://cs.brown.edu/courses/cs143/
Face detection
 Many new digital cameras now detect
faces
 Canon, Sony, Fuji, …
http://cs.brown.edu/courses/cs143/
Smile detection
Sony Cyber-shot® T70 Digital Still Camera
http://cs.brown.edu/courses/cs143/
Object recognition (in
supermarkets)
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously
watching for items. When an item is detected and recognized, the
cashier verifies the quantity of items that were found under the basket,
and continues to close the transaction. The item can remain under the
basket, and with LaneHawk,you are assured to get paid for it… “
http://cs.brown.edu/courses/cs143/
Vision-based biometrics
“How the Afghan Girl [whose name is Sharbat Gula] was Identified by
Her Iris Patterns” Read the story
wikipedia
http://cs.brown.edu/courses/cs143/
Login without a password…
Fingerprint scanners on
many new laptops,
other devices
Face recognition systems now
beginning to appear more widely
http://www.sensiblevision.com/
http://cs.brown.edu/courses/cs143/
Object recognition (in mobile
phones)
Point & Find, Nokia
Google Goggles
http://cs.brown.edu/courses/cs143/
The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects: shape capture
http://cs.brown.edu/courses/cs143/
Pirates of the Carribean, Industrial Light and Magic
Special effects: motion
capture
http://cs.brown.edu/courses/cs143/
Sports
Sportvision first down line
Nice explanation on www.howstuffworks.com
http://www.sportvision.com/video.html
http://cs.brown.edu/courses/cs143/
Smart cars
 Mobileye
 Vision systems currently in high-end BMW, GM,
Volvo models
 By 2010: 70% of car manufacturers.
Slide content courtesy of Amnon Shashua
http://cs.brown.edu/courses/cs143/
Google cars
Oct 9, 2010. "Google Cars Drive Themselves, in Traffic". The New York Times. John
Markoff
June 24, 2011. "Nevada state law paves the way for driverless cars". Financial Post.
Christine Dobby
Aug 9, 2011, "Human error blamed after Google's driverless car sparks five-vehicle
crash". The Star (Toronto) http://cs.brown.edu/courses/cs143/
Interactive Games: Kinect
 Object Recognition:
http://www.youtube.com/watch?feature=iv&v=fQ59dXOo
63o
 Mario: http://www.youtube.com/watch?v=8CTJL5lUjHg
 3D: http://www.youtube.com/watch?v=7QrnwoO1-8A
 Robot:
http://www.youtube.com/watch?v=w8BmgtMKFbY
http://cs.brown.edu/courses/cs143/
Vision in space
Vision systems (JPL) used for several tasks
• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
http://cs.brown.edu/courses/cs143/
Industrial robots
Vision-guided robots position nut runners on wheels
http://cs.brown.edu/courses/cs143/
Mobile robots
http://www.robocup.org/
NASA’s Mars Spirit Rover
http://en.wikipedia.org/wiki/Spirit_rover
Saxena et al. 2008
STAIR at Stanford
http://cs.brown.edu/
courses/cs143/
Medical imaging
Image guided surgery
Grimson et al., MIT
3D imaging
MRI, CT
http://cs.brown.edu/courses/cs143/

More Related Content

Similar to cs498-1-1-CourseIntro.ppt

Computer vision basics
Computer vision basicsComputer vision basics
Computer vision basics
Shilpa Sharma
 
Global Azure AI Tour Buenos Aires Argentina, Drones and AI
Global Azure AI Tour Buenos Aires Argentina, Drones and AIGlobal Azure AI Tour Buenos Aires Argentina, Drones and AI
Global Azure AI Tour Buenos Aires Argentina, Drones and AI
Bruno Capuano
 
The Glass Class: Rapid Prototyping for Wearable Computers
The Glass Class: Rapid Prototyping for Wearable ComputersThe Glass Class: Rapid Prototyping for Wearable Computers
The Glass Class: Rapid Prototyping for Wearable Computers
Mark Billinghurst
 
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICTAugmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
Parth Darji
 
Open Cv – An Introduction To The Vision
Open Cv – An Introduction To The VisionOpen Cv – An Introduction To The Vision
Open Cv – An Introduction To The Vision
Hemanth Haridas
 
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
Bruno Capuano
 
vision.ppt
vision.pptvision.ppt
vision.ppt
ArunKumarS600928
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
nyomans1
 
vision.ppt
vision.pptvision.ppt
vision.ppt
nyomans1
 
Computer Vision Lab
Computer Vision LabComputer Vision Lab
Computer Vision Lab
Karenne Mata
 
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision
butest
 
WearAbility = Wearable Computers and Accessibilty
WearAbility =  Wearable Computers and AccessibiltyWearAbility =  Wearable Computers and Accessibilty
WearAbility = Wearable Computers and Accessibilty
Ted Drake
 
Ubiquitous Information Architecture - OZ IA 2010
Ubiquitous Information Architecture - OZ IA 2010Ubiquitous Information Architecture - OZ IA 2010
Ubiquitous Information Architecture - OZ IA 2010
Samantha Starmer
 
Digital Fabrication Studio: 3D Scanning
Digital Fabrication Studio: 3D ScanningDigital Fabrication Studio: 3D Scanning
Digital Fabrication Studio: 3D Scanning
Massimo Menichinelli
 
Deep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image ProcessingDeep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image Processing
Grigory Sapunov
 
Introduction
IntroductionIntroduction
Introduction
sagayaaurelia1
 
Deep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic ApplicationsDeep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic Applications
Darius Burschka
 
Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014
Ted Drake
 
Material
MaterialMaterial
Material
MohamedTarek424
 
University of Northampton (UK)
University of Northampton (UK)University of Northampton (UK)
University of Northampton (UK)
Michell Zappa
 

Similar to cs498-1-1-CourseIntro.ppt (20)

Computer vision basics
Computer vision basicsComputer vision basics
Computer vision basics
 
Global Azure AI Tour Buenos Aires Argentina, Drones and AI
Global Azure AI Tour Buenos Aires Argentina, Drones and AIGlobal Azure AI Tour Buenos Aires Argentina, Drones and AI
Global Azure AI Tour Buenos Aires Argentina, Drones and AI
 
The Glass Class: Rapid Prototyping for Wearable Computers
The Glass Class: Rapid Prototyping for Wearable ComputersThe Glass Class: Rapid Prototyping for Wearable Computers
The Glass Class: Rapid Prototyping for Wearable Computers
 
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICTAugmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
Augmented reality : Possibilities and Challenges - An IEEE talk at DA-IICT
 
Open Cv – An Introduction To The Vision
Open Cv – An Introduction To The VisionOpen Cv – An Introduction To The Vision
Open Cv – An Introduction To The Vision
 
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
2021 02 13 CodeGen Verona - Let’s code a drone to follow faces syncing everyt...
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
Computer Vision Lab
Computer Vision LabComputer Vision Lab
Computer Vision Lab
 
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision
 
WearAbility = Wearable Computers and Accessibilty
WearAbility =  Wearable Computers and AccessibiltyWearAbility =  Wearable Computers and Accessibilty
WearAbility = Wearable Computers and Accessibilty
 
Ubiquitous Information Architecture - OZ IA 2010
Ubiquitous Information Architecture - OZ IA 2010Ubiquitous Information Architecture - OZ IA 2010
Ubiquitous Information Architecture - OZ IA 2010
 
Digital Fabrication Studio: 3D Scanning
Digital Fabrication Studio: 3D ScanningDigital Fabrication Studio: 3D Scanning
Digital Fabrication Studio: 3D Scanning
 
Deep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image ProcessingDeep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image Processing
 
Introduction
IntroductionIntroduction
Introduction
 
Deep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic ApplicationsDeep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic Applications
 
Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014Wearable Accessibility - Accessing Higher Ground 2014
Wearable Accessibility - Accessing Higher Ground 2014
 
Material
MaterialMaterial
Material
 
University of Northampton (UK)
University of Northampton (UK)University of Northampton (UK)
University of Northampton (UK)
 

Recently uploaded

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 

Recently uploaded (20)

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 

cs498-1-1-CourseIntro.ppt

  • 1. CS-498 Computer Vision  Week 1, Day 1  Computer Vision Examples  Overview of the Course  Introduction to Images 1
  • 2.  Computer Graphics: Models to Images Computer Vision and Nearby Fields © Pixar http://pichost.me/1578472/
  • 3. Computer Vision and Nearby Fields (2)  Computational Photography: Images to images http://docs.opencv.org/trunk/doc/tutorials/photo/hdr_imaging ng
  • 4. Computer Vision: Images to models Make computers “understand” images and video. What kind of scene? Where are the cars? How far is the building? … images.google.com/ https://www.clarifai.com http://cs.brown.edu/courses/cs1
  • 5. 3D from thousands of images Building Rome in a Day: Agarwal et al. 2009 http://cs.brown.edu/courses/cs143/
  • 6. Vision is really hard  Vision is an amazing feat of natural intelligence  Visual cortex occupies about 50% of Macaque brain  More human brain devoted to vision than anything else Is that a queen or a bishop? http://cs.brown.edu/courses/cs143/
  • 7. Who is investing in vision?  Microsoft  Google  NVIDIA  Direct Supply  … 7
  • 8. Appendix  What are examples of computer vision being used in the world? http://cs.brown.edu/courses/cs143/
  • 9. Ridiculously brief history of computer vision  1966: Minsky assigns computer vision as an undergrad summer project  1960’s: interpretation of synthetic worlds  1970’s: some progress on interpreting selected images  1980’s: ANNs come and go; shift toward geometry and increased mathematical rigor  1990’s: face recognition; statistical analysis in vogue  2000’s: broader recognition; large annotated datasets available; video processing starts  2030’s: robot uprising?  Guzman ‘68 Ohta Kanade ‘78 Turk and Pentland ‘91 http://cs.brown.edu/courses/cs143/
  • 10. How vision is used now  Examples of state-of-the-art Some of the following slides by Steve Seitz
  • 11. Optical character recognition (OCR) Digit recognition, AT&T labs http://www.research.att.com/~yann/ Technology to convert scanned docs to text • If you have a scanner, it probably came with OCR software License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition http://cs.brown.edu/courses/cs143/
  • 12. Face detection  Many new digital cameras now detect faces  Canon, Sony, Fuji, … http://cs.brown.edu/courses/cs143/
  • 13. Smile detection Sony Cyber-shot® T70 Digital Still Camera http://cs.brown.edu/courses/cs143/
  • 14. Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “ http://cs.brown.edu/courses/cs143/
  • 15. Vision-based biometrics “How the Afghan Girl [whose name is Sharbat Gula] was Identified by Her Iris Patterns” Read the story wikipedia http://cs.brown.edu/courses/cs143/
  • 16. Login without a password… Fingerprint scanners on many new laptops, other devices Face recognition systems now beginning to appear more widely http://www.sensiblevision.com/ http://cs.brown.edu/courses/cs143/
  • 17. Object recognition (in mobile phones) Point & Find, Nokia Google Goggles http://cs.brown.edu/courses/cs143/
  • 18. The Matrix movies, ESC Entertainment, XYZRGB, NRC Special effects: shape capture http://cs.brown.edu/courses/cs143/
  • 19. Pirates of the Carribean, Industrial Light and Magic Special effects: motion capture http://cs.brown.edu/courses/cs143/
  • 20. Sports Sportvision first down line Nice explanation on www.howstuffworks.com http://www.sportvision.com/video.html http://cs.brown.edu/courses/cs143/
  • 21. Smart cars  Mobileye  Vision systems currently in high-end BMW, GM, Volvo models  By 2010: 70% of car manufacturers. Slide content courtesy of Amnon Shashua http://cs.brown.edu/courses/cs143/
  • 22. Google cars Oct 9, 2010. "Google Cars Drive Themselves, in Traffic". The New York Times. John Markoff June 24, 2011. "Nevada state law paves the way for driverless cars". Financial Post. Christine Dobby Aug 9, 2011, "Human error blamed after Google's driverless car sparks five-vehicle crash". The Star (Toronto) http://cs.brown.edu/courses/cs143/
  • 23. Interactive Games: Kinect  Object Recognition: http://www.youtube.com/watch?feature=iv&v=fQ59dXOo 63o  Mario: http://www.youtube.com/watch?v=8CTJL5lUjHg  3D: http://www.youtube.com/watch?v=7QrnwoO1-8A  Robot: http://www.youtube.com/watch?v=w8BmgtMKFbY http://cs.brown.edu/courses/cs143/
  • 24. Vision in space Vision systems (JPL) used for several tasks • Panorama stitching • 3D terrain modeling • Obstacle detection, position tracking • For more, read “Computer Vision on Mars” by Matthies et al. NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007. http://cs.brown.edu/courses/cs143/
  • 25. Industrial robots Vision-guided robots position nut runners on wheels http://cs.brown.edu/courses/cs143/
  • 26. Mobile robots http://www.robocup.org/ NASA’s Mars Spirit Rover http://en.wikipedia.org/wiki/Spirit_rover Saxena et al. 2008 STAIR at Stanford http://cs.brown.edu/ courses/cs143/
  • 27. Medical imaging Image guided surgery Grimson et al., MIT 3D imaging MRI, CT http://cs.brown.edu/courses/cs143/