SlideShare a Scribd company logo
Computer Vision
Introduction
What is computer vision?
Terminator 2
Every picture tells a story
• Goal of computer vision is to write computer programs that can
interpret images
Can computers match (or beat) human vision?
• Yes and no (but mostly no!)
– humans are much better at “hard” things
– computers can be better at “easy” things
Optical illusions
Copyright A.Kitaoka 2003
Why is computer vision difficult?
• Inverse problem
• Ill-posed
• High-dimensional data
• Noise
• Variation
Earth viewers (3D modeling)
Image from Microsoft’s Virtual Earth
(see also: Google Earth)
Google streetview
Photosynth
http://labs.live.com/photosynth/
http://www.youtube.com/watch?v=p16frKJLVi0
by Noah Snavely, Steve Seitz, and Rick Szeliski
Optical character recognition
Digit recognition, AT&T labs
http://www.research.att.com/~yann/
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
Face detection
• Many new digital cameras now detect faces
– Canon, Sony, Fuji, …
Smile detection
Sony Cyber-shot® T70 Digital Still Camera
Object recognition (in supermarkets)
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching
for items. When an item is detected and recognized, the cashier verifies the
quantity of items that were found under the basket, and continues to close the
transaction. The item can remain under the basket, and with LaneHawk,you are
assured to get paid for it… “
Face recognition
Who is she?
Vision-based biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
Login without a password…
Fingerprint scanners on
many new laptops,
other devices
Face recognition systems now
beginning to appear more widely
http://www.sensiblevision.com/
Object recognition (in mobile phones)
• This is becoming real:
– Microsoft Research
– Point & Find, Nokia, NTT Docomo
The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects: shape capture
Bullet time:
http://www.youtube.com/watch?v=J5ryLMZTO5M
Pirates of the Carribean, Industrial Light and Magic
Click here for interactive demo
Special effects: motion capture
Sports
Sportvision first down line
Nice explanation on www.howstuffworks.com
http://www.youtube.com/watch?v=UyPU2l9rdvo
Smart cars
• Mobileye
– Vision systems currently in high-end BMW, GM, Volvo models
– By 2010: 70% of car manufacturers.
– Video demo
Vision-based interaction (and games)
Nintendo Wii has camera-based IR
tracking built in. See Lee’s work at
CMU on clever tricks on using it to
create a multi-touch display!
Digimask: put your face on a 3D avatar.
“Game turns moviegoers into Human Joysticks”, CNET
Camera tracking a crowd, based on this work.
Vision-based HCI
• Reatrix: http://www.youtube.com/watch?v=QzsQKULMbiU
Gaming
• Sony Eyetoy • Microsoft Natal
http://www.youtube.com/watch?v=AOXoh
r4XE-4&feature=related
http://www.youtube.com/watch?v=1BRSf
CuLYHc
Motion capture
• Marker-based motion capture
– http://www.youtube.com/watch?v=V0yT8mwg9nc
• Organic motion
• http://www.organicmotion.com/
Looking at people
• Hand gesture
• Head pose
• Expression
• Identity
http://www.youtube.com/watch?v=NwVBzx0LMNQ
Vision in space
Vision systems (JPL) used for several tasks
• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
Gigapan
• http://www.gigapan.org/index.php
• HP TouchSmart with Gigapn demo at Chicago
O’Hare airport
Robotics
http://www.robocup.org/NASA’s Mars Spirit Rover
http://en.wikipedia.org/wiki/Spirit_rover
Medical imaging
Image guided surgery
Grimson et al., MIT
3D imaging
MRI, CT
Digital comestics
Inpainting
Bertalmio et al. SIGGRAPH 00
Debluring
Fergus et al. SIGGRAPH 06
Digital photo albums
• Picasa, Flickr, Photobucket, etc.
• Categorization
• Tagging
• Search
Computational photography
• Image acquisition
• Hardware/software
• Optics
• Shuttle speed
• Novel sensors
• Multiple camera
• Multiple shots
• Multi flash
• Applications: high dynamic range imaging, super resolution,
photomontage, panorama moasicing, debluring, light field,
camera projector system…
Image and video search
• Google
• YouTubes
• Microsoft
• Yahoo
Current state of the art
• You just saw examples of current systems.
– Many of these are less than 5 years old
• This is a very active research area, and rapidly changing
– Many new applications in the next 5 years
• To learn more about vision applications and companies
– David Lowe maintains an excellent overview of
vision companies
• http://www.cs.ubc.ca/spider/lowe/vision.html
• Confluence of vision, graphics, learning,
sensing and signal processing
Software and hardware
• Algorithms: processing images and videos
• Camera: acquiring images/videos
• Embedded system
Topics
• Image formation: camera model, camera calibration,
radiometry, color, shading
• Early vision: stereopsis, structure from motion, illumination,
reflectance, shape from X, texture
• Mid-level vision: segmentation, grouping, Kalman filter,
particle filter, shape representation
• High-level vision: correspondence, matching, object
detection, object recognition, visual tracking
• Recent topics: image and video retrieval, internet vision
Related topics
Textbooks and references
• Textbook
– Computer Vision: A Modern Approach, David Forsyth and Jean Ponce
– Computer Vision: Algorithms and Applications (draft), Richard Szeliski
• Reference for background study:
– Introductory Techniques for 3-D Computer Vision, Emanuele Trucco and Alessandro
Verri
– Multiple View Geometry in Computer Vision, Richard Hartley and Andrew Zisserman
– An Invitation to 3-D Vision by Yi Ma, Stefano Soatto, and Jana Kosecka
– Robot Vision, Berthold Horn
– Learning OpenCV: Computer Vision with OpenCV Library, Gary Bradski and Adrian
Kaehler
• Reading assignments will be from the text and additional material that will
be handed out or made available on the web page
• All lecture slides will be available on the course website
http://faculty.ucmerced.edu/mhyang/course/cse274/index.htm
Grading
• Based on projects
• No midterm or final
• 20% Homework
• 40% Programming assignments
• 40% Term project
Project 1: features
Project 2: Lucas-Kande Tracker
http://www.youtube.com/watch?v=yoQ8pSXrl4g
Project 3: object detection

More Related Content

What's hot

Comp4010 Lecture5 Interaction and Prototyping
Comp4010 Lecture5 Interaction and PrototypingComp4010 Lecture5 Interaction and Prototyping
Comp4010 Lecture5 Interaction and Prototyping
Mark Billinghurst
 
CHI 2015: The Glass Class - Designing Wearable Interfaces
CHI 2015: The Glass Class - Designing Wearable InterfacesCHI 2015: The Glass Class - Designing Wearable Interfaces
CHI 2015: The Glass Class - Designing Wearable Interfaces
Mark Billinghurst
 
Multimodal Multi-sensory Interaction for Mixed Reality
Multimodal Multi-sensory Interaction for Mixed RealityMultimodal Multi-sensory Interaction for Mixed Reality
Multimodal Multi-sensory Interaction for Mixed Reality
Mark Billinghurst
 
COMP Lecture1 - Introduction to Virtual Reality
COMP Lecture1 - Introduction to Virtual RealityCOMP Lecture1 - Introduction to Virtual Reality
COMP Lecture1 - Introduction to Virtual Reality
Mark Billinghurst
 
SVR2011 Keynote
SVR2011 KeynoteSVR2011 Keynote
SVR2011 Keynote
Mark Billinghurst
 
Solid 2014 kobayashi
Solid 2014 kobayashiSolid 2014 kobayashi
Solid 2014 kobayashi
Shigeru Kobayashi
 
Vision on Tap 6/21/2009
Vision on Tap 6/21/2009Vision on Tap 6/21/2009
Vision on Tap 6/21/2009
kevingc
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality Applications
Mark Billinghurst
 
The UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
The UX of Tomorrow: Designing for the Unknown by Jeff FeddersenThe UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
The UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
Oxford Tech + UX
 
Siggraph 2014: The Glass Class - Designing Wearable Interfaces
Siggraph 2014: The Glass Class - Designing Wearable InterfacesSiggraph 2014: The Glass Class - Designing Wearable Interfaces
Siggraph 2014: The Glass Class - Designing Wearable Interfaces
Mark Billinghurst
 
Design for failure in the IoT: what could possibly go wrong?
Design for failure in the IoT: what could possibly go wrong?Design for failure in the IoT: what could possibly go wrong?
Design for failure in the IoT: what could possibly go wrong?
Claire Rowland
 
UX for Connected Products: EuroIA16
UX for Connected Products: EuroIA16UX for Connected Products: EuroIA16
UX for Connected Products: EuroIA16
Claire Rowland
 
UX Scotland 2018: Systems, discontinuities and thinking beyond UI: Key quest...
UX Scotland 2018: Systems, discontinuities and thinking beyond UI:  Key quest...UX Scotland 2018: Systems, discontinuities and thinking beyond UI:  Key quest...
UX Scotland 2018: Systems, discontinuities and thinking beyond UI: Key quest...
Claire Rowland
 
Foundations of Interaction Design
Foundations of Interaction DesignFoundations of Interaction Design
Foundations of Interaction Design
Karen McGrane
 
UX Day Mannheim: UX for systems of connected products
UX Day Mannheim: UX for systems of connected productsUX Day Mannheim: UX for systems of connected products
UX Day Mannheim: UX for systems of connected products
Claire Rowland
 
Mobile UX - the intricacies of designing for mobile devices
Mobile UX - the intricacies of designing for mobile devicesMobile UX - the intricacies of designing for mobile devices
Mobile UX - the intricacies of designing for mobile devices
Antony Ribot
 
A short history of interaction design
A short history of interaction designA short history of interaction design
A short history of interaction design
Remy Bourganel
 
Interusability: designing a coherent system UX
Interusability: designing a coherent system UXInterusability: designing a coherent system UX
Interusability: designing a coherent system UX
Claire Rowland
 
Glass' evolution, some suggestions
Glass' evolution, some suggestionsGlass' evolution, some suggestions
Glass' evolution, some suggestions
Russell Sullivan
 
Interaction modalities, technologies and tools for interactive art
Interaction modalities, technologies and tools for interactive artInteraction modalities, technologies and tools for interactive art
Interaction modalities, technologies and tools for interactive art
Jorge Cardoso
 

What's hot (20)

Comp4010 Lecture5 Interaction and Prototyping
Comp4010 Lecture5 Interaction and PrototypingComp4010 Lecture5 Interaction and Prototyping
Comp4010 Lecture5 Interaction and Prototyping
 
CHI 2015: The Glass Class - Designing Wearable Interfaces
CHI 2015: The Glass Class - Designing Wearable InterfacesCHI 2015: The Glass Class - Designing Wearable Interfaces
CHI 2015: The Glass Class - Designing Wearable Interfaces
 
Multimodal Multi-sensory Interaction for Mixed Reality
Multimodal Multi-sensory Interaction for Mixed RealityMultimodal Multi-sensory Interaction for Mixed Reality
Multimodal Multi-sensory Interaction for Mixed Reality
 
COMP Lecture1 - Introduction to Virtual Reality
COMP Lecture1 - Introduction to Virtual RealityCOMP Lecture1 - Introduction to Virtual Reality
COMP Lecture1 - Introduction to Virtual Reality
 
SVR2011 Keynote
SVR2011 KeynoteSVR2011 Keynote
SVR2011 Keynote
 
Solid 2014 kobayashi
Solid 2014 kobayashiSolid 2014 kobayashi
Solid 2014 kobayashi
 
Vision on Tap 6/21/2009
Vision on Tap 6/21/2009Vision on Tap 6/21/2009
Vision on Tap 6/21/2009
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality Applications
 
The UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
The UX of Tomorrow: Designing for the Unknown by Jeff FeddersenThe UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
The UX of Tomorrow: Designing for the Unknown by Jeff Feddersen
 
Siggraph 2014: The Glass Class - Designing Wearable Interfaces
Siggraph 2014: The Glass Class - Designing Wearable InterfacesSiggraph 2014: The Glass Class - Designing Wearable Interfaces
Siggraph 2014: The Glass Class - Designing Wearable Interfaces
 
Design for failure in the IoT: what could possibly go wrong?
Design for failure in the IoT: what could possibly go wrong?Design for failure in the IoT: what could possibly go wrong?
Design for failure in the IoT: what could possibly go wrong?
 
UX for Connected Products: EuroIA16
UX for Connected Products: EuroIA16UX for Connected Products: EuroIA16
UX for Connected Products: EuroIA16
 
UX Scotland 2018: Systems, discontinuities and thinking beyond UI: Key quest...
UX Scotland 2018: Systems, discontinuities and thinking beyond UI:  Key quest...UX Scotland 2018: Systems, discontinuities and thinking beyond UI:  Key quest...
UX Scotland 2018: Systems, discontinuities and thinking beyond UI: Key quest...
 
Foundations of Interaction Design
Foundations of Interaction DesignFoundations of Interaction Design
Foundations of Interaction Design
 
UX Day Mannheim: UX for systems of connected products
UX Day Mannheim: UX for systems of connected productsUX Day Mannheim: UX for systems of connected products
UX Day Mannheim: UX for systems of connected products
 
Mobile UX - the intricacies of designing for mobile devices
Mobile UX - the intricacies of designing for mobile devicesMobile UX - the intricacies of designing for mobile devices
Mobile UX - the intricacies of designing for mobile devices
 
A short history of interaction design
A short history of interaction designA short history of interaction design
A short history of interaction design
 
Interusability: designing a coherent system UX
Interusability: designing a coherent system UXInterusability: designing a coherent system UX
Interusability: designing a coherent system UX
 
Glass' evolution, some suggestions
Glass' evolution, some suggestionsGlass' evolution, some suggestions
Glass' evolution, some suggestions
 
Interaction modalities, technologies and tools for interactive art
Interaction modalities, technologies and tools for interactive artInteraction modalities, technologies and tools for interactive art
Interaction modalities, technologies and tools for interactive art
 

Similar to Introduction

vision.ppt
vision.pptvision.ppt
vision.ppt
ArunKumarS600928
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
nyomans1
 
vision.ppt
vision.pptvision.ppt
vision.ppt
nyomans1
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction
Wael Badawy
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
Tanvir Moin
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
台灣資料科學年會
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
Jia-Bin Huang
 
Intro
IntroIntro
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision
butest
 
Computer vision
Computer visionComputer vision
Computer vision
AnkitKamal6
 
Magic Leap Pitch (Development, Manufacturing and Launch Plans)
Magic Leap Pitch (Development, Manufacturing and Launch Plans)Magic Leap Pitch (Development, Manufacturing and Launch Plans)
Magic Leap Pitch (Development, Manufacturing and Launch Plans)
Nicholas Ng
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
Nitin Sharma
 
1.pdf
1.pdf1.pdf
1.pdf
Tony Creat
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
Chen Sagiv
 
Application in Augmented and Virtual Reality
Application in Augmented and Virtual RealityApplication in Augmented and Virtual Reality
Application in Augmented and Virtual Reality
Mark Billinghurst
 
01 cie552 introduction
01 cie552 introduction01 cie552 introduction
01 cie552 introduction
Elsayed Hemayed
 
ICS1020 CV
ICS1020 CVICS1020 CV
ICS1020 CV
Vanessa Camilleri
 
Lecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptxLecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptx
AlifahadHussain
 
Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introduction
cairo university
 
OpenCV
OpenCVOpenCV

Similar to Introduction (20)

vision.ppt
vision.pptvision.ppt
vision.ppt
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Intro
IntroIntro
Intro
 
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Magic Leap Pitch (Development, Manufacturing and Launch Plans)
Magic Leap Pitch (Development, Manufacturing and Launch Plans)Magic Leap Pitch (Development, Manufacturing and Launch Plans)
Magic Leap Pitch (Development, Manufacturing and Launch Plans)
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
1.pdf
1.pdf1.pdf
1.pdf
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
Application in Augmented and Virtual Reality
Application in Augmented and Virtual RealityApplication in Augmented and Virtual Reality
Application in Augmented and Virtual Reality
 
01 cie552 introduction
01 cie552 introduction01 cie552 introduction
01 cie552 introduction
 
ICS1020 CV
ICS1020 CVICS1020 CV
ICS1020 CV
 
Lecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptxLecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptx
 
Lecture 1 computer vision introduction
Lecture 1 computer vision introductionLecture 1 computer vision introduction
Lecture 1 computer vision introduction
 
OpenCV
OpenCVOpenCV
OpenCV
 

Recently uploaded

Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
MJDuyan
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
RamseyBerglund
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
nitinpv4ai
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
giancarloi8888
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
Steve Thomason
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
MysoreMuleSoftMeetup
 

Recently uploaded (20)

Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
 

Introduction

  • 2. What is computer vision? Terminator 2
  • 3. Every picture tells a story • Goal of computer vision is to write computer programs that can interpret images
  • 4. Can computers match (or beat) human vision? • Yes and no (but mostly no!) – humans are much better at “hard” things – computers can be better at “easy” things
  • 6. Why is computer vision difficult? • Inverse problem • Ill-posed • High-dimensional data • Noise • Variation
  • 7. Earth viewers (3D modeling) Image from Microsoft’s Virtual Earth (see also: Google Earth)
  • 10. Optical character recognition Digit recognition, AT&T labs http://www.research.att.com/~yann/ Technology to convert scanned docs to text • If you have a scanner, it probably came with OCR software License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
  • 11. Face detection • Many new digital cameras now detect faces – Canon, Sony, Fuji, …
  • 12. Smile detection Sony Cyber-shot® T70 Digital Still Camera
  • 13. Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “
  • 15. Vision-based biometrics “How the Afghan Girl was Identified by Her Iris Patterns” Read the story
  • 16. Login without a password… Fingerprint scanners on many new laptops, other devices Face recognition systems now beginning to appear more widely http://www.sensiblevision.com/
  • 17. Object recognition (in mobile phones) • This is becoming real: – Microsoft Research – Point & Find, Nokia, NTT Docomo
  • 18. The Matrix movies, ESC Entertainment, XYZRGB, NRC Special effects: shape capture Bullet time: http://www.youtube.com/watch?v=J5ryLMZTO5M
  • 19. Pirates of the Carribean, Industrial Light and Magic Click here for interactive demo Special effects: motion capture
  • 20. Sports Sportvision first down line Nice explanation on www.howstuffworks.com http://www.youtube.com/watch?v=UyPU2l9rdvo
  • 21. Smart cars • Mobileye – Vision systems currently in high-end BMW, GM, Volvo models – By 2010: 70% of car manufacturers. – Video demo
  • 22. Vision-based interaction (and games) Nintendo Wii has camera-based IR tracking built in. See Lee’s work at CMU on clever tricks on using it to create a multi-touch display! Digimask: put your face on a 3D avatar. “Game turns moviegoers into Human Joysticks”, CNET Camera tracking a crowd, based on this work.
  • 23. Vision-based HCI • Reatrix: http://www.youtube.com/watch?v=QzsQKULMbiU
  • 24. Gaming • Sony Eyetoy • Microsoft Natal http://www.youtube.com/watch?v=AOXoh r4XE-4&feature=related http://www.youtube.com/watch?v=1BRSf CuLYHc
  • 25. Motion capture • Marker-based motion capture – http://www.youtube.com/watch?v=V0yT8mwg9nc • Organic motion • http://www.organicmotion.com/
  • 26. Looking at people • Hand gesture • Head pose • Expression • Identity http://www.youtube.com/watch?v=NwVBzx0LMNQ
  • 27. Vision in space Vision systems (JPL) used for several tasks • Panorama stitching • 3D terrain modeling • Obstacle detection, position tracking • For more, read “Computer Vision on Mars” by Matthies et al. NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007.
  • 28. Gigapan • http://www.gigapan.org/index.php • HP TouchSmart with Gigapn demo at Chicago O’Hare airport
  • 29. Robotics http://www.robocup.org/NASA’s Mars Spirit Rover http://en.wikipedia.org/wiki/Spirit_rover
  • 30. Medical imaging Image guided surgery Grimson et al., MIT 3D imaging MRI, CT
  • 33. Debluring Fergus et al. SIGGRAPH 06
  • 34. Digital photo albums • Picasa, Flickr, Photobucket, etc. • Categorization • Tagging • Search
  • 35. Computational photography • Image acquisition • Hardware/software • Optics • Shuttle speed • Novel sensors • Multiple camera • Multiple shots • Multi flash • Applications: high dynamic range imaging, super resolution, photomontage, panorama moasicing, debluring, light field, camera projector system…
  • 36. Image and video search • Google • YouTubes • Microsoft • Yahoo
  • 37. Current state of the art • You just saw examples of current systems. – Many of these are less than 5 years old • This is a very active research area, and rapidly changing – Many new applications in the next 5 years • To learn more about vision applications and companies – David Lowe maintains an excellent overview of vision companies • http://www.cs.ubc.ca/spider/lowe/vision.html • Confluence of vision, graphics, learning, sensing and signal processing
  • 38. Software and hardware • Algorithms: processing images and videos • Camera: acquiring images/videos • Embedded system
  • 39. Topics • Image formation: camera model, camera calibration, radiometry, color, shading • Early vision: stereopsis, structure from motion, illumination, reflectance, shape from X, texture • Mid-level vision: segmentation, grouping, Kalman filter, particle filter, shape representation • High-level vision: correspondence, matching, object detection, object recognition, visual tracking • Recent topics: image and video retrieval, internet vision
  • 41. Textbooks and references • Textbook – Computer Vision: A Modern Approach, David Forsyth and Jean Ponce – Computer Vision: Algorithms and Applications (draft), Richard Szeliski • Reference for background study: – Introductory Techniques for 3-D Computer Vision, Emanuele Trucco and Alessandro Verri – Multiple View Geometry in Computer Vision, Richard Hartley and Andrew Zisserman – An Invitation to 3-D Vision by Yi Ma, Stefano Soatto, and Jana Kosecka – Robot Vision, Berthold Horn – Learning OpenCV: Computer Vision with OpenCV Library, Gary Bradski and Adrian Kaehler • Reading assignments will be from the text and additional material that will be handed out or made available on the web page • All lecture slides will be available on the course website http://faculty.ucmerced.edu/mhyang/course/cse274/index.htm
  • 42. Grading • Based on projects • No midterm or final • 20% Homework • 40% Programming assignments • 40% Term project
  • 44. Project 2: Lucas-Kande Tracker http://www.youtube.com/watch?v=yoQ8pSXrl4g
  • 45. Project 3: object detection