SlideShare a Scribd company logo
1 of 85
CSCI 455: Intro to Computer Vision
Acknowledgement
Many of the following slides are modified from the
excellent class notes of similar courses offered in
other schools by Prof Yung-Yu Chuang, Fredo
Durand, Alexei Efros, William Freeman, James Hays,
Svetlana Lazebnik, Andrej Karpathy, Fei-Fei Li,
Srinivasa Narasimhan, Silvio Savarese, Steve Seitz,
Richard Szeliski, Noah Snavely and Li Zhang. The
instructor is extremely thankful to the researchers
for making their notes available online. Please feel
free to use and modify any of the slides, but
acknowledge the original sources where
appropriate.
Today
1. What is computer vision?
2. Course overview
3. Image filtering
Today
• Readings
– Szeliski, Chapter 1 (Introduction)
Every image tells a story
• Goal of computer vision:
perceive the “story”
behind the picture
• Compute properties of
the world
– 3D shape
– Names of people or
objects
– What happened?
The goal of computer vision
Can the computer match human
perception?
• Yes and no (mainly no)
– computers can be better at
“easy” things
– humans are much better at
“hard” things
• But huge progress has
been made
– Accelerating in the last 4
years due to deep learning
– What is considered “hard”
keeps changing
Human perception has its
shortcomings
Sinha and Poggio, Nature, 1996
(“The Presidential Illusion”
But humans can tell a lot about a
scene from a little information…
Source: “80 million tiny images” by Torralba, et al.
The goal of computer vision
The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world
The goal of computer vision
• Computing the 3D shape of the world
Internet Photos (“Colosseum”) Reconstructed 3D cameras
and points
Dense 3D model
The goal of computer vision
• Recognize objects and people
Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building
flag
wall
banner
bus
cars
bus
face
street lamp
slide credit: Fei-Fei, Fergus & Torralba
The goal of computer vision
• “Enhance” images
The goal of computer vision
• Forensics
Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
The goal of computer vision
• Improve photos (“Computational Photography”)
Inpainting / image completion
(image credit: Hays and Efros)
Super-resolution (source: 2d3)
Low-light photography
(credit: Hasinoff et al., SIGGRAPH ASIA 2016)
Depth of field on cell phone camera
(source: Google Research Blog)
Why study computer vision?
• Billions of images/videos captured per day
• Huge number of useful applications
• The next slides show the current state of the art
Optical character recognition (OCR)
Digit recognition, AT&T labs (1990’s)
http://yann.lecun.com/exdb/lenet/
• If you have a scanner, it probably came with OCR software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
Automatic check processing
Sudoku grabber
http://sudokugrab.blogspot.com/
Face detection
• Nearly all cameras detect faces in real time
– (Why?)
Face Recognition
Face recognition
Who is she? Source: S. Seitz
Vision-based biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
Source: S. Seitz
Login without a password
Fingerprint scanners on
many new smartphones
and other devices
Face unlock on Apple iPhone X
See also http://www.sensiblevision.com/
Object recognition (in supermarkets)
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching
for items. When an item is detected and recognized, the cashier verifies the
quantity of items that were found under the basket, and continues to close the
transaction. The item can remain under the basket, and with LaneHawk,you are
assured to get paid for it… “
Source: S. Seitz
Object recognition (in mobile phones)
Source: S. Seitz
iPhone Apps: (www.kooaba.com)
Source: S. Lazebnik
Google Goggles
Google Search by Image
Leaf Recognition
Bird Identification
Merlin Bird ID (based on Cornell Tech technology!)
Special effects: camera tracking
Boujou, 2d3
The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects: shape capture
Source: S. Seitz
Pirates of the Carribean, Industrial Light and Magic
Special effects: motion capture
Source: S. Seitz
3D face tracking w/ consumer cameras
Snapchat Lenses
Face2Face system (Thies et al.)
Image synthesis
Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
Image synthesis
Zhu, et al., Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, ICCV 2017
Sports
Sportvision first down line
Nice explanation on www.howstuffworks.com
Source: S. Seitz
Vision-based interaction (and games)
Nintendo Wii has camera-based IR
tracking built in. See Lee’s work at
CMU on clever tricks on using it to
create a multi-touch display!
Assistive technologies
Kinect
Smart cars
• Mobileye
• Tesla Autopilot
• Safety features in many high-end cars
Self-driving cars
Google Waymo
Vision in space
Vision systems (JPL) uses for several tasks
• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
The Heights of Mount Sharp
http://www.nasa.gov/mission_pages/msl/multimedia/pia16077.html
Panorama captured by Curiosity Rover, August 18, 2012 (Sol 12)
Robotics
NASA’s Mars Curiosity Rover
https://en.wikipedia.org/wiki/Curiosity_(rover)
Amazon Picking Challenge
http://www.robocup2016.org/en/events
/amazon-picking-challenge/
Amazon Prime Air
Medical imaging
3D imaging
(MRI, CT)
Skin cancer classification with deep learning
https://cs.stanford.edu/people/esteva/nature/
Virtual & Augmented Reality
6DoF head tracking Hand & body tracking
3D-360 video capture3D scene understanding
My own work
• Automatic 3D reconstruction from Internet
photo collections
“Statue of Liberty”
3D model
Flickr photos
“Half Dome, Yosemite” “Colosseum, Rome”
Photosynth
City-scale reconstruction
Reconstruction of Dubrovnik, Croatia, from ~40,000 images
Depth from a single image
Current state of the art
• You just saw many examples of current systems.
– Many of these are less than 5 years old
• This is a very active research area, and rapidly
changing
– Many new apps in the next 5 years
– Deep learning powering many modern applications
• Many startups across a dizzying array of areas
– Deep learning, robotics, autonomous vehicles, medical
imaging, construction, inspection, VR/AR, …
Why is computer vision difficult?
Viewpoint variation
Illumination
Scale
Why is computer vision difficult?
Intra-class variation
Background clutter
Motion (Source: S. Lazebnik)
Occlusion
Challenges: local ambiguity
slide credit: Fei-Fei, Fergus & Torralba
But there are lots of cues we can exploit…
Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a
particular 2D picture
– We often need to use prior knowledge about the
structure of the world
Image source: F. Durand
Important notes
• Textbook:
Rick Szeliski, Computer Vision: Algorithms
and Applications
online at: http://szeliski.org/Book/
Course requirements
• Prerequisites—these are essential!
– Data structures
– A good working knowledge of Python programming
– Linear algebra
– Vector calculus
• Course does not assume prior imaging experience
– computer vision, image processing, graphics, etc.
Course overview (tentative)
1. Low-level vision
– image processing, edge detection,
feature detection, cameras, image
formation
2. Geometry and algorithms
– projective geometry, stereo,
structure from motion, optimization
3. Recognition
– face detection / recognition,
category recognition, segmentation
1. Low-level vision
• Basic image processing and image formation
Filtering, edge detection
* =
Feature extraction Image formation
Project: Hybrid images from image
pyramids
G 1/4
G 1/8
Gaussian 1/2
Project: Feature detection and matching
2. Geometry
Projective geometry
Stereo
Multi-view stereo Structure from motion
Project: Creating panoramas
Project: Single-View Modeling
Project: Photometric Stereo
3. Recognition
Sources: D. Lowe, L. Fei-Fei
Face detection and recognition
Single instance recognition
Category recognition
Project: Convolutional Neural Networks
4. Light, color, and reflectance
Light & Color Reflectance
5. Advanced topics: Internet Vision
Human-aided computer vision
Turning the camera around
Internet datasets
5. Advanced topics
Motion and trackingMonocular motion capture
Novel cameras and displays
3D scanning
Questions?

More Related Content

What's hot

What is computer vision?
What is computer vision?What is computer vision?
What is computer vision?Qentinel
 
Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)AshTheMidBenchers
 
Introduction to Computer Vision.pdf
Introduction to Computer Vision.pdfIntroduction to Computer Vision.pdf
Introduction to Computer Vision.pdfKnoldus Inc.
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 
Face detection ppt
Face detection pptFace detection ppt
Face detection pptPooja R
 
Computer vision and Open CV
Computer vision and Open CVComputer vision and Open CV
Computer vision and Open CVChariza Pladin
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsJoseph Paul Cohen PhD
 
Presentation on "Computer vision".
Presentation on "Computer vision".Presentation on "Computer vision".
Presentation on "Computer vision".MobeenAhmed25
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognitionAshiq Ullah
 

What's hot (20)

Computer vision
Computer vision Computer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
What is computer vision?
What is computer vision?What is computer vision?
What is computer vision?
 
Computer vision ppt
Computer vision pptComputer vision ppt
Computer vision ppt
 
Computer vision ppt
Computer vision pptComputer vision ppt
Computer vision ppt
 
Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Introduction to Computer Vision.pdf
Introduction to Computer Vision.pdfIntroduction to Computer Vision.pdf
Introduction to Computer Vision.pdf
 
Computer vision
Computer visionComputer vision
Computer vision
 
Object Recognition
Object RecognitionObject Recognition
Object Recognition
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 
Face detection ppt
Face detection pptFace detection ppt
Face detection ppt
 
Computer vision and Open CV
Computer vision and Open CVComputer vision and Open CV
Computer vision and Open CV
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applications
 
Presentation on "Computer vision".
Presentation on "Computer vision".Presentation on "Computer vision".
Presentation on "Computer vision".
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 

Similar to Computer vision introduction

vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.pptnyomans1
 
vision.ppt
vision.pptvision.ppt
vision.pptnyomans1
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Visionbutest
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryTanvir Moin
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
Materi_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdfMateri_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdfichsan6
 
Image Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSImage Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSOge Marques
 
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Skolkovo Robotics Center
 
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...Data Con LA
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep LearningDavid Khosid
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamWithTheBest
 

Similar to Computer vision introduction (20)

Introduction
IntroductionIntroduction
Introduction
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision01Introduction.pptx - C280, Computer Vision
01Introduction.pptx - C280, Computer Vision
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Intro
IntroIntro
Intro
 
Materi_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdfMateri_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdf
 
1.pdf
1.pdf1.pdf
1.pdf
 
Image Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSImage Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOS
 
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
 
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...
Data Con LA 2019 - State of the Art of Innovation in Computer Vision by Chris...
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep Learning
 
vision-1.ppt
vision-1.pptvision-1.ppt
vision-1.ppt
 
ICS1020 CV
ICS1020 CVICS1020 CV
ICS1020 CV
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
 

More from Wael Badawy

HTML introduction
HTML introduction HTML introduction
HTML introduction Wael Badawy
 
Np complete reductions
Np complete reductionsNp complete reductions
Np complete reductionsWael Badawy
 
N F A - Non Deterministic Finite Automata
N F A - Non Deterministic Finite AutomataN F A - Non Deterministic Finite Automata
N F A - Non Deterministic Finite AutomataWael Badawy
 
Computer Vision Cameras
Computer Vision CamerasComputer Vision Cameras
Computer Vision CamerasWael Badawy
 
Computer Vision Gans
Computer Vision GansComputer Vision Gans
Computer Vision GansWael Badawy
 
Computer Vision image classification
Computer Vision image classificationComputer Vision image classification
Computer Vision image classificationWael Badawy
 
Computer Vision Structure from motion
Computer Vision Structure from motionComputer Vision Structure from motion
Computer Vision Structure from motionWael Badawy
 
Universal turing
Universal turing Universal turing
Universal turing Wael Badawy
 
Turing variations
Turing variations Turing variations
Turing variations Wael Badawy
 
Time complexity
Time complexity Time complexity
Time complexity Wael Badawy
 
Regular pumping
Regular pumping Regular pumping
Regular pumping Wael Badawy
 
Regular pumping examples
Regular pumping examples Regular pumping examples
Regular pumping examples Wael Badawy
 
Regular properties
Regular propertiesRegular properties
Regular propertiesWael Badawy
 
Regular expressions
Regular expressions Regular expressions
Regular expressions Wael Badawy
 
Pushdown Automota
Pushdown Automota Pushdown Automota
Pushdown Automota Wael Badawy
 
Pda accept context free
Pda accept context freePda accept context free
Pda accept context freeWael Badawy
 
Computer Vision sfm
Computer Vision sfmComputer Vision sfm
Computer Vision sfmWael Badawy
 
Computer vision - photometric
Computer vision - photometricComputer vision - photometric
Computer vision - photometricWael Badawy
 

More from Wael Badawy (20)

HTML introduction
HTML introduction HTML introduction
HTML introduction
 
Np complete reductions
Np complete reductionsNp complete reductions
Np complete reductions
 
N F A - Non Deterministic Finite Automata
N F A - Non Deterministic Finite AutomataN F A - Non Deterministic Finite Automata
N F A - Non Deterministic Finite Automata
 
Parsers -
Parsers -Parsers -
Parsers -
 
Computer Vision Cameras
Computer Vision CamerasComputer Vision Cameras
Computer Vision Cameras
 
Computer Vision Gans
Computer Vision GansComputer Vision Gans
Computer Vision Gans
 
Computer Vision image classification
Computer Vision image classificationComputer Vision image classification
Computer Vision image classification
 
Computer Vision Structure from motion
Computer Vision Structure from motionComputer Vision Structure from motion
Computer Vision Structure from motion
 
Universal turing
Universal turing Universal turing
Universal turing
 
Turing Machine
Turing MachineTuring Machine
Turing Machine
 
Turing variations
Turing variations Turing variations
Turing variations
 
Time complexity
Time complexity Time complexity
Time complexity
 
Regular pumping
Regular pumping Regular pumping
Regular pumping
 
Regular pumping examples
Regular pumping examples Regular pumping examples
Regular pumping examples
 
Regular properties
Regular propertiesRegular properties
Regular properties
 
Regular expressions
Regular expressions Regular expressions
Regular expressions
 
Pushdown Automota
Pushdown Automota Pushdown Automota
Pushdown Automota
 
Pda accept context free
Pda accept context freePda accept context free
Pda accept context free
 
Computer Vision sfm
Computer Vision sfmComputer Vision sfm
Computer Vision sfm
 
Computer vision - photometric
Computer vision - photometricComputer vision - photometric
Computer vision - photometric
 

Recently uploaded

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 

Recently uploaded (20)

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 

Computer vision introduction

  • 1. CSCI 455: Intro to Computer Vision
  • 2. Acknowledgement Many of the following slides are modified from the excellent class notes of similar courses offered in other schools by Prof Yung-Yu Chuang, Fredo Durand, Alexei Efros, William Freeman, James Hays, Svetlana Lazebnik, Andrej Karpathy, Fei-Fei Li, Srinivasa Narasimhan, Silvio Savarese, Steve Seitz, Richard Szeliski, Noah Snavely and Li Zhang. The instructor is extremely thankful to the researchers for making their notes available online. Please feel free to use and modify any of the slides, but acknowledge the original sources where appropriate.
  • 3. Today 1. What is computer vision? 2. Course overview 3. Image filtering
  • 4. Today • Readings – Szeliski, Chapter 1 (Introduction)
  • 5. Every image tells a story • Goal of computer vision: perceive the “story” behind the picture • Compute properties of the world – 3D shape – Names of people or objects – What happened?
  • 6. The goal of computer vision
  • 7. Can the computer match human perception? • Yes and no (mainly no) – computers can be better at “easy” things – humans are much better at “hard” things • But huge progress has been made – Accelerating in the last 4 years due to deep learning – What is considered “hard” keeps changing
  • 8. Human perception has its shortcomings Sinha and Poggio, Nature, 1996 (“The Presidential Illusion”
  • 9. But humans can tell a lot about a scene from a little information… Source: “80 million tiny images” by Torralba, et al.
  • 10.
  • 11. The goal of computer vision
  • 12. The goal of computer vision
  • 13. The goal of computer vision • Compute the 3D shape of the world
  • 14. The goal of computer vision • Computing the 3D shape of the world Internet Photos (“Colosseum”) Reconstructed 3D cameras and points Dense 3D model
  • 15. The goal of computer vision • Recognize objects and people Terminator 2, 1991
  • 16. slide credit: Fei-Fei, Fergus & Torralba
  • 18. The goal of computer vision • “Enhance” images
  • 19.
  • 20. The goal of computer vision • Forensics Source: Nayar and Nishino, “Eyes for Relighting”
  • 21. Source: Nayar and Nishino, “Eyes for Relighting”
  • 22. Source: Nayar and Nishino, “Eyes for Relighting”
  • 23.
  • 24. The goal of computer vision • Improve photos (“Computational Photography”) Inpainting / image completion (image credit: Hays and Efros) Super-resolution (source: 2d3) Low-light photography (credit: Hasinoff et al., SIGGRAPH ASIA 2016) Depth of field on cell phone camera (source: Google Research Blog)
  • 25. Why study computer vision? • Billions of images/videos captured per day • Huge number of useful applications • The next slides show the current state of the art
  • 26. Optical character recognition (OCR) Digit recognition, AT&T labs (1990’s) http://yann.lecun.com/exdb/lenet/ • If you have a scanner, it probably came with OCR software License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition Automatic check processing Sudoku grabber http://sudokugrab.blogspot.com/
  • 27. Face detection • Nearly all cameras detect faces in real time – (Why?)
  • 29. Face recognition Who is she? Source: S. Seitz
  • 30. Vision-based biometrics “How the Afghan Girl was Identified by Her Iris Patterns” Read the story Source: S. Seitz
  • 31. Login without a password Fingerprint scanners on many new smartphones and other devices Face unlock on Apple iPhone X See also http://www.sensiblevision.com/
  • 32. Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “ Source: S. Seitz
  • 33. Object recognition (in mobile phones) Source: S. Seitz
  • 38. Bird Identification Merlin Bird ID (based on Cornell Tech technology!)
  • 39. Special effects: camera tracking Boujou, 2d3
  • 40. The Matrix movies, ESC Entertainment, XYZRGB, NRC Special effects: shape capture Source: S. Seitz
  • 41. Pirates of the Carribean, Industrial Light and Magic Special effects: motion capture Source: S. Seitz
  • 42. 3D face tracking w/ consumer cameras Snapchat Lenses Face2Face system (Thies et al.)
  • 43. Image synthesis Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
  • 44. Image synthesis Zhu, et al., Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, ICCV 2017
  • 45. Sports Sportvision first down line Nice explanation on www.howstuffworks.com Source: S. Seitz
  • 46. Vision-based interaction (and games) Nintendo Wii has camera-based IR tracking built in. See Lee’s work at CMU on clever tricks on using it to create a multi-touch display! Assistive technologies
  • 48. Smart cars • Mobileye • Tesla Autopilot • Safety features in many high-end cars
  • 50. Vision in space Vision systems (JPL) uses for several tasks • Panorama stitching • 3D terrain modeling • Obstacle detection, position tracking • For more, read “Computer Vision on Mars” by Matthies et al. The Heights of Mount Sharp http://www.nasa.gov/mission_pages/msl/multimedia/pia16077.html Panorama captured by Curiosity Rover, August 18, 2012 (Sol 12)
  • 51. Robotics NASA’s Mars Curiosity Rover https://en.wikipedia.org/wiki/Curiosity_(rover) Amazon Picking Challenge http://www.robocup2016.org/en/events /amazon-picking-challenge/ Amazon Prime Air
  • 52. Medical imaging 3D imaging (MRI, CT) Skin cancer classification with deep learning https://cs.stanford.edu/people/esteva/nature/
  • 53.
  • 54. Virtual & Augmented Reality 6DoF head tracking Hand & body tracking 3D-360 video capture3D scene understanding
  • 55. My own work • Automatic 3D reconstruction from Internet photo collections “Statue of Liberty” 3D model Flickr photos “Half Dome, Yosemite” “Colosseum, Rome”
  • 57. City-scale reconstruction Reconstruction of Dubrovnik, Croatia, from ~40,000 images
  • 58. Depth from a single image
  • 59. Current state of the art • You just saw many examples of current systems. – Many of these are less than 5 years old • This is a very active research area, and rapidly changing – Many new apps in the next 5 years – Deep learning powering many modern applications • Many startups across a dizzying array of areas – Deep learning, robotics, autonomous vehicles, medical imaging, construction, inspection, VR/AR, …
  • 60. Why is computer vision difficult? Viewpoint variation Illumination Scale
  • 61. Why is computer vision difficult? Intra-class variation Background clutter Motion (Source: S. Lazebnik) Occlusion
  • 62. Challenges: local ambiguity slide credit: Fei-Fei, Fergus & Torralba
  • 63. But there are lots of cues we can exploit… Source: S. Lazebnik
  • 64. Bottom line • Perception is an inherently ambiguous problem – Many different 3D scenes could have given rise to a particular 2D picture – We often need to use prior knowledge about the structure of the world Image source: F. Durand
  • 65.
  • 66.
  • 67.
  • 68. Important notes • Textbook: Rick Szeliski, Computer Vision: Algorithms and Applications online at: http://szeliski.org/Book/
  • 69. Course requirements • Prerequisites—these are essential! – Data structures – A good working knowledge of Python programming – Linear algebra – Vector calculus • Course does not assume prior imaging experience – computer vision, image processing, graphics, etc.
  • 70. Course overview (tentative) 1. Low-level vision – image processing, edge detection, feature detection, cameras, image formation 2. Geometry and algorithms – projective geometry, stereo, structure from motion, optimization 3. Recognition – face detection / recognition, category recognition, segmentation
  • 71. 1. Low-level vision • Basic image processing and image formation Filtering, edge detection * = Feature extraction Image formation
  • 72. Project: Hybrid images from image pyramids G 1/4 G 1/8 Gaussian 1/2
  • 73.
  • 74.
  • 80. 3. Recognition Sources: D. Lowe, L. Fei-Fei Face detection and recognition Single instance recognition Category recognition
  • 82. 4. Light, color, and reflectance Light & Color Reflectance
  • 83. 5. Advanced topics: Internet Vision Human-aided computer vision Turning the camera around Internet datasets
  • 84. 5. Advanced topics Motion and trackingMonocular motion capture Novel cameras and displays 3D scanning