SlideShare a Scribd company logo
1 of 58
Kathleen Tuite
Julia Robinson Math Festival
Math + Photography
+ Computer Science
Dimensions
21 3
2-Dimensional
3-Dimensional (3D)
How do humans judge depth?
?
How do humans judge depth?
?
5ft45° 90°
How do humans judge depth?
5ft
5ft45° 90°
6 cm
depth perception
Cameras are like eyes
Stereographic Camera
Stereograph of Yosemite (1901)
Stereograph Viewer
iPhone insideMirrors
What if we had many eyes?
lots of photos ~ lots of eyes
Maybe we can make a collage?
We can make a 3D model!
How to make a 3D model from photos
Step 1: Find recognizable pieces
(features) of photo
• Corners of windows
• Distinct texture
• Building details
Step 2: Find the same features in
multiple photos
Typically there are 100s of matches between
photos
Step 3: Triangulate to figure out how
far away things are
Step 3: Triangulate to figure out how
far away things are
Structure from Motion
Step 3 in a bit more detail
• Don’t know what the 3D model should look like
• Don’t know where the photos were taken from
• Do know what matches between different photos
Hey computer, can
you figure this out
for me?
No
problem!
Can we reconstruct all of UW
campus in 3D?
Campus map
Info about a 3D model
Teams
Go outside and take
pictures of buildings
Add pictures to flags
Computer
vision
algorithms
Capture flags for your
team and conquer
buildings
Look for flags on the map
Feedback for an accepted photo
1058 new points!
Players only earn points for taking useful photos
Seed Models
Players can start their own seed models from small photo collections
UW Empire Cornell Empire
Competition Details
Two rounds
Six weeks total
45 players
Over 109,000 photos!
Coverage of UW Campus
What kind of math went into this?
• Geometry and trigonometry
– How a camera captures a 3D scene in a 2D photos
– How to extract 3D shape from lots of different 2D
photos
– Lots of matrices and linear algebra behind the scenes
(the computer does the hard part)
Programming and computers can help
us do more math
• Thousands of
photos
• Millions of 3D
points
• Set up the problem
and have the
computer do the
hard work
Depth cameras
Depth cameras
Project Tango:
a cell phone with a depth camera
Take photos and try Photosynth!
Photosynth Cheat/Easter Egg
…press “c”
Project Euler (sounds like “Oiler”)
Thanks!
• Cool links
– http://photosynth.net/
– https://projecteuler.net/
– http://www.poppy3d.com/
Kathleen Tuite
ktuite@cs.washington.edu
www.superfiretruck.com

More Related Content

Similar to How do computers see in 3D?

vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.pptnyomans1
 
vision.ppt
vision.pptvision.ppt
vision.pptnyomans1
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryTanvir Moin
 
Introducting the art pipeline
Introducting the art pipelineIntroducting the art pipeline
Introducting the art pipelineDavid Edwards
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
Steam presentation deux 3 d prints from photographs
Steam presentation deux  3 d prints from photographsSteam presentation deux  3 d prints from photographs
Steam presentation deux 3 d prints from photographsScott Eastellerson
 
3D Modeling The World Around
3D Modeling The World Around 3D Modeling The World Around
3D Modeling The World Around Victor Gramm
 
Computer Vision Course includes deep learning
Computer Vision Course includes deep learningComputer Vision Course includes deep learning
Computer Vision Course includes deep learninggigap29589
 
HCI-software engineering life cycle.pptx
HCI-software engineering life cycle.pptxHCI-software engineering life cycle.pptx
HCI-software engineering life cycle.pptxMousoomiS1
 
Digital stereoscopic imaging (1)
Digital stereoscopic imaging (1)Digital stereoscopic imaging (1)
Digital stereoscopic imaging (1)kamsaliraviteja
 
Data Challenges with 3D Computer Vision
Data Challenges with 3D Computer VisionData Challenges with 3D Computer Vision
Data Challenges with 3D Computer VisionMartin Scholl
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction Wael Badawy
 
Introduction to three.js & Leap Motion
Introduction to three.js & Leap MotionIntroduction to three.js & Leap Motion
Introduction to three.js & Leap MotionLee Trout
 

Similar to How do computers see in 3D? (20)

3D modeling and animation
3D modeling and animation3D modeling and animation
3D modeling and animation
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introducting the art pipeline
Introducting the art pipelineIntroducting the art pipeline
Introducting the art pipeline
 
ICS1020 CV
ICS1020 CVICS1020 CV
ICS1020 CV
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Steam presentation deux 3 d prints from photographs
Steam presentation deux  3 d prints from photographsSteam presentation deux  3 d prints from photographs
Steam presentation deux 3 d prints from photographs
 
3D Modeling The World Around
3D Modeling The World Around 3D Modeling The World Around
3D Modeling The World Around
 
Computer Vision Course includes deep learning
Computer Vision Course includes deep learningComputer Vision Course includes deep learning
Computer Vision Course includes deep learning
 
HCI-software engineering life cycle.pptx
HCI-software engineering life cycle.pptxHCI-software engineering life cycle.pptx
HCI-software engineering life cycle.pptx
 
Digital stereoscopic imaging (1)
Digital stereoscopic imaging (1)Digital stereoscopic imaging (1)
Digital stereoscopic imaging (1)
 
Data Challenges with 3D Computer Vision
Data Challenges with 3D Computer VisionData Challenges with 3D Computer Vision
Data Challenges with 3D Computer Vision
 
10833762.ppt
10833762.ppt10833762.ppt
10833762.ppt
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction
 
Introduction to three.js & Leap Motion
Introduction to three.js & Leap MotionIntroduction to three.js & Leap Motion
Introduction to three.js & Leap Motion
 
Holography
HolographyHolography
Holography
 

How do computers see in 3D?

Editor's Notes

  1. The first thing I’m going to show you…
  2. LengthLength and widthLength and width and depth
  3. Here’s a 2d videogame… mario. Every object has a Width, height… and they move, but they’re FLAT like pieces of paper. always at the same depth on the screen
  4. How many of you have seen minecraft? This minecraft world is made out of things with volume, like lego blocks… things that we can stack
  5. How do you judge depth? Supposetheres a cat in front of us and we want to figure out the distnace to the catUse geometry to figure this out. Make a triangle…
  6. We use geometry to figure out how far away stuff is
  7. Use trigonometry to get this length… kknow the cat is five feet away from the original position.
  8. Don’t have to move around… have two eyesMakes a triangle between our two eyes and some point in the world
  9. When you model this with geometry… triangles
  10. When this camera takes a picture, it looks like this… two pictures from different viewpoints. Slightly differennt…
  11. Toy I grew up with
  12. this poppy device is like the modern version of this, excpet you put in your eyephone
  13. Doesn’t work because the building is in a 3D world, it’s not flat, and the pictures don’t line up
  14. Things that are easy for a computer to detect… really good at corners and where the color changes a lot
  15. We use geometry to figure out how far away stuff is
  16. We use geometry to figure out how far away stuff is
  17. Super technical, but I want to share it with you guys anyway.Gives you shape by moving
  18. Geometry program on the computer figures it out
  19. For us, when we got people to play photocity, the latter happened. And what’s surprising about our game is that while less than 50 people played, and only a small fraction of those made a significant contribution, we still collected over 100,000 photos and put together one of the largest, most comprehensive set of photos of a single location.
  20. This is the website for PhotoCity. It’s got a big map of the campus, and you can click on anything on the map to get more info, which shows up on the side. At the bottom, there’s a bar that says how many flags each team has.
  21. Here’s a more detailed picture of the map with castles and flags representing 3D models in the world that players can grow by taking photos.
  22. Now let me walk you through a cycle of play.The player starts out by identifying a target building on the map…And then they go OUTSIDE and take photosAnd upload them to the flag that they’re targetingWe run our computer vision to find out how much new geometry was created and whether the player was able to capture any flags or conquer any buildings!
  23. Flags are big virtual columns at GPS locations in the world. We’ve colored all the points in this particular flag column orange to direct the players’ attention to expanding this model to the left
  24. When the model grows, the flag gets filled in, and eventually captured by a player.
  25. And then Our system automatically places a new flag as the model grows
  26. Players extend buildings and capture flags by taking and uploading photos. Their photos are automatically accepted or rejected by the system, and when they’re accepted, they actually add 3D points to the 3D model. This shows where the new points came from – they’re the glowing orange points – so players can learn what works about their pictures when they work.
  27. One thing players can do instead of expanding existing buildings, is go out and start their own from scratch. You need anywhere from 20 to 200 photos to start a seed.5 continents around the world
  28. A bunch of different seed models by players
  29. So, now I’m going to tell you about the big competition we had last spring between university of washington and cornell.
  30. Two round that each lasted 3 weeks. We got about 45 players.
  31. This shows the coverage of photos over UW campus – each blue dot is where a photo was taken
  32. And here’s a flythrough of one part of the university of washington campus – this is red square at UW and this model was made from a 3,000-photo subset of all the photos.
  33. Here’s the last video I’m going to show you of the results produced by these players and this game. This particular model was made from 7000 photos and contains almost 20 million 3D points. As computer vision algorithms get even better, we’ll be able to use the original 50,000 photos of uw and 50,000 photos of Cornell and run these through new algorithms and techniques to get even better-looking models.
  34. Cornell arts quad. 5000 images (1/10th of the total images at cornell)As computer vision techniques get better, we’ll be able to rerun these photos and generate clearer models
  35. We’re not doing repetitive math problems by hand
  36. How many of you have an xbox with a kinect? Lets your xbox see in 3D
  37. Right now, kinect is probably the only depth camera you have… you’re going to grow up in a world where these cameras are everywhere… in your cell phone, in your parents’ car
  38. Free program from microsoft that lets you build models from photos
  39. Whenyoureinteractign with it, press c… shows you the geometry of the situation
  40. ive always loved math… math degree in college… ive found that being good at math and programming and using those skills together lets me work on really interesting problems that also involve my hobbies like photography.
  41. Here are some links for you to follow up with at home… photosynth in particular because it’ll do the 3d reconstruction stuff I was talking about for you with your own photos