SlideShare a Scribd company logo
1 of 23
Internship Challenge Presentation:
Estimating Distance between Two Balls App
Author: IKUTO OKI
Date: 2021/06/18
May Internship Challenge
Estimating Distance between Two Balls App
Overview
An application for estimating the distance between two balls
from a monocular RGB image
3
ball
ball
distance
Overview
4
There is no restriction on the direction of image capture
Overview
5
Goal : estimate distance between two balls
distance
There are no restriction on the direction of image capture
Base technology & theory
6
Semantic segmentation
why : estimating area of each ball
When the distance between objects increases by x, the apparent size
increases by 1/x2
・ to estimate the distance to the ball from the area
of the ball
Triangle congruence theorems
why : triangulate the distance between two balls from
the distance between ball and camera
Method -overview-
7
Triangle congruence theorem needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
𝐿1, 𝐿2, 𝜃 → 𝐷
Method -overview-
8
Triangle congruence theorem needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
prior information
・camera angle of view
・number of pixels when the ball is
at the reference distance
𝐿1, 𝐿2, 𝜃 → 𝐷
Method -estimating distance between ball and camera-
9
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
Triangle congruence theorem needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
Method -estimating distance between ball and camera-
10
1 : Calibrating distance of the ball (prior information)
· count the number of pixels when the ball is at the reference distance
· use five annotation images
2 : Calculating the area of each ball
· semantic segmentation
· labeling two balls using K-MEANS
3 : Calculating the distance between ball and camera
· use law of distance 𝑆0
1
𝑥2 ⇄ 𝐿0𝑥
1 : Calibrating distance of the ball (prior information)
· count the number of pixels when the ball is at the reference distance
· use five annotation images
2 : Calculate the area of each ball
· semantic segmentation
· rabeling two balls using K-MEANS
3 : Calculating the distance between ball and camera
· use law of distance 𝑆0
1
𝑥2 ⇄ 𝐿0𝑥
Method -estimating distance between ball and camera-
Ball distance=0.5[m](= 𝐿0) annotation image
X0[pixels]
U [pixels]
V [pixels]
S0=X0/(U2+V2)
11
Method -estimating distance between ball and camera-
12
1 : Calibrating distance of the ball (prior information)
· count the number of pixels when the ball is at the reference distance
· use five annotation images
2 : Calculating the area of each ball
· semantic segmentation
· labelling two balls using K-MEANS
3 : Calculating the distance between ball and camera
· use law of distance 𝑆0
1
𝑥2 ⇄ 𝐿0𝑥
output of semseg
output of K-MEANS
1 : Calibrating distance of the ball (prior information)
· count the number of pixels when the ball is at the reference distance
· use five annotation images
2 : Calculating the area of each ball
· semantic segmentation
· labelling two balls using K-MEANS
3 : Calculating the distance between ball and camera
· use law of distance 𝑆0
1
𝑥2 ⇄ 𝐿0𝑥
Method -estimating distance between ball and camera-
13
𝑥:scale factor, 𝐿𝑥: 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑡ℎ𝑎𝑡 𝑠𝑐𝑎𝑙𝑒 𝑓𝑎𝑐𝑡𝑜𝑟 𝑖𝑠 𝑥
1
𝑥2 =
𝑆𝑥
𝑆0
, 𝑆0
1
𝑥2 ⇄ 𝐿0𝑥 (= 𝐿𝑥)より
𝐿𝑥 = 𝐿0
𝑆0
𝑆𝑥
Method -estimate angle between ball1, ball2, and camera-
14
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
Triangle congruence theorems needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
15
Method -estimate angle between ball1, ball2, and camera-
𝜃
Ball1(𝑢1, 𝑣1)
ball2 (𝑢2, 𝑣2)
𝛼: vertical angle
𝛽: horizontal angle
Find the Euclidean distance and
normalize by the angle of view
▶︎ estimate 𝜃 = 𝐴 ×
𝐵
𝐶
𝐵 = (𝑢1 − 𝑢2)2 + (𝑢1−𝑢2)2
𝐴 = 𝛼2 + 𝛽2
𝐶 = 𝑈𝑚𝑎𝑥
2
+ 𝑉
𝑚𝑎𝑥
2
16
Method -triangle congruence theorems -
Triangle congruence theorems needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
<repost>
17
Method -triangle congruence theorems -
Triangle congruence theorems needs
𝐿1:distance between camera and ball
𝐿2:distance between camera and ball
𝜃:angle between ball1, ball2, and camera
𝜃
𝐿1
𝐿2
𝐷
ball1
ball2
<repost>
𝐷= 𝐿1
2
+ 𝐿2
2
− 2𝐿1𝐿2𝑐𝑜𝑠𝜃
(Law of cosines)
Result
18
9905 9906 9908 9909 9910
Result
19
9906
9905 9908 9909 9910
Discussion1
20
No.1 possible source of error is the accuracy of semseg
→ Increase annotation images and new ways to learn
9906
Correct distance : 50[cm]
Output distance 345[cm]
Discussion2
21
Photo distortion may affect accuracy
→Obtaining internal parameters and correcting
[1],[2]http://www.tctec.co.jp/solution/img_pro/camera/
Distortion[1] No distortion[2]
Application, future potential
22
All this program needs are balls and a monocular
visible light camera. Because it is simple to use, it can
be installed on Smartphone.
Program only needs one direction to work
K-MEANS (k=2) was used, but another algorithm(e.g. X-MEANS)
can be used to estimate if more than three balls are included
Thank you for listening!

More Related Content

What's hot (10)

Ejerccio uno
Ejerccio unoEjerccio uno
Ejerccio uno
 
Đề Thi HK2 Toán 7 - THCS - THPT Quốc Tế Canada
Đề Thi HK2 Toán 7 - THCS - THPT Quốc Tế CanadaĐề Thi HK2 Toán 7 - THCS - THPT Quốc Tế Canada
Đề Thi HK2 Toán 7 - THCS - THPT Quốc Tế Canada
 
Pirâmides questões resolvidas do livro fundamentos de matemática elementar ...
Pirâmides   questões resolvidas do livro fundamentos de matemática elementar ...Pirâmides   questões resolvidas do livro fundamentos de matemática elementar ...
Pirâmides questões resolvidas do livro fundamentos de matemática elementar ...
 
Test3 circle square
Test3  circle squareTest3  circle square
Test3 circle square
 
Answers for c
Answers for cAnswers for c
Answers for c
 
Test2 circle square with the correction
Test2 circle square with the correctionTest2 circle square with the correction
Test2 circle square with the correction
 
Module 3 circle area and perimeter
Module 3 circle area and perimeterModule 3 circle area and perimeter
Module 3 circle area and perimeter
 
Test 1 square circle
Test 1 square circleTest 1 square circle
Test 1 square circle
 
Đề Thi HK2 Toán 8 - THCS Bình Trị Đông
Đề Thi HK2 Toán 8 - THCS Bình Trị Đông Đề Thi HK2 Toán 8 - THCS Bình Trị Đông
Đề Thi HK2 Toán 8 - THCS Bình Trị Đông
 
Calculos completos 28 2011
Calculos completos 28 2011Calculos completos 28 2011
Calculos completos 28 2011
 

Similar to May internship challenge: Estimating Distance between Two Balls App

Introduction on Photogrammetry.pdf
Introduction on Photogrammetry.pdfIntroduction on Photogrammetry.pdf
Introduction on Photogrammetry.pdf
BSuresh26
 
Golf Regression Paper1
Golf Regression Paper1Golf Regression Paper1
Golf Regression Paper1
Scott Naleway
 
roboGolf-CIT-research-congress
roboGolf-CIT-research-congressroboGolf-CIT-research-congress
roboGolf-CIT-research-congress
Raffy Lauglaug
 
BallCatchingRobot
BallCatchingRobotBallCatchingRobot
BallCatchingRobot
gauravbrd
 

Similar to May internship challenge: Estimating Distance between Two Balls App (13)

Introduction on Photogrammetry.pdf
Introduction on Photogrammetry.pdfIntroduction on Photogrammetry.pdf
Introduction on Photogrammetry.pdf
 
Distance Estimation to Image Objects Using Adapted Scale
Distance Estimation to Image Objects Using Adapted ScaleDistance Estimation to Image Objects Using Adapted Scale
Distance Estimation to Image Objects Using Adapted Scale
 
Distance Estimation to Image Objects Using Adapted Scale
Distance Estimation to Image Objects Using Adapted ScaleDistance Estimation to Image Objects Using Adapted Scale
Distance Estimation to Image Objects Using Adapted Scale
 
The flow of baseline estimation using a single omnidirectional camera
The flow of baseline estimation using a single omnidirectional cameraThe flow of baseline estimation using a single omnidirectional camera
The flow of baseline estimation using a single omnidirectional camera
 
Golf Regression Paper1
Golf Regression Paper1Golf Regression Paper1
Golf Regression Paper1
 
roboGolf-CIT-research-congress
roboGolf-CIT-research-congressroboGolf-CIT-research-congress
roboGolf-CIT-research-congress
 
Hawkeye
HawkeyeHawkeye
Hawkeye
 
Fuzzy-proportional-integral-derivative-based controller for object tracking i...
Fuzzy-proportional-integral-derivative-based controller for object tracking i...Fuzzy-proportional-integral-derivative-based controller for object tracking i...
Fuzzy-proportional-integral-derivative-based controller for object tracking i...
 
BallCatchingRobot
BallCatchingRobotBallCatchingRobot
BallCatchingRobot
 
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
 
Saad alsheekh multi view
Saad alsheekh  multi viewSaad alsheekh  multi view
Saad alsheekh multi view
 
GNSS and Total Station Survey
GNSS and Total Station SurveyGNSS and Total Station Survey
GNSS and Total Station Survey
 
Survey work presentation
Survey work presentationSurvey work presentation
Survey work presentation
 

More from Ridge-i, Inc.

More from Ridge-i, Inc. (8)

Unsupervised Video Anomaly Detection: A brief overview
Unsupervised Video Anomaly Detection: A brief overviewUnsupervised Video Anomaly Detection: A brief overview
Unsupervised Video Anomaly Detection: A brief overview
 
Continual Learning Introduction
Continual Learning IntroductionContinual Learning Introduction
Continual Learning Introduction
 
Introduction to Few shot learning
Introduction to Few shot learningIntroduction to Few shot learning
Introduction to Few shot learning
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
May internship challenge: Font Generator
May internship challenge: Font GeneratorMay internship challenge: Font Generator
May internship challenge: Font Generator
 
How to learn with non-reliable labels?
How to learn with non-reliable labels?How to learn with non-reliable labels?
How to learn with non-reliable labels?
 
How to learn with non-reliable labels? (Japanese version)
How to learn with non-reliable labels? (Japanese version)How to learn with non-reliable labels? (Japanese version)
How to learn with non-reliable labels? (Japanese version)
 
May internship challenge: User Authentication System only using image data: C...
May internship challenge: User Authentication System only using image data: C...May internship challenge: User Authentication System only using image data: C...
May internship challenge: User Authentication System only using image data: C...
 

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
Wonjun Hwang
 

Recently uploaded (20)

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 

May internship challenge: Estimating Distance between Two Balls App

  • 1. Internship Challenge Presentation: Estimating Distance between Two Balls App Author: IKUTO OKI Date: 2021/06/18
  • 2. May Internship Challenge Estimating Distance between Two Balls App
  • 3. Overview An application for estimating the distance between two balls from a monocular RGB image 3 ball ball distance
  • 4. Overview 4 There is no restriction on the direction of image capture
  • 5. Overview 5 Goal : estimate distance between two balls distance There are no restriction on the direction of image capture
  • 6. Base technology & theory 6 Semantic segmentation why : estimating area of each ball When the distance between objects increases by x, the apparent size increases by 1/x2 ・ to estimate the distance to the ball from the area of the ball Triangle congruence theorems why : triangulate the distance between two balls from the distance between ball and camera
  • 7. Method -overview- 7 Triangle congruence theorem needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 𝐿1, 𝐿2, 𝜃 → 𝐷
  • 8. Method -overview- 8 Triangle congruence theorem needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 prior information ・camera angle of view ・number of pixels when the ball is at the reference distance 𝐿1, 𝐿2, 𝜃 → 𝐷
  • 9. Method -estimating distance between ball and camera- 9 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 Triangle congruence theorem needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera
  • 10. Method -estimating distance between ball and camera- 10 1 : Calibrating distance of the ball (prior information) · count the number of pixels when the ball is at the reference distance · use five annotation images 2 : Calculating the area of each ball · semantic segmentation · labeling two balls using K-MEANS 3 : Calculating the distance between ball and camera · use law of distance 𝑆0 1 𝑥2 ⇄ 𝐿0𝑥
  • 11. 1 : Calibrating distance of the ball (prior information) · count the number of pixels when the ball is at the reference distance · use five annotation images 2 : Calculate the area of each ball · semantic segmentation · rabeling two balls using K-MEANS 3 : Calculating the distance between ball and camera · use law of distance 𝑆0 1 𝑥2 ⇄ 𝐿0𝑥 Method -estimating distance between ball and camera- Ball distance=0.5[m](= 𝐿0) annotation image X0[pixels] U [pixels] V [pixels] S0=X0/(U2+V2) 11
  • 12. Method -estimating distance between ball and camera- 12 1 : Calibrating distance of the ball (prior information) · count the number of pixels when the ball is at the reference distance · use five annotation images 2 : Calculating the area of each ball · semantic segmentation · labelling two balls using K-MEANS 3 : Calculating the distance between ball and camera · use law of distance 𝑆0 1 𝑥2 ⇄ 𝐿0𝑥 output of semseg output of K-MEANS
  • 13. 1 : Calibrating distance of the ball (prior information) · count the number of pixels when the ball is at the reference distance · use five annotation images 2 : Calculating the area of each ball · semantic segmentation · labelling two balls using K-MEANS 3 : Calculating the distance between ball and camera · use law of distance 𝑆0 1 𝑥2 ⇄ 𝐿0𝑥 Method -estimating distance between ball and camera- 13 𝑥:scale factor, 𝐿𝑥: 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑡ℎ𝑎𝑡 𝑠𝑐𝑎𝑙𝑒 𝑓𝑎𝑐𝑡𝑜𝑟 𝑖𝑠 𝑥 1 𝑥2 = 𝑆𝑥 𝑆0 , 𝑆0 1 𝑥2 ⇄ 𝐿0𝑥 (= 𝐿𝑥)より 𝐿𝑥 = 𝐿0 𝑆0 𝑆𝑥
  • 14. Method -estimate angle between ball1, ball2, and camera- 14 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 Triangle congruence theorems needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera
  • 15. 15 Method -estimate angle between ball1, ball2, and camera- 𝜃 Ball1(𝑢1, 𝑣1) ball2 (𝑢2, 𝑣2) 𝛼: vertical angle 𝛽: horizontal angle Find the Euclidean distance and normalize by the angle of view ▶︎ estimate 𝜃 = 𝐴 × 𝐵 𝐶 𝐵 = (𝑢1 − 𝑢2)2 + (𝑢1−𝑢2)2 𝐴 = 𝛼2 + 𝛽2 𝐶 = 𝑈𝑚𝑎𝑥 2 + 𝑉 𝑚𝑎𝑥 2
  • 16. 16 Method -triangle congruence theorems - Triangle congruence theorems needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 <repost>
  • 17. 17 Method -triangle congruence theorems - Triangle congruence theorems needs 𝐿1:distance between camera and ball 𝐿2:distance between camera and ball 𝜃:angle between ball1, ball2, and camera 𝜃 𝐿1 𝐿2 𝐷 ball1 ball2 <repost> 𝐷= 𝐿1 2 + 𝐿2 2 − 2𝐿1𝐿2𝑐𝑜𝑠𝜃 (Law of cosines)
  • 20. Discussion1 20 No.1 possible source of error is the accuracy of semseg → Increase annotation images and new ways to learn 9906 Correct distance : 50[cm] Output distance 345[cm]
  • 21. Discussion2 21 Photo distortion may affect accuracy →Obtaining internal parameters and correcting [1],[2]http://www.tctec.co.jp/solution/img_pro/camera/ Distortion[1] No distortion[2]
  • 22. Application, future potential 22 All this program needs are balls and a monocular visible light camera. Because it is simple to use, it can be installed on Smartphone. Program only needs one direction to work K-MEANS (k=2) was used, but another algorithm(e.g. X-MEANS) can be used to estimate if more than three balls are included
  • 23. Thank you for listening!