SlideShare a Scribd company logo
1 of 32
ARJUN AK
S7IT
TKR18IT003. 1
CONTENTS
● INTRODUCTION
● LITURATURE SURVEY
● METHODOLOGY
● FLOWCHART
● NETWORK ARCHITECTURE
● SELECTION OF GAME ENGINE
● SELECTION OF AR SDK
● VUFORIA
● ARCore AND ARKit
● ADVANTAGE AND DISADVANTAGES
● CONCLUSION
● REFERENCES
22
INTRODUCTION
● User Experience is a very important factor of any software product and thus
companies spend a lot of time and money in development stage to make sure that
customers get a hassle-free experience.
● In today’s age it is observed that in order to further improve user experience the use
of Machine Learning is to be employed.
● We plan on surveying all the possible ways to implement Computer Vision based
marker less object tracking and integrate it with Augmented Reality Software
Development Kits to create an interface ready for customers to use.
● The main idea is to track hand landmarks using Computer Vision and use them as
placeholders for AR watch object.
33
LITERATURE SURVEY
SL NO PAPER AUTHOR YEAR
1 MediaPipe Hands: On-device Real-
time Hand Tracking
Fan Zhang, Valentin
Bazarevsky, Andrey
Vakunov, Andrei
Tkachenka, George Sung
18 Jun 2020.
2 Pose Anchor: A Single-stage Hand
Keypoint Detection Network
Yuan Li, Xinggang
Wang, Wenyu Liu and
Bin Feng
July 2020
3 A Study on MQTT based
Environmental Parameters,
Monitoring and Alarming System
K.J., Reshmaa, Selvin
Peter Paul J., and
Swetha V
2018
4 A Survey of Frameworks and Game
Engines for Serious Game
Development
Brent Cowan and Bill
Kapralos
2014
METHODOLOGY
“Researchers from Google “present’ on-device a real-time solution predicting a
human palm skeleton from a single RGB camera for AR products.
Two models comprise the pipeline:
•A palm detector responsible for bounding box generation across palm and provides it
to,
•A hand landmark model, that maps a palm sketch on the full hand. This is
implemented through MediaPipe ,ML solutions building platform.
55
FLOWCHART
666
● The graph can be divided into 2 subparts, one for hand detection and the other for
points of reference mapping.
● A key optimization here is that the hand detection only runs as needed, saving a lot
of processing power
● To accomplish this start by extracting the palm location in the present video frame
from the generated hand landmarks in the preceding frame, thus preventing the
palm detection each frame
● For toughness, the tracker even provides confidence score for captured hand.
whenever the confidence drops below the threshold then only the palm detector is
reapplied to the upcoming frame.
● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a
classifier which is on the basis of frames on poorly differently marked sequence
data by combining a Convolution Neural Net in an Expectation Maximisation
algorithm.
METHODOLOGY
77
● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence
level knowledge is available for the source videos.
● This method is explained in situation related to hand shape detection, it could be of a
wider use to any video based recognition task which doesn’t have frame-level labelling
● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping
landmarks, Pose Anchor presents a single effective network architecture for hand
landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor
network, which is based on RPN in Faster Region-based C.N.N
● Instead of manually designing hand pose anchors, they are generated using K-means
clustering based on OKS
● The main reason for using pose anchors is to mitigate occlusion to some extent by using
the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and
NZSL were used to show the robustness and feasibility of the architecture.
88
NETWORK DIAGRAM
99
HEAT MAP GENERATED
1010
Selection of Game Engine:
There are many game engines available to make AR applications. To chose one among
them many factors must be kept in mind, they are:
● Scripting: The code written to provide the instructions.
● Rendering: The generation of the 3D scene, the factors to be measured are speed and
accuracy.
● Animation: the change in render per frame to simulate movement.
● Artificial Intelligence: Ability of the computer to make decisions.
● Physics: Real world based calculations on simulated physical interactions.
● Audio: Audio feedback provided that can be spatially distinguished.
● Networking : Users have the ability to play with others online
1111
●A logical approach would be to go with a game engine which is widely used in the market,
so we used 2 Surveys to determine a suitable game engine
●Survey 1: Scan through a database of approximately 200 academic publications for
“serious game”, “educational game”, and “simulator”.
●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious
gaming”
1212
GAME ENGINE AND FRAMEWORK
1313
●By analyzing given information we can clearly conclude that Unity and Unreal are
the most beneficial game engines, we chose Unity as our preferred game engine as the
only advantage of Unreal over Unity is C++ support, but the programming language
to be used is not an restriction in our situation.
14
Unity Engine Properties:
● Lighting can be done before hand or at run time, custom shaders can be created
using shader graphs and shadder programming thus replicating realistic lighting
conditions as well as textures which can be changed based on input to change the
simulation environment.
● Unity uses C# as the programming language behind the implementation of any
application built using the framework and thus complicated logical simulations and
inputs can be generated.
1515
The Machine Learning-Agents Toolkit
provided by the game engine:
●This is an free to use SDK available to integrate ML models with Unity.The 3 main features
in the SDK are
● Sensors, Agents, and Academy
●The agents collect, observe and execute actions. The agent is the component being trained by
constantly optimizing it’s policy known as Brain.
●The Academy manages the simulation,it is a singleton and thus used to keep track of all the
steps involved in the simulation.
●Singleton is defined as a software design pattern that restricts the instantiation of a class to
one "single" instance, any other instances which are created are automatically destroyed.
1616
ML AGENTS DOCUMENTATION
1717
Integration of the model with Unity:
The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion
SDK and Gesture simple control package and used it for detection.
This shows that any kind of trained model can be imported into Unity.
The recommended method to import ML model into Unity:
● STEP 1: Save and export the model to ONNX format as this is the format supported by
Unity.
● STEP 2: Import Unity ML Agents
● STEP 3: Import the model into Unity.
● STEP 4: Access the camera feed frame by frame.
● STEP 5: Resize the render texture if required.
● STEP 6: Read the Softmax layer output from the model.
18
Selection of AR SDK
● SDK has multiple components within the application: identification, tracking, etc
● There are many AR SDKs to chose from like:
Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit.
● There is another study by Anasse HANAFI in which they are focusing on those SDKs
which give a platform and support and function with hardware.
● They had tested these SDKs on the basis of licence type, target platform, development
platform, tracking type, functionality.
● The most important feature as per our requirement is tracking type and the results are
tabulated in Table
1919
AR SDKs COMPARISON BASED ON TRACKING
2020
Vuforia
●Vuforia is an augmented reality software development kit for mobile devices that enables the
creation of augmented reality applications.
●It uses computer vision technology to recognize and track planar images and 3D objects in real
time .
●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail
Industry. Their observation is can be referenced for our work to find which will be more suitable
for the process in hand .
2121
VUFORIA RESULTS BASED ON DISTANCE
2222
● Thus we can see that the results are perfect for our scenario, so it can be considered to
use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a
more suitable choice would be to go for ARCore or ARKit.
2323
ARCore and ARKit:
●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for
bringing more AR apps to thier platforms . They use the smartphone cameras to add
interactive elements to an existing environment.
●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices.
Unity offers a wrapper class known as ARFoundation which includes both of these SDKs
and an added advantage is that it is not required to import these from an external source.
●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for
ARKit or ARCore. Their observations are shown in Table
2424
AR FOUNDATION TABLE
2525
● As it can be seen from Table that ARFoundation either has all the features or
they are under development. Thus to keep the implementation simple, future
ready and cover all target platforms it is suitable to use ARFoundation as our
SDK.
● ARFoundation can be imported from Unity’s Package Manager which comes
along with Unity during installation.
2626
ADVANTAGES
● Once the content is placed in a room, it is more flexible than marker-based
alternatives.
● Marker less AR significantly increases the average range of motion.
● Marker based alternative relies on the image recognisability whereas
maker less doesn’t.
2727
DISADVANTAGES
● The augmented reality content may not make sense in certain context
● For better experience it is required that the surface has a texture for computer
vision to recognize it.
2828
CONCLUSION
We had a look at various computer vision techniques for implementing marker less
tracking on hands and also determined an appropriate game engine to import that
model into. We also compared various Augmented reality SDKs which can be used to
properly instantiate the desired objects. We found using ARfoundation to implement
the tracking will be much more efficient because of its simplistic nature in
implementation , future ready features and tracking and beacuse of its ablity to cover
all target platforms .
2929
REFERENCES
1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling
Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking”
arXiv:2006.10214v1 [cs.CV] 18 Jun 2020.
2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael
Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua,
Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception
pipelines.volume abs/1906.08172, 2019
3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million
Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR).
4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint
Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020)
3030
5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region
proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99
6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick,
“Microsoft coco: Common objects in context,” in European conference on computer vision. Springer,
2014, pp. 740–755.
7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game
Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies
8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris
Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent
Agents” in arxiv.org by Cornell University
9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental
Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP
(2018)
3131
THANK
YOU
3232

More Related Content

Similar to Marker less augmentedd reality using computer vision

How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...Unity Technologies
 
Doc muntation of android
Doc muntation of androidDoc muntation of android
Doc muntation of androidmsramakrishna
 
IRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAVIRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAVIRJET Journal
 
Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools Matteo Valoriani
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Codemotion
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Codemotion
 
Kudan deck slide share e
Kudan deck slide share eKudan deck slide share e
Kudan deck slide share eTomoko Takeda
 
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdfJIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdfSamiraKids
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET Journal
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Schuyler Kennedy
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloLinaro
 
Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...Iaetsd Iaetsd
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9IOSR Journals
 
IRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET Journal
 
Mak product overview_no_video
Mak product overview_no_videoMak product overview_no_video
Mak product overview_no_videoPeter Swan
 
Leveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge DevicesLeveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge DevicesICS
 
Final project report format
Final project report formatFinal project report format
Final project report formatMasud Sarkar
 

Similar to Marker less augmentedd reality using computer vision (20)

How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
 
virtual_chess
virtual_chessvirtual_chess
virtual_chess
 
Android Report
Android ReportAndroid Report
Android Report
 
Doc muntation of android
Doc muntation of androidDoc muntation of android
Doc muntation of android
 
IRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAVIRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAV
 
Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
 
Kudan deck slide share e
Kudan deck slide share eKudan deck slide share e
Kudan deck slide share e
 
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdfJIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
F04402038042
F04402038042F04402038042
F04402038042
 
Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9
 
IRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented Reality
 
Mak product overview_no_video
Mak product overview_no_videoMak product overview_no_video
Mak product overview_no_video
 
Leveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge DevicesLeveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge Devices
 
Final project report format
Final project report formatFinal project report format
Final project report format
 

Recently uploaded

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Recently uploaded (20)

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

Marker less augmentedd reality using computer vision

  • 2. CONTENTS ● INTRODUCTION ● LITURATURE SURVEY ● METHODOLOGY ● FLOWCHART ● NETWORK ARCHITECTURE ● SELECTION OF GAME ENGINE ● SELECTION OF AR SDK ● VUFORIA ● ARCore AND ARKit ● ADVANTAGE AND DISADVANTAGES ● CONCLUSION ● REFERENCES 22
  • 3. INTRODUCTION ● User Experience is a very important factor of any software product and thus companies spend a lot of time and money in development stage to make sure that customers get a hassle-free experience. ● In today’s age it is observed that in order to further improve user experience the use of Machine Learning is to be employed. ● We plan on surveying all the possible ways to implement Computer Vision based marker less object tracking and integrate it with Augmented Reality Software Development Kits to create an interface ready for customers to use. ● The main idea is to track hand landmarks using Computer Vision and use them as placeholders for AR watch object. 33
  • 4. LITERATURE SURVEY SL NO PAPER AUTHOR YEAR 1 MediaPipe Hands: On-device Real- time Hand Tracking Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung 18 Jun 2020. 2 Pose Anchor: A Single-stage Hand Keypoint Detection Network Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng July 2020 3 A Study on MQTT based Environmental Parameters, Monitoring and Alarming System K.J., Reshmaa, Selvin Peter Paul J., and Swetha V 2018 4 A Survey of Frameworks and Game Engines for Serious Game Development Brent Cowan and Bill Kapralos 2014
  • 5. METHODOLOGY “Researchers from Google “present’ on-device a real-time solution predicting a human palm skeleton from a single RGB camera for AR products. Two models comprise the pipeline: •A palm detector responsible for bounding box generation across palm and provides it to, •A hand landmark model, that maps a palm sketch on the full hand. This is implemented through MediaPipe ,ML solutions building platform. 55
  • 7. ● The graph can be divided into 2 subparts, one for hand detection and the other for points of reference mapping. ● A key optimization here is that the hand detection only runs as needed, saving a lot of processing power ● To accomplish this start by extracting the palm location in the present video frame from the generated hand landmarks in the preceding frame, thus preventing the palm detection each frame ● For toughness, the tracker even provides confidence score for captured hand. whenever the confidence drops below the threshold then only the palm detector is reapplied to the upcoming frame. ● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a classifier which is on the basis of frames on poorly differently marked sequence data by combining a Convolution Neural Net in an Expectation Maximisation algorithm. METHODOLOGY 77
  • 8. ● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence level knowledge is available for the source videos. ● This method is explained in situation related to hand shape detection, it could be of a wider use to any video based recognition task which doesn’t have frame-level labelling ● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping landmarks, Pose Anchor presents a single effective network architecture for hand landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor network, which is based on RPN in Faster Region-based C.N.N ● Instead of manually designing hand pose anchors, they are generated using K-means clustering based on OKS ● The main reason for using pose anchors is to mitigate occlusion to some extent by using the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and NZSL were used to show the robustness and feasibility of the architecture. 88
  • 11. Selection of Game Engine: There are many game engines available to make AR applications. To chose one among them many factors must be kept in mind, they are: ● Scripting: The code written to provide the instructions. ● Rendering: The generation of the 3D scene, the factors to be measured are speed and accuracy. ● Animation: the change in render per frame to simulate movement. ● Artificial Intelligence: Ability of the computer to make decisions. ● Physics: Real world based calculations on simulated physical interactions. ● Audio: Audio feedback provided that can be spatially distinguished. ● Networking : Users have the ability to play with others online 1111
  • 12. ●A logical approach would be to go with a game engine which is widely used in the market, so we used 2 Surveys to determine a suitable game engine ●Survey 1: Scan through a database of approximately 200 academic publications for “serious game”, “educational game”, and “simulator”. ●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious gaming” 1212
  • 13. GAME ENGINE AND FRAMEWORK 1313
  • 14. ●By analyzing given information we can clearly conclude that Unity and Unreal are the most beneficial game engines, we chose Unity as our preferred game engine as the only advantage of Unreal over Unity is C++ support, but the programming language to be used is not an restriction in our situation. 14
  • 15. Unity Engine Properties: ● Lighting can be done before hand or at run time, custom shaders can be created using shader graphs and shadder programming thus replicating realistic lighting conditions as well as textures which can be changed based on input to change the simulation environment. ● Unity uses C# as the programming language behind the implementation of any application built using the framework and thus complicated logical simulations and inputs can be generated. 1515
  • 16. The Machine Learning-Agents Toolkit provided by the game engine: ●This is an free to use SDK available to integrate ML models with Unity.The 3 main features in the SDK are ● Sensors, Agents, and Academy ●The agents collect, observe and execute actions. The agent is the component being trained by constantly optimizing it’s policy known as Brain. ●The Academy manages the simulation,it is a singleton and thus used to keep track of all the steps involved in the simulation. ●Singleton is defined as a software design pattern that restricts the instantiation of a class to one "single" instance, any other instances which are created are automatically destroyed. 1616
  • 18. Integration of the model with Unity: The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion SDK and Gesture simple control package and used it for detection. This shows that any kind of trained model can be imported into Unity. The recommended method to import ML model into Unity: ● STEP 1: Save and export the model to ONNX format as this is the format supported by Unity. ● STEP 2: Import Unity ML Agents ● STEP 3: Import the model into Unity. ● STEP 4: Access the camera feed frame by frame. ● STEP 5: Resize the render texture if required. ● STEP 6: Read the Softmax layer output from the model. 18
  • 19. Selection of AR SDK ● SDK has multiple components within the application: identification, tracking, etc ● There are many AR SDKs to chose from like: Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit. ● There is another study by Anasse HANAFI in which they are focusing on those SDKs which give a platform and support and function with hardware. ● They had tested these SDKs on the basis of licence type, target platform, development platform, tracking type, functionality. ● The most important feature as per our requirement is tracking type and the results are tabulated in Table 1919
  • 20. AR SDKs COMPARISON BASED ON TRACKING 2020
  • 21. Vuforia ●Vuforia is an augmented reality software development kit for mobile devices that enables the creation of augmented reality applications. ●It uses computer vision technology to recognize and track planar images and 3D objects in real time . ●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail Industry. Their observation is can be referenced for our work to find which will be more suitable for the process in hand . 2121
  • 22. VUFORIA RESULTS BASED ON DISTANCE 2222
  • 23. ● Thus we can see that the results are perfect for our scenario, so it can be considered to use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a more suitable choice would be to go for ARCore or ARKit. 2323
  • 24. ARCore and ARKit: ●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for bringing more AR apps to thier platforms . They use the smartphone cameras to add interactive elements to an existing environment. ●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices. Unity offers a wrapper class known as ARFoundation which includes both of these SDKs and an added advantage is that it is not required to import these from an external source. ●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for ARKit or ARCore. Their observations are shown in Table 2424
  • 26. ● As it can be seen from Table that ARFoundation either has all the features or they are under development. Thus to keep the implementation simple, future ready and cover all target platforms it is suitable to use ARFoundation as our SDK. ● ARFoundation can be imported from Unity’s Package Manager which comes along with Unity during installation. 2626
  • 27. ADVANTAGES ● Once the content is placed in a room, it is more flexible than marker-based alternatives. ● Marker less AR significantly increases the average range of motion. ● Marker based alternative relies on the image recognisability whereas maker less doesn’t. 2727
  • 28. DISADVANTAGES ● The augmented reality content may not make sense in certain context ● For better experience it is required that the surface has a texture for computer vision to recognize it. 2828
  • 29. CONCLUSION We had a look at various computer vision techniques for implementing marker less tracking on hands and also determined an appropriate game engine to import that model into. We also compared various Augmented reality SDKs which can be used to properly instantiate the desired objects. We found using ARfoundation to implement the tracking will be much more efficient because of its simplistic nature in implementation , future ready features and tracking and beacuse of its ablity to cover all target platforms . 2929
  • 30. REFERENCES 1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking” arXiv:2006.10214v1 [cs.CV] 18 Jun 2020. 2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua, Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception pipelines.volume abs/1906.08172, 2019 3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020) 3030
  • 31. 5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99 6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in European conference on computer vision. Springer, 2014, pp. 740–755. 7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies 8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent Agents” in arxiv.org by Cornell University 9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP (2018) 3131