SlideShare a Scribd company logo
ARJUN AK
S7IT
TKR18IT003. 1
CONTENTS
● INTRODUCTION
● LITURATURE SURVEY
● METHODOLOGY
● FLOWCHART
● NETWORK ARCHITECTURE
● SELECTION OF GAME ENGINE
● SELECTION OF AR SDK
● VUFORIA
● ARCore AND ARKit
● ADVANTAGE AND DISADVANTAGES
● CONCLUSION
● REFERENCES
22
INTRODUCTION
● User Experience is a very important factor of any software product and thus
companies spend a lot of time and money in development stage to make sure that
customers get a hassle-free experience.
● In today’s age it is observed that in order to further improve user experience the use
of Machine Learning is to be employed.
● We plan on surveying all the possible ways to implement Computer Vision based
marker less object tracking and integrate it with Augmented Reality Software
Development Kits to create an interface ready for customers to use.
● The main idea is to track hand landmarks using Computer Vision and use them as
placeholders for AR watch object.
33
LITERATURE SURVEY
SL NO PAPER AUTHOR YEAR
1 MediaPipe Hands: On-device Real-
time Hand Tracking
Fan Zhang, Valentin
Bazarevsky, Andrey
Vakunov, Andrei
Tkachenka, George Sung
18 Jun 2020.
2 Pose Anchor: A Single-stage Hand
Keypoint Detection Network
Yuan Li, Xinggang
Wang, Wenyu Liu and
Bin Feng
July 2020
3 A Study on MQTT based
Environmental Parameters,
Monitoring and Alarming System
K.J., Reshmaa, Selvin
Peter Paul J., and
Swetha V
2018
4 A Survey of Frameworks and Game
Engines for Serious Game
Development
Brent Cowan and Bill
Kapralos
2014
METHODOLOGY
“Researchers from Google “present’ on-device a real-time solution predicting a
human palm skeleton from a single RGB camera for AR products.
Two models comprise the pipeline:
•A palm detector responsible for bounding box generation across palm and provides it
to,
•A hand landmark model, that maps a palm sketch on the full hand. This is
implemented through MediaPipe ,ML solutions building platform.
55
FLOWCHART
666
● The graph can be divided into 2 subparts, one for hand detection and the other for
points of reference mapping.
● A key optimization here is that the hand detection only runs as needed, saving a lot
of processing power
● To accomplish this start by extracting the palm location in the present video frame
from the generated hand landmarks in the preceding frame, thus preventing the
palm detection each frame
● For toughness, the tracker even provides confidence score for captured hand.
whenever the confidence drops below the threshold then only the palm detector is
reapplied to the upcoming frame.
● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a
classifier which is on the basis of frames on poorly differently marked sequence
data by combining a Convolution Neural Net in an Expectation Maximisation
algorithm.
METHODOLOGY
77
● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence
level knowledge is available for the source videos.
● This method is explained in situation related to hand shape detection, it could be of a
wider use to any video based recognition task which doesn’t have frame-level labelling
● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping
landmarks, Pose Anchor presents a single effective network architecture for hand
landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor
network, which is based on RPN in Faster Region-based C.N.N
● Instead of manually designing hand pose anchors, they are generated using K-means
clustering based on OKS
● The main reason for using pose anchors is to mitigate occlusion to some extent by using
the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and
NZSL were used to show the robustness and feasibility of the architecture.
88
NETWORK DIAGRAM
99
HEAT MAP GENERATED
1010
Selection of Game Engine:
There are many game engines available to make AR applications. To chose one among
them many factors must be kept in mind, they are:
● Scripting: The code written to provide the instructions.
● Rendering: The generation of the 3D scene, the factors to be measured are speed and
accuracy.
● Animation: the change in render per frame to simulate movement.
● Artificial Intelligence: Ability of the computer to make decisions.
● Physics: Real world based calculations on simulated physical interactions.
● Audio: Audio feedback provided that can be spatially distinguished.
● Networking : Users have the ability to play with others online
1111
●A logical approach would be to go with a game engine which is widely used in the market,
so we used 2 Surveys to determine a suitable game engine
●Survey 1: Scan through a database of approximately 200 academic publications for
“serious game”, “educational game”, and “simulator”.
●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious
gaming”
1212
GAME ENGINE AND FRAMEWORK
1313
●By analyzing given information we can clearly conclude that Unity and Unreal are
the most beneficial game engines, we chose Unity as our preferred game engine as the
only advantage of Unreal over Unity is C++ support, but the programming language
to be used is not an restriction in our situation.
14
Unity Engine Properties:
● Lighting can be done before hand or at run time, custom shaders can be created
using shader graphs and shadder programming thus replicating realistic lighting
conditions as well as textures which can be changed based on input to change the
simulation environment.
● Unity uses C# as the programming language behind the implementation of any
application built using the framework and thus complicated logical simulations and
inputs can be generated.
1515
The Machine Learning-Agents Toolkit
provided by the game engine:
●This is an free to use SDK available to integrate ML models with Unity.The 3 main features
in the SDK are
● Sensors, Agents, and Academy
●The agents collect, observe and execute actions. The agent is the component being trained by
constantly optimizing it’s policy known as Brain.
●The Academy manages the simulation,it is a singleton and thus used to keep track of all the
steps involved in the simulation.
●Singleton is defined as a software design pattern that restricts the instantiation of a class to
one "single" instance, any other instances which are created are automatically destroyed.
1616
ML AGENTS DOCUMENTATION
1717
Integration of the model with Unity:
The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion
SDK and Gesture simple control package and used it for detection.
This shows that any kind of trained model can be imported into Unity.
The recommended method to import ML model into Unity:
● STEP 1: Save and export the model to ONNX format as this is the format supported by
Unity.
● STEP 2: Import Unity ML Agents
● STEP 3: Import the model into Unity.
● STEP 4: Access the camera feed frame by frame.
● STEP 5: Resize the render texture if required.
● STEP 6: Read the Softmax layer output from the model.
18
Selection of AR SDK
● SDK has multiple components within the application: identification, tracking, etc
● There are many AR SDKs to chose from like:
Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit.
● There is another study by Anasse HANAFI in which they are focusing on those SDKs
which give a platform and support and function with hardware.
● They had tested these SDKs on the basis of licence type, target platform, development
platform, tracking type, functionality.
● The most important feature as per our requirement is tracking type and the results are
tabulated in Table
1919
AR SDKs COMPARISON BASED ON TRACKING
2020
Vuforia
●Vuforia is an augmented reality software development kit for mobile devices that enables the
creation of augmented reality applications.
●It uses computer vision technology to recognize and track planar images and 3D objects in real
time .
●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail
Industry. Their observation is can be referenced for our work to find which will be more suitable
for the process in hand .
2121
VUFORIA RESULTS BASED ON DISTANCE
2222
● Thus we can see that the results are perfect for our scenario, so it can be considered to
use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a
more suitable choice would be to go for ARCore or ARKit.
2323
ARCore and ARKit:
●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for
bringing more AR apps to thier platforms . They use the smartphone cameras to add
interactive elements to an existing environment.
●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices.
Unity offers a wrapper class known as ARFoundation which includes both of these SDKs
and an added advantage is that it is not required to import these from an external source.
●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for
ARKit or ARCore. Their observations are shown in Table
2424
AR FOUNDATION TABLE
2525
● As it can be seen from Table that ARFoundation either has all the features or
they are under development. Thus to keep the implementation simple, future
ready and cover all target platforms it is suitable to use ARFoundation as our
SDK.
● ARFoundation can be imported from Unity’s Package Manager which comes
along with Unity during installation.
2626
ADVANTAGES
● Once the content is placed in a room, it is more flexible than marker-based
alternatives.
● Marker less AR significantly increases the average range of motion.
● Marker based alternative relies on the image recognisability whereas
maker less doesn’t.
2727
DISADVANTAGES
● The augmented reality content may not make sense in certain context
● For better experience it is required that the surface has a texture for computer
vision to recognize it.
2828
CONCLUSION
We had a look at various computer vision techniques for implementing marker less
tracking on hands and also determined an appropriate game engine to import that
model into. We also compared various Augmented reality SDKs which can be used to
properly instantiate the desired objects. We found using ARfoundation to implement
the tracking will be much more efficient because of its simplistic nature in
implementation , future ready features and tracking and beacuse of its ablity to cover
all target platforms .
2929
REFERENCES
1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling
Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking”
arXiv:2006.10214v1 [cs.CV] 18 Jun 2020.
2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael
Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua,
Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception
pipelines.volume abs/1906.08172, 2019
3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million
Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR).
4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint
Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020)
3030
5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region
proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99
6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick,
“Microsoft coco: Common objects in context,” in European conference on computer vision. Springer,
2014, pp. 740–755.
7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game
Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies
8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris
Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent
Agents” in arxiv.org by Cornell University
9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental
Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP
(2018)
3131
THANK
YOU
3232

More Related Content

Similar to Marker less augmentedd reality using computer vision

How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
Unity Technologies
 
virtual_chess
virtual_chessvirtual_chess
virtual_chess
Chinar Patil
 
Doc muntation of android
Doc muntation of androidDoc muntation of android
Doc muntation of android
msramakrishna
 
Android Report
Android ReportAndroid Report
Android Report
Ganesh Waghmare
 
IRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAVIRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAV
IRJET Journal
 
Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools
Matteo Valoriani
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Codemotion
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Codemotion
 
Kudan deck slide share e
Kudan deck slide share eKudan deck slide share e
Kudan deck slide share e
Tomoko Takeda
 
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdfJIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
SamiraKids
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET Journal
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16
Schuyler Kennedy
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Linaro
 
F04402038042
F04402038042F04402038042
F04402038042
ijceronline
 
Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...
Iaetsd Iaetsd
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9
IOSR Journals
 
IRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented Reality
IRJET Journal
 
Mak product overview_no_video
Mak product overview_no_videoMak product overview_no_video
Mak product overview_no_video
Peter Swan
 
Leveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge DevicesLeveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge Devices
ICS
 
Final project report format
Final project report formatFinal project report format
Final project report format
Masud Sarkar
 

Similar to Marker less augmentedd reality using computer vision (20)

How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
 
virtual_chess
virtual_chessvirtual_chess
virtual_chess
 
Doc muntation of android
Doc muntation of androidDoc muntation of android
Doc muntation of android
 
Android Report
Android ReportAndroid Report
Android Report
 
IRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAVIRJET- Proposed Design for 3D Map Generation using UAV
IRJET- Proposed Design for 3D Map Generation using UAV
 
Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
 
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
 
Kudan deck slide share e
Kudan deck slide share eKudan deck slide share e
Kudan deck slide share e
 
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdfJIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
F04402038042
F04402038042F04402038042
F04402038042
 
Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...Iirdem design and implementation of finger writing in air by using open cv (c...
Iirdem design and implementation of finger writing in air by using open cv (c...
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9
 
IRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented RealityIRJET-Implementation of Image Processing using Augmented Reality
IRJET-Implementation of Image Processing using Augmented Reality
 
Mak product overview_no_video
Mak product overview_no_videoMak product overview_no_video
Mak product overview_no_video
 
Leveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge DevicesLeveraging Artificial Intelligence Processing on Edge Devices
Leveraging Artificial Intelligence Processing on Edge Devices
 
Final project report format
Final project report formatFinal project report format
Final project report format
 

Recently uploaded

CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
MIGUELANGEL966976
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
wisnuprabawa3
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
University of Maribor
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 

Recently uploaded (20)

CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 

Marker less augmentedd reality using computer vision

  • 2. CONTENTS ● INTRODUCTION ● LITURATURE SURVEY ● METHODOLOGY ● FLOWCHART ● NETWORK ARCHITECTURE ● SELECTION OF GAME ENGINE ● SELECTION OF AR SDK ● VUFORIA ● ARCore AND ARKit ● ADVANTAGE AND DISADVANTAGES ● CONCLUSION ● REFERENCES 22
  • 3. INTRODUCTION ● User Experience is a very important factor of any software product and thus companies spend a lot of time and money in development stage to make sure that customers get a hassle-free experience. ● In today’s age it is observed that in order to further improve user experience the use of Machine Learning is to be employed. ● We plan on surveying all the possible ways to implement Computer Vision based marker less object tracking and integrate it with Augmented Reality Software Development Kits to create an interface ready for customers to use. ● The main idea is to track hand landmarks using Computer Vision and use them as placeholders for AR watch object. 33
  • 4. LITERATURE SURVEY SL NO PAPER AUTHOR YEAR 1 MediaPipe Hands: On-device Real- time Hand Tracking Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung 18 Jun 2020. 2 Pose Anchor: A Single-stage Hand Keypoint Detection Network Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng July 2020 3 A Study on MQTT based Environmental Parameters, Monitoring and Alarming System K.J., Reshmaa, Selvin Peter Paul J., and Swetha V 2018 4 A Survey of Frameworks and Game Engines for Serious Game Development Brent Cowan and Bill Kapralos 2014
  • 5. METHODOLOGY “Researchers from Google “present’ on-device a real-time solution predicting a human palm skeleton from a single RGB camera for AR products. Two models comprise the pipeline: •A palm detector responsible for bounding box generation across palm and provides it to, •A hand landmark model, that maps a palm sketch on the full hand. This is implemented through MediaPipe ,ML solutions building platform. 55
  • 7. ● The graph can be divided into 2 subparts, one for hand detection and the other for points of reference mapping. ● A key optimization here is that the hand detection only runs as needed, saving a lot of processing power ● To accomplish this start by extracting the palm location in the present video frame from the generated hand landmarks in the preceding frame, thus preventing the palm detection each frame ● For toughness, the tracker even provides confidence score for captured hand. whenever the confidence drops below the threshold then only the palm detector is reapplied to the upcoming frame. ● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a classifier which is on the basis of frames on poorly differently marked sequence data by combining a Convolution Neural Net in an Expectation Maximisation algorithm. METHODOLOGY 77
  • 8. ● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence level knowledge is available for the source videos. ● This method is explained in situation related to hand shape detection, it could be of a wider use to any video based recognition task which doesn’t have frame-level labelling ● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping landmarks, Pose Anchor presents a single effective network architecture for hand landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor network, which is based on RPN in Faster Region-based C.N.N ● Instead of manually designing hand pose anchors, they are generated using K-means clustering based on OKS ● The main reason for using pose anchors is to mitigate occlusion to some extent by using the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and NZSL were used to show the robustness and feasibility of the architecture. 88
  • 11. Selection of Game Engine: There are many game engines available to make AR applications. To chose one among them many factors must be kept in mind, they are: ● Scripting: The code written to provide the instructions. ● Rendering: The generation of the 3D scene, the factors to be measured are speed and accuracy. ● Animation: the change in render per frame to simulate movement. ● Artificial Intelligence: Ability of the computer to make decisions. ● Physics: Real world based calculations on simulated physical interactions. ● Audio: Audio feedback provided that can be spatially distinguished. ● Networking : Users have the ability to play with others online 1111
  • 12. ●A logical approach would be to go with a game engine which is widely used in the market, so we used 2 Surveys to determine a suitable game engine ●Survey 1: Scan through a database of approximately 200 academic publications for “serious game”, “educational game”, and “simulator”. ●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious gaming” 1212
  • 13. GAME ENGINE AND FRAMEWORK 1313
  • 14. ●By analyzing given information we can clearly conclude that Unity and Unreal are the most beneficial game engines, we chose Unity as our preferred game engine as the only advantage of Unreal over Unity is C++ support, but the programming language to be used is not an restriction in our situation. 14
  • 15. Unity Engine Properties: ● Lighting can be done before hand or at run time, custom shaders can be created using shader graphs and shadder programming thus replicating realistic lighting conditions as well as textures which can be changed based on input to change the simulation environment. ● Unity uses C# as the programming language behind the implementation of any application built using the framework and thus complicated logical simulations and inputs can be generated. 1515
  • 16. The Machine Learning-Agents Toolkit provided by the game engine: ●This is an free to use SDK available to integrate ML models with Unity.The 3 main features in the SDK are ● Sensors, Agents, and Academy ●The agents collect, observe and execute actions. The agent is the component being trained by constantly optimizing it’s policy known as Brain. ●The Academy manages the simulation,it is a singleton and thus used to keep track of all the steps involved in the simulation. ●Singleton is defined as a software design pattern that restricts the instantiation of a class to one "single" instance, any other instances which are created are automatically destroyed. 1616
  • 18. Integration of the model with Unity: The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion SDK and Gesture simple control package and used it for detection. This shows that any kind of trained model can be imported into Unity. The recommended method to import ML model into Unity: ● STEP 1: Save and export the model to ONNX format as this is the format supported by Unity. ● STEP 2: Import Unity ML Agents ● STEP 3: Import the model into Unity. ● STEP 4: Access the camera feed frame by frame. ● STEP 5: Resize the render texture if required. ● STEP 6: Read the Softmax layer output from the model. 18
  • 19. Selection of AR SDK ● SDK has multiple components within the application: identification, tracking, etc ● There are many AR SDKs to chose from like: Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit. ● There is another study by Anasse HANAFI in which they are focusing on those SDKs which give a platform and support and function with hardware. ● They had tested these SDKs on the basis of licence type, target platform, development platform, tracking type, functionality. ● The most important feature as per our requirement is tracking type and the results are tabulated in Table 1919
  • 20. AR SDKs COMPARISON BASED ON TRACKING 2020
  • 21. Vuforia ●Vuforia is an augmented reality software development kit for mobile devices that enables the creation of augmented reality applications. ●It uses computer vision technology to recognize and track planar images and 3D objects in real time . ●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail Industry. Their observation is can be referenced for our work to find which will be more suitable for the process in hand . 2121
  • 22. VUFORIA RESULTS BASED ON DISTANCE 2222
  • 23. ● Thus we can see that the results are perfect for our scenario, so it can be considered to use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a more suitable choice would be to go for ARCore or ARKit. 2323
  • 24. ARCore and ARKit: ●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for bringing more AR apps to thier platforms . They use the smartphone cameras to add interactive elements to an existing environment. ●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices. Unity offers a wrapper class known as ARFoundation which includes both of these SDKs and an added advantage is that it is not required to import these from an external source. ●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for ARKit or ARCore. Their observations are shown in Table 2424
  • 26. ● As it can be seen from Table that ARFoundation either has all the features or they are under development. Thus to keep the implementation simple, future ready and cover all target platforms it is suitable to use ARFoundation as our SDK. ● ARFoundation can be imported from Unity’s Package Manager which comes along with Unity during installation. 2626
  • 27. ADVANTAGES ● Once the content is placed in a room, it is more flexible than marker-based alternatives. ● Marker less AR significantly increases the average range of motion. ● Marker based alternative relies on the image recognisability whereas maker less doesn’t. 2727
  • 28. DISADVANTAGES ● The augmented reality content may not make sense in certain context ● For better experience it is required that the surface has a texture for computer vision to recognize it. 2828
  • 29. CONCLUSION We had a look at various computer vision techniques for implementing marker less tracking on hands and also determined an appropriate game engine to import that model into. We also compared various Augmented reality SDKs which can be used to properly instantiate the desired objects. We found using ARfoundation to implement the tracking will be much more efficient because of its simplistic nature in implementation , future ready features and tracking and beacuse of its ablity to cover all target platforms . 2929
  • 30. REFERENCES 1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking” arXiv:2006.10214v1 [cs.CV] 18 Jun 2020. 2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua, Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception pipelines.volume abs/1906.08172, 2019 3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020) 3030
  • 31. 5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99 6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in European conference on computer vision. Springer, 2014, pp. 740–755. 7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies 8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent Agents” in arxiv.org by Cornell University 9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP (2018) 3131