SlideShare a Scribd company logo
Real Time
Object & Pose Detection
GUIDED BY: GROUP MEMBERS:
Prof. Nisha Bhati Ashwin Bicholiya
Aishwary Joshi
Arjun Soni
Anshul Sharma
INTRODUCTION
• Efficient and accurate object detection has been an important topic
in the advancement of computer vision systems.
• Our project aims to detect the object with the goal of achieving
high accuracy with a real-time performance.
• In this project, we use a completely deep learning based approach
to solve the problem of object detection.
• The input to the system will be a real time image, and the output
will be a bounding box corresponding to all the objects in the
image, along with the class of object in each box.
OBJECTIVE
• Develop a application that detects an object and it can be used for
vehicles counting, when the object is a vehicle such as a bicycle or
car, it can count how many vehicles have passed from a particular
area or road and it can recognize human activity too.
Problem Domain
• Humans can easily detect and identify objects present in an image
but for the computer or machine a classifying and finding an
unknown number of individual objects within an image is extremely
a difficult problem.
• Although there exist object detection software and application
they do not give an accurate result of an object because despite a
lot of research, real-time, and dynamic object detection methods
are still in process.
Solution Domain
• To classify image objects and also to determine the objects
positions we use two different methods object classification and
localization in the same time.
• The true purpose of this project is that it can be used in security,
surveillance and autonomous vehicle driving to detect pedestrians
walking or jogging on the street to avoid accidents.
• We use object classification and localization it provides accuracy,
speed for real time detection and also improves detection tasks are
optimized using one multi-task function and an object is compared
to the image’s true objects.
Required Resources
• As the application is mobile based and will be available for Android devices.
Software Requirements:
 An integrated development environment (IDE) Microsoft Visual Studio code.
 Flutter (UI software development kit ).
 Dart programming language
 Tensorflow
 YOLO Framework.
Hardware Requirements:
 To run Visual Studio code we need-
• Operating system of Microsoft Windows 10 (32-or 64-bit)
• RAM- 4 GB
• Hard Disk- Minimum 1GB disk space
Methodology to be adopted
PREPROCESSING
• The process of preprocessing improves the image intensity by
suppressing the unwanted features or enhancing them for further
processing.
• It resizes the image size to 448*448 and also normalizes the
contrast and brightness effects. The image is also cropped and
resized so that feature extraction can be performed easily. The
input images are pre-processed and very easily normalize the
contrasts and brightness.
Yolo Framework
This model divides the image into S×S grid and for each grid cell predicts B
bounding boxes, confidence for those boxes, and C class probabilities.
This predictions are encoded as an S×S × (B*5+C) tensor.
Use-case Diagram
System
User
Initialize
Real time image
Classification
&Localization
Recognisation
General
Output
Upload Image
Open camera
Cancel
Image-Grid S×S
S×S Grid to
bound box
Capture image
Activity Diagram
Upload Image or Open
Camera
Image
format
not
supported
Format Accepted
Error found Pre Processing
Classification & Localization
Recognization
Output Generated
Sequence Diagram
USER GUI System
1. Display Menu
2. Open camera/upload image
3. Process Image
Localization
5. Classification
4. Preprocessing
7. General Output
8. Display Output
&
Input image
Architecture of the System
Pre Processing
Classification and
Localization
Output
What are Localization and detection
Classification
Classification with
Localization Detection
Multiple Objects
Classification with localization
Vector features
Multiple Layers
Outputs the
predicted class
1. pedestrian
2. Car
3. Motorcycle
4. Background
Bx, by, bh, bw
Bounding Boxes
0,0
1,1Bx, by
bh
bw
Defining the target label Y
1. pedestrian
2. Car
3. Motorcycle
4. Background
Need to output bx, by, bh, bw, class label(1-4)
X=
PC
Bx
By
Bh
Bw
C1
C2
C3
1
Bx
By
Bh
Bw
0
1
0
0
?
?
?
?
?
?
?
Y=Y=
Y=
Overlapping Objects:
PC
Bx
By
Bh
Bw
C1
C2
C3
PC
Bx
By
Bh
Bw
C1
C2
C3
PC
Bx
By
Bh
Bw
C1
C2
C3
Y=
Y=
Anchor box 1: Anchor box 2:
1
Bx
By
Bh
Bw
1
0
0
1
Bx
By
Bh
Bw
0
1
0
Y=

More Related Content

What's hot

Object detection
Object detectionObject detection
Object detection
ROUSHAN RAJ KUMAR
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
Raviraj singh shekhawat
 
Object detection
Object detectionObject detection
Object detection
Jksuryawanshi
 
Machine Learning - Object Detection and Classification
Machine Learning - Object Detection and ClassificationMachine Learning - Object Detection and Classification
Machine Learning - Object Detection and Classification
Vikas Jain
 
Application of image processing
Application of image processingApplication of image processing
Application of image processing
University of Potsdam
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
MrsShwetaBanait1
 
Image recognition
Image recognitionImage recognition
Image recognition
Aseed Usmani
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objects
Manish Khare
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
Suraj Aavula
 
Driver Drowsiness Detection report
Driver Drowsiness Detection reportDriver Drowsiness Detection report
Driver Drowsiness Detection report
PurvanshJain1
 
Yolo
YoloYolo
Traffic sign recognition
Traffic sign recognitionTraffic sign recognition
Traffic sign recognition
AKR Education
 
Face recognition attendance system
Face recognition attendance systemFace recognition attendance system
Face recognition attendance system
Naomi Kulkarni
 
face detection
face detectionface detection
face detection
Smriti Tikoo
 
OpenCV presentation series- part 1
OpenCV presentation series- part 1OpenCV presentation series- part 1
OpenCV presentation series- part 1
Sairam Adithya
 
Image segmentation ppt
Image segmentation pptImage segmentation ppt
Image segmentation ppt
Gichelle Amon
 
project ppt.pptx
project ppt.pptxproject ppt.pptx
project ppt.pptx
GYamini22
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
Vanya Valindria
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
Raviteja Chowdary Adusumalli
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
DEEPASHRI HK
 

What's hot (20)

Object detection
Object detectionObject detection
Object detection
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Object detection
Object detectionObject detection
Object detection
 
Machine Learning - Object Detection and Classification
Machine Learning - Object Detection and ClassificationMachine Learning - Object Detection and Classification
Machine Learning - Object Detection and Classification
 
Application of image processing
Application of image processingApplication of image processing
Application of image processing
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objects
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Driver Drowsiness Detection report
Driver Drowsiness Detection reportDriver Drowsiness Detection report
Driver Drowsiness Detection report
 
Yolo
YoloYolo
Yolo
 
Traffic sign recognition
Traffic sign recognitionTraffic sign recognition
Traffic sign recognition
 
Face recognition attendance system
Face recognition attendance systemFace recognition attendance system
Face recognition attendance system
 
face detection
face detectionface detection
face detection
 
OpenCV presentation series- part 1
OpenCV presentation series- part 1OpenCV presentation series- part 1
OpenCV presentation series- part 1
 
Image segmentation ppt
Image segmentation pptImage segmentation ppt
Image segmentation ppt
 
project ppt.pptx
project ppt.pptxproject ppt.pptx
project ppt.pptx
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
 

Similar to Object detection presentation

Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
IRJET Journal
 
Real Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AIReal Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AI
IRJET Journal
 
Built-in Face Recognition for Smart Phone Devices
Built-in Face Recognition for Smart Phone DevicesBuilt-in Face Recognition for Smart Phone Devices
Built-in Face Recognition for Smart Phone Devices
IRJET Journal
 
Elderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detectionElderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detection
Tanvi Mittal
 
ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
vikaspandey0702
 
IRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe Model
IRJET Journal
 
IRJET- Autonamy of Attendence using Face Recognition
IRJET- Autonamy of Attendence using Face RecognitionIRJET- Autonamy of Attendence using Face Recognition
IRJET- Autonamy of Attendence using Face Recognition
IRJET Journal
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
Ashiq Ullah
 
2 D3 D Concersion Swaggmedia
2 D3 D Concersion   Swaggmedia2 D3 D Concersion   Swaggmedia
2 D3 D Concersion Swaggmedia
Craig Nobles
 
Object recognition
Object recognitionObject recognition
Object recognition
Aakanksha Singh
 
Dq4301702706
Dq4301702706Dq4301702706
Dq4301702706
IJERA Editor
 
IRJET - Face Recognition based Attendance System
IRJET -  	  Face Recognition based Attendance SystemIRJET -  	  Face Recognition based Attendance System
IRJET - Face Recognition based Attendance System
IRJET Journal
 
Project Face Detection
Project Face Detection Project Face Detection
Project Face Detection
Abu Saleh Musa
 
Face detection presentation slide
Face detection  presentation slideFace detection  presentation slide
Face detection presentation slide
Sanjoy Dutta
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
IRJET Journal
 
GP_Slides_V3 .pptx
GP_Slides_V3 .pptxGP_Slides_V3 .pptx
GP_Slides_V3 .pptx
AhmedEldairy
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
Face Recognition Based on Image Processing in an Advanced Robotic System
Face Recognition Based on Image Processing in an Advanced Robotic SystemFace Recognition Based on Image Processing in an Advanced Robotic System
Face Recognition Based on Image Processing in an Advanced Robotic System
IRJET Journal
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
Chen Sagiv
 

Similar to Object detection presentation (20)

Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
 
Real Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AIReal Time Moving Object Detection for Day-Night Surveillance using AI
Real Time Moving Object Detection for Day-Night Surveillance using AI
 
Built-in Face Recognition for Smart Phone Devices
Built-in Face Recognition for Smart Phone DevicesBuilt-in Face Recognition for Smart Phone Devices
Built-in Face Recognition for Smart Phone Devices
 
Elderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detectionElderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detection
 
ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
 
IRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe Model
 
IRJET- Autonamy of Attendence using Face Recognition
IRJET- Autonamy of Attendence using Face RecognitionIRJET- Autonamy of Attendence using Face Recognition
IRJET- Autonamy of Attendence using Face Recognition
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 
2 D3 D Concersion Swaggmedia
2 D3 D Concersion   Swaggmedia2 D3 D Concersion   Swaggmedia
2 D3 D Concersion Swaggmedia
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Dq4301702706
Dq4301702706Dq4301702706
Dq4301702706
 
IRJET - Face Recognition based Attendance System
IRJET -  	  Face Recognition based Attendance SystemIRJET -  	  Face Recognition based Attendance System
IRJET - Face Recognition based Attendance System
 
Project Face Detection
Project Face Detection Project Face Detection
Project Face Detection
 
Face detection presentation slide
Face detection  presentation slideFace detection  presentation slide
Face detection presentation slide
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
 
GP_Slides_V3 .pptx
GP_Slides_V3 .pptxGP_Slides_V3 .pptx
GP_Slides_V3 .pptx
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
Face Recognition Based on Image Processing in an Advanced Robotic System
Face Recognition Based on Image Processing in an Advanced Robotic SystemFace Recognition Based on Image Processing in an Advanced Robotic System
Face Recognition Based on Image Processing in an Advanced Robotic System
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 

Recently uploaded

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 

Recently uploaded (20)

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 

Object detection presentation

  • 1. Real Time Object & Pose Detection GUIDED BY: GROUP MEMBERS: Prof. Nisha Bhati Ashwin Bicholiya Aishwary Joshi Arjun Soni Anshul Sharma
  • 2. INTRODUCTION • Efficient and accurate object detection has been an important topic in the advancement of computer vision systems. • Our project aims to detect the object with the goal of achieving high accuracy with a real-time performance. • In this project, we use a completely deep learning based approach to solve the problem of object detection. • The input to the system will be a real time image, and the output will be a bounding box corresponding to all the objects in the image, along with the class of object in each box.
  • 3. OBJECTIVE • Develop a application that detects an object and it can be used for vehicles counting, when the object is a vehicle such as a bicycle or car, it can count how many vehicles have passed from a particular area or road and it can recognize human activity too.
  • 4. Problem Domain • Humans can easily detect and identify objects present in an image but for the computer or machine a classifying and finding an unknown number of individual objects within an image is extremely a difficult problem. • Although there exist object detection software and application they do not give an accurate result of an object because despite a lot of research, real-time, and dynamic object detection methods are still in process.
  • 5. Solution Domain • To classify image objects and also to determine the objects positions we use two different methods object classification and localization in the same time. • The true purpose of this project is that it can be used in security, surveillance and autonomous vehicle driving to detect pedestrians walking or jogging on the street to avoid accidents. • We use object classification and localization it provides accuracy, speed for real time detection and also improves detection tasks are optimized using one multi-task function and an object is compared to the image’s true objects.
  • 6. Required Resources • As the application is mobile based and will be available for Android devices. Software Requirements:  An integrated development environment (IDE) Microsoft Visual Studio code.  Flutter (UI software development kit ).  Dart programming language  Tensorflow  YOLO Framework. Hardware Requirements:  To run Visual Studio code we need- • Operating system of Microsoft Windows 10 (32-or 64-bit) • RAM- 4 GB • Hard Disk- Minimum 1GB disk space
  • 8. PREPROCESSING • The process of preprocessing improves the image intensity by suppressing the unwanted features or enhancing them for further processing. • It resizes the image size to 448*448 and also normalizes the contrast and brightness effects. The image is also cropped and resized so that feature extraction can be performed easily. The input images are pre-processed and very easily normalize the contrasts and brightness.
  • 9.
  • 10. Yolo Framework This model divides the image into S×S grid and for each grid cell predicts B bounding boxes, confidence for those boxes, and C class probabilities. This predictions are encoded as an S×S × (B*5+C) tensor.
  • 11. Use-case Diagram System User Initialize Real time image Classification &Localization Recognisation General Output Upload Image Open camera Cancel Image-Grid S×S S×S Grid to bound box Capture image
  • 12. Activity Diagram Upload Image or Open Camera Image format not supported Format Accepted Error found Pre Processing Classification & Localization Recognization Output Generated
  • 13. Sequence Diagram USER GUI System 1. Display Menu 2. Open camera/upload image 3. Process Image Localization 5. Classification 4. Preprocessing 7. General Output 8. Display Output &
  • 14. Input image Architecture of the System Pre Processing Classification and Localization Output
  • 15. What are Localization and detection Classification Classification with Localization Detection Multiple Objects
  • 16. Classification with localization Vector features Multiple Layers Outputs the predicted class 1. pedestrian 2. Car 3. Motorcycle 4. Background Bx, by, bh, bw Bounding Boxes 0,0 1,1Bx, by bh bw
  • 17. Defining the target label Y 1. pedestrian 2. Car 3. Motorcycle 4. Background Need to output bx, by, bh, bw, class label(1-4) X= PC Bx By Bh Bw C1 C2 C3 1 Bx By Bh Bw 0 1 0 0 ? ? ? ? ? ? ? Y=Y= Y=