The document proposes a 3-stage algorithm for real-time video object tracking on the DaVinci processor:
1. A novel object segmentation and background subtraction algorithm is designed to handle noise, illumination changes, and multiple moving objects.
2. Binary Large OBject (BLOB) detection is used to identify image clusters and solve problems of abrupt object shapes, sizes, and counts.
3. A centroid-based tracking method is used to improve robustness to occlusion and contour sliding.
Optimizations are applied at both the algorithm and code levels to reduce memory usage and access and improve execution speed, allowing the tracking of 30 frames per second in real-time. The algorithm provides at least a 2
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...TELKOMNIKA JOURNAL
Recently, structured light 3D imaging devices have gained a keen attention due to their potential
applications to robotics, industrial manufacturing and medical imaging. Most of these applications require
high 3D precision yet high speed in image capturing for hard and/or soft real time environments. This
paper presents a method of high speed image capturing for structured light 3D imaging sensors with FPGA
based structured light pattern generation and projector-camera synchronization. Suggested setup reduces
the time for pattern projection and camera triggering to 16msec from 100msec that should be required by
conventional methods.
Design and Analysis of Quantization Based Low Bit Rate Encoding Systemijtsrd
The objective of this paper is to develop a low bit rate encoding for VQ problems such as real time image coding.. The decision tree is generated by an offline process.. A new systolic architecture to realize the encoder of full search vector quantization VQ for high speed applications is presented here. Over past decades digital video compression technologies have become an integral part. Therefore the purpose is to improve image quality in Remote cardiac pulse measurement using Adaptive filter. It describes the approach to be used for feature extraction from many images.. This paper presents a real time application of compression of the image processing technique which can be efficiently used for the interfacing with any hardware. Therefore we have used Raspberry Pi in compression of image. We have developed an algorithm that is based on the endoscopic images that consist of the differential pulse code modulation. The compressors consist of a low cost YEF colour space converters and variable length predictive algorithm for lossless compression. Mr. Nilesh Bodne | Dr. Sunil Kumar "Design and Analysis of Quantization Based Low Bit Rate Encoding System" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-6 , October 2019, URL: https://www.ijtsrd.com/papers/ijtsrd29289.pdf Paper URL: https://www.ijtsrd.com/engineering/electronics-and-communication-engineering/29289/design-and-analysis-of-quantization-based-low-bit-rate-encoding-system/mr-nilesh-bodne
Dissertation synopsis for imagedenoising(noise reduction )using non local me...Arti Singh
Dissertation report for image denoising using non-local mean algorithm, discussion about subproblem of noise reduction,descrption for problem in image noise
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/10/person-re-identification-and-tracking-at-the-edge-challenges-and-techniques-a-presentation-from-the-university-of-auckland/
Morteza Biglari-Abhari, Senior Lecturer at the University of Auckland, presents the “Person Re-Identification and Tracking at the Edge: Challenges and Techniques” tutorial at the May 2021 Embedded Vision Summit.
Numerous video analytics applications require understanding how people are moving through a space, including the ability to recognize when the same person has moved outside of the camera’s view and then back into the camera’s view, or when a person has passed from the view of one camera to the view of another. This capability is referred to as person re-identification and tracking. It’s an essential technique for applications such as surveillance for security, health and safety monitoring in healthcare and industrial facilities, intelligent transportation systems and smart cities. It can also assist in gathering business intelligence such as monitoring customer behavior in shopping environments. Person re-identification is challenging.
In this talk, Biglari-Abhari discusses the key challenges and current approaches for person re-identification and tracking, as well as his initial work on multi-camera systems and techniques to improve accuracy, especially fusing appearance and spatio-temporal models. He also briefly discusses privacy-preserving techniques, which are critical for some applications, as well as challenges for real-time processing at the edge.
FPGA Based Pattern Generation and Synchonization for High Speed Structured Li...TELKOMNIKA JOURNAL
Recently, structured light 3D imaging devices have gained a keen attention due to their potential
applications to robotics, industrial manufacturing and medical imaging. Most of these applications require
high 3D precision yet high speed in image capturing for hard and/or soft real time environments. This
paper presents a method of high speed image capturing for structured light 3D imaging sensors with FPGA
based structured light pattern generation and projector-camera synchronization. Suggested setup reduces
the time for pattern projection and camera triggering to 16msec from 100msec that should be required by
conventional methods.
Design and Analysis of Quantization Based Low Bit Rate Encoding Systemijtsrd
The objective of this paper is to develop a low bit rate encoding for VQ problems such as real time image coding.. The decision tree is generated by an offline process.. A new systolic architecture to realize the encoder of full search vector quantization VQ for high speed applications is presented here. Over past decades digital video compression technologies have become an integral part. Therefore the purpose is to improve image quality in Remote cardiac pulse measurement using Adaptive filter. It describes the approach to be used for feature extraction from many images.. This paper presents a real time application of compression of the image processing technique which can be efficiently used for the interfacing with any hardware. Therefore we have used Raspberry Pi in compression of image. We have developed an algorithm that is based on the endoscopic images that consist of the differential pulse code modulation. The compressors consist of a low cost YEF colour space converters and variable length predictive algorithm for lossless compression. Mr. Nilesh Bodne | Dr. Sunil Kumar "Design and Analysis of Quantization Based Low Bit Rate Encoding System" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-6 , October 2019, URL: https://www.ijtsrd.com/papers/ijtsrd29289.pdf Paper URL: https://www.ijtsrd.com/engineering/electronics-and-communication-engineering/29289/design-and-analysis-of-quantization-based-low-bit-rate-encoding-system/mr-nilesh-bodne
Dissertation synopsis for imagedenoising(noise reduction )using non local me...Arti Singh
Dissertation report for image denoising using non-local mean algorithm, discussion about subproblem of noise reduction,descrption for problem in image noise
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/10/person-re-identification-and-tracking-at-the-edge-challenges-and-techniques-a-presentation-from-the-university-of-auckland/
Morteza Biglari-Abhari, Senior Lecturer at the University of Auckland, presents the “Person Re-Identification and Tracking at the Edge: Challenges and Techniques” tutorial at the May 2021 Embedded Vision Summit.
Numerous video analytics applications require understanding how people are moving through a space, including the ability to recognize when the same person has moved outside of the camera’s view and then back into the camera’s view, or when a person has passed from the view of one camera to the view of another. This capability is referred to as person re-identification and tracking. It’s an essential technique for applications such as surveillance for security, health and safety monitoring in healthcare and industrial facilities, intelligent transportation systems and smart cities. It can also assist in gathering business intelligence such as monitoring customer behavior in shopping environments. Person re-identification is challenging.
In this talk, Biglari-Abhari discusses the key challenges and current approaches for person re-identification and tracking, as well as his initial work on multi-camera systems and techniques to improve accuracy, especially fusing appearance and spatio-temporal models. He also briefly discusses privacy-preserving techniques, which are critical for some applications, as well as challenges for real-time processing at the edge.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Automated Traffic sign board classification system is one of the key technologies of Intelligent
Transportation Systems (ITS). Traffic Surveillance System is being more and important with improving
urban scale and increasing number of vehicles. This Paper presents an intelligent sign board
classification method based on blob analysis in traffic surveillance. Processing is done by three main
steps: moving object segmentation, blob analysis, and classifying. A Sign board is modelled as a
rectangular patch and classified via blob analysis. By processing the blob of sign boards, the meaningful
features are extracted. Tracking moving targets is achieved by comparing the extracted features with
training data. After classifying the sign boards the system will intimate to user in the form of alarms,
sound waves. The experimental results show that the proposed system can provide real-time and useful
information for traffic surveillance.
Moving object detection using background subtraction algorithm using simulinkeSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Implementation of Object Tracking for Real Time VideoIDES Editor
Real-time tracking of object boundaries is an
important task in many vision applications. Here we propose
an approach to implement the level set method. This approach
does not need to solve any partial differential equations (PDFs),
thus reducing the computation dramatically compared with
optimized narrow band techniques proposed before. With our
approach, real-time level-set based video tracking can be
achieved.
A binarization technique for extraction of devanagari text from camera based ...sipij
This paper presents a binarization method for camera based natural scene (NS) images based on edge
analysis and morphological dilation. Image is converted to grey scale image and edge detection is carried
out using canny edge detection. The edge image is dilated using morphological dilation and analyzed to
remove edges corresponding to non-text regions. The image is binarized using mean and standard
deviation of edge pixels. Post processing of resulting images is done to fill gaps and to smooth text strokes.
The algorithm is tested on a variety of NS images captured using a digital camera under variable
resolutions, lightening conditions having text of different fonts, styles and backgrounds. The results are
compared with other standard techniques. The method is fast and works well for camera based natural
scene images.
A Review of Feature Extraction Techniques for CBIR based on SVMIJEEE
As with the advancement of multimedia technologies, users are not gratified with the conventional retrieval system techniques. So a application “Content Based Image Retrieval System” is introduced. CBIR is the application to retrieve the images or to search the digital images from the large database .The term “content” deals with the colour, shape, texture and all the information which is extracted from the image itself. This paper reviews the CBIR system which uses SVM classifier based algorithms for feature extraction phase.
PC-based Vision System for Operating Parameter Identification on a CNC MachineIDES Editor
Identification of suitable or optimum operating
parameters on a CNC machine is a non-trivial task. Especially
when the material of the component changes, operating
parameters need to be suitably varied. In this paper, a PCbased
vision system is presented for the automatic identification
of component material and appropriate selection of operating
parameters. The objective of this work is to develop a support
system to aid the operator in quick identification of machining
parameters
Improving Performance of Multileveled BTC Based CBIR Using Sundry Color SpacesCSCJournals
The paper presents an extension of content based image retrieval (CBIR) techniques based on multilevel Block Truncation Coding (BTC) using nine sundry color spaces. Block truncation coding based features is one of the CBIR methods proposed using color features of image. The approach basically considers red, green and blue planes of an image to compute feature vector. This BTC based CBIR can be extended as multileveled BTC for performance improvement in image retrieval. The paper extends the multileveled BTC using RGB color space to other nine color spaces. The CBIR techniques like BTC Level-1, BTC Level-2, BTC Level-3 and BTC Level-4 are applied using various color spaces to analyze and compare their performances. The CBIR techniques are tested on generic image database of 1000 images spread across 11 categories. For each CBIR technique, 55 queries (5 per category) are fired on extended Wang generic image database to compute average precision and recall for all queries. The results have shown the performance improvement (ie., higher precision and recall values) with BTC-CBIR methods using luminance-chrominance color spaces (YCgCb, Kekre’s LUV, YUV, YIQ, YCbCr) as compared to non-luminance (RGB, HSI, HSV, rgb , XYZ) Color spaces. The performance of multileveled BTC-CBIR increases gradually with increase in level up to certain extent (Level 3) and then increases slightly due to voids being created at higher levels. In all levels of BTC Kekre’s LUV color space gives best performance
An unsupervised method for real time video shot segmentationcsandit
Segmentation of a video into its constituent shots is a fundamental task for indexing and
analysis in content based video retrieval systems. In this paper, a novel approach is presented
for accurately detecting the shot boundaries in real time video streams, without any a priori
knowledge about the content or type of the video. The edges of objects in a video frame are
detected using a spatio-temporal fuzzy hostility index. These edges are treated as features of the
frame. The correlation between the features is computed for successive incoming frames of the
video. The mean and standard deviation of the correlation values obtained are updated as new
video frames are streamed in. This is done to dynamically set the threshold value using the
three-sigma rule for detecting the shot boundary (abrupt transition). A look back mechanism
forms an important part of the proposed algorithm to detect any missed hard cuts, especially
during the start of the video. The proposed method is shown to be applicable for online video
analysis and summarization systems. In an experimental evaluation on a heterogeneous test set,
consisting of videos from sports, movie songs and music albums, the proposed method achieves
99.24% recall and 99.35% precision on the average.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Automated Traffic sign board classification system is one of the key technologies of Intelligent
Transportation Systems (ITS). Traffic Surveillance System is being more and important with improving
urban scale and increasing number of vehicles. This Paper presents an intelligent sign board
classification method based on blob analysis in traffic surveillance. Processing is done by three main
steps: moving object segmentation, blob analysis, and classifying. A Sign board is modelled as a
rectangular patch and classified via blob analysis. By processing the blob of sign boards, the meaningful
features are extracted. Tracking moving targets is achieved by comparing the extracted features with
training data. After classifying the sign boards the system will intimate to user in the form of alarms,
sound waves. The experimental results show that the proposed system can provide real-time and useful
information for traffic surveillance.
Moving object detection using background subtraction algorithm using simulinkeSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Implementation of Object Tracking for Real Time VideoIDES Editor
Real-time tracking of object boundaries is an
important task in many vision applications. Here we propose
an approach to implement the level set method. This approach
does not need to solve any partial differential equations (PDFs),
thus reducing the computation dramatically compared with
optimized narrow band techniques proposed before. With our
approach, real-time level-set based video tracking can be
achieved.
A binarization technique for extraction of devanagari text from camera based ...sipij
This paper presents a binarization method for camera based natural scene (NS) images based on edge
analysis and morphological dilation. Image is converted to grey scale image and edge detection is carried
out using canny edge detection. The edge image is dilated using morphological dilation and analyzed to
remove edges corresponding to non-text regions. The image is binarized using mean and standard
deviation of edge pixels. Post processing of resulting images is done to fill gaps and to smooth text strokes.
The algorithm is tested on a variety of NS images captured using a digital camera under variable
resolutions, lightening conditions having text of different fonts, styles and backgrounds. The results are
compared with other standard techniques. The method is fast and works well for camera based natural
scene images.
A Review of Feature Extraction Techniques for CBIR based on SVMIJEEE
As with the advancement of multimedia technologies, users are not gratified with the conventional retrieval system techniques. So a application “Content Based Image Retrieval System” is introduced. CBIR is the application to retrieve the images or to search the digital images from the large database .The term “content” deals with the colour, shape, texture and all the information which is extracted from the image itself. This paper reviews the CBIR system which uses SVM classifier based algorithms for feature extraction phase.
PC-based Vision System for Operating Parameter Identification on a CNC MachineIDES Editor
Identification of suitable or optimum operating
parameters on a CNC machine is a non-trivial task. Especially
when the material of the component changes, operating
parameters need to be suitably varied. In this paper, a PCbased
vision system is presented for the automatic identification
of component material and appropriate selection of operating
parameters. The objective of this work is to develop a support
system to aid the operator in quick identification of machining
parameters
Improving Performance of Multileveled BTC Based CBIR Using Sundry Color SpacesCSCJournals
The paper presents an extension of content based image retrieval (CBIR) techniques based on multilevel Block Truncation Coding (BTC) using nine sundry color spaces. Block truncation coding based features is one of the CBIR methods proposed using color features of image. The approach basically considers red, green and blue planes of an image to compute feature vector. This BTC based CBIR can be extended as multileveled BTC for performance improvement in image retrieval. The paper extends the multileveled BTC using RGB color space to other nine color spaces. The CBIR techniques like BTC Level-1, BTC Level-2, BTC Level-3 and BTC Level-4 are applied using various color spaces to analyze and compare their performances. The CBIR techniques are tested on generic image database of 1000 images spread across 11 categories. For each CBIR technique, 55 queries (5 per category) are fired on extended Wang generic image database to compute average precision and recall for all queries. The results have shown the performance improvement (ie., higher precision and recall values) with BTC-CBIR methods using luminance-chrominance color spaces (YCgCb, Kekre’s LUV, YUV, YIQ, YCbCr) as compared to non-luminance (RGB, HSI, HSV, rgb , XYZ) Color spaces. The performance of multileveled BTC-CBIR increases gradually with increase in level up to certain extent (Level 3) and then increases slightly due to voids being created at higher levels. In all levels of BTC Kekre’s LUV color space gives best performance
An unsupervised method for real time video shot segmentationcsandit
Segmentation of a video into its constituent shots is a fundamental task for indexing and
analysis in content based video retrieval systems. In this paper, a novel approach is presented
for accurately detecting the shot boundaries in real time video streams, without any a priori
knowledge about the content or type of the video. The edges of objects in a video frame are
detected using a spatio-temporal fuzzy hostility index. These edges are treated as features of the
frame. The correlation between the features is computed for successive incoming frames of the
video. The mean and standard deviation of the correlation values obtained are updated as new
video frames are streamed in. This is done to dynamically set the threshold value using the
three-sigma rule for detecting the shot boundary (abrupt transition). A look back mechanism
forms an important part of the proposed algorithm to detect any missed hard cuts, especially
during the start of the video. The proposed method is shown to be applicable for online video
analysis and summarization systems. In an experimental evaluation on a heterogeneous test set,
consisting of videos from sports, movie songs and music albums, the proposed method achieves
99.24% recall and 99.35% precision on the average.
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSETIJCSEA Journal
The world over, image recognition are essential players in promoting quality object recognition especially in emergency and search-rescue operation. In this paper precise image recognition system using Matlab Simulink Blockset to detect selected object from crowd is presented. The process involves extracting object
features and then recognizes it considering illumination, direction and pose. A Simulink model has been developed to eliminate the tiny elements from the image, then creating segments for precise object recognition. Furthermore, the simulation explores image recognition from the coloured and gray-scale images through image processing techniques in Simulink environment. The tool employed for computation
and simulation is the Matlab image processing blockset. The process comprises morphological operation method which is effective for captured images and video. The results of extensive simulations indicate that this method is suitable for application identifying a person from a crow. The model can be used in emergency and search-rescue operation as well as in medicine, information security, access control, law enforcement, surveillance system, microscopy etc.
REGISTRATION TECHNOLOGIES and THEIR CLASSIFICATION IN AUGMENTED REALITY THE K...IJCSEA Journal
The registration in augmented reality is process which merges virtual objects generated by computer with
real world image caught by camera. This paper describes the knowledge-based registration, computer
vision-based registration and tracker-based registration technology. This paper mainly focused on trackerbased registration technology in augmented reality. Also described method in tracker- based technology,
problem and solution.
REGISTRATION TECHNOLOGIES and THEIR CLASSIFICATION IN AUGMENTED REALITY THE K...IJCSEA Journal
The registration in augmented reality is process which merges virtual objects generated by computer with real world image caught by camera. This paper describes the knowledge-based registration, computer vision-based registration and tracker-based registration technology. This paper mainly focused on trackerbased
registration technology in augmented reality. Also described method in tracker- based technology, problem and solution.
An Efficient Block Matching Algorithm Using Logical ImageIJERA Editor
Motion estimation, which has been widely used in various image sequence coding schemes, plays a key role in the transmission and storage of video signals at reduced bit rates. There are two classes of motion estimation methods, Block matching algorithms (BMA) and Pel-recursive algorithms (PRA). Due to its implementation simplicity, block matching algorithms have been widely adopted by various video coding standards such as CCITT H.261, ITU-T H.263, and MPEG. In BMA, the current image frame is partitioned into fixed-size rectangular blocks. The motion vector for each block is estimated by finding the best matching block of pixels within the search window in the previous frame according to matching criteria. The goal of this work is to find a fast method for motion estimation and motion segmentation using proposed model. Recent day Communication between ends is facilitated by the development in the area of wired and wireless networks. And it is a challenge to transmit large data file over limited bandwidth channel. Block matching algorithms are very useful in achieving the efficient and acceptable compression. Block matching algorithm defines the total computation cost and effective bit budget. To efficiently obtain motion estimation different approaches can be followed but above constraints should be kept in mind. This paper presents a novel method using three step and diamond algorithms with modified search pattern based on logical image for the block based motion estimation. It has been found that, the improved PSNR value obtained from proposed algorithm shows a better computation time (faster) as compared to original Three step Search (3SS/TSS ) method .The experimental results based on the number of video sequences were presented to demonstrate the advantages of proposed motion estimation technique.
Design and implementation of video tracking system based on camera field of viewsipij
The basic idea of this paper is to design and implement of video tracking system based on Camera Field of
View (CFOV), Otsu’s method was used to detect targets such as vehicles and people. Whereas most
algorithms were spent a lot of time to execute the process, an algorithm was developed to achieve it in a
little time. The histogram projection was used in both directional to detect target from search region,
which is robust to various light conditions in Charge Couple Device (CCD) camera images and saves
computation time.
Our algorithm based on background subtraction, and normalize cross correlation operation from a series
of sequential sub images can estimate the motion vector. Camera field of view (CFOV) was determined and
calibrated to find the relation between real distance and image distance. The system was tested by
measuring the real position of object in the laboratory and compares it with the result of computed one. So
these results are promising to develop the system in future.
Motion Object Detection Using BGS TechniqueMangaiK4
Abstract--- The detection of moving object is an important in many applications such as a vehicle identification in a traffic monitoring system,human detection in a crime branch.In this paper we identify a vehicle in a video sequence.This paper briefly explain the detection of moving vehicle in a video.We introduce a new algorithm BGS for idntifying vehicle in a video sequence.First, we differentiate the foreground from background in frames by learning the background.Then, the image is divided into many small nonoverlapped frames. The candidates of the vehicle part can be found from the frames if there is some change in gray level between the current image and the background.The extracted background subtraction method is used in subsequent analysis to detect a vehicle and classify moving vehicle
Motion Object Detection Using BGS TechniqueMangaiK4
Abstract--- The detection of moving object is an important in many applications such as a vehicle identification in a traffic monitoring system,human detection in a crime branch.In this paper we identify a vehicle in a video sequence.This paper briefly explain the detection of moving vehicle in a video.We introduce a new algorithm BGS for idntifying vehicle in a video sequence.First, we differentiate the foreground from background in frames by learning the background.Then, the image is divided into many small nonoverlapped frames. The candidates of the vehicle part can be found from the frames if there is some change in gray level between the current image and the background.The extracted background subtraction method is used in subsequent analysis to detect a vehicle and classify moving vehicle.
OBJECT DETECTION, EXTRACTION AND CLASSIFICATION USING IMAGE PROCESSING TECHNIQUEJournal For Research
Domestic refrigerators are widely used household appliances and a large extent of energy is consumed by this system. A phase change material is a substances that can store or release significant amount of heat energy by changing the phase liquid to vapour or vice versa. So, reduction of temperature fluctuation and improvement of system performances is that main reason of using PCM enhances the heat transfer rate thus improves the COP of refrigeration as well as the quality frozen food. The release and storage rate of a refrigerator is depends upon the characteristics of refrigerators and its properties using phase change material for a certain thermal load it is found that COP of conventional refrigerator is increased . The phase change material used in chamber built manually and which surrounds the evaporator chamber of a conventional refrigerator the whole heat transfer for load given to refrigerator cabin (to evaporator) evaporator to phase change material by conduction. This system hence improves the performances of household refrigerator by increasing its compressor cut-off time and thereby minimizing electrical energy usage. The main objective is to improve the performance, cooling time period, storage capacity and to maintain the constant cooling effect for more time during power cut off hours using phase change material.
International Journal of Engineering and Science Invention (IJESI)inventionjournals
International Journal of Engineering and Science Invention (IJESI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJESI publishes research articles and reviews within the whole field Engineering Science and Technology, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
Automated License Plate Recognition for Toll Booth ApplicationIJERA Editor
This paper describes the Smart Vehicle Screening System, which can be installed into a tollbooth for automated recognition of vehicle license plate information using a photograph of a vehicle. An automated system could then be implemented to control the payment of fees, parking areas, highways, bridges or tunnels, etc. There are considered an approach to identify vehicle through recognizing of it license plate using image fusion, neural networks and threshold techniques as well as some experimental results to recognize the license plate successfully.
Similar to Conference research paper_target_tracking (20)
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
1. Design and implementation of novel image
segmentation and BLOB detection algorithm for
real-time video surveillance using DaVinci processor
Badri Narayana Patro
Dept. of Electrical Engineering
Indian Institute of Technology Bombay, India
Email: badriindia.1@gmail.com
Abstract—A video surveillance system is primarily designed
to track key objects, or people exhibiting suspicious behavior, as
they move from one position to another and record it for possible
future use. The critical parts of an object tracking algorithm are
object segmentation, image clusters detection, and identification
and tracking of these image clusters. The major roadblocks of the
tracking algorithm arise due to abrupt object shape, ambiguity
in number and size of objects, background and illumination
changes, noise in images, contour sliding, occlusions and real-
time processing.
This paper will explain a solution of the object tracking
problem, in 3 stages: In the first stage, design a novel object
segmentation and background subtraction algorithm, These al-
gorithm will take care of salt pepper noise, and changes in scene
illumination. In the second stage, solve the abrupt object shape
problems, objects size and count various objects present , using
image clusters detected and identified by the BLOBs (Binary
Large OBjects) in the image frame. In the third stage, design
a centroid based tracking method, to improve robustness w.r.t
occlusion and contour sliding.
A variety of optimizations, both at algorithm level and code
level, are applied to the video surveillance algorithm. At code level
optimization mechanisms significantly reduce memory access,
memory occupancy and improved operation execution speed.
Object tracking happens in real-time consuming 30 frames per
second(fps) and is robust to occlusion, contour sliding, back-
ground and illumination changes. Execution time for different
blocks of this object tracking algorithm were estimated and the
accuracy of the detection was verified using the debugger and the
profiler, which will provided by the TI(Texas Instrument) Code
Composer Studio (CCS). We demonstrate that this algorithm,
with code and algorithm level optimization on TIs DaVinci
multimedia processor (TMS320DM6437), provides at least two
times speedup and is able to track a moving object in real-time
as compared to without optimization.
Keywords—Centroid Based, Segmentation, BLOB, Tracking,
Optimization, Background Subtraction DaVinci Processor.
I. INTRODUCTION
Surveillance systems are used for monitoring, tracking
of objects and screening of activities in public places such
as banks, in order to ensure security. Various aspects like
screening objects and people, biometric identification and
video surveillance, maintaining the database of threats and
blackmails etc., which are used for monitoring the activity.
In this paper, we present our approach in each stage,
such as video object tracking approach based on background
subtraction, image segmentation [1], blob detection and iden-
tification, and center of mass based tracking, where these
techniques were implemented on TMS320DM6437. Our ap-
proach consists of improvements and novel ideas in each stage
of the object tracking algorithm, such as, threshold based
segmentation, centroid based tracking and a novel idea for
blob detection and identification. These individual improve-
ments combine together to improve video object tracking, and
provide fast, accurate and good video object tracking services.
Apart from presenting new and improved algorithms in the
different stages of the object tracking algorithm, also present
optimizations at the source code level. These consist of using
the inbuilt DSP instructions, the DSP pipeline technique, and
Temporary Variables, to achieve speed optimization and code
optimization (based on the characteristics of C64x+ DSP core).
Consequently, this algorithm can be applied to multiple moving
and still objects in the case of a fixed camera. Different steps
for video object tracking are shown in Figure 1.
Surveillance camera is used to capture input video data.
This input video frame is used to create a YUV-stream, after
correcting for bad pixels, color, lens distortion etc., from the
RGB stream. This input stream is then down sampled and
followed by resizing to 480x720, and low light is adjusted
using adjust auto focus, white balance and exposer. This front
end processing is collectively referred to as the Image Pipe.
The most computation intensive part of tracking algo-
rithm is background subtraction, which is based on difference
between a previous image or background image[3] and the
current image. There are various methods for background
subtraction[11], and each algorithm has its own merits and
demerits. However, algorithms based on the difference of
images have problems in the following cases: First case, when
still objects included in the tracking task exist. Second, when
more than one moving objects are present in the same frame.
Third, when the camera is moving. Fourth, when occlusion of
objects occurs. By considering real-time performance and com-
putational speed optimized method for background subtraction
is implemented in this paper which tries to avoid most of the
effects. The occlusion effect can be solved by using adjacency
pixel value and center of mass algorithm for object tracking.
Image segmentation process is identifying components of
homogeneous regions in the image. This algorithm is used to
extract various information of a particular object like persons,
car. Primer there are three image Segmentation algorithm[10],
1909978-1-4799-3080-7/14/$31.00 c 2014 IEEE
2. such as 1. threshold based technique, 2. boundary based
technique, 3. region based technique.. The basic threshold
technique is the process of reducing the gray levels in the
image and is based on pixel intensity level. problem in thresh-
old [10] technique requires additional filtering background
noise and clustering. It is very difficult to distinguish intensity
gradient, i.e., variance in contrast in case of edge based
technique. problem in region growing is to to find regions in
case of over stringent[10], blurred regions and merging of the
regions. A novel approach of image segmentation algorithm,
in order to extract all objects information in the input image,
is implemented in this paper which is optimized and overcome
most of the problems.
Fig. 1: Object tracking for visual surveillance system
The main problem of object tracking in continuous video
stream is Binary Large OBjects (BLOBs) detection and
identification[4]. Various approaches developed for blob detec-
tion can be grouped into following categories: scale-space anal-
ysis, matched filters or template matching, watershed detection
and sub-pixel precise blob detection which will have their
advantage and disadvantages. In this paper we will discusses
about various method for blob detection and implement best
method for BLOB Detection.
In case of BLOB identification process those pixel value
which is greater than threshold value are checked for their
adjacency pixels[4].Each blob is enclosed by rectangle which
is elastic in nature, i.e., it can stretch in vertical, and horizontal
directions until the whole blob is enclosed in a rectangle box.
The process is then repeated for all blobs that are present in
the image. This paper is based on a BLOB detection system
with adjacency pixels, a Bounding Box and a Center-of-Mass
based approach for tracking the computation of the center
points using DSP processor. Also in this paper will discuss
about design problems and performance gain. The statistical
feature[1] of each blob such as, the approximate location of
the center gravity, the size of the rectangular enclosure, the
actual size or pixel count of the blob, and volume bounded by
the membership function value are calculated.
The major problem for detection and tracking applications
is occlusion. In Kalman[8] filter, observed that tracking of
moving objects are partially occluded. Therefore the objects
need to be detected without occlusion or the model of the
occluding objects had to be taken into account as well. The
use of centroids introduces a competitive learning mechanisms
in the tracking algorithm which lead to improved robustness
with respect to occlusion and contour sliding[6].
Boundary Box and Center of mass are two approaches
for estimation of center points of the enclosed BLOB’s. The
problem in the boundary box[4] method is that center position
is strongly depends upon pixel presents at the blobs border
and Flicking at boundary due to detection of pixels based on
Threshold value. In order to achieve a higher precision for
the center point computation the application of a Center-of-
Mass (CoM) based method has been selected. For the CoM
based method the BLOB detection module needs to store more
information about the identified pixels. It covers the design
and implementation of a tracking solution for the estimated
BLOB center points in hardware and in software. Those two
approaches are compared with respect to precision[6] and
performance.
Video surveillance systems require high resolution video,
large bandwidth and higher computational speed at low cost
and power. DaVinci devices are suitable for surveillance ap-
plications as they provide, ASIC-like low cost and power
for the complex processing, programmable DSPs with high
performance. The DaVinci processor DM6437 is a fixed-point
DSP based on the third generation high performance, advanced
VelociTI[12] very-long-instruction-word (VLIW) structure of
clock rate 700 MHz and faces digital multimedia appli-
cations. It also provide function accelerators in the video
processing[10] subsystems(VPSS) for common video process-
ing tasks such as Image pipeline, encoding, decoding and
display.
II. DESIGN STATEMENT
From the Literature Review we can derive our problem
statement which can consider following points
• To have a better a background subtraction model for
object tracking.
• To develop a better object segmentation algorithm
with a good image filtering and edge detection which
will improve computation time and complexity for real
time applications.
• Design an better image clusters detection and identifi-
cation program which is not discussed in the most of
literature.
• Design simple, accurate and optimized tracking
method to improved robustness w.r.t occlusion and
contour sliding.
• Optimization of all algorithms for program flow de-
signed, memories access time and memory uses.
III. ALGORITHM DESIGN FOR OBJECT TRACKING
It proposes a novel algorithm, which consist of five stages:
In the first stage, input captured stream has to be pre-processed
for down-sampling, resizing and to creates a YUV-stream after
correcting for bad pixels, color, lens distortion. Second stage,
segregate the moving objects from the background form each
frame using back ground subtraction method all objects present
in the image can be detected irrespective of they are moving
or not. Third stage, with the help of image segmentation using
crisp fuzzier, smooth filter and median filter, the subtracted
image is filtered out and free from salt paper noise. In the
fourth stage, the segmented image is processed for detecting
and identifying the BLOBs present, which is going to be
tracked. In the fifth stage, the object tracking is carried out
1910 2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI)
3. by feature extraction and center of mass calculation in feature
space of the BLOB detection results of successive frames. The
major steps for object tracking are as shown in Figure 2.
Fig. 2: TRACKING BLOCK DIAGRAM
The major steps for object tracking are such as:A. Image
Preprocessing, B. Background subtraction, C. Object seg-
mentation, D. Blob Detection and Identification, E. Feature
Extraction and Selection, F. Object tracking.
A. Camera control and Image Pipe
The goal of pre-processing is to process the captured input
stream and apply basic image processing logic to change row
input video into a specific format by removing noise using
smoothing and unwanted object by using morphological oper-
ation. Here captured frame is processed using down sampling,
resizing, preview engine and H3A in order to creates a YUV-
stream. The role of Image Preview[5] is for transforming
row uncompressed image and video data from camera into
YUV422 for compression or display. Capture Image stream is
converted from RGB to YUV after correcting for bad pixels,
color,dark frame, lens distortion, luminance enhancement and
chrominance suppression. Job of resizer is to make up sam-
ple for digital zoom using bi-cubic Interpolation and down
sampling to reduce image size. Auto focus (AF), auto white
balance(AWB), and auto exposure (AE), collectively known as
H3A is applied on the raw image data.
B. Background subtraction
The main motivation for the background subtraction is to
detect all the foreground objects in a frame sequence from
static camera. The rationale in this approach is that of detecting
the moving objects from the difference between the current
frame and a reference frame, often called the “background im-
age”, or “background model”. Four major steps in background
subtraction method are
• Pre-processing : Noise filtering
• Background Modeling: Mean filter or Median filter or
frame difference.
• Foreground detection: Detection Based on 2 frames or
3 frames or Gaussian model.
• Data validation
In order to detect the foreground objects, the difference be-
tween the current frame and an image of the scene’s static
background is compared with a threshold. The background
subtraction equation is expressed as:
|Frame(t) − Background(t)| > Th (1)
Background(x, y, t) = Image(x, y, t − 1) (2a)
|Image(x, y, t) − Image(x, y, t − 1)| > Th (2b)
There are some problems associated with this method[11]
are occlusion handling problem i.e. overlapping of moving
blobs, camera oscillations, lighting condition, shadow detec-
tion and illumination changes. There are various other simple
approaches aiming to, maximize speed, limit the memory
requirements, to achieve the highest possible accuracy un-
der any possible circumstances. These approaches include,
running gaussian average, temporal median filter, mixture of
gaussians, kernel density estimation (KDE), co-occurrence of
image variations, eigen backgrounds. By considering real-
time performance computational speed background subtraction
algorithm is implemented in this paper using running average
difference method as the background model and foreground
detection using 2 frame detection. the back ground modeling
equation as follows:
B(i + 1) = α ∗ F(i) + (1 − α) ∗ B(i) (3a)
|I(x, y, t) − B(x, y, t)| > Th (3b)
where α, the learning rate, is typically 0.05.
C. Object segmentation
In image segmentation process,an image is segmented
into various regions that have same characteristics and then
extracting the interested regions. The basic aspects in image
segmentation include threshold, clustering, edge detection and
region growing. This paper describe about a novel image
segment algorithm as follows:
1) Novel image segmentation approach:: The novel image
segmentation algorithm proposed uses crisp fuzzier, smooth
filter and median filter in the order as shown in figure below.
Fig. 3: Image segmentation Steps
In this approach, crisp fuzzier is used for finding out the
relevant gray value information and the output data of crisp
fuzzier is then processed in order to eliminate isolated points
and noise.Enhancing the segmented result using morphological
operations. Elimination of isolated points and noise removal
is done by using a binary smoothing filter and a median filter
respectively.
Steps for Image Segmentation approach are as follows:
2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI) 1911
4. 1) The pixel data from the input frame and is passed
through a crisp fuzzyfier. The job of crisp fuzzyfier
is to first select a pixel value , say p, if the selected
pixel value p is in the range of PL and PH ( PL ≤ p
≤ PH), then the pixel data is assigned a value of 1.0.
Else the pixel data is assign to zero. where PL and
PH are the lower and upper limits of the pixel values.
This process will replete until all data are processed.
2) The image pixel data obtained in step-1 is processed
for smoothing filter which is used to a. remove
isolated points, b. fill in small holes, c. fill in small
notches in edge segments.
3) The binary image data obtained in step-2 is filtered
using a median filter. The median filter is a nonlinear
digital filtering technique, in which the center pixel
value is replaced with the median of all the pixel
values in the filter window. However, the performance
of Gaussian is much better than median blur for
high levels of noise, but it is used to remove salt
and pepper noise (impulsive noise) in the blobs and
separate out those points whose intensity is very
different form their neighbors and make a blob of
all pixel value whose intensity is nearly equal to
its neighbors. Finally the output of the median filter
consists of clusters or blobs.
D. Object Detection and Identification
The aim of the BLOB detection is to determine the
center point of the Blobs in the current frame encoded in
XY-coordinates. A BLOB consists of white pixels while the
background pixels are black. To simplify the problem, an upper
bound for the number of Blobs to detect has been defined.
There are Two simple criteria to decide if a pixel belongs to a
BLOB:1) Is brightness of the pixel greater than threshold? 2)
Is pixel adjacent to pixels of a detected BLOB? .Dependent on
the viewing angle of the user and the speed of his motion,the
intensity and shape of the Blobs can vary. if the user speed of
motion is less than the actual frame rate and resolution of the
camera which will eliminate blurring effect.
In case of BLOB identification process those pixel value
which is greater than threshold value are checked for their
adjacency pixels. To combine detected pixels to Blobs a test of
pixel adjacency needs to be performed. There are two common
methods to evaluate adjacent Pixels that is 4-pixel and 8-pixel
neighborhood as shown in fig.4 . 4-pixel neighborhood checks
for adjacent pixel only on vertical and horizontal axis of the the
current pixel. 8-pixel neighborhood check for adjacent pixel on
vertical, horizontal and diagonal axis of the the current pixel.
In this paper,to realize BLOB detection 8 pixel neighborhood
is used for adjacency check due to 8-pixel neighborhood is
more reliable than 4-pixel neighborhood.
The BLOBs with blur depends on the direction of the
objects movement. If the motion of the object goes along the
horizontal or vertical axis of the screen, the BLOB can show
an elliptical shape. In the detection approach, only the elliptical
shape has been taken into account.
E. Feature Extraction of Blob
In this step geometric and radiometric feature of the blob
are calculated. Boundary length, Blob area, color of each blob,
Fig. 4: 4 and 8 Connected Neighborhoods
shape, Distance between the blob, Geometric moments that is
co-ordinate of center of blob, and higher order moments are
the following significant features of blobs which are going to
extract.
F. Object Tracking: Center-Of-Mass method
After Blobs detection and feature selection method, the
next step is computation of their center points of Blobs. Bound-
ary Box and Center of mass are two approaches for estimation
of center points of the enclosed BLOBs. In Boundary Box
approaches, the center-point of the blob can be estimated with
the help of minimum and maximum coordinates of the pixels
on the horizontal and vertical axis.
The center positions of the BLOBs can be computed using
following formula :
X position =
maxX position + minX position
2
(4a)
Y position =
maxY position + minY position
2
(4b)
The computational complexity of mathematical operations
like addition and subtraction are not so high. Problem is
division operation, that is Divide by 2 can be achieve by using
bit shift operator. The problems in boundary box algorithms
are follows: 1. Center position is strongly depends upon pixel
presents at the blob’s border. 2. Flicking at the boundary due
to detection of pixels based on Threshold value. When Blobs
in motion, this Flickering effects become stronger and stronger
and cause motion Blur in the blobs shape. In order to reduce
flickering effect center-of-mass method(CoM is used for the
computation of the blob center point.
In this method, for the computation of the center point
All pixels of the detected blob are taken into account. The
formula for finding position value of the detected pixels as
weighted sum and calculates an averaged center coordinate.
BLOBs center position is follows
X position =
(X position of all BLOB pixels)
number of all BLOB pixels
(5a)
Y position =
(Y position of all BLOB pixels)
number of all BLOB pixels
(5b)
Above algorithm helps to find the center position of a
BLOB w. r. t. its mass. But it does not use the information
1912 2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI)
5. about the brightness of the pixel, which belongs to the BLOB.
In order to increases the precision of the center-position of a
blob, CoM algorithm is modified by replacing pixel position
with the product of the pixel position and brightness of the
pixels is multiplied as weights to pixel position. CoM approach
shifts the estimated center position of the BLOB more to the
region concentrated with pixels with the highest brightness.
center position =
(pixel position ∗ pixel brightness)
(pixel brightness)
(6)
X position =
(X pixel position ∗ pixel brightness)
(pixel brightness)
(7a)
Y position =
(Y pixel position ∗ pixel brightness)
(pixel brightness)
(7b)
Since video frame is gray scale, flickering depends upon
threshold value and the color gradient. In order to avoid a
similar flickering in the computed center-position of the blob,
it is recommended to running average( a running summation
of the all pixel values during the detection phase followed by
division of the number of pixels). CoM and Boundary Box are
giving similar results for Blobs with perfect circular shape. But
for the Blobs with motion blur, the modified Center-of-Mass
based approach was closer to the expected optimal result.
IV. IMPLEMENTATION OF OBJECT TRACKING
ALGORITHM
A. Software Design Flow
DaVinci Software Architecture consist of three layers
that are Signal Processing Layer (SPL),Input Output Layer
(IOL) and Application Layer(APL). SPL take care of all the
processing functions or algorithms that run on the device.
Similarly, all the input and output, that is all the peripheral
drivers are grouped into another layer that is Input-Output
Layer (IOL).Also IOL generates buffers for input or output
data buffers which reside in shared memory. The IOL drivers
are integrated into an Operating System such as Linux OS or
DSP/BIAS. IOL contains Video Processing Subsystem(VPSS)
device driver used for video capturing and displaying. The
third layer is the Application Layer which interacts with IOL
and SPL. Also APL makes calls to IOL for data input and
output, and to SPL for processing. The APL generates a master
thread which is the highest priority level thread such as a
video thread or an audio thread. The job of master thread is
to handles the opening of I/O resources (through EPSI API),
the creation of processing algorithm instances (through VISA
API), as well as the freeing of these resources.
In case of video playback loop, video capture driver reads
data from a input video port and starts storing into a input
memory buffer. An interrupt is generated by the IOL to the
APL, When this input buffer is full and the fully filled input
buffer pointer is passed to the APL. Now APL generates an
interrupt and passes that buffer pointer to the SPL. Now SPL
layer processes this input buffer data and when data processing
completes, it generates an interrupt back to the APL and
transfers the pointer of the output buffer that it created. The
APL passes this output buffer pointer to the IOL and also APL
gives instruction to IOL for display output buffer using display
driver. As we know that overhead passing the buffer pointers is
negligible because here, only pointers to the memory location
are passed while the buffers remain in that memory location.
Fig. 5: Software Processor Flow
B. Sequence Diagram and API Design Flow
The control flow of the video tracking application is
shown in Figure 7. The following is the sequence of oper-
ations: Initialize video capture device using VPFE open() and
VPBE open(). Configure the video capture device using this
API.The FVID handle is then used to configure the video
input (composite, component, s-video), video standard (ntsc,
pal, secam, auto), video file format (UYVY,YUYV, YUV420,
etc.).
VPSS getbuffer ,The VPSS driver has a queue of buffers
for capturing video frames. Whenever a buffer is filled up, it
is moved to the back of the queue.The application can obtain
a buffer using FVID dequeue() API. If a buffer is dequeued
from the VPFE queue, the buffer depth at the VPFE driver
decreases.
FVID exchange() API removes a buffer from the VPFE
buffer queue, takes a buffer from application and adds it buffer
to the VPFE buffer queue.The buffer dequeued from the VPFE
buffer queue is returned to the application as shown in Figure.6
. Close all VPFE and VPBF device using VPFE close() and
VPBE close().
C. Object Tracking Function Flow
The steps involved in video surveillance tracking system
are as shown in Figure. 8 as follows:
• Capture video sequence by CCD camera.
• Pre-process of the video frame free form noise.
• Segregating input image frame sequences from the
input video frame.
2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI) 1913
6. Fig. 6: EPSI APIs for DSP/BIOS
Fig. 7: Sequential Flow diagram object tracking algorithm
• Convert image from one domain to another domain
and Down sample it.
• Delay Image frame for background subtraction.
• Finding of the moving objects for consecutive frames
using background subtraction.
• Segmenting the regions where there is movement
using fuzzyfier, mean and median filter operation.
• Blob detection using adjacency pixel value method.
• extracting features and marking objects tracking using
edge or boundary box method.
• Tracking of particular object using center of mass
method.
Fig. 8: Steps for implement object tracking
D. Optimization techniques
1) Algorithm Optimization:
• Accuracy and efficient way for object segment using
crisp fuzzyfier, smooth and median filter.
• Weighted Average method for Back ground subtrac-
tion.
• Boundary box matching for BLOB detection
• Center of mass based tracking.
2) Code Optimization:
• Changing Data type of Variable: using short data type
instead of word data type.
• Speed Optimization is done using Temporary Variable
which will avoid repeated computations
• Inbuilt Instruction: using Intrinsic operators like
ADD2, SUB2,MPY, LDW and STW.
• Change Compiler Setting: -pm , -mt and -on
• DSP Pipeline Concept : using MUST ITERATE
V. PROFILING AND DEBUGGING RESULTS
A. Profiling results
Profiling is used to measure code performance and make
sure for efficient use of the DSP targets. Profiling is used on
1914 2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI)
7. TABLE I: Object tracking profiler data
Function name Cycle Without OPT With TempVar With Inbuilt
Extract uyvy fun 1340201 1326798.99 1299995
Copy frame fun 94419 93474.81 91586
Frame subtract fun 28340201 28056798.99 27489995
Segmentation fun 48849160 48360668.4 47383685
Blob detection fun 25124071 24872830.29 24370349
Centriod loop fun 15644100 15487659 15174777
Write uyvy fun 1234020 1221679.8 1196999
Total Cycle 120626172 119419910 117007387
TABLE II: Object tracking profiler data
Function name Cycle Without OPT With Chagedatatype With pipeline
Extract uyvy fun 1340201 1031955 1246387
Copy frame fun 94419 72703 87810
Frame subtract fun 28340201 21821955 26356387
Segmentation fun 48849160 37613853 45429719
Blob detection fun 25124071 19345535 23365386
Centriod loop fun 15644100 12045957 14549013
Write uyvy fun 1234020 950195 1147639
Total Cycle 120626172 92882153 112182340
different function of Object tracking and the time taken to exe-
cute each function is measured by considering its inclusive and
exclusive cycle, access count and processor clock frequency as
shown in profiler data table I and II.
B. Debugging results
Debugging results of object tracking is shown in the
following Figure 9 and Figure 10. Result of the background
subtraction with filtering, segmentation, Blob detection and
center of mass tracking is shown in Figures.
(a) (b) (c)
Figure 9: Debugging results : (a)Two ball tracking, (b)Two object tracking, and
(c)Background subtraction with filtering
(a) (b) (c)
Figure 10: Debugging results : (a) Segmentation with filtering, (b)Blob Detection
(c)Center of mass based tracking
VI. CONCLUSION
An object tracking algorithm is developed and implemented
on DaVinci Processor (DM6437), using weighted average
based on background subtraction, BLOBs detection using
adjacent pixel value, identification using boundary box algo-
rithm, and tracking of the object based on center of mass
method. Presence of crisp fuzzier, smoothing and median filters
processes play a major role in processing pixel data in image
segmentation, which solved the salt pepper noise. Background
and illumination changes problem is solved using weighted
average based on background subtraction. Number of object,
size of object and abrupt object shape problems of tracking
are solved using BLOBs detection using adjacent pixel value,
identification using boundary box algorithm. Tracking of the
object is based on center of mass method, which improves ro-
bustness w.r.t occlusion and contour sliding. These approaches
were optimized, both in code level and also in the algorithm.
Based on profiling results, this object tracking approach as
implemented on DaVinci processor is efficient and much faster
for real time scenarios.
ACKNOWLEDGMENT
I express my sincere thanks and deep sense of gratitude to
my supervisor Prof. V Rajbabu for his invaluable guidance,
inspiration, unremitting support,encouragement and for his
stimulating suggestions during experiment. I acknowledge with
thanks to Mr. Vijay, Mr. Alok, Mr. Sarat and My family
who have directly or indirectly co-operate me throughout my
research activity.
REFERENCES
[1] Hinz. S., ‘‘FAST AND SUBPIXEL PRECISE BLOB DETECTION
AND ATTRIBUTION”,Technical University Munich, Germany.
[2] Uy. D. L.,‘‘An algorithm for image clusters detection and identification
based on color for an autonomous mobile robot”, Research report
submitted to Hampton university, Verginia, 1994
[3] Patro. B. N. ,‘‘Real-Time Video and Image Processing for Video Surveil-
lance using DaVinci Processor”, Master of Technology, Indian Institute
of Technology Bombay, Mumbai,India, June 2012.
[4] Bochem. A., Herpers. R., and Kent. K. B., ‘‘Hardware Acceleration of
BLOB Detection for Image Processing”, Third International Conference
on Advances in Circuits, 2010, pp. 28-33.
[5] Pawate. B. I. , ‘‘Developing Embedded Software using DaVinci&OMAP
Technology” Margon & Claypool, 2009.
[6] Nascimento. J. C., Abrantes. A. J., Marques. J. S., ‘‘AN ALGORITHM
FOR CENTROID -BASEDTRACKING OF MOVING OBJECT. ”, Lis-
boa, portugal, October 1994.
[7] Xiang. H. I., Xiao-ning. F.,Guo-qiang. H. , ‘‘An Optimization for
Distance Estimation in Real-Time with DM6437 ”, 4th International
Conference on Signal Processing Systems,2012.
[8] Frank.T., Haag.M., Kollnig.H., and Nagel. H.-H., ‘‘Tracking of occluded
vehicles in traffic scenes.”, European Conference on Computer Vision-
Volume II, pages 485494, London, UK, 1996.
[9] Agarwal. S., Mishra. S., ‘‘A STUDY OF MULTIPLE HUMAN TRACK-
ING FOR VISUAL.”, IJAET,ISSN: 2231-1963,Nov. 2012.
[10] Hedberg. H. , ‘‘A Survey of Various Image Segmentation Techniques.”,
Dept. of Electroscience, Lund University, Sweden.
[11] Piccardi. M. , ‘‘Background subtraction techniques: a review.”, IEEE
International Conference on Systems, Man and Cybernetic, 2004.
[12] Texas Instrument Inc., ‘‘TMS320C6000 Optimizing Compiler v7.0
User’s Guide.”, Spru187Q, 2010.2.
2014 International Conference on Advances in Computing,Communications and Informatics (ICACCI) 1915