SlideShare a Scribd company logo
1 of 3
Abstract
Interest in the area of vision-based surveillance is growing rapidly with the continuous
evolution of Computer Vision technologies. A number of applications in security guard for
communities and important buildings, traffic surveillance in cities and expressways, detection
of military targets, detection of anomalous behaviors etc. are expected to use Vision-based
surveillance system. In order to make use of this vision-based surveillance system, efficient
and effective techniques to analyze and extract feature information from image sequences are
required to be developed. This thesis is dedicated to the finding a solution that can be
integrated to an existing closed circuit television systems by using an intelligent algorithm
that can detect unusual activity and alert human operator in real time with the help of Human
Activity Recognition (HAR).
In human activity recognition on video surveillance, there is a wide range of
applications taken over the function with a different activity. They are three methods, namely
physical method, partial self-determination, and entirely autonomous structure. In the
physical method, the creature itself performs the examinations, which give the video as an
input, which by the removal of background activities and the particular object alone with the
action is taken. Then by following the movement of the object and recognition of their
particular activity with the final decision is formed. At the partial self-determination, the
input video is breaking down with processing and then given as free from any intervention.
Automated human activity analysis has been, and remains, a challenging problem. Security
and surveillance are essential issues in today's world. Any behavior which is uncommon in
occurrence and deviates from customarily understood action could be termed as suspicious.
For different application regions, while identifying human exercises, fundamentally three
angles are taking in worry for human movement recognition system: Segmentation, feature
extraction, and activity classification. This model aimsat the automatic detection of abnormal
behavior in surveillance videos.
This research work has three stages. The first stage of work is focused for automatic
human activity recognition in video surveillance system for healthcare monitoring. With a
special focus on elderly patients’ care, safety arrangements and supervision areas and in
applications designed for smart homes. Sensor and visual devices enable HAR, and there is a
multitude of sensor classifications, such as sensors that can be worn, sensors tagged to a
target and sensors tagged to the background. The automated learning methodologies in HAR
are either handcrafted or deep learning or a combination of both. Handcrafted models can be
regional or wholesome recognition models such as RGB, 3D mapping and skeleton data
models, and deep learning models are categorized into generative models such as LSTM
(long short-term memory), discriminative models such as convolutional neural networks
(CNNs) or a synthesis of such models. Several datasets are available for undertaking HAR
analysis and representation. The hierarchy of processes in HAR is classified into gathering
information, preliminary processing, property derivation and guiding based on framed
models. The proposed study considers the role of smartphones in HARs with a particular
interest in keeping a tab on the lifestyle of subjects. Smartphones act as HAR devices with
inbuilt sensors with custom-made applications, and the merits of both handcrafted and deep
learning models are considered in framing a model that can enable lifestyle tracking in real
time. This performance-enhanced real-time tracking human activity recognition (PERT-HAR)
model is economical and effective in accurate identification and representation of actions of
the subjects and thereby provides more accurate data for real-time investigation and remedial
measures. This model achieves an accuracy of 97–99% in a properly controlled environment.
Despite the benefits of HPE, it is still a challenging process due to the variations in
visual appearances, lighting, occlusions, dimensionality, etc. To resolve these issues, the
second research work presents a squirrel search optimization with a deep convolutional
neural network for HPE (SSDCNN-HPE) technique. The major intention of the SSDCNN-
HPE technique is to identify the human pose accurately and efficiently. Primarily, the video
frame conversion process is performed and pre-processing takes place via bilateral filtering-
based noise removal process. Then, the EfficientNet model is applied to identify the body
points of a person with no problem constraints. Besides, the hyperparameter tuning of the
EfficientNet model takes place by the use of the squirrel search algorithm (SSA). In the final
stage, the multiclass support vector machine (M-SVM) technique was utilized for the
identification and classification of human poses. The design of bilateral filtering followed by
SSA based EfficientNet model for HPE depicts the novelty of the work. To demonstrate the
enhanced outcomes of the SSDCNN-HPE approach, a series of simulations are executed.
Finally, the SSDCNN-HPE methodology has accomplished maximum performance with
higher accuracy of 0.993 on Penn action dataset, where the existing models achieved nearly
0.98 and 0.99 of accuracy on the same dataset.
From the literature it is observed that HAR models developed in an unconstrained
environment have several limitations like Personal Interference (PI), Electromagnetic (EM)
noise, in-band noise or human movements and outliers involved while capturing the input
data. These noises are affecting the overall performance and robustness of the model. In order
to improve the model performance, noise removal techniques are introduced in this work.
Noises like salt and pepper noise, Gaussian noise, and blurring of the boundaries and outlier
treatment are processed for the hybrid data acquired using video and sensors in line-of-sight
mode in this paper. For removing these noises, a combination of filters like Kalman filter,
MOSSE filter, Butter-worth and J filter, are applied to the input data. By doing this noise
removal technique, it is observed that Peak to Sidelobe Ratio (PSR) is reduced from the raw
data. After removing these noises, features are extracted using the top layers of CNN
Inceptionv3 model for video data. Similarly, by using inertial sensor, features like tri-axial
accelerometer, gyroscopes and magnetometers are collected and a feature vector is created.
Pyramidal flow feature fusion (PFFF) technique is used to fuse the extracted features from
video and inertial sensor data. Finally, the fused features are given to the SVM classifier to
perform activity recognition. The proposed experimental methodology has been tested on
UCF sports data-set and the results have been obtained. From the results obtained, it is
observed that by performing a noise removal process and introducing hybrid features, the
robustness of HAR model have been improved and able to achieve an accuracy of 97.4% in
noisy data and 98.7% in noiseless data.

More Related Content

Similar to Abstract.docx

Chapter 1_Introduction.docx
Chapter 1_Introduction.docxChapter 1_Introduction.docx
Chapter 1_Introduction.docx
KISHWARYA2
Β 
A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal Features
CSCJournals
Β 
Unconstrained Activity Recognition in an Office Environment
Unconstrained Activity Recognition in an Office EnvironmentUnconstrained Activity Recognition in an Office Environment
Unconstrained Activity Recognition in an Office Environment
Christopher Ramirez
Β 
PS_Unconstrained_Activity
PS_Unconstrained_ActivityPS_Unconstrained_Activity
PS_Unconstrained_Activity
Parker Sankey
Β 
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operatorProposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
QUESTJOURNAL
Β 
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICESFUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
vasim hasina
Β 

Similar to Abstract.docx (20)

Chapter 1_Introduction.docx
Chapter 1_Introduction.docxChapter 1_Introduction.docx
Chapter 1_Introduction.docx
Β 
D018112429
D018112429D018112429
D018112429
Β 
A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal Features
Β 
Survey on Human Behavior Recognition using CNN
Survey on Human Behavior Recognition using CNNSurvey on Human Behavior Recognition using CNN
Survey on Human Behavior Recognition using CNN
Β 
Human activity recognition updated 1 - Copy.pptx
Human activity recognition updated 1 - Copy.pptxHuman activity recognition updated 1 - Copy.pptx
Human activity recognition updated 1 - Copy.pptx
Β 
IRJET- Survey on Detection of Crime
IRJET-  	  Survey on Detection of CrimeIRJET-  	  Survey on Detection of Crime
IRJET- Survey on Detection of Crime
Β 
Unconstrained Activity Recognition in an Office Environment
Unconstrained Activity Recognition in an Office EnvironmentUnconstrained Activity Recognition in an Office Environment
Unconstrained Activity Recognition in an Office Environment
Β 
PS_Unconstrained_Activity
PS_Unconstrained_ActivityPS_Unconstrained_Activity
PS_Unconstrained_Activity
Β 
Human Activity Recognition Using Smartphone
Human Activity Recognition Using SmartphoneHuman Activity Recognition Using Smartphone
Human Activity Recognition Using Smartphone
Β 
F0932733
F0932733F0932733
F0932733
Β 
BIOMETRIC AUTHORIZATION SYSTEM USING GAIT BIOMETRY
BIOMETRIC AUTHORIZATION SYSTEM USING GAIT BIOMETRYBIOMETRIC AUTHORIZATION SYSTEM USING GAIT BIOMETRY
BIOMETRIC AUTHORIZATION SYSTEM USING GAIT BIOMETRY
Β 
IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...
IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...
IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...
Β 
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operatorProposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
Proposed Multi-object Tracking Algorithm Using Sobel Edge Detection operator
Β 
Comparative study of various enhancement techniques for finger print images
Comparative study of various enhancement techniques for finger print imagesComparative study of various enhancement techniques for finger print images
Comparative study of various enhancement techniques for finger print images
Β 
Comparative study of various enhancement techniques for finger print images
Comparative study of various enhancement techniques for finger print imagesComparative study of various enhancement techniques for finger print images
Comparative study of various enhancement techniques for finger print images
Β 
IRJET= Air Writing: Gesture Recognition using Ultrasound Sensors and Grid-Eye...
IRJET= Air Writing: Gesture Recognition using Ultrasound Sensors and Grid-Eye...IRJET= Air Writing: Gesture Recognition using Ultrasound Sensors and Grid-Eye...
IRJET= Air Writing: Gesture Recognition using Ultrasound Sensors and Grid-Eye...
Β 
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICESFUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
FUSION OF GAIT AND FINGERPRINT FOR USER AUTHENTICATION ON MOBILE DEVICES
Β 
IRJET- Recognition of Human Action Interaction using Motion History Image
IRJET-  	  Recognition of Human Action Interaction using Motion History ImageIRJET-  	  Recognition of Human Action Interaction using Motion History Image
IRJET- Recognition of Human Action Interaction using Motion History Image
Β 
Gj3511231126
Gj3511231126Gj3511231126
Gj3511231126
Β 
IRJET- Recurrent Neural Network for Human Action Recognition using Star S...
IRJET-  	  Recurrent Neural Network for Human Action Recognition using Star S...IRJET-  	  Recurrent Neural Network for Human Action Recognition using Star S...
IRJET- Recurrent Neural Network for Human Action Recognition using Star S...
Β 

Recently uploaded

Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
Β 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systems
meharikiros2
Β 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
Β 

Recently uploaded (20)

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
Β 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
Β 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
Β 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
Β 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Β 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
Β 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
Β 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
Β 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
Β 
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Β 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Β 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systems
Β 
πŸ‘‰ Yavatmal Call Girls Service Just Call πŸ‘πŸ‘„6378878445 πŸ‘πŸ‘„ Top Class Call Girl S...
πŸ‘‰ Yavatmal Call Girls Service Just Call πŸ‘πŸ‘„6378878445 πŸ‘πŸ‘„ Top Class Call Girl S...πŸ‘‰ Yavatmal Call Girls Service Just Call πŸ‘πŸ‘„6378878445 πŸ‘πŸ‘„ Top Class Call Girl S...
πŸ‘‰ Yavatmal Call Girls Service Just Call πŸ‘πŸ‘„6378878445 πŸ‘πŸ‘„ Top Class Call Girl S...
Β 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
Β 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
Β 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
Β 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
Β 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
Β 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
Β 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Β 

Abstract.docx

  • 1. Abstract Interest in the area of vision-based surveillance is growing rapidly with the continuous evolution of Computer Vision technologies. A number of applications in security guard for communities and important buildings, traffic surveillance in cities and expressways, detection of military targets, detection of anomalous behaviors etc. are expected to use Vision-based surveillance system. In order to make use of this vision-based surveillance system, efficient and effective techniques to analyze and extract feature information from image sequences are required to be developed. This thesis is dedicated to the finding a solution that can be integrated to an existing closed circuit television systems by using an intelligent algorithm that can detect unusual activity and alert human operator in real time with the help of Human Activity Recognition (HAR). In human activity recognition on video surveillance, there is a wide range of applications taken over the function with a different activity. They are three methods, namely physical method, partial self-determination, and entirely autonomous structure. In the physical method, the creature itself performs the examinations, which give the video as an input, which by the removal of background activities and the particular object alone with the action is taken. Then by following the movement of the object and recognition of their particular activity with the final decision is formed. At the partial self-determination, the input video is breaking down with processing and then given as free from any intervention. Automated human activity analysis has been, and remains, a challenging problem. Security and surveillance are essential issues in today's world. Any behavior which is uncommon in occurrence and deviates from customarily understood action could be termed as suspicious. For different application regions, while identifying human exercises, fundamentally three angles are taking in worry for human movement recognition system: Segmentation, feature extraction, and activity classification. This model aimsat the automatic detection of abnormal behavior in surveillance videos. This research work has three stages. The first stage of work is focused for automatic human activity recognition in video surveillance system for healthcare monitoring. With a special focus on elderly patients’ care, safety arrangements and supervision areas and in applications designed for smart homes. Sensor and visual devices enable HAR, and there is a multitude of sensor classifications, such as sensors that can be worn, sensors tagged to a target and sensors tagged to the background. The automated learning methodologies in HAR
  • 2. are either handcrafted or deep learning or a combination of both. Handcrafted models can be regional or wholesome recognition models such as RGB, 3D mapping and skeleton data models, and deep learning models are categorized into generative models such as LSTM (long short-term memory), discriminative models such as convolutional neural networks (CNNs) or a synthesis of such models. Several datasets are available for undertaking HAR analysis and representation. The hierarchy of processes in HAR is classified into gathering information, preliminary processing, property derivation and guiding based on framed models. The proposed study considers the role of smartphones in HARs with a particular interest in keeping a tab on the lifestyle of subjects. Smartphones act as HAR devices with inbuilt sensors with custom-made applications, and the merits of both handcrafted and deep learning models are considered in framing a model that can enable lifestyle tracking in real time. This performance-enhanced real-time tracking human activity recognition (PERT-HAR) model is economical and effective in accurate identification and representation of actions of the subjects and thereby provides more accurate data for real-time investigation and remedial measures. This model achieves an accuracy of 97–99% in a properly controlled environment. Despite the benefits of HPE, it is still a challenging process due to the variations in visual appearances, lighting, occlusions, dimensionality, etc. To resolve these issues, the second research work presents a squirrel search optimization with a deep convolutional neural network for HPE (SSDCNN-HPE) technique. The major intention of the SSDCNN- HPE technique is to identify the human pose accurately and efficiently. Primarily, the video frame conversion process is performed and pre-processing takes place via bilateral filtering- based noise removal process. Then, the EfficientNet model is applied to identify the body points of a person with no problem constraints. Besides, the hyperparameter tuning of the EfficientNet model takes place by the use of the squirrel search algorithm (SSA). In the final stage, the multiclass support vector machine (M-SVM) technique was utilized for the identification and classification of human poses. The design of bilateral filtering followed by SSA based EfficientNet model for HPE depicts the novelty of the work. To demonstrate the enhanced outcomes of the SSDCNN-HPE approach, a series of simulations are executed. Finally, the SSDCNN-HPE methodology has accomplished maximum performance with higher accuracy of 0.993 on Penn action dataset, where the existing models achieved nearly 0.98 and 0.99 of accuracy on the same dataset. From the literature it is observed that HAR models developed in an unconstrained environment have several limitations like Personal Interference (PI), Electromagnetic (EM)
  • 3. noise, in-band noise or human movements and outliers involved while capturing the input data. These noises are affecting the overall performance and robustness of the model. In order to improve the model performance, noise removal techniques are introduced in this work. Noises like salt and pepper noise, Gaussian noise, and blurring of the boundaries and outlier treatment are processed for the hybrid data acquired using video and sensors in line-of-sight mode in this paper. For removing these noises, a combination of filters like Kalman filter, MOSSE filter, Butter-worth and J filter, are applied to the input data. By doing this noise removal technique, it is observed that Peak to Sidelobe Ratio (PSR) is reduced from the raw data. After removing these noises, features are extracted using the top layers of CNN Inceptionv3 model for video data. Similarly, by using inertial sensor, features like tri-axial accelerometer, gyroscopes and magnetometers are collected and a feature vector is created. Pyramidal flow feature fusion (PFFF) technique is used to fuse the extracted features from video and inertial sensor data. Finally, the fused features are given to the SVM classifier to perform activity recognition. The proposed experimental methodology has been tested on UCF sports data-set and the results have been obtained. From the results obtained, it is observed that by performing a noise removal process and introducing hybrid features, the robustness of HAR model have been improved and able to achieve an accuracy of 97.4% in noisy data and 98.7% in noiseless data.