Video compression is an emerging research topic in the field of block based video encoders. Due to the growth of video coding technologies, high efficiency video coding (HEVC) delivers superior coding performance. With the increased encoding complexity, the HEVC enhances the rate-distortion (RD) performance. In the video compression, the out-sized coding units (CUs) have higher encoding complexity. Therefore, the computational encoding cost and complexity remain vital concerns, which need to be considered as an optimization task. In this manuscript, an enhanced whale optimization algorithm (EWOA) is implemented to reduce the computational time and complexity of the HEVC. In the EWOA, a cosine function is incorporated with the controlling parameter A and two correlation factors are included in the WOA for controlling the position of whales and regulating the movement of search mechanism during the optimization and search processes. The bit streams in the Luma-coding tree block are selected using EWOA that defines the CU neighbors and is used in the HEVC. The results indicate that the EWOA achieves best bit rate (BR), time saving, and peak signal to noise ratio (PSNR). The EWOA showed 0.006-0.012 dB higher PSNR than the existing models in the real-time videos.
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
An efficient modified video compression HEVC technique based on high quality assessment saliency features presented for the assessment of high quality videos. To create an efficient saliency map we extract global temporal alignment component and robust spatial components. To obtain high quality saliency here, we combine spatial saliency features and temporal saliency features together for different macroblocks in association with transformed residuals. In this way, our saliency model outperforms all the existing techniques. In this paper, we have generated high reconstruction quality video after compression considering SFU dataset. Our experimental result outperforms all the existing techniques in terms of saliency map detection, PSNR and high-resolution quality.
An efficient modified video compression HEVC technique based on high quality assessment saliency features presented for the assessment of high quality videos. To create an efficient saliency map we extract global temporal alignment component and robust spatial components. To obtain high quality saliency here, we combine spatial saliency features and temporal saliency features together for different macroblocks in association with transformed residuals. In this way, our saliency model outperforms all the existing techniques. In this paper, we have generated high reconstruction quality video after compression considering SFU dataset. Our experimental result outperforms all the existing techniques in terms of saliency map detection, PSNR and high-resolution quality.
This document presents a real-time H.264 video compression algorithm for underwater channels. It proposes a hardware/software encoder design using a multi-core processor and graphics accelerator. The software uses uneven multi-hexagonal search for efficient motion estimation and Trellis 1 quantization. The encoder was tested on an underwater video achieving a 210:1 compression ratio and 41.37dB PSNR at 17.5 frames/second, satisfying real-time underwater video transmission requirements.
Deep learning-based switchable network for in-loop filtering in high efficie...IJECEIAES
The video codecs are focusing on a smart transition in this era. A future area of research that has not yet been fully investigated is the effect of deep learning on video compression. The paper’s goal is to reduce the ringing and artifacts that loop filtering causes when high-efficiency video compression is used. Even though there is a lot of research being done to lessen this effect, there are still many improvements that can be made. In This paper we have focused on an intelligent solution for improvising in-loop filtering in high efficiency video coding (HEVC) using a deep convolutional neural network (CNN). The paper proposes the design and implementation of deep CNN-based loop filtering using a series of 15 CNN networks followed by a combine and squeeze network that improves feature extraction. The resultant output is free from double enhancement and the peak signal-to-noise ratio is improved by 0.5 dB compared to existing techniques. The experiments then demonstrate that improving the coding efficiency by pipelining this network to the current network and using it for higher quantization parameters (QP) is more effective than using it separately. Coding efficiency is improved by an average of 8.3% with the switching based deep CNN in-loop filtering.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
The impact of jitter on the HEVC video streaming with Multiple CodingHakimSahour
This document discusses the impact of jitter on video quality when streaming HEVC encoded video over wireless networks. It presents a study evaluating the effects of quantization parameter (QP) values, video content, and jitter on quality of experience (QoE). The study finds that using higher QP values, which lowers bitrate and increases compression, degrades video quality as measured by PSNR. It also finds that different video content results in varying PSNR values for the same encoding settings. Additionally, the results show that adjusting the QP value can help recover from the negative effects of jitter on received video quality. The document proposes using multiple description coding (MDC) to further improve transmission over error-prone wireless channels.
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
An efficient modified video compression HEVC technique based on high quality assessment saliency features presented for the assessment of high quality videos. To create an efficient saliency map we extract global temporal alignment component and robust spatial components. To obtain high quality saliency here, we combine spatial saliency features and temporal saliency features together for different macroblocks in association with transformed residuals. In this way, our saliency model outperforms all the existing techniques. In this paper, we have generated high reconstruction quality video after compression considering SFU dataset. Our experimental result outperforms all the existing techniques in terms of saliency map detection, PSNR and high-resolution quality.
An efficient modified video compression HEVC technique based on high quality assessment saliency features presented for the assessment of high quality videos. To create an efficient saliency map we extract global temporal alignment component and robust spatial components. To obtain high quality saliency here, we combine spatial saliency features and temporal saliency features together for different macroblocks in association with transformed residuals. In this way, our saliency model outperforms all the existing techniques. In this paper, we have generated high reconstruction quality video after compression considering SFU dataset. Our experimental result outperforms all the existing techniques in terms of saliency map detection, PSNR and high-resolution quality.
This document presents a real-time H.264 video compression algorithm for underwater channels. It proposes a hardware/software encoder design using a multi-core processor and graphics accelerator. The software uses uneven multi-hexagonal search for efficient motion estimation and Trellis 1 quantization. The encoder was tested on an underwater video achieving a 210:1 compression ratio and 41.37dB PSNR at 17.5 frames/second, satisfying real-time underwater video transmission requirements.
Deep learning-based switchable network for in-loop filtering in high efficie...IJECEIAES
The video codecs are focusing on a smart transition in this era. A future area of research that has not yet been fully investigated is the effect of deep learning on video compression. The paper’s goal is to reduce the ringing and artifacts that loop filtering causes when high-efficiency video compression is used. Even though there is a lot of research being done to lessen this effect, there are still many improvements that can be made. In This paper we have focused on an intelligent solution for improvising in-loop filtering in high efficiency video coding (HEVC) using a deep convolutional neural network (CNN). The paper proposes the design and implementation of deep CNN-based loop filtering using a series of 15 CNN networks followed by a combine and squeeze network that improves feature extraction. The resultant output is free from double enhancement and the peak signal-to-noise ratio is improved by 0.5 dB compared to existing techniques. The experiments then demonstrate that improving the coding efficiency by pipelining this network to the current network and using it for higher quantization parameters (QP) is more effective than using it separately. Coding efficiency is improved by an average of 8.3% with the switching based deep CNN in-loop filtering.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
The impact of jitter on the HEVC video streaming with Multiple CodingHakimSahour
This document discusses the impact of jitter on video quality when streaming HEVC encoded video over wireless networks. It presents a study evaluating the effects of quantization parameter (QP) values, video content, and jitter on quality of experience (QoE). The study finds that using higher QP values, which lowers bitrate and increases compression, degrades video quality as measured by PSNR. It also finds that different video content results in varying PSNR values for the same encoding settings. Additionally, the results show that adjusting the QP value can help recover from the negative effects of jitter on received video quality. The document proposes using multiple description coding (MDC) to further improve transmission over error-prone wireless channels.
Distributed Video Coding (DVC) has become increasingly popular in recent times among the researchers in video coding due to its attractive and promising features. DVC primarily has a modified complexity balance between the encoder and decoder, in contrast to conventional video codecs. However, Most of the reported DVC schemes have a high time-delay in decoder which hinders its practical application in real-time systems. In this work, we focus on speed up the Side Information(SI) generation module in DVC, which is a major function in the DVC coding algorithm and one of the time-consuming factor at the decoder. By applied it through Compute Unified Device Architecture (CUDA) based on General-Purpose Graphics Processing Unit (GPGPU), the experimental results show that a considerable speedup can be obtained by using the proposed parallelized SI generation algorithm.
Video Compression Algorithm Based on Frame Difference Approaches ijsc
The huge usage of digital multimedia via communications, wireless communications, Internet, Intranet and cellular mobile leads to incurable growth of data flow through these Media. The researchers go deep in developing efficient techniques in these fields such as compression of data, image and video. Recently, video compression techniques and their applications in many areas (educational, agriculture, medical …) cause this field to be one of the most interested fields. Wavelet transform is an efficient method that can be used to perform an efficient compression technique. This work deals with the developing of an efficient video compression approach based on frames difference approaches that concentrated on the calculation of frame near distance (difference between frames). The
selection of the meaningful frame depends on many factors such as compression performance, frame details, frame size and near distance between frames. Three different approaches are applied for removing the lowest frame difference. In this paper, many videos are tested to insure the efficiency of this technique, in addition a good performance results has been obtained.
Novel Approach for Evaluating Video Transmission using Combined Scalable Vide...IJECEIAES
One of the main problems in video transmission is the bandwidth fluctuation in wireless channel. Therefore, there is an urgent need to find an efficient bandwidth utilization and method. This research utilizes the Combined Scalable Video Coding (CSVC) which comes from Joint Scalable Video Model (JSVM). In the combined scalable video coding, we implement Coarse Grain Scalability (CGS) and Medium Grain Scalability (MGS). We propose a new scheme in which it can be implemented on Network Simulator II (NS-2) over wireless broadband network. The advantages of this new scheme over the other schemes are more realistic and based on open source program. The result shows that CSVC implementation on MGS mode outperforms CGS mode.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
High efficiency video coding (HEVC) is the newest video codec to increase significantly the coding
efficiency of its ancestor H.264/Advance Video Coding. However, the HEVC delivers a highly increased
computation complexity. In this paper, a coding unit partitioning pattern optimization method based on
particle swarm optimization (PSO) is proposed to reduce the computational complexity of hierarchical
quadtree-based coding unit partitioning. The required coding unit partitioning pattern for exhaustive
partitioning and the rate distortion cost are efficiently considered as the chromosome and the fitness
function of the PSO, respectively. To reduce the computational time, the cellular automata-based (CA)
rule based time limit is used in order to find out the best possible modes of operation. Compared to the
current state of the art algorithms, this scheme is computationally simple and achieves superior
reconstructed video quality (12% increase in PSNR compared to existing methods) at less computational
complexity (overall delay by 40%), Increasing the bandwidth and reducing the errors.
This document discusses video quality analysis for H.264 based on the human visual system. It proposes an improved video quality assessment method that adds color comparison to structural similarity measurement. The method separates similarity measurement into four comparisons: luminance, contrast, structure, and color. Experimental results on video sets with two distortion types show the proposed method's quality scores are more consistent with visual quality than classical methods. It also discusses the H.264 video coding standard and provides examples of encoding and decoding experimental results.
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Vignesh V Menon
Presented at Mile High Video (MHV'24)
Abstract: Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. However, keeping the encoding time within an acceptable
threshold for a smooth user experience is important to reduce the carbon footprint and energy consumption on encoding servers in video streaming applications. Toward this realization, we introduce
an encoding latency-aware dynamic resolution encoding scheme (LADRE) for adaptive video streaming applications. LADRE determines the encoding resolution for each target bitrate by utilizing a random forest-based prediction model for every video segment based on spatiotemporal features and the acceptable target latency.
Experimental results show that LADRE achieves an overall average quality improvement of 0.58 dB PSNR and 0.43 dB XPSNR while maintaining the same bitrate, compared to the HTTP Live Streaming (HLS) bitrate ladder encoding of 200 s segments using the VVenC
encoder, when the encoding latency for each representation is set to remain below the 200 s threshold. This is accompanied by a 84.17 % reduction in overall encoding energy consumption.
Abstract: With the emergence of multiple modern video codecs, streaming service providers are forced to encode, store, and transmit bitrate ladders of multiple codecs separately, consequently suffering from additional energy costs for encoding, storage, and transmission. To tackle this issue, we introduce an online energy-efficient Multi-Codec Bitrate ladder Estimation scheme (MCBE) for adaptive video streaming applications. In MCBE, quality representations within the bitrate ladder of new-
generation codecs (e.g., High Efficiency Video Coding (HEVC),
Alliance for Open Media Video 1 (AV1)) that lie below the predicted rate-distortion curve of the Advanced Video Coding (AVC) codec are removed. Moreover, perceptual redundancy between representations of the bitrate ladders of the considered codecs is also minimized based on a Just Noticeable Difference (JND) threshold. Therefore, random forest-based models predict the VMAF score of bitrate ladder representations of each codec. In a live streaming session where all clients support the decoding of AVC, HEVC, and AV1, MCBE achieves impressive results, reducing cumulative encoding energy by 56.45%, storage energy usage by 94.99%, and transmission energy usage by 77.61% (considering a JND of six VMAF points). These energy reductions are in comparison to a baseline bitrate ladder encoding based on current industry practice.
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Alpen-Adria-Universität
With the emergence of multiple modern video codecs, streaming service providers are forced to encode, store, and transmit bitrate ladders of multiple codecs separately, consequently suffering from additional energy costs for encoding, storage, and transmission.
To tackle this issue, we introduce an online energy-efficient Multi-Codec Bitrate ladder Estimation scheme (MCBE) for adaptive video streaming applications. In MCBE, quality representations within the bitrate ladder of new-generation codecs (e.g., HEVC, AV1) that lie below the predicted rate-distortion curve of the AVC codec are removed. Moreover, perceptual redundancy between representations of the bitrate ladders of the considered codecs is also minimized based on a Just Noticeable Difference (JND) threshold. Therefore, random forest-based models predict the VMAF of bitrate ladder representations of each codec. In a live streaming session where all clients support the decoding of AVC, HEVC, and AV1, MCBE achieves impressive results, reducing cumulative encoding energy by 56.45%, storage energy usage by 94.99%, and transmission energy usage by 77.61% (considering a JND of six VMAF points). These energy reductions are in comparison to a baseline bitrate ladder encoding based on current industry practice.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
This document summarizes a research paper that proposes a new method to improve the efficiency of the HEVC video coding standard. The proposed method uses particle swarm optimization (PSO) to optimize the coding unit partitioning patterns in HEVC in order to reduce computational complexity. Compared to existing algorithms, the proposed PSO-based method achieves a 12% increase in PSNR video quality while reducing computational complexity by 40% and overall encoding delay. The method integrates this optimized intra-frame prediction into the HEVC encoding and decoding process.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
High efficiency video coding (HEVC) is the newest video codec to increase significantly the coding
efficiency of its ancestor H.264/Advance Video Coding. However, the HEVC delivers a highly increased
computation complexity. In this paper, a coding unit partitioning pattern optimization method based on
particle swarm optimization (PSO) is proposed to reduce the computational complexity of hierarchical
quadtree-based coding unit partitioning. The required coding unit partitioning pattern for exhaustive
partitioning and the rate distortion cost are efficiently considered as the chromosome and the fitness
function of the PSO, respectively. To reduce the computational time, the cellular automata-based (CA)
rule based time limit is used in order to find out the best possible modes of operation. Compared to the
current state of the art algorithms, this scheme is computationally simple and achieves superior
reconstructed video quality (12% increase in PSNR compared to existing methods) at less computational
complexity (overall delay by 40%), Increasing the bandwidth and reducing the errors..
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and DevelopmentIJERD Editor
The document summarizes an emerging VP8 video codec that is designed for mobile devices. It aims to significantly reduce computational complexity through several techniques while maintaining good video quality. The key techniques include a predictive algorithm for motion estimation that reduces computation by 18.5-20x compared to full search, using integer discrete cosine transform instead of floating point to achieve 2.6-3.5x speed improvement, and skipping DCT and quantization for some macroblocks to reduce computations. Experimental results on test sequences show negligible quality degradation of 0.2-0.5dB for integer DCT and 0.5dB on average for the full codec, while achieving real-time encoding rates on mobile devices. The proposed low-complexity
High Definition (HD) devices requires HD-videos for the effective uses of HD devices. However, it consists of some issues such as high storage capacity, limited battery power of high definition devices, long encoding time, and high computational complexity when it comes to the transmission, broadcasting and internet traffic. Many existing techniques consists these above-mentioned issues. Therefore, there is a need of an efficient technique, which reduces unnecessary amount of space, provides high compression rate and requires low bandwidth spectrum. Therefore, in the paper we have introduced an efficient video compression technique as modified HEVC coding based on saliency features to counter these existing drawbacks. We highlight first, on extracting features on the raw data and then compressed it largely. This technique makes our model powerful and provides effective performance in terms of compression. Our experiment results proves that our model provide better efficiency in terms of average PSNR, MSE and bitrate. Our experimental results outperforms all the existing techniques in terms of saliency map detection, AUC, NSS, KLD and JSD. The average AUC, NSS and KLD value by our proposed method are 0.846, 1.702 and 0.532 respectively which is very high compare to other existing technique.
In the current scenario compression of video files is in high demand. Color video compression has become a significant technology to lessen the memory space and to decrease transmission time. Video compression using fractal technique is based on self similarity concept by comparing the range block and domain block. However, its computational complexity is very high. In this paper we presented hybrid video compression technique to compress Audio/Video Interleaved file and overcome the problem of Computational complexity. We implemented Discrete Wavelet Transform and hybrid fractal HV partition technique using Particle Swarm Optimization (called mapping of PSO) for compression of videos. The analysis demonstrate that hybrid technique gives a very good speed up to compress video and achieve Peak Signal to Noise Ratio.
Scanned document compression using block based hybrid video codecMuthu Samy
Sybian Technologies Pvt Ltd
Final Year Projects & Real Time live Projects
JAVA(All Domains)
DOTNET(All Domains)
ANDROID
EMBEDDED
VLSI
MATLAB
Project Support
Abstract, Diagrams, Review Details, Relevant Materials, Presentation,
Supporting Documents, Software E-Books,
Software Development Standards & Procedure
E-Book, Theory Classes, Lab Working Programs, Project Design & Implementation
24/7 lab session
Final Year Projects For BE,ME,B.Sc,M.Sc,B.Tech,BCA,MCA
PROJECT DOMAIN:
Cloud Computing
Networking
Network Security
PARALLEL AND DISTRIBUTED SYSTEM
Data Mining
Mobile Computing
Service Computing
Software Engineering
Image Processing
Bio Medical / Medical Imaging
Contact Details:
Sybian Technologies Pvt Ltd,
No,33/10 Meenakshi Sundaram Building,
Sivaji Street,
(Near T.nagar Bus Terminus)
T.Nagar,
Chennai-600 017
Ph:044 42070551
Mobile No:9790877889,9003254624,7708845605
Mail Id:sybianprojects@gmail.com,sunbeamvijay@yahoo.com
Scanned document compression using block based hybrid video codecMuthu Samy
Sybian Technologies Pvt Ltd
Final Year Projects & Real Time live Projects
JAVA(All Domains)
DOTNET(All Domains)
ANDROID
EMBEDDED
VLSI
MATLAB
Project Support
Abstract, Diagrams, Review Details, Relevant Materials, Presentation,
Supporting Documents, Software E-Books,
Software Development Standards & Procedure
E-Book, Theory Classes, Lab Working Programs, Project Design & Implementation
24/7 lab session
Final Year Projects For BE,ME,B.Sc,M.Sc,B.Tech,BCA,MCA
PROJECT DOMAIN:
Cloud Computing
Networking
Network Security
PARALLEL AND DISTRIBUTED SYSTEM
Data Mining
Mobile Computing
Service Computing
Software Engineering
Image Processing
Bio Medical / Medical Imaging
Contact Details:
Sybian Technologies Pvt Ltd,
No,33/10 Meenakshi Sundaram Building,
Sivaji Street,
(Near T.nagar Bus Terminus)
T.Nagar,
Chennai-600 017
Ph:044 42070551
Mobile No:9790877889,9003254624,7708845605
Mail Id:sybianprojects@gmail.com,sunbeamvijay@yahoo.com
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
Global interconnect planning becomes a challenge as semiconductor technology continuously scales. Because of the increasing wire resistance and higher capacitive coupling in smaller features, the delay of global interconnects becomes large compared with the delay of a logic gate, introducing a huge performance gap that needs to be resolved A novel equalized global link architecture and driver– receiver co design flow are proposed for high-speed and low-energy on-chip communication by utilizing a continuous-time linear equalizer (CTLE). The proposed global link is analyzed using a linear system method, and the formula of CTLE eye opening is derived to provide high-level design guidelines and insights.
Compared with the separate driver–receiver design flow, over 50% energy reduction is observed.
HARDWARE SOFTWARE CO-SIMULATION OF MOTION ESTIMATION IN H.264 ENCODERcscpconf
This paper proposes about motion estimation in H.264/AVC encoder. Compared with standards
such as MPEG-2 and MPEG-4 Visual, H.264 can deliver better image quality at the same
compressed bit rate or at a lower bit rate. The increase in compression efficiency comes at the
expense of increase in complexity, which is a fact that must be overcome. An efficient Co-design
methodology is required, where the encoder software application is highly optimized and
structured in a very modular and efficient manner, so as to allow its most complex and time
consuming operations to be offloaded to dedicated hardware accelerators. The Motion
Estimation algorithm is the most computationally intensive part of the encoder which is simulated using MATLAB. The hardware/software co-simulation is done using system generator tool and implemented using Xilinx FPGA Spartan 3E for different scanning methods.
Efficient Architecture for Variable Block Size Motion Estimation in H.264/AVCIDES Editor
This paper proposes an efficient VLSI architecture
for the implementation of variable block size motion
estimation (VBSME). To improve the performance video
compression the Variable Block Size Motion Estimation
(VBSME) is the critical path. Variable Block Size Motion
Estimation feature has been introduced in to the H.264/AVC.
This feature induces significant complexities into the design
of the H.264/AVC video codec. This paper we compare the
existing architectures for VBSME. An efficient architecture
to improve the performance of Spiral Search for Variable Size
Motion Estimation in H.264/AVC is proposed. Among various
architectures available for VBSME spiral search provides
hardware friendly data flow with efficient utilization of
resources. The proposed implementation is verified using the
MATLAB on foreman, coastguard and train sequences. The
proposed Adaptive thresholding technique reduces the average
number of computations significantly with negligible effect
on the video quality. The results are verified using hardware
implementation on Xilinx Virtex 4 it was able to achieve real
time video coding of 60 fps at 95.56 MHz CLK frequency.
A Novel Approaches For Chromatic Squander Less Visceral Coding Techniques Usi...IJERA Editor
Recent advances in video capturing and display technologies, along with the exponentially increasing demand of
video services, challenge the video coding research community to design new algorithms able to significantly
improve the compression performance of the current H.264/AVC standard. This target is currently gaining
evidence with the standardization activities in the High Efficiency Video Coding (HEVC) project. The distortion
models used in HEVC are mean squared error (MSE) and sum of absolute difference (SAD). However, they are
widely criticized for not correlating well with perceptual image quality. The structural similarity (SSIM) index
has been found to be a good indicator of perceived image quality. Meanwhile, it is computationally simple
compared with other state-of-the-art perceptual quality measures and has a number of desirable mathematical
properties for optimization tasks. We propose a perceptual video coding method to improve upon the current
HEVC based on an SSIM-inspired divisive normalization scheme as an attempt to transform the DCT domain
frame prediction residuals to a perceptually uniform space before encoding.
Based on the residual divisive normalization process, we define a distortion model for mode selection and show
that such a divisive normalization strategy largely simplifies the subsequent perceptual rate-distortion
optimization procedure. We further adjust the divisive normalization factors based on local content of the video
frame. Experiments show that the scheme can achieve significant gain in terms of rate-SSIM performance and
better visual quality when compared with HEVC
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
More Related Content
Similar to Optimal coding unit decision for early termination in high efficiency video coding using enhanced whale optimization algorithm
Distributed Video Coding (DVC) has become increasingly popular in recent times among the researchers in video coding due to its attractive and promising features. DVC primarily has a modified complexity balance between the encoder and decoder, in contrast to conventional video codecs. However, Most of the reported DVC schemes have a high time-delay in decoder which hinders its practical application in real-time systems. In this work, we focus on speed up the Side Information(SI) generation module in DVC, which is a major function in the DVC coding algorithm and one of the time-consuming factor at the decoder. By applied it through Compute Unified Device Architecture (CUDA) based on General-Purpose Graphics Processing Unit (GPGPU), the experimental results show that a considerable speedup can be obtained by using the proposed parallelized SI generation algorithm.
Video Compression Algorithm Based on Frame Difference Approaches ijsc
The huge usage of digital multimedia via communications, wireless communications, Internet, Intranet and cellular mobile leads to incurable growth of data flow through these Media. The researchers go deep in developing efficient techniques in these fields such as compression of data, image and video. Recently, video compression techniques and their applications in many areas (educational, agriculture, medical …) cause this field to be one of the most interested fields. Wavelet transform is an efficient method that can be used to perform an efficient compression technique. This work deals with the developing of an efficient video compression approach based on frames difference approaches that concentrated on the calculation of frame near distance (difference between frames). The
selection of the meaningful frame depends on many factors such as compression performance, frame details, frame size and near distance between frames. Three different approaches are applied for removing the lowest frame difference. In this paper, many videos are tested to insure the efficiency of this technique, in addition a good performance results has been obtained.
Novel Approach for Evaluating Video Transmission using Combined Scalable Vide...IJECEIAES
One of the main problems in video transmission is the bandwidth fluctuation in wireless channel. Therefore, there is an urgent need to find an efficient bandwidth utilization and method. This research utilizes the Combined Scalable Video Coding (CSVC) which comes from Joint Scalable Video Model (JSVM). In the combined scalable video coding, we implement Coarse Grain Scalability (CGS) and Medium Grain Scalability (MGS). We propose a new scheme in which it can be implemented on Network Simulator II (NS-2) over wireless broadband network. The advantages of this new scheme over the other schemes are more realistic and based on open source program. The result shows that CSVC implementation on MGS mode outperforms CGS mode.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
High efficiency video coding (HEVC) is the newest video codec to increase significantly the coding
efficiency of its ancestor H.264/Advance Video Coding. However, the HEVC delivers a highly increased
computation complexity. In this paper, a coding unit partitioning pattern optimization method based on
particle swarm optimization (PSO) is proposed to reduce the computational complexity of hierarchical
quadtree-based coding unit partitioning. The required coding unit partitioning pattern for exhaustive
partitioning and the rate distortion cost are efficiently considered as the chromosome and the fitness
function of the PSO, respectively. To reduce the computational time, the cellular automata-based (CA)
rule based time limit is used in order to find out the best possible modes of operation. Compared to the
current state of the art algorithms, this scheme is computationally simple and achieves superior
reconstructed video quality (12% increase in PSNR compared to existing methods) at less computational
complexity (overall delay by 40%), Increasing the bandwidth and reducing the errors.
This document discusses video quality analysis for H.264 based on the human visual system. It proposes an improved video quality assessment method that adds color comparison to structural similarity measurement. The method separates similarity measurement into four comparisons: luminance, contrast, structure, and color. Experimental results on video sets with two distortion types show the proposed method's quality scores are more consistent with visual quality than classical methods. It also discusses the H.264 video coding standard and provides examples of encoding and decoding experimental results.
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Vignesh V Menon
Presented at Mile High Video (MHV'24)
Abstract: Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. However, keeping the encoding time within an acceptable
threshold for a smooth user experience is important to reduce the carbon footprint and energy consumption on encoding servers in video streaming applications. Toward this realization, we introduce
an encoding latency-aware dynamic resolution encoding scheme (LADRE) for adaptive video streaming applications. LADRE determines the encoding resolution for each target bitrate by utilizing a random forest-based prediction model for every video segment based on spatiotemporal features and the acceptable target latency.
Experimental results show that LADRE achieves an overall average quality improvement of 0.58 dB PSNR and 0.43 dB XPSNR while maintaining the same bitrate, compared to the HTTP Live Streaming (HLS) bitrate ladder encoding of 200 s segments using the VVenC
encoder, when the encoding latency for each representation is set to remain below the 200 s threshold. This is accompanied by a 84.17 % reduction in overall encoding energy consumption.
Abstract: With the emergence of multiple modern video codecs, streaming service providers are forced to encode, store, and transmit bitrate ladders of multiple codecs separately, consequently suffering from additional energy costs for encoding, storage, and transmission. To tackle this issue, we introduce an online energy-efficient Multi-Codec Bitrate ladder Estimation scheme (MCBE) for adaptive video streaming applications. In MCBE, quality representations within the bitrate ladder of new-
generation codecs (e.g., High Efficiency Video Coding (HEVC),
Alliance for Open Media Video 1 (AV1)) that lie below the predicted rate-distortion curve of the Advanced Video Coding (AVC) codec are removed. Moreover, perceptual redundancy between representations of the bitrate ladders of the considered codecs is also minimized based on a Just Noticeable Difference (JND) threshold. Therefore, random forest-based models predict the VMAF score of bitrate ladder representations of each codec. In a live streaming session where all clients support the decoding of AVC, HEVC, and AV1, MCBE achieves impressive results, reducing cumulative encoding energy by 56.45%, storage energy usage by 94.99%, and transmission energy usage by 77.61% (considering a JND of six VMAF points). These energy reductions are in comparison to a baseline bitrate ladder encoding based on current industry practice.
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Alpen-Adria-Universität
With the emergence of multiple modern video codecs, streaming service providers are forced to encode, store, and transmit bitrate ladders of multiple codecs separately, consequently suffering from additional energy costs for encoding, storage, and transmission.
To tackle this issue, we introduce an online energy-efficient Multi-Codec Bitrate ladder Estimation scheme (MCBE) for adaptive video streaming applications. In MCBE, quality representations within the bitrate ladder of new-generation codecs (e.g., HEVC, AV1) that lie below the predicted rate-distortion curve of the AVC codec are removed. Moreover, perceptual redundancy between representations of the bitrate ladders of the considered codecs is also minimized based on a Just Noticeable Difference (JND) threshold. Therefore, random forest-based models predict the VMAF of bitrate ladder representations of each codec. In a live streaming session where all clients support the decoding of AVC, HEVC, and AV1, MCBE achieves impressive results, reducing cumulative encoding energy by 56.45%, storage energy usage by 94.99%, and transmission energy usage by 77.61% (considering a JND of six VMAF points). These energy reductions are in comparison to a baseline bitrate ladder encoding based on current industry practice.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
This document summarizes a research paper that proposes a new method to improve the efficiency of the HEVC video coding standard. The proposed method uses particle swarm optimization (PSO) to optimize the coding unit partitioning patterns in HEVC in order to reduce computational complexity. Compared to existing algorithms, the proposed PSO-based method achieves a 12% increase in PSNR video quality while reducing computational complexity by 40% and overall encoding delay. The method integrates this optimized intra-frame prediction into the HEVC encoding and decoding process.
IMPROVING PSNR AND PROCESSING SPEED FOR HEVC USING HYBRID PSO FOR INTRA FRAME...ijma
High efficiency video coding (HEVC) is the newest video codec to increase significantly the coding
efficiency of its ancestor H.264/Advance Video Coding. However, the HEVC delivers a highly increased
computation complexity. In this paper, a coding unit partitioning pattern optimization method based on
particle swarm optimization (PSO) is proposed to reduce the computational complexity of hierarchical
quadtree-based coding unit partitioning. The required coding unit partitioning pattern for exhaustive
partitioning and the rate distortion cost are efficiently considered as the chromosome and the fitness
function of the PSO, respectively. To reduce the computational time, the cellular automata-based (CA)
rule based time limit is used in order to find out the best possible modes of operation. Compared to the
current state of the art algorithms, this scheme is computationally simple and achieves superior
reconstructed video quality (12% increase in PSNR compared to existing methods) at less computational
complexity (overall delay by 40%), Increasing the bandwidth and reducing the errors..
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and DevelopmentIJERD Editor
The document summarizes an emerging VP8 video codec that is designed for mobile devices. It aims to significantly reduce computational complexity through several techniques while maintaining good video quality. The key techniques include a predictive algorithm for motion estimation that reduces computation by 18.5-20x compared to full search, using integer discrete cosine transform instead of floating point to achieve 2.6-3.5x speed improvement, and skipping DCT and quantization for some macroblocks to reduce computations. Experimental results on test sequences show negligible quality degradation of 0.2-0.5dB for integer DCT and 0.5dB on average for the full codec, while achieving real-time encoding rates on mobile devices. The proposed low-complexity
High Definition (HD) devices requires HD-videos for the effective uses of HD devices. However, it consists of some issues such as high storage capacity, limited battery power of high definition devices, long encoding time, and high computational complexity when it comes to the transmission, broadcasting and internet traffic. Many existing techniques consists these above-mentioned issues. Therefore, there is a need of an efficient technique, which reduces unnecessary amount of space, provides high compression rate and requires low bandwidth spectrum. Therefore, in the paper we have introduced an efficient video compression technique as modified HEVC coding based on saliency features to counter these existing drawbacks. We highlight first, on extracting features on the raw data and then compressed it largely. This technique makes our model powerful and provides effective performance in terms of compression. Our experiment results proves that our model provide better efficiency in terms of average PSNR, MSE and bitrate. Our experimental results outperforms all the existing techniques in terms of saliency map detection, AUC, NSS, KLD and JSD. The average AUC, NSS and KLD value by our proposed method are 0.846, 1.702 and 0.532 respectively which is very high compare to other existing technique.
In the current scenario compression of video files is in high demand. Color video compression has become a significant technology to lessen the memory space and to decrease transmission time. Video compression using fractal technique is based on self similarity concept by comparing the range block and domain block. However, its computational complexity is very high. In this paper we presented hybrid video compression technique to compress Audio/Video Interleaved file and overcome the problem of Computational complexity. We implemented Discrete Wavelet Transform and hybrid fractal HV partition technique using Particle Swarm Optimization (called mapping of PSO) for compression of videos. The analysis demonstrate that hybrid technique gives a very good speed up to compress video and achieve Peak Signal to Noise Ratio.
Scanned document compression using block based hybrid video codecMuthu Samy
Sybian Technologies Pvt Ltd
Final Year Projects & Real Time live Projects
JAVA(All Domains)
DOTNET(All Domains)
ANDROID
EMBEDDED
VLSI
MATLAB
Project Support
Abstract, Diagrams, Review Details, Relevant Materials, Presentation,
Supporting Documents, Software E-Books,
Software Development Standards & Procedure
E-Book, Theory Classes, Lab Working Programs, Project Design & Implementation
24/7 lab session
Final Year Projects For BE,ME,B.Sc,M.Sc,B.Tech,BCA,MCA
PROJECT DOMAIN:
Cloud Computing
Networking
Network Security
PARALLEL AND DISTRIBUTED SYSTEM
Data Mining
Mobile Computing
Service Computing
Software Engineering
Image Processing
Bio Medical / Medical Imaging
Contact Details:
Sybian Technologies Pvt Ltd,
No,33/10 Meenakshi Sundaram Building,
Sivaji Street,
(Near T.nagar Bus Terminus)
T.Nagar,
Chennai-600 017
Ph:044 42070551
Mobile No:9790877889,9003254624,7708845605
Mail Id:sybianprojects@gmail.com,sunbeamvijay@yahoo.com
Scanned document compression using block based hybrid video codecMuthu Samy
Sybian Technologies Pvt Ltd
Final Year Projects & Real Time live Projects
JAVA(All Domains)
DOTNET(All Domains)
ANDROID
EMBEDDED
VLSI
MATLAB
Project Support
Abstract, Diagrams, Review Details, Relevant Materials, Presentation,
Supporting Documents, Software E-Books,
Software Development Standards & Procedure
E-Book, Theory Classes, Lab Working Programs, Project Design & Implementation
24/7 lab session
Final Year Projects For BE,ME,B.Sc,M.Sc,B.Tech,BCA,MCA
PROJECT DOMAIN:
Cloud Computing
Networking
Network Security
PARALLEL AND DISTRIBUTED SYSTEM
Data Mining
Mobile Computing
Service Computing
Software Engineering
Image Processing
Bio Medical / Medical Imaging
Contact Details:
Sybian Technologies Pvt Ltd,
No,33/10 Meenakshi Sundaram Building,
Sivaji Street,
(Near T.nagar Bus Terminus)
T.Nagar,
Chennai-600 017
Ph:044 42070551
Mobile No:9790877889,9003254624,7708845605
Mail Id:sybianprojects@gmail.com,sunbeamvijay@yahoo.com
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
Global interconnect planning becomes a challenge as semiconductor technology continuously scales. Because of the increasing wire resistance and higher capacitive coupling in smaller features, the delay of global interconnects becomes large compared with the delay of a logic gate, introducing a huge performance gap that needs to be resolved A novel equalized global link architecture and driver– receiver co design flow are proposed for high-speed and low-energy on-chip communication by utilizing a continuous-time linear equalizer (CTLE). The proposed global link is analyzed using a linear system method, and the formula of CTLE eye opening is derived to provide high-level design guidelines and insights.
Compared with the separate driver–receiver design flow, over 50% energy reduction is observed.
HARDWARE SOFTWARE CO-SIMULATION OF MOTION ESTIMATION IN H.264 ENCODERcscpconf
This paper proposes about motion estimation in H.264/AVC encoder. Compared with standards
such as MPEG-2 and MPEG-4 Visual, H.264 can deliver better image quality at the same
compressed bit rate or at a lower bit rate. The increase in compression efficiency comes at the
expense of increase in complexity, which is a fact that must be overcome. An efficient Co-design
methodology is required, where the encoder software application is highly optimized and
structured in a very modular and efficient manner, so as to allow its most complex and time
consuming operations to be offloaded to dedicated hardware accelerators. The Motion
Estimation algorithm is the most computationally intensive part of the encoder which is simulated using MATLAB. The hardware/software co-simulation is done using system generator tool and implemented using Xilinx FPGA Spartan 3E for different scanning methods.
Efficient Architecture for Variable Block Size Motion Estimation in H.264/AVCIDES Editor
This paper proposes an efficient VLSI architecture
for the implementation of variable block size motion
estimation (VBSME). To improve the performance video
compression the Variable Block Size Motion Estimation
(VBSME) is the critical path. Variable Block Size Motion
Estimation feature has been introduced in to the H.264/AVC.
This feature induces significant complexities into the design
of the H.264/AVC video codec. This paper we compare the
existing architectures for VBSME. An efficient architecture
to improve the performance of Spiral Search for Variable Size
Motion Estimation in H.264/AVC is proposed. Among various
architectures available for VBSME spiral search provides
hardware friendly data flow with efficient utilization of
resources. The proposed implementation is verified using the
MATLAB on foreman, coastguard and train sequences. The
proposed Adaptive thresholding technique reduces the average
number of computations significantly with negligible effect
on the video quality. The results are verified using hardware
implementation on Xilinx Virtex 4 it was able to achieve real
time video coding of 60 fps at 95.56 MHz CLK frequency.
A Novel Approaches For Chromatic Squander Less Visceral Coding Techniques Usi...IJERA Editor
Recent advances in video capturing and display technologies, along with the exponentially increasing demand of
video services, challenge the video coding research community to design new algorithms able to significantly
improve the compression performance of the current H.264/AVC standard. This target is currently gaining
evidence with the standardization activities in the High Efficiency Video Coding (HEVC) project. The distortion
models used in HEVC are mean squared error (MSE) and sum of absolute difference (SAD). However, they are
widely criticized for not correlating well with perceptual image quality. The structural similarity (SSIM) index
has been found to be a good indicator of perceived image quality. Meanwhile, it is computationally simple
compared with other state-of-the-art perceptual quality measures and has a number of desirable mathematical
properties for optimization tasks. We propose a perceptual video coding method to improve upon the current
HEVC based on an SSIM-inspired divisive normalization scheme as an attempt to transform the DCT domain
frame prediction residuals to a perceptually uniform space before encoding.
Based on the residual divisive normalization process, we define a distortion model for mode selection and show
that such a divisive normalization strategy largely simplifies the subsequent perceptual rate-distortion
optimization procedure. We further adjust the divisive normalization factors based on local content of the video
frame. Experiments show that the scheme can achieve significant gain in terms of rate-SSIM performance and
better visual quality when compared with HEVC
Similar to Optimal coding unit decision for early termination in high efficiency video coding using enhanced whale optimization algorithm (20)
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
Neural network optimizer of proportional-integral-differential controller par...IJECEIAES
Wide application of proportional-integral-differential (PID)-regulator in industry requires constant improvement of methods of its parameters adjustment. The paper deals with the issues of optimization of PID-regulator parameters with the use of neural network technology methods. A methodology for choosing the architecture (structure) of neural network optimizer is proposed, which consists in determining the number of layers, the number of neurons in each layer, as well as the form and type of activation function. Algorithms of neural network training based on the application of the method of minimizing the mismatch between the regulated value and the target value are developed. The method of back propagation of gradients is proposed to select the optimal training rate of neurons of the neural network. The neural network optimizer, which is a superstructure of the linear PID controller, allows increasing the regulation accuracy from 0.23 to 0.09, thus reducing the power consumption from 65% to 53%. The results of the conducted experiments allow us to conclude that the created neural superstructure may well become a prototype of an automatic voltage regulator (AVR)-type industrial controller for tuning the parameters of the PID controller.
An improved modulation technique suitable for a three level flying capacitor ...IJECEIAES
This research paper introduces an innovative modulation technique for controlling a 3-level flying capacitor multilevel inverter (FCMLI), aiming to streamline the modulation process in contrast to conventional methods. The proposed
simplified modulation technique paves the way for more straightforward and
efficient control of multilevel inverters, enabling their widespread adoption and
integration into modern power electronic systems. Through the amalgamation of
sinusoidal pulse width modulation (SPWM) with a high-frequency square wave
pulse, this controlling technique attains energy equilibrium across the coupling
capacitor. The modulation scheme incorporates a simplified switching pattern
and a decreased count of voltage references, thereby simplifying the control
algorithm.
A review on features and methods of potential fishing zoneIJECEIAES
This review focuses on the importance of identifying potential fishing zones in seawater for sustainable fishing practices. It explores features like sea surface temperature (SST) and sea surface height (SSH), along with classification methods such as classifiers. The features like SST, SSH, and different classifiers used to classify the data, have been figured out in this review study. This study underscores the importance of examining potential fishing zones using advanced analytical techniques. It thoroughly explores the methodologies employed by researchers, covering both past and current approaches. The examination centers on data characteristics and the application of classification algorithms for classification of potential fishing zones. Furthermore, the prediction of potential fishing zones relies significantly on the effectiveness of classification algorithms. Previous research has assessed the performance of models like support vector machines, naïve Bayes, and artificial neural networks (ANN). In the previous result, the results of support vector machine (SVM) were 97.6% more accurate than naive Bayes's 94.2% to classify test data for fisheries classification. By considering the recent works in this area, several recommendations for future works are presented to further improve the performance of the potential fishing zone models, which is important to the fisheries community.
Electrical signal interference minimization using appropriate core material f...IJECEIAES
As demand for smaller, quicker, and more powerful devices rises, Moore's law is strictly followed. The industry has worked hard to make little devices that boost productivity. The goal is to optimize device density. Scientists are reducing connection delays to improve circuit performance. This helped them understand three-dimensional integrated circuit (3D IC) concepts, which stack active devices and create vertical connections to diminish latency and lower interconnects. Electrical involvement is a big worry with 3D integrates circuits. Researchers have developed and tested through silicon via (TSV) and substrates to decrease electrical wave involvement. This study illustrates a novel noise coupling reduction method using several electrical involvement models. A 22% drop in electrical involvement from wave-carrying to victim TSVs introduces this new paradigm and improves system performance even at higher THz frequencies.
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...IJECEIAES
Climate change's impact on the planet forced the United Nations and governments to promote green energies and electric transportation. The deployments of photovoltaic (PV) and electric vehicle (EV) systems gained stronger momentum due to their numerous advantages over fossil fuel types. The advantages go beyond sustainability to reach financial support and stability. The work in this paper introduces the hybrid system between PV and EV to support industrial and commercial plants. This paper covers the theoretical framework of the proposed hybrid system including the required equation to complete the cost analysis when PV and EV are present. In addition, the proposed design diagram which sets the priorities and requirements of the system is presented. The proposed approach allows setup to advance their power stability, especially during power outages. The presented information supports researchers and plant owners to complete the necessary analysis while promoting the deployment of clean energy. The result of a case study that represents a dairy milk farmer supports the theoretical works and highlights its advanced benefits to existing plants. The short return on investment of the proposed approach supports the paper's novelty approach for the sustainable electrical system. In addition, the proposed system allows for an isolated power setup without the need for a transmission line which enhances the safety of the electrical network
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
Enhancing battery system identification: nonlinear autoregressive modeling fo...IJECEIAES
Precisely characterizing Li-ion batteries is essential for optimizing their
performance, enhancing safety, and prolonging their lifespan across various
applications, such as electric vehicles and renewable energy systems. This
article introduces an innovative nonlinear methodology for system
identification of a Li-ion battery, employing a nonlinear autoregressive with
exogenous inputs (NARX) model. The proposed approach integrates the
benefits of nonlinear modeling with the adaptability of the NARX structure,
facilitating a more comprehensive representation of the intricate
electrochemical processes within the battery. Experimental data collected
from a Li-ion battery operating under diverse scenarios are employed to
validate the effectiveness of the proposed methodology. The identified
NARX model exhibits superior accuracy in predicting the battery's behavior
compared to traditional linear models. This study underscores the
importance of accounting for nonlinearities in battery modeling, providing
insights into the intricate relationships between state-of-charge, voltage, and
current under dynamic conditions.
Smart grid deployment: from a bibliometric analysis to a surveyIJECEIAES
Smart grids are one of the last decades' innovations in electrical energy.
They bring relevant advantages compared to the traditional grid and
significant interest from the research community. Assessing the field's
evolution is essential to propose guidelines for facing new and future smart
grid challenges. In addition, knowing the main technologies involved in the
deployment of smart grids (SGs) is important to highlight possible
shortcomings that can be mitigated by developing new tools. This paper
contributes to the research trends mentioned above by focusing on two
objectives. First, a bibliometric analysis is presented to give an overview of
the current research level about smart grid deployment. Second, a survey of
the main technological approaches used for smart grid implementation and
their contributions are highlighted. To that effect, we searched the Web of
Science (WoS), and the Scopus databases. We obtained 5,663 documents
from WoS and 7,215 from Scopus on smart grid implementation or
deployment. With the extraction limitation in the Scopus database, 5,872 of
the 7,215 documents were extracted using a multi-step process. These two
datasets have been analyzed using a bibliometric tool called bibliometrix.
The main outputs are presented with some recommendations for future
research.
Use of analytical hierarchy process for selecting and prioritizing islanding ...IJECEIAES
One of the problems that are associated to power systems is islanding
condition, which must be rapidly and properly detected to prevent any
negative consequences on the system's protection, stability, and security.
This paper offers a thorough overview of several islanding detection
strategies, which are divided into two categories: classic approaches,
including local and remote approaches, and modern techniques, including
techniques based on signal processing and computational intelligence.
Additionally, each approach is compared and assessed based on several
factors, including implementation costs, non-detected zones, declining
power quality, and response times using the analytical hierarchy process
(AHP). The multi-criteria decision-making analysis shows that the overall
weight of passive methods (24.7%), active methods (7.8%), hybrid methods
(5.6%), remote methods (14.5%), signal processing-based methods (26.6%),
and computational intelligent-based methods (20.8%) based on the
comparison of all criteria together. Thus, it can be seen from the total weight
that hybrid approaches are the least suitable to be chosen, while signal
processing-based methods are the most appropriate islanding detection
method to be selected and implemented in power system with respect to the
aforementioned factors. Using Expert Choice software, the proposed
hierarchy model is studied and examined.
Enhancing of single-stage grid-connected photovoltaic system using fuzzy logi...IJECEIAES
The power generated by photovoltaic (PV) systems is influenced by
environmental factors. This variability hampers the control and utilization of
solar cells' peak output. In this study, a single-stage grid-connected PV
system is designed to enhance power quality. Our approach employs fuzzy
logic in the direct power control (DPC) of a three-phase voltage source
inverter (VSI), enabling seamless integration of the PV connected to the
grid. Additionally, a fuzzy logic-based maximum power point tracking
(MPPT) controller is adopted, which outperforms traditional methods like
incremental conductance (INC) in enhancing solar cell efficiency and
minimizing the response time. Moreover, the inverter's real-time active and
reactive power is directly managed to achieve a unity power factor (UPF).
The system's performance is assessed through MATLAB/Simulink
implementation, showing marked improvement over conventional methods,
particularly in steady-state and varying weather conditions. For solar
irradiances of 500 and 1,000 W/m2
, the results show that the proposed
method reduces the total harmonic distortion (THD) of the injected current
to the grid by approximately 46% and 38% compared to conventional
methods, respectively. Furthermore, we compare the simulation results with
IEEE standards to evaluate the system's grid compatibility.
Enhancing photovoltaic system maximum power point tracking with fuzzy logic-b...IJECEIAES
Photovoltaic systems have emerged as a promising energy resource that
caters to the future needs of society, owing to their renewable, inexhaustible,
and cost-free nature. The power output of these systems relies on solar cell
radiation and temperature. In order to mitigate the dependence on
atmospheric conditions and enhance power tracking, a conventional
approach has been improved by integrating various methods. To optimize
the generation of electricity from solar systems, the maximum power point
tracking (MPPT) technique is employed. To overcome limitations such as
steady-state voltage oscillations and improve transient response, two
traditional MPPT methods, namely fuzzy logic controller (FLC) and perturb
and observe (P&O), have been modified. This research paper aims to
simulate and validate the step size of the proposed modified P&O and FLC
techniques within the MPPT algorithm using MATLAB/Simulink for
efficient power tracking in photovoltaic systems.
Adaptive synchronous sliding control for a robot manipulator based on neural ...IJECEIAES
Robot manipulators have become important equipment in production lines, medical fields, and transportation. Improving the quality of trajectory tracking for
robot hands is always an attractive topic in the research community. This is a
challenging problem because robot manipulators are complex nonlinear systems
and are often subject to fluctuations in loads and external disturbances. This
article proposes an adaptive synchronous sliding control scheme to improve trajectory tracking performance for a robot manipulator. The proposed controller
ensures that the positions of the joints track the desired trajectory, synchronize
the errors, and significantly reduces chattering. First, the synchronous tracking
errors and synchronous sliding surfaces are presented. Second, the synchronous
tracking error dynamics are determined. Third, a robust adaptive control law is
designed,the unknown components of the model are estimated online by the neural network, and the parameters of the switching elements are selected by fuzzy
logic. The built algorithm ensures that the tracking and approximation errors
are ultimately uniformly bounded (UUB). Finally, the effectiveness of the constructed algorithm is demonstrated through simulation and experimental results.
Simulation and experimental results show that the proposed controller is effective with small synchronous tracking errors, and the chattering phenomenon is
significantly reduced.
Remote field-programmable gate array laboratory for signal acquisition and de...IJECEIAES
A remote laboratory utilizing field-programmable gate array (FPGA) technologies enhances students’ learning experience anywhere and anytime in embedded system design. Existing remote laboratories prioritize hardware access and visual feedback for observing board behavior after programming, neglecting comprehensive debugging tools to resolve errors that require internal signal acquisition. This paper proposes a novel remote embeddedsystem design approach targeting FPGA technologies that are fully interactive via a web-based platform. Our solution provides FPGA board access and debugging capabilities beyond the visual feedback provided by existing remote laboratories. We implemented a lab module that allows users to seamlessly incorporate into their FPGA design. The module minimizes hardware resource utilization while enabling the acquisition of a large number of data samples from the signal during the experiments by adaptively compressing the signal prior to data transmission. The results demonstrate an average compression ratio of 2.90 across three benchmark signals, indicating efficient signal acquisition and effective debugging and analysis. This method allows users to acquire more data samples than conventional methods. The proposed lab allows students to remotely test and debug their designs, bridging the gap between theory and practice in embedded system design.
Detecting and resolving feature envy through automated machine learning and m...IJECEIAES
Efficiently identifying and resolving code smells enhances software project quality. This paper presents a novel solution, utilizing automated machine learning (AutoML) techniques, to detect code smells and apply move method refactoring. By evaluating code metrics before and after refactoring, we assessed its impact on coupling, complexity, and cohesion. Key contributions of this research include a unique dataset for code smell classification and the development of models using AutoGluon for optimal performance. Furthermore, the study identifies the top 20 influential features in classifying feature envy, a well-known code smell, stemming from excessive reliance on external classes. We also explored how move method refactoring addresses feature envy, revealing reduced coupling and complexity, and improved cohesion, ultimately enhancing code quality. In summary, this research offers an empirical, data-driven approach, integrating AutoML and move method refactoring to optimize software project quality. Insights gained shed light on the benefits of refactoring on code quality and the significance of specific features in detecting feature envy. Future research can expand to explore additional refactoring techniques and a broader range of code metrics, advancing software engineering practices and standards.
Smart monitoring technique for solar cell systems using internet of things ba...IJECEIAES
Rapidly and remotely monitoring and receiving the solar cell systems status parameters, solar irradiance, temperature, and humidity, are critical issues in enhancement their efficiency. Hence, in the present article an improved smart prototype of internet of things (IoT) technique based on embedded system through NodeMCU ESP8266 (ESP-12E) was carried out experimentally. Three different regions at Egypt; Luxor, Cairo, and El-Beheira cities were chosen to study their solar irradiance profile, temperature, and humidity by the proposed IoT system. The monitoring data of solar irradiance, temperature, and humidity were live visualized directly by Ubidots through hypertext transfer protocol (HTTP) protocol. The measured solar power radiation in Luxor, Cairo, and El-Beheira ranged between 216-1000, 245-958, and 187-692 W/m 2 respectively during the solar day. The accuracy and rapidity of obtaining monitoring results using the proposed IoT system made it a strong candidate for application in monitoring solar cell systems. On the other hand, the obtained solar power radiation results of the three considered regions strongly candidate Luxor and Cairo as suitable places to build up a solar cells system station rather than El-Beheira.
An efficient security framework for intrusion detection and prevention in int...IJECEIAES
Over the past few years, the internet of things (IoT) has advanced to connect billions of smart devices to improve quality of life. However, anomalies or malicious intrusions pose several security loopholes, leading to performance degradation and threat to data security in IoT operations. Thereby, IoT security systems must keep an eye on and restrict unwanted events from occurring in the IoT network. Recently, various technical solutions based on machine learning (ML) models have been derived towards identifying and restricting unwanted events in IoT. However, most ML-based approaches are prone to miss-classification due to inappropriate feature selection. Additionally, most ML approaches applied to intrusion detection and prevention consider supervised learning, which requires a large amount of labeled data to be trained. Consequently, such complex datasets are impossible to source in a large network like IoT. To address this problem, this proposed study introduces an efficient learning mechanism to strengthen the IoT security aspects. The proposed algorithm incorporates supervised and unsupervised approaches to improve the learning models for intrusion detection and mitigation. Compared with the related works, the experimental outcome shows that the model performs well in a benchmark dataset. It accomplishes an improved detection accuracy of approximately 99.21%.
Blood finder application project report (1).pdfKamal Acharya
Blood Finder is an emergency time app where a user can search for the blood banks as
well as the registered blood donors around Mumbai. This application also provide an
opportunity for the user of this application to become a registered donor for this user have
to enroll for the donor request from the application itself. If the admin wish to make user
a registered donor, with some of the formalities with the organization it can be done.
Specialization of this application is that the user will not have to register on sign-in for
searching the blood banks and blood donors it can be just done by installing the
application to the mobile.
The purpose of making this application is to save the user’s time for searching blood of
needed blood group during the time of the emergency.
This is an android application developed in Java and XML with the connectivity of
SQLite database. This application will provide most of basic functionality required for an
emergency time application. All the details of Blood banks and Blood donors are stored
in the database i.e. SQLite.
This application allowed the user to get all the information regarding blood banks and
blood donors such as Name, Number, Address, Blood Group, rather than searching it on
the different websites and wasting the precious time. This application is effective and
user friendly.
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfBalvir Singh
Sri Guru Hargobind Ji (19 June 1595 - 3 March 1644) is revered as the Sixth Nanak.
• On 25 May 1606 Guru Arjan nominated his son Sri Hargobind Ji as his successor. Shortly
afterwards, Guru Arjan was arrested, tortured and killed by order of the Mogul Emperor
Jahangir.
• Guru Hargobind's succession ceremony took place on 24 June 1606. He was barely
eleven years old when he became 6th Guru.
• As ordered by Guru Arjan Dev Ji, he put on two swords, one indicated his spiritual
authority (PIRI) and the other, his temporal authority (MIRI). He thus for the first time
initiated military tradition in the Sikh faith to resist religious persecution, protect
people’s freedom and independence to practice religion by choice. He transformed
Sikhs to be Saints and Soldier.
• He had a long tenure as Guru, lasting 37 years, 9 months and 3 days
This study Examines the Effectiveness of Talent Procurement through the Imple...DharmaBanothu
In the world with high technology and fast
forward mindset recruiters are walking/showing interest
towards E-Recruitment. Present most of the HRs of
many companies are choosing E-Recruitment as the best
choice for recruitment. E-Recruitment is being done
through many online platforms like Linkedin, Naukri,
Instagram , Facebook etc. Now with high technology E-
Recruitment has gone through next level by using
Artificial Intelligence too.
Key Words : Talent Management, Talent Acquisition , E-
Recruitment , Artificial Intelligence Introduction
Effectiveness of Talent Acquisition through E-
Recruitment in this topic we will discuss about 4important
and interlinked topics which are
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...Transcat
Join us for this solutions-based webinar on the tools and techniques for commissioning and maintaining PV Systems. In this session, we'll review the process of building and maintaining a solar array, starting with installation and commissioning, then reviewing operations and maintenance of the system. This course will review insulation resistance testing, I-V curve testing, earth-bond continuity, ground resistance testing, performance tests, visual inspections, ground and arc fault testing procedures, and power quality analysis.
Fluke Solar Application Specialist Will White is presenting on this engaging topic:
Will has worked in the renewable energy industry since 2005, first as an installer for a small east coast solar integrator before adding sales, design, and project management to his skillset. In 2022, Will joined Fluke as a solar application specialist, where he supports their renewable energy testing equipment like IV-curve tracers, electrical meters, and thermal imaging cameras. Experienced in wind power, solar thermal, energy storage, and all scales of PV, Will has primarily focused on residential and small commercial systems. He is passionate about implementing high-quality, code-compliant installation techniques.
Applications of artificial Intelligence in Mechanical Engineering.pdfAtif Razi
Historically, mechanical engineering has relied heavily on human expertise and empirical methods to solve complex problems. With the introduction of computer-aided design (CAD) and finite element analysis (FEA), the field took its first steps towards digitization. These tools allowed engineers to simulate and analyze mechanical systems with greater accuracy and efficiency. However, the sheer volume of data generated by modern engineering systems and the increasing complexity of these systems have necessitated more advanced analytical tools, paving the way for AI.
AI offers the capability to process vast amounts of data, identify patterns, and make predictions with a level of speed and accuracy unattainable by traditional methods. This has profound implications for mechanical engineering, enabling more efficient design processes, predictive maintenance strategies, and optimized manufacturing operations. AI-driven tools can learn from historical data, adapt to new information, and continuously improve their performance, making them invaluable in tackling the multifaceted challenges of modern mechanical engineering.
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...PriyankaKilaniya
Energy efficiency has been important since the latter part of the last century. The main object of this survey is to determine the energy efficiency knowledge among consumers. Two separate districts in Bangladesh are selected to conduct the survey on households and showrooms about the energy and seller also. The survey uses the data to find some regression equations from which it is easy to predict energy efficiency knowledge. The data is analyzed and calculated based on five important criteria. The initial target was to find some factors that help predict a person's energy efficiency knowledge. From the survey, it is found that the energy efficiency awareness among the people of our country is very low. Relationships between household energy use behaviors are estimated using a unique dataset of about 40 households and 20 showrooms in Bangladesh's Chapainawabganj and Bagerhat districts. Knowledge of energy consumption and energy efficiency technology options is found to be associated with household use of energy conservation practices. Household characteristics also influence household energy use behavior. Younger household cohorts are more likely to adopt energy-efficient technologies and energy conservation practices and place primary importance on energy saving for environmental reasons. Education also influences attitudes toward energy conservation in Bangladesh. Low-education households indicate they primarily save electricity for the environment while high-education households indicate they are motivated by environmental concerns.
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...DharmaBanothu
The Network on Chip (NoC) has emerged as an effective
solution for intercommunication infrastructure within System on
Chip (SoC) designs, overcoming the limitations of traditional
methods that face significant bottlenecks. However, the complexity
of NoC design presents numerous challenges related to
performance metrics such as scalability, latency, power
consumption, and signal integrity. This project addresses the
issues within the router's memory unit and proposes an enhanced
memory structure. To achieve efficient data transfer, FIFO buffers
are implemented in distributed RAM and virtual channels for
FPGA-based NoC. The project introduces advanced FIFO-based
memory units within the NoC router, assessing their performance
in a Bi-directional NoC (Bi-NoC) configuration. The primary
objective is to reduce the router's workload while enhancing the
FIFO internal structure. To further improve data transfer speed,
a Bi-NoC with a self-configurable intercommunication channel is
suggested. Simulation and synthesis results demonstrate
guaranteed throughput, predictable latency, and equitable
network access, showing significant improvement over previous
designs
Properties of Fluids, Fluid Statics, Pressure MeasurementIndrajeet sahu
Properties of Fluids: Density, viscosity, surface tension, compressibility, and specific gravity define fluid behavior.
Fluid Statics: Studies pressure, hydrostatic pressure, buoyancy, and fluid forces on surfaces.
Pressure at a Point: In a static fluid, the pressure at any point is the same in all directions. This is known as Pascal's principle. The pressure increases with depth due to the weight of the fluid above.
Hydrostatic Pressure: The pressure exerted by a fluid at rest due to the force of gravity. It can be calculated using the formula P=ρghP=ρgh, where PP is the pressure, ρρ is the fluid density, gg is the acceleration due to gravity, and hh is the height of the fluid column above the point in question.
Buoyancy: The upward force exerted by a fluid on a submerged or partially submerged object. This force is equal to the weight of the fluid displaced by the object, as described by Archimedes' principle. Buoyancy explains why objects float or sink in fluids.
Fluid Pressure on Surfaces: The analysis of pressure forces on surfaces submerged in fluids. This includes calculating the total force and the center of pressure, which is the point where the resultant pressure force acts.
Pressure Measurement: Manometers, barometers, pressure gauges, and differential pressure transducers measure fluid pressure.
Open Channel Flow: fluid flow with a free surfaceIndrajeet sahu
Open Channel Flow: This topic focuses on fluid flow with a free surface, such as in rivers, canals, and drainage ditches. Key concepts include the classification of flow types (steady vs. unsteady, uniform vs. non-uniform), hydraulic radius, flow resistance, Manning's equation, critical flow conditions, and energy and momentum principles. It also covers flow measurement techniques, gradually varied flow analysis, and the design of open channels. Understanding these principles is vital for effective water resource management and engineering applications.
We have designed & manufacture the Lubi Valves LBF series type of Butterfly Valves for General Utility Water applications as well as for HVAC applications.
Optimal coding unit decision for early termination in high efficiency video coding using enhanced whale optimization algorithm
1. International Journal of Electrical and Computer Engineering (IJECE)
Vol. 13, No. 6, December 2023, pp. 6378~6387
ISSN: 2088-8708, DOI: 10.11591/ijece.v13i6.pp6378-6387 6378
Journal homepage: http://ijece.iaescore.com
Optimal coding unit decision for early termination in high
efficiency video coding using enhanced whale optimization
algorithm
Suhas Shankarnahalli Krishnegowda, Hosanna Princye Periapandi
Department of Electronics and Communication Engineering, S.E.A. College of Engineering and Technology, Visvesvaraya
Technological University, Belagavi, India
Article Info ABSTRACT
Article history:
Received Jan 28, 2023
Revised Jul 8, 2023
Accepted Jul 17, 2023
Video compression is an emerging research topic in the field of block based
video encoders. Due to the growth of video coding technologies, high
efficiency video coding (HEVC) delivers superior coding performance. With
the increased encoding complexity, the HEVC enhances the rate-distortion
(RD) performance. In the video compression, the out-sized coding units (CUs)
have higher encoding complexity. Therefore, the computational encoding cost
and complexity remain vital concerns, which need to be considered as an
optimization task. In this manuscript, an enhanced whale optimization
algorithm (EWOA) is implemented to reduce the computational time and
complexity of the HEVC. In the EWOA, a cosine function is incorporated
with the controlling parameter A and two correlation factors are included in
the WOA for controlling the position of whales and regulating the movement
of search mechanism during the optimization and search processes. The bit
streams in the Luma-coding tree block are selected using EWOA that defines
the CU neighbors and is used in the HEVC. The results indicate that the
EWOA achieves best bit rate (BR), time saving, and peak signal to noise ratio
(PSNR). The EWOA showed 0.006-0.012 dB higher PSNR than the existing
models in the real-time videos.
Keywords:
Coding units
Discrete cosine transform
Faster encoding
High efficiency video coding
Prediction units
Whale optimization algorithm
This is an open access article under the CC BY-SA license.
Corresponding Author:
Suhas Shankarnahalli Krishnegowda
Department of Electronics and Communication Engineering, S.E.A. College of Engineering and Technology,
Visvesvaraya Technological University
Belagavi-590018, India
Email: suhashoodi@gmail.com
1. INTRODUCTION
In recent times, the demand for higher-definition video services has increased in the applications like
digital broadcast and internet streaming [1], [2]. To meet the need for transmission and storage of higher
resolution videos, a new video coding standard is developed named high efficiency video coding (HEVC)
[3]–[5]. HEVC is effectively related to the conventional video coding standards developed prior such as moving
picture experts group (MPEG)-2, H.264, and MPEG-4 part 2 [6], [7]. In HEVC, the compression improvement
is usually based on the implementation of new encoding methodologies like asymmetric motion partition, intra-
prediction modes, and so on. Among the available methodologies, the flexible quad-tree partitioning
methodology of coding tree unit (CTU) is efficient [8]. When the size of CTU is 64 × 64, the size of coding
unit (CU) is 8 × 8, 32 × 32, 64 × 64, and 16 × 16 and its depth size will be 0, 1, 2, or 3. The CUs in the CTUs
are partitioned into 4 blocks based on depth range. In HEVC, the optimum CU partition is selected according
to the rate distortion (RD) costs [9], [10]. In HEVC, CU is further-partitioned into several prediction-units
2. Int J Elec & Comp Eng ISSN: 2088-8708
Optimal coding unit decision for early termination in high … (Suhas Shankarnahalli Krishnegowda)
6379
(PUs). Hence, the optimal PUs modes are considered as the modes with minimal RD costs among several inter
and intra PUs modes. In the higher resolution videos, the mode decision process ensures compression
efficiency. In a larger space HEVC, the search for the best PU and CU decision results in high computational
time and complexity, where it limits the usage of HEVC encoders in real time applications [11]–[13].
Some of the conventional methods used for video compression are mentioned as follows;
convolutional neural networks (CNN) [14], and adaptive switching neural networks [15]. Duvar et al. [16]
presented an effective decision algorithm for reducing the encoding time of the HEVC. Initially, the intra block
similarity was carried out at the PU level by using the integral images. In addition, at the CU level, an early
termination mode was developed. Hence, the developed fast inter mode decision algorithm significantly
bypasses the PU modes in PU phase, and also removes the unnecessary controls in the CU phase at a lower
depth. In this literature study, the efficacy of the developed algorithm was investigated by means of bit rate
and peak signal to noise ratio (PSNR). The experimental outcomes demonstrated that the developed algorithm
significantly improves the coding efficacy with low system complexity. In addition, the developed algorithm
delivers a good relationship between time savings and coding efficacy related to the earlier approaches. The
negative side lobes and artifacts edges were generated in the developed algorithm that affects the system
performances. Cen et al. [17] developed a new fast CU depth decision framework for decreasing the
computational complexity in the HEVC. The developed framework includes a CU depth range determination
and a new CU depth comparison algorithm. In this study, the CU depth range was identified based on CU-
depth’s distribution in similar sequences. The experimental results confirmed that the computational
complexity of the developed framework was low compared to the existing works. The developed algorithm
lacks in retaining higher quality videos at the receiver side.
At dissimilar levels of coding abstraction, Jiang and Nooshabadi [18] presented a series of
optimization methods for multi-view HEVC. In this literature, the optimized resource scheduled wavefront
parallel processing and the quantization parameters based on the early termination of CTU were performed for
disparity estimation and parallel motion estimation. From the experimental investigation, the developed
optimization methods achieved better experimental results compared to the previous research work in light of
PSNR and bit error rate. The developed algorithm effectively reduces the system complexity, but it did not
concentrate on the major issue of poor video resolution. Bouaafia et al. [19] presented deep convolutional
neural network (DCNN) and support vector machine (SVM) in the inter mode HEVC for optimizing the
complexity allocations at the CU level. Initially, the SVM based fast CU model was developed for decreasing
the HEVC complexity, and further, the DCNN model was utilized for predicting the CU partition. The
experimental outcome indicates that the developed online SVM and DCNN models achieved better results in
light of time saving and bit rate. In contrast, the developed algorithm reduces the importance of the color
components in the compressed videos.
Ma et al. [20] introduced a new faster intra-coding algorithm for speeding up the encoding
mechanism. At first, a faster CU-sized decision model was implemented for selecting dissimilar depth decision
algorithms for every coding unit. Then, a faster directional mode decision technique was employed, which
compares the directional modes of the parent units. The best directional mode of the parent units and the RD
cost of the first directional mode was integrated for selecting the best directional mode for the current unit
efficiently. The experimental outcome represents that the developed algorithm attained good performance in
the video encoding in light of Bjontegaard delta bit rate (BDBR) and time-saving. The developed algorithm
was not able to handle massive workloads at higher speeds. Kuanar et al. [21] implemented a new CNN model
for effective CU mode selection in the HEVC. Hence, the extensive experimental investigation showed that
the developed CNN model has significantly decreased the encoding time related to other state-of-the-art
machine learning models, but it was computationally expensive.
Hassan et al. [22] developed a surgical telemonitoring system based on HEVC by implementing a
shallow CNN model. The experimental investigation confirmed that the shallow CNN model maintains higher
visual quality with a better bit rate. Compared to the state-of-the-art models, the developed shallow-based CNN
model was effective and efficient for surgical tele-monitoring systems. He et al. [23] developed a new fuzzy based
SVM classifier for improving the compression efficiency of the HEVC. In addition to this, the fuzzy based SVM
classifier was improved by utilizing the information entropy measure for solving the outliers and the negative
impact of data noise problems. However, the undertaken CNN model and the fuzzy based SVM classifier was
computationally complex and needed high-end specification systems. Imen et al. [24] integrated modified
AlexNet and modified LeNet-5 for predicting the HEVC’s CU partition. The experimental analysis states that the
developed model was computationally complex. The key contributions of this research paper are given below;
a. Proposed enhanced whale optimization algorithm (EWOA) to decrease the computational time and
complexity of the HEVC, which selects the bit streams in the luma coding tree block for effectively
determining the CU neighbors. The EWOA is effective in the optimization problems related to the
conventional optimization algorithms such as puzzle optimization algorithm [25] and stochastic Komodo
algorithm [26].
3. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 6, December 2023: 6378-6387
6380
b. Implemented discrete cosine transform (DCT) for generating the residuals by subtracting the prediction
values from the input values. The efficacy of the EWOA is analyzed in light of ∆BR, time-saving, and
∆PSNR. This paper is prepared in this manner: methodology details, results and discussion, and the
conclusion of the EWOA are depicted in sections 2 to 4, respectively.
2. RESEARCH METHOD
In this research, the efficacy of the EWOA is tested on a few online videos: PeopleOnStreet, Traffic,
kimono, ParkScene, Cactus, BQTerrace, FourPeople, PartyScene, BasketballDrive, Johnny, BasketballDrill,
BQMall, RaceHorses, BasketballPass, BQSquare, BlowingBubbles, and KristenAndSara. The sample video
frames are graphically indicated in Figure 1. The proposed framework includes three major steps like optimal
bit-streams selection in HEVC using EWOA, inter and intra prediction in HEVC, and data transformation by
DCT. The workflow of the proposed framework is represented in Figure 2.
Figure 1. Sample collected video frames
Figure 2. Workflow of the proposed framework
Initially, the frames from the videos are extracted and the separated frames are given to the HEVC for
predicting the motion from the video sequences. The basic design of the HEVC is very similar to the H.264.
In this scenario, the block-based coding approach significantly exploits both spatial and temporal statistical
dependencies. Generally, the HEVC utilizes flexible and adaptive quad-tree coding block partitions for
effective coding, transformation and prediction. The basic information about HEVC is given as follows.
2.1. Prediction structure
Generally, the quad-tree block partition works based on the CTU structure, which is more similar to
the macro-block. The sequence of frames is named a video, and in HEVC, every coded video frame is
categorized into slices and CTUs. Further, the CTUs are sub-categorized into the square regions named as CU.
4. Int J Elec & Comp Eng ISSN: 2088-8708
Optimal coding unit decision for early termination in high … (Suhas Shankarnahalli Krishnegowda)
6381
In the HEVC, the CU is predicted using inter or intra-predictions, and the 1st frame of the video sequence at
each random access point is coded utilizing intra-predictions. The residual video frames are coded by
performing inter-predictions, and further, the residual frames are transformed into transform units (TU) by
implementing the DCT algorithm. Usually, the CTUs are made up of two chroma coding tree blocks (CTB),
quadtree syntax and luma CTB, where every Chroma CTBs has the block size of (𝑁/2) × (𝑁/2) and luma
CTB has the block size of 𝑁 × 𝑁. The CTB size is the same as the size of Coding Blocks (CBs), where the
CTB has more CUs and is associated with the Tus and PUs. The inter prediction, intra prediction, and coding
modes are selected at the CU level, where 𝑁 is represented as bit-streams and it will be 64, 32, 16, and 8 bits.
In this research manuscript, the bit-streams are chosen by implementing an effective meta-heuristics
algorithm named EWOA, which generally follows the behavior of humpback whales, here, error rate is
considered as the objective function. After creating an initial population, the humpback whale improves its
location based on the encircling method, and it is mathematically defined in (1) and (2) [27], [28]. Where,
𝑡 indicates iteration number, 𝐷 indicates the distance between a prey 𝑃′(𝑡) and the humpback whale’s
position 𝑃(𝑡) and 𝐴 and 𝐵 states coefficient values, which are determined in (3) and (4).
𝐷 = |𝐵 ⊙ 𝑃′(𝑡) − 𝑃(𝑡)| (1)
𝑃(𝑡 + 1) = |𝑃′(𝑡) − 𝐴 ⊙ 𝐷| (2)
𝐴 = 2𝑙 ⊙ 𝑟 − 𝑙 (3)
𝐵 = 2𝑟 (4)
where, 𝑟 represents a random vector, which usually ranges between 0 to 1, and 𝑙 represents the linearity values,
which range between 0 to 2. On the other hand, the bubble-net method is accomplished based on shrinking
encircling and spiral updating position, as shown in (5) and (6). Where, ⊙ indicates element by element
multiplication process, 𝑏 represents constant value that is used to determine the logarithmic spiral shape,
𝑎 denotes random value, which ranges between [-1, 1], and 𝐷
́ = |𝑃′(𝑡) − 𝑃(𝑡)| represents the distance between
the humpback whales and the prey.
𝑃 (𝑡 + 1) = 𝐷
́ ⊙ 𝑒𝑏𝑎
⊙ 𝑐𝑜𝑠(2𝜋𝑎) + 𝑃′(𝑡) (5)
𝑃(𝑡 + 1) = {
𝑃′(𝑡) − 𝐴 ⊙ 𝐷 𝑖𝑓 𝑝 ≥ 0.5
𝐷
́ ⊙ 𝑒𝑏𝑎
⊙ 𝑐𝑜𝑠(2𝜋𝑎) + 𝑃′(𝑡) 𝑖𝑓 𝑝 < 0.5
(6)
where 𝑝 ∈ [0,1] indicates the probability of choosing the shrinking encircling method or spiral method to adjust
the whales' position. The humpback whales search for their prey in the exploration section. The position of the
humpback whales is updated by computing the random search agents and then finding the best search agents.
This process is mathematically indicated in (7) and (8) [29], [30].
𝐷 = |𝐵 ⊙ 𝑃𝑟𝑎𝑛𝑑 − 𝑃(𝑡)| (7)
𝑃(𝑡 + 1) = |𝑃𝑟𝑎𝑛𝑑 − 𝐴 ⊙ 𝐷| (8)
where, 𝑃𝑟𝑎𝑛𝑑 indicates random position, which is determined based on the current population. Due to the lack
of prior knowledge, updating the positions of search agents is trapped into local optima problems in the existing
WOA. Therefore, a novel cosine function is added with the control parameter 𝐴 for controlling the whale’s
position. The inclusion of cosine function in the control parameter provides a better balance of exploitation and
exploration and it is mathematically indicated in (9).
𝐴 = 1 + 0.5 × 𝑐𝑜𝑠 (𝜋
𝑡
𝐼𝑡𝑒𝑟𝑚𝑎𝑥
) (9)
During the search process, two correlation factors 𝐶𝐹1 and 𝐶𝐹2 are used for regulating the movement
of the search agents. As shown in (7) and (8) are updated as in (10) and (11). The assumed parameters of the
EWOA are: number of agents is represented as 100, 𝑡 indicates current iteration, 𝐶𝐹1 = 2.5, 𝐶𝐹2 = 1.5 and
𝐼𝑡𝑒𝑟𝑚𝑎𝑥 = 100 represents maximum iteration. Once the maximum iteration is reached, the EWOA
automatically terminates.
𝐷 = |𝐵 ⊙ 𝑃𝑟𝑎𝑛𝑑 − 𝑃(𝑡)|/𝐶𝐹1 (10)
5. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 6, December 2023: 6378-6387
6382
𝑃(𝑡 + 1) = |𝑃𝑟𝑎𝑛𝑑 − 𝐴 ⊙ 𝐷|/𝐶𝐹2 (11)
2.2. Inter-prediction in HEVC
In the HEVC, the inter-predictions support the division of prediction blocks (PBs) related to the intra-
predictions. Generally, the inter-coded PUs have numerous motion parameters that include reference image
indexes, usage flags, motion vectors, and reference image lists. The CU is indicated as one PU, while the CU
is coded with a skip model, and it has no efficient motion parameters and transformation coefficients obtained
by merging the modes. The encoder utilizes explicit transmission or merges mode of motion parameters for
every PU in the inter-coded PUs. Hence, the merged model is employed in the skip mode and inter-coded PU.
In the HEVC, the merge mode is utilized for identifying the neighbor inter-coded PUs. The inter-prediction in
HEVC has motion vectors with units of one-eight and one-quarter for determining the distance between chroma
samples and luma samples.
2.3. Intra-prediction in HEVC
In the HEVC, the intra-units generally exploit the spatial correlation of PU and its neighborhood image
pixels for effective prediction. The new features like TU, PU, CU, and CTU are defined in the HEVC for
achieving higher compression and removing spatial redundancy. On the other hand, the optimization of rate-
distortion is carried-out to identify the superiorly best prediction mode of every CU. The RD cost function of
intra-prediction in HEVC is defined in (12).
𝑅𝐷 = 𝑆𝑆𝐸 + 𝜆 × 𝑅 (12)
where, 𝑅 indicates bit-rate, 𝑆𝑆𝐸 represents the sum of squared distance between the original and reconstructed
pixels, and 𝜆 denotes the quantization parameter. Additionally, the HEVC uses a recursive structure and squad
tree for CUs splitting. Every CU is categorized into 4 PUs and further, the intra-prediction is carried-out for
every PUs. The CUs size ranges from 8 × 8 to 64 × 64 pixels and the PUs size ranges from 4 × 4 to 64 × 64
pixels. Subsequently, the HEVC performs the intra block predictions for 4 × 4 to 64 × 64 pixels. Generally,
the HEVC supports 35 intra-predictions and it includes 33 angular predictions. From the reconstructed PUs,
two reference array sets are used for intra-prediction in the HEVC. The present image pixel 𝐶𝑥,𝑦 is projected
towards the reference image pixels with a fixed displacement parameter 𝑑 that helps in defining the angularity
of vertical and horizontal prediction modes. The interpolation is carried out at an accuracy of 1/32, once the
reference samples 𝑅𝑖 and 𝑅𝑖+1 are determined and it is mathematically represented in (13).
𝐶𝑥,𝑦 = ((32 − 𝑑) × 𝑅𝑖 + 𝑑 × 𝑅𝑖+1 + 16) ≫ 5 (13)
In HEVC, the prediction of the angular modes delivers effective intra-prediction, while more edges
are presented. The DC predictions are extensively used for predicting the flat-surfaces. The block prediction is
generated by a weighted average of four reference samples in the planar prediction, which is determined in (14).
𝑃ℎ (𝑥,𝑦) = 𝑑 × (𝑥 + 1) + 𝑏 × (𝑁 − (𝑥 + 1))
𝑃𝑣 (𝑥,𝑦) = 𝑎 × (𝑦 + 1) + 𝑐 × (𝑁 − (𝑦 + 1))
𝑃𝑃𝐿 (𝑥,𝑦) = (𝑃ℎ (𝑥,𝑦) + 𝑃𝑣 (𝑥,𝑦) + 𝑁) ≫ (𝑙𝑜𝑔2 𝑁 + 1) (14)
where, a and d indicate bottom left and top right samples. In the HEVC, the filtering process is managed by
TB size and intra-prediction mode. The neighborhood samples are not filtered, when the DC-intra prediction
mode is chosen. The bi-linear filter is enabled when the distance between the horizontal/vertical mode and
intra-prediction mode is higher than that of the threshold value. The sample 35 intra-prediction modes are
represented in Figure 3.
2.4. Transformation
In the transformation procedure, the residuals are transformed into TU utilizing DCT. In video
compression, the DCT is an extensively utilized transformation technique, which is effective in energy
compaction, computation efficiency, and correlation reduction. The DCT includes 16 members, and the one-
dimensional DCT of 1 × 𝑁 vector 𝑥(𝑛) is determined in (15) and (16).
𝑌[𝑘] = 𝐶[𝑘] ∑ 𝑥[𝑛]𝑐𝑜𝑠 [
(2𝑛+1)𝑘𝜋
2𝑁
]
𝑁−1
𝑛=0 (15)
where 𝑘 = 0,1,2, … 𝑁 − 1.
6. Int J Elec & Comp Eng ISSN: 2088-8708
Optimal coding unit decision for early termination in high … (Suhas Shankarnahalli Krishnegowda)
6383
𝐶[𝑘] =
[
√
1
𝑁
𝑓𝑜𝑟 𝑘 = 0
√
1
𝑁
𝑓𝑜𝑟 𝑘 = 1,2, … 𝑁 − 1
]
(16)
Figure 3. Quad-tree structure of the CUs
The original feature vectors 𝑥(𝑛) are re-constructed from the DCT coefficients 𝑌[𝑘] utilizing the
Inverse DCT operation, and it is mathematically denoted in (17). Then, the DCT is extended to the
transformation of the image, which is achieved by computing the individual rows and columns of the two-
dimensional image. The mathematical equation of two dimensional DCT is indicated in (18) and (19).
𝑥[𝑛] = ∑ 𝐶[𝑘]𝑌[𝑘]𝑐𝑜𝑠 [
(2𝑛+1)𝑘𝜋
2𝑁
]
𝑁−1
𝑘=0 (17)
where, 𝑛 = 0,1,2, … . 𝑁 − 1
[𝑗, 𝑘] = 𝐶[𝑗]𝐶[𝑘] ∑ ∑ 𝑥[𝑚, 𝑛] 𝑐𝑜𝑠 (
(2𝑚+1)𝑗𝜋
2𝑁
)
𝑁−1
𝑛=0
𝑁−1
𝑚=0 𝑐𝑜𝑠 (
(2𝑛+1)𝑘𝜋
2𝑁
) (18)
where the size of the image is represented as 𝑥(𝑛1, 𝑛2), and 𝑗, 𝑘, 𝑚, 𝑛 = 0,1,2, … 𝑁 − 1.
𝐶[𝑗] 𝑎𝑛𝑑 𝐶[𝑘] =
[
√
1
𝑁
𝑓𝑜𝑟 𝑗, 𝑘 = 0
√
1
𝑁
𝑓𝑜𝑟 𝑗, 𝑘 = 1,2, … 𝑁 − 1
]
(19)
Correspondingly, the two-dimensional inverse DCT is determined in (20).
𝑥[𝑚, 𝑛] = ∑ ∑ 𝐶[𝑗]𝐶[𝑘]𝑌[𝑗, 𝑘] 𝑐𝑜𝑠 (
(2𝑚+1)𝑗𝜋
2𝑁
)
𝑁−1
𝑘=0
𝑁−1
𝑗=0 𝑐𝑜𝑠 (
(2𝑛+1)𝑘𝜋
2𝑁
) (20)
The DCT represented in (18) and (20) are orthonormal and perfectly reconstructed the coefficients for
achieving infinite precision. At last, the reconstructed samples are achieved from the inverse transformation,
and the reconstructed CTUs are arranged for constructing a final image. The experimental results of the EWOA
are depicted in section 3.
3. RESULTS AND DISCUSSION
In this research, the EWOA is implemented using MATLAB R2020a software. The simulation is
performed with an i7 processor system with 8 GB random access memory, and 1 TB hard disk. This research
study mainly uses HEVC/H.265 for motion estimation. The performance of the EWOA is analyzed in light of
∆BR, time saving, and ∆PSNR. Additionally, the effectiveness of the EWOA is compared to the prior research
model: online SVM+DCNN [19]. The most crucial performance measures of fast encoding: time saving ∆𝑇
and ∆𝐵𝑅 are mathematically denoted in (21) and (22).
7. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 6, December 2023: 6378-6387
6384
∆𝑇 =
𝑇𝑝−𝑇𝑂
𝑇𝑂
× 100 (21)
∆𝑅 =
𝑅𝑝−𝑅𝑜
𝑅𝑂
× 100 (22)
where, 𝑇𝑝 and 𝑅𝑝 are denoted as computational time and bit rate of the EWOA, 𝑇𝑂 and 𝑅𝑂 are stated as
computational time and bit rate of the existing models. Similarly, PSNR is utilized for measuring the quality
of original and compressed frames which is mathematically denoted in (23). Where, the PSNR value of the
EWOA is indicated as 𝑃𝑆𝑁𝑅𝑝 and the PSNR value of the existing model is specified as 𝑃𝑆𝑁𝑅𝑜.
∆𝑃𝑆𝑁𝑅 = 𝑃𝑆𝑁𝑅𝑝 − 𝑃𝑆𝑁𝑅𝑜 (𝑑𝐵) (23)
3.1. Quantitative analysis
By viewing Table 1, the effectiveness of the EWOA is validated with the existing models such as
deep CNN [19], online SVM [19] and conventional WOA by means of 𝛥𝐵𝑅. Here, the EWOA’s performance
is evaluated on seventeen real time videos. From the experimental investigation, the overall performance shows
that the EWOA outperforms the online SVM [19], deep CNN [19] and WOA in terms of 𝛥𝐵𝑅, as shown in
Table 1. It implies that the EWOA is more robust in diminishing complexity of the inter-mode HEVC related
to other models online SVM [19], deep CNN [19], and WOA. Correspondingly, in Table 2, the experimental
investigation of the EWOA is done in terms of ∆PSNR value. From the inspection, ∆PSNR value of the EWOA
is higher than the prior models: deep CNN [19], online SVM [19] and WOA. In this scenario, the EWOA
almost showed 0.006-0.012 dB value higher than the existing models in the real time videos.
Table 1. Performance investigation of the EWOA and the existing models in light of Δ𝐵𝑅
ΔBR (%)
Videos Deep CNN [19] Online SVM [19] WOA EWOA
PeopleOnStreet 0.57200 2.23500 0.55734 0.51824
Traffic -0.57100 2.02200 -0.58390 -0.71332
Kimono 0.72800 0.44900 0.71962 0.62093
ParkScene -0.40100 0.79000 -0.41199 -0.87320
Cactus 0.41200 0.71700 0.40141 0.29312
BQTerrace -2.60600 0.32800 -2.61793 -2.87302
BasketballDrive 1.13000 0.58300 1.11834 1.00923
BasketballDrill -0.04400 1.44000 -0.04866 -0.08392
BQMall 1.01900 3.31300 1.00660 -0.30293
PartyScene -0.70900 2.41500 -0.72106 -0.89203
RaceHorses 0.52900 2.65000 0.52264 0.33025
BasketballPass 0.69000 3.16700 0.68472 0.62039
BQSquare -2.64700 3.69200 -2.65648 -2.77823
BlowingBubbles -0.32700 2.20700 -0.32899 -0.30977
FourPeople -0.65900 1.01700 -0.66884 -0.67898
Johnny -0.36100 1.72900 -0.37648 -1.09797
KristenAndSara -1.02600 1.35700 -1.04073 -1.20393
Table 2. Performance investigation of the EWOA and the existing models in light of ∆PSNR
∆PSNR (dB)
Videos Deep CNN [19] Online SVM [19] WOA EWOA
PeopleOnStreet -0.03000 -0.05500 -0.03703 -0.02883
Traffic -0.06100 -0.04780 -0.06254 -0.05009
Kimono -0.02500 -0.02000 -0.03453 -0.02456
ParkScene -0.05200 -0.02300 -0.05741 -0.04565
Cactus -0.05200 -0.01900 -0.05880 -0.04446
BQTerrace -0.07100 -0.02200 -0.07137 -0.06754
BasketballDrive -0.03100 -0.02300 -0.03909 -0.02157
BasketballDrill -0.05600 -0.04700 -0.06349 -0.04876
BQMall -0.05100 -0.05800 -0.05220 -0.04267
PartyScene -0.08500 -0.06200 -0.09025 -0.07066
RaceHorses -0.04000 -0.05300 -0.04326 -0.03355
BasketballPass -0.05400 -0.06700 -0.05946 -0.04953
BQSquare -0.16200 -0.08600 -0.16599 -0.12577
BlowingBubbles -0.07100 -0.06700 -0.07515 -0.05518
FourPeople -0.06200 -0.01400 -0.06455 -0.05459
Johnny -0.12400 -0.03600 -0.12421 -0.10467
KristenAndSara -0.07000 -0.02800 -0.07924 -0.06965
8. Int J Elec & Comp Eng ISSN: 2088-8708
Optimal coding unit decision for early termination in high … (Suhas Shankarnahalli Krishnegowda)
6385
As represented in Table 3, seventeen online real time videos are utilized for investigating the
effectiveness of the EWOA. In Table 3, the EWOA’s efficacy is validated in light of time saving ΔT. From the
inspection, the EWOA achieved better results compared to the online SVM [19], deep CNN [19] and WOA in
light of time saving ΔT on the real time videos, which is the major problem highlighted in the literature section.
Table 3. Performance investigation of the EWOA and the existing models in light of ∆T
ΔT (%)
Videos Deep CNN [19] Online SVM [19] WOA EWOA
PeopleOnStreet -50.67000 -56.56000 -51.64113 -49.73822
Traffic -57.90000 -58.07000 -59.64715 -56.20333
Kimono -43.26000 -44.18000 -44.21259 -42.48243
ParkScene -64.14000 -52.60000 -65.91336 -63.93794
Cactus -52.57000 -41.38000 -53.87029 -51.08473
BQTerrace -58.43000 -41.45000 -59.64306 -57.46738
BasketballDrive -51.30000 -51.17000 -52.69093 -51.84837
BasketballDrill -53.54000 -55.87000 -54.24158 -52.46364
BQMall -52.25000 -55.96000 -53.06197 -52.84949
PartyScene -51.54000 -52.93000 -52.63542 -50.63434
RaceHorses -42.22000 -51.25000 -42.91538 -40.93940
BasketballPass -52.42000 -55.49000 -53.54240 -51.54234
BQSquare -52.79000 -55.92000 -54.45821 -51.44434
BlowingBubbles -46.55000 -51.87000 -47.48938 -44.48394
FourPeople -67.54000 -51.90000 -68.56404 -66.56344
Johnny -69.66000 -60.12000 -70.43801 -68.43444
KristenAndSara -67.20000 -53.57000 -68.15833 -65.15849
3.2. Discussion
In the present decade, the HEVC has better coding efficiency, because of the rapid growth of video
coding technology. The encoding complexity is increased in the HEVC, while improving the performance of
RD. In addition, the emerging HEVC uses new coding structures, which are characterized by TU, PU and CU.
It enhances the coding efficiency superiorly, but increases computational complexity on the decision of optimal
TU, PU, and CU sizes. Computational complexity remains a vital problem, and it should be considered in the
optimization task. As discussed in the previous sections, we proposed a EWOA with quad tree coding and DCT
for fast CU partition that superiorly decreases the complexity of HEVC at inter-mode. The proposed framework
achieved a good trade-off between the RD performance and complexity reduction. The EWOA with quad tree
coding and DCT not only predicts the HEVC CU partition at inter-mode and reduces the HEVC complexity
with minimal error value. In this manuscript, almost seventeen online real time videos are used for analyzing
the effectiveness of the proposed framework in light of PSNR, time saving ∆𝑇 and ∆𝐵𝑅.
4. CONCLUSION
In this study, an efficient video compression is achieved by implementing HEVC with an optimization
algorithm: EWOA. The EWOA is utilized for estimating the motions from the video sequences. In the proposed
framework, the quad tree coding block is employed for partitioning the structures, and DCT is applied to the
extracted video frames for improving the coding efficiency. The performance of the EWOA is investigated by
comparing the input video sequences with decompressed video sequences in terms of ΔBR, ΔT, and ΔPSNR.
The simulation analysis concluded that the EWOA attained better performance in the video compression, and
it showed 0.006-0.012 dB higher PSNR than the existing models in the real time videos like asketballPass,
BQTerrace, BasketballDrive, RaceHorses, BQMall, BlowingBubbles, Cactus, FourPeople, PartyScene,
PeopleOnStreet, Johnny, Kimono, KristenAndSara, Traffic, ParkScene, BasketballDrill, and BQSquare. The
proposed framework is specifically used for surveillance or conversational videos that largely reduces the
bandwidth without degrading the visual quality. The future studies will focus on video coding optimization or
perceptual based medical image. Additionally, a novel algorithm can be developed for fast mode selection
based on the pattern directions of the neighboring PU.
AUTHOR CONTRIBUTIONS
The paper conceptualization, methodology, software, validation, formal analysis, investigation,
resources, data curation, writing—original draft preparation, writing—review and editing, visualization, have
been done by 1st
author. The supervision and project administration, have been done by 2nd
author.
9. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 6, December 2023: 6378-6387
6386
REFERENCES
[1] W. El-Shafai, I. M. Almomani, and A. Alkhayer, “Optical bit-plane-based 3D-JST cryptography algorithm with cascaded 2D-FrFT
encryption for efficient and secure HEVC communication,” IEEE Access, vol. 9, pp. 35004–35026, 2021, doi:
10.1109/ACCESS.2021.3062403.
[2] Y. Zhang, C. Zhang, R. Fan, S. Ma, Z. Chen, and C.-C. J. Kuo, “Recent advances on HEVC inter-frame coding: From optimization
to implementation and beyond,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 11, pp. 4321–4339,
Nov. 2020, doi: 10.1109/TCSVT.2019.2954474.
[3] A. A. Elrowayati, M. A. Alrshah, M. F. L. Abdullah, and R. Latip, “HEVC watermarking techniques for authentication and
copyright applications: Challenges and opportunities,” IEEE Access, vol. 8, pp. 114172–114189, 2020, doi:
10.1109/ACCESS.2020.3004049.
[4] W. Zhu, Y. Yi, H. Zhang, P. Chen, and H. Zhang, “Fast mode decision algorithm for HEVC intra coding based on texture
partition and direction,” Journal of Real-Time Image Processing, vol. 17, no. 2, pp. 275–292, Apr. 2020, doi: 10.1007/s11554-
018-0766-z.
[5] J.-K. Lee, N. Kim, S. Cho, and J.-W. Kang, “Deep video prediction network-based inter-frame coding in HEVC,” IEEE Access,
vol. 8, pp. 95906–95917, 2020, doi: 10.1109/ACCESS.2020.2993566.
[6] D. Xu, “Commutative encryption and data hiding in HEVC video compression,” IEEE Access, vol. 7, pp. 66028–66041, 2019, doi:
10.1109/ACCESS.2019.2916484.
[7] X. Sun, H. Ma, W. Zuo, and M. Liu, “Perceptual-based HEVC intra coding optimization using deep convolution networks,” IEEE
Access, vol. 7, pp. 56308–56316, 2019, doi: 10.1109/ACCESS.2019.2910245.
[8] H. Huang, I. Schiopu, and A. Munteanu, “Frame-wise CNN-based filtering for intra-frame quality enhancement of HEVC videos,”
IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 6, pp. 2100–2113, Jun. 2021, doi:
10.1109/TCSVT.2020.3018230.
[9] A. Mercat, A. Makinen, J. Sainio, A. Lemmetti, M. Viitanen, and J. Vanne, “Comparative rate-distortion-complexity analysis of
VVC and HEVC video codecs,” IEEE Access, vol. 9, pp. 67813–67828, 2021, doi: 10.1109/ACCESS.2021.3077116.
[10] M. Saldanha, G. Sanchez, C. Marcon, and L. Agostini, “Fast 3D-HEVC depth map encoding using machine learning,” IEEE
Transactions on Circuits and Systems for Video Technology, vol. 30, no. 3, pp. 850–861, Mar. 2020, doi:
10.1109/TCSVT.2019.2898122.
[11] X. Sun, X. Yang, S. Wang, and M. Liu, “Content-aware rate control scheme for HEVC based on static and dynamic saliency
detection,” Neurocomputing, vol. 411, pp. 393–405, Oct. 2020, doi: 10.1016/j.neucom.2020.06.003.
[12] J. Lin, D. Liu, H. Yang, H. Li, and F. Wu, “Convolutional neural network-based block up-sampling for HEVC,” IEEE Transactions
on Circuits and Systems for Video Technology, vol. 29, no. 12, pp. 3701–3715, Dec. 2019, doi: 10.1109/TCSVT.2018.2884203.
[13] M. Zhou et al., “SSIM-based global optimization for CTU-level rate control in HEVC,” IEEE Transactions on Multimedia, vol. 21,
no. 8, pp. 1921–1933, Aug. 2019, doi: 10.1109/TMM.2019.2895281.
[14] Y. Zhang, T. Shen, X. Ji, Y. Zhang, R. Xiong, and Q. Dai, “Residual highway convolutional neural networks for in-loop filtering
in HEVC,” IEEE Transactions on Image Processing, vol. 27, no. 8, pp. 3827–3841, Aug. 2018, doi: 10.1109/TIP.2018.2815841.
[15] W. Lin et al., “Partition-aware adaptive switching neural networks for post-processing in HEVC,” IEEE Transactions on
Multimedia, vol. 22, no. 11, pp. 2749–2763, Nov. 2020, doi: 10.1109/TMM.2019.2962310.
[16] R. Duvar, O. Akbulut, and O. Urhan, “Fast inter mode decision exploiting intra-block similarity in HEVC,” Signal Processing:
Image Communication, vol. 78, pp. 503–510, Oct. 2019, doi: 10.1016/j.image.2019.08.010.
[17] Y.-F. Cen, W.-L. Wang, and X.-W. Yao, “A fast CU depth decision mechanism for HEVC,” Information Processing Letters,
vol. 115, no. 9, pp. 719–724, Sep. 2015, doi: 10.1016/j.ipl.2015.04.001.
[18] C. Jiang and S. Nooshabadi, “Multi-level complexity reduction for HEVC multiview coding,” Journal of Real-Time Image
Processing, vol. 17, no. 2, pp. 197–213, Apr. 2020, doi: 10.1007/s11554-018-0757-0.
[19] S. Bouaafia, R. Khemiri, F. E. Sayadi, and M. Atri, “Fast CU partition-based machine learning approach for reducing HEVC
complexity,” Journal of Real-Time Image Processing, vol. 17, no. 1, pp. 185–196, Feb. 2020, doi: 10.1007/s11554-019-00936-0.
[20] Y. Ma, Z. Liu, X. Wang, and S. Cao, “Fast intra coding based on CU size decision and direction mode decision for HEVC,”
Multimedia Tools and Applications, vol. 77, no. 12, pp. 14907–14929, Jun. 2018, doi: 10.1007/s11042-017-5074-2.
[21] S. Kuanar, K. R. Rao, M. Bilas, and J. Bredow, “Adaptive CU mode selection in HEVC intra prediction: A deeplearning approach,”
Circuits, Systems, and Signal Processing, vol. 38, no. 11, pp. 5081–5102, Nov. 2019, doi: 10.1007/s00034-019-01110-4.
[22] A. Hassan, M. Ghafoor, S. A. Tariq, T. Zia, and W. Ahmad, “High efficiency video coding (HEVC)–based surgical telementoring
system using shallow convolutional neural network,” Journal of Digital Imaging, vol. 32, no. 6, pp. 1027–1043, Dec. 2019, doi:
10.1007/s10278-019-00206-2.
[23] S. He, Z. Deng, and C. Shi, “Fast decision algorithm of CU size for HEVC intra-prediction based on a kernel fuzzy SVM classifier,”
Electronics, vol. 11, no. 17, Sep. 2022, doi: 10.3390/electronics11172791.
[24] W. Imen, M. Amna, B. Fatma, S. F. Ezahra, and N. Masmoudi, “Fast HEVC intra-CU decision partition algorithm with modified
LeNet-5 and AlexNet,” Signal, Image and Video Processing, vol. 16, no. 7, pp. 1811–1819, Oct. 2022, doi: 10.1007/s11760-022-
02139-w.
[25] F. A. Zeidabadi and M. Dehghani, “POA: Puzzle optimization algorithm,” International Journal of Intelligent Engineering and
Systems, vol. 15, no. 1, pp. 273–280, Feb. 2022, doi: 10.22266/ijies2022.0228.25.
[26] P. D. Kusuma and M. Kallista, “Stochastic komodo algorithm,” International Journal of Intelligent Engineering and Systems,
vol. 15, no. 4, pp. 156–166, Aug. 2022, doi: 10.22266/ijies2022.0831.15.
[27] N. Rana, M. S. A. Latiff, S. M. Abdulhamid, and H. Chiroma, “Whale optimization algorithm: a systematic review of contemporary
applications, modifications and developments,” Neural Computing and Applications, vol. 32, no. 20, pp. 16245–16277, Oct. 2020,
doi: 10.1007/s00521-020-04849-z.
[28] S. Chakraborty, A. K. Saha, R. Chakraborty, and M. Saha, “An enhanced whale optimization algorithm for large scale optimization
problems,” Knowledge-Based Systems, vol. 233, Dec. 2021, doi: 10.1016/j.knosys.2021.107543.
[29] Q.-V. Pham, S. Mirjalili, N. Kumar, M. Alazab, and W.-J. Hwang, “Whale optimization algorithm with applications to resource
allocation in wireless networks,” IEEE Transactions on Vehicular Technology, vol. 69, no. 4, pp. 4285–4297, Apr. 2020, doi:
10.1109/TVT.2020.2973294.
[30] Z. Yan, J. Zhang, J. Zeng, and J. Tang, “Nature-inspired approach: An enhanced whale optimization algorithm for global
optimization,” Mathematics and Computers in Simulation, vol. 185, pp. 17–46, Jul. 2021, doi: 10.1016/j.matcom.2020.12.008.
10. Int J Elec & Comp Eng ISSN: 2088-8708
Optimal coding unit decision for early termination in high … (Suhas Shankarnahalli Krishnegowda)
6387
BIOGRAPHIES OF AUTHORS
Suhas Shankarnahalli Krishnegowda is a research scholar in VTU Belgaum.
Received his BE (Electronics and Communication Engineering) from SEA College of
Engineering and Technology, Visvesvaraya Technological University, Belgaum, Karnataka,
India, in the year 2011 and completed his MTech (Digital Electronics) from Srinivas Institute
of Technology, Mangalore, affiliated to Visvesvaraya Technological University, Belgaum,
Karnataka, India, in the year of 2013 and since then he is actively involved in teaching and
research and has ten years of experience in teaching. He is persuing PhD (ECE) from VTU.
At present, he is working as Assistant Professor in South East Asian college of Engineering
and Technology, Bangalore Affiliated to Visveswaraya Technological University. His area of
interest is in the field of signal processing, bio medical signal processing, and communication
system. He can be contacted at email: suhashoodi@gmail.com.
Hosanna Princye Periapandi received his BE(E&I) from Sapthagiri College of
Engineering from Periyar University, Tamilnadu, India in the year 2002 and completed his
Masters in Engineering from Anna University in the year of 2004 and since then actively
involved in teaching and research and has thirteen years of experience in Teaching. She
obtained his PhD in the field of Information and Communication Engineering from Anna
University in the year of 2018. At Present, she is working as an Associate Professor in SEA
College of Engineering and Technology, Bangalore affiliated to Visveswaraya Technological
University, her area of interest is the field of medical image processing, signal processing and
VLSI. She can be contacted at email: hprincye@gmail.com.