We evaluate the performance of three state of the art video codecs on synthetic videos. The evaluation is
based on both subjective and objective quality metrics. The subjective quality of the compressed video
sequences is evaluated using the Double Stimulus Impairment Scale (DSIS) assessment metric while the
Peak Signal-to-Noise Ratio (PSNR) is used for the objective evaluation. An extensive number of
experiments are conducted to study the effect of frame rate and resolution on codecs’ performance for
synthetic videos. The evaluation results show that video codecs respond in different ways to frame rate and
frame resolution change. H.264 shows superior capabilities compared to other codecs. Mean Opinion
Score (MOS) results are shown for various bitrates, frame rates and frame resolutions.
A REAL-TIME H.264/AVC ENCODER&DECODER WITH VERTICAL MODE FOR INTRA FRAME AND ...csandit
The video coding standards are being developed to satisfy the requirements of applications for
various purposes, better picture quality, higher coding efficiency, and more error robustness.
The new international video coding standard H.264 /AVC aims at having significant
improvements in coding efficiency, and error robustness in comparison with the previous
standards such as MPEG-2, H261, H263,and H264. Video stream needs to be processed from
several steps in order to encode and decode the video such that it is compressed efficiently with
available limited resources of hardware and software. All advantages and disadvantages of
available algorithms should be known to implement a codec to accomplish final requirement.
The purpose of this project is to implement all basic building blocks of H.264 video encoder and
decoder. The significance of the project is the inclusion of all components required to encode
and decode a video in MatLab .
Video Compression Algorithm Based on Frame Difference Approaches ijsc
The huge usage of digital multimedia via communications, wireless communications, Internet, Intranet and cellular mobile leads to incurable growth of data flow through these Media. The researchers go deep in developing efficient techniques in these fields such as compression of data, image and video. Recently, video compression techniques and their applications in many areas (educational, agriculture, medical …) cause this field to be one of the most interested fields. Wavelet transform is an efficient method that can be used to perform an efficient compression technique. This work deals with the developing of an efficient video compression approach based on frames difference approaches that concentrated on the calculation of frame near distance (difference between frames). The
selection of the meaningful frame depends on many factors such as compression performance, frame details, frame size and near distance between frames. Three different approaches are applied for removing the lowest frame difference. In this paper, many videos are tested to insure the efficiency of this technique, in addition a good performance results has been obtained.
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
Global interconnect planning becomes a challenge as semiconductor technology continuously scales. Because of the increasing wire resistance and higher capacitive coupling in smaller features, the delay of global interconnects becomes large compared with the delay of a logic gate, introducing a huge performance gap that needs to be resolved A novel equalized global link architecture and driver– receiver co design flow are proposed for high-speed and low-energy on-chip communication by utilizing a continuous-time linear equalizer (CTLE). The proposed global link is analyzed using a linear system method, and the formula of CTLE eye opening is derived to provide high-level design guidelines and insights.
Compared with the separate driver–receiver design flow, over 50% energy reduction is observed.
Analysis of Impact of Channel Error Rate on Average PSNR in Multimedia TrafficIOSR Journals
Abstract : The performance of the multimedia traffic in Ad-Hoc networks is highly impacted with the Signal to Noise Ratio. The Average PSNR (Peak Signal to Noise Ratio) is an important parameter for the evaluation of multimedia traffic in Ad-Hoc Networks. With the increase of bandwidth of the channels, it becomes necessary to take care of other network parameters like PSNR and ASNR( Average Signal to Noise Ratio) .Enhanced bandwidth with higher channel error rates demand a careful analysis of signal to noise ratio for optimum performance. In this paper, we have evaluated the effect of channel error rate on Average PSNR for the MPEG-4 traffic in Ad-hoc Networks. Keywords: MANETs, Evalvid, MPEG-4, Fragmentation, PSNR
A REAL-TIME H.264/AVC ENCODER&DECODER WITH VERTICAL MODE FOR INTRA FRAME AND ...csandit
The video coding standards are being developed to satisfy the requirements of applications for
various purposes, better picture quality, higher coding efficiency, and more error robustness.
The new international video coding standard H.264 /AVC aims at having significant
improvements in coding efficiency, and error robustness in comparison with the previous
standards such as MPEG-2, H261, H263,and H264. Video stream needs to be processed from
several steps in order to encode and decode the video such that it is compressed efficiently with
available limited resources of hardware and software. All advantages and disadvantages of
available algorithms should be known to implement a codec to accomplish final requirement.
The purpose of this project is to implement all basic building blocks of H.264 video encoder and
decoder. The significance of the project is the inclusion of all components required to encode
and decode a video in MatLab .
Video Compression Algorithm Based on Frame Difference Approaches ijsc
The huge usage of digital multimedia via communications, wireless communications, Internet, Intranet and cellular mobile leads to incurable growth of data flow through these Media. The researchers go deep in developing efficient techniques in these fields such as compression of data, image and video. Recently, video compression techniques and their applications in many areas (educational, agriculture, medical …) cause this field to be one of the most interested fields. Wavelet transform is an efficient method that can be used to perform an efficient compression technique. This work deals with the developing of an efficient video compression approach based on frames difference approaches that concentrated on the calculation of frame near distance (difference between frames). The
selection of the meaningful frame depends on many factors such as compression performance, frame details, frame size and near distance between frames. Three different approaches are applied for removing the lowest frame difference. In this paper, many videos are tested to insure the efficiency of this technique, in addition a good performance results has been obtained.
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
Global interconnect planning becomes a challenge as semiconductor technology continuously scales. Because of the increasing wire resistance and higher capacitive coupling in smaller features, the delay of global interconnects becomes large compared with the delay of a logic gate, introducing a huge performance gap that needs to be resolved A novel equalized global link architecture and driver– receiver co design flow are proposed for high-speed and low-energy on-chip communication by utilizing a continuous-time linear equalizer (CTLE). The proposed global link is analyzed using a linear system method, and the formula of CTLE eye opening is derived to provide high-level design guidelines and insights.
Compared with the separate driver–receiver design flow, over 50% energy reduction is observed.
Analysis of Impact of Channel Error Rate on Average PSNR in Multimedia TrafficIOSR Journals
Abstract : The performance of the multimedia traffic in Ad-Hoc networks is highly impacted with the Signal to Noise Ratio. The Average PSNR (Peak Signal to Noise Ratio) is an important parameter for the evaluation of multimedia traffic in Ad-Hoc Networks. With the increase of bandwidth of the channels, it becomes necessary to take care of other network parameters like PSNR and ASNR( Average Signal to Noise Ratio) .Enhanced bandwidth with higher channel error rates demand a careful analysis of signal to noise ratio for optimum performance. In this paper, we have evaluated the effect of channel error rate on Average PSNR for the MPEG-4 traffic in Ad-hoc Networks. Keywords: MANETs, Evalvid, MPEG-4, Fragmentation, PSNR
Analyzing Video Streaming Quality by Using Various Error Correction Methods o...IJERA Editor
Transmission video over ad hoc networks has become one of the most important and interesting subjects of study for researchers and programmers because of the strong relationship between video applications and frequent users of various mobile devices, such as laptops, PDAs, and mobile phones in all aspects of life. However, many challenges, such as packet loss, congestion (i.e., impairments at the network layer), multipath fading (i.e., impairments at the physical layer) [1], and link failure, exist in transferring video over ad hoc networks; these challenges negatively affect the quality of the perceived video [2].This study has investigated video transfer over ad hoc networks. The main challenges of transferring video over ad hoc networks as well as types of errors that may occur during video transmission, various types of video mechanisms, error correction methods, and different Quality of Service (QoS) parameters that affect the quality of the received video are also investigated.
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...IDES Editor
This paper presents a novel approach for designing
a linear phase digital high pass FIR filter using Improved
Particle Swarm Optimization (IPSO) algorithm. Design of
FIR filter is a multi-modal optimization problem. The
conservative gradient based optimization techniques are not
efficient for digital filter design. Given the specifications for
the filters to be realized, IPSO algorithm generates a set of
optimal filter coefficients and tries to meet the ideal frequency
response characteristics. This paper presents the realization
of the optimal FIR high pass filter of filter order 20 as per
given problem statements. The simulation results have been
compared to those obtained from well accepted classical
algorithms like Park and McClellan algorithm (PM), and
evolutionary algorithms like genetic algorithm (GA) and
particle swarm optimization (PSO). The results rationalize
that the proposed optimal filter design approach using IPSO
outperforms PM, RGA, PSO in the accuracy of the designed
filter, as well as in the convergence speed and solution quality
A Hybrid DWT-SVD Method for Digital Video Watermarking Using Random Frame Sel...researchinventy
Digital watermarking refers to embedding watermarks in a multimedia documents and files in order to protect them from illegal copying and identifying manipulations. We proposed a technique is Hybrid DWTSVD method for digital video watermarking. This paper presents a robust and secure watermarking algorithm for video sequence. Proposed scheme divide the video in to frames and then blue channel is used for watermarking insertion. Discrete wavelet transform is used for watermark embedding. Watermark is inserted in to mid frequency component for better resistance to video manipulation operation. PSNR and MSE are computed for testing the proposed method
PRACTICAL APPROACHES TO TARGET DETECTION IN LONG RANGE AND LOW QUALITY INFRAR...sipij
It is challenging to detect vehicles in long range and low quality infrared videos using deep learning
techniques such as You Only Look Once (YOLO) mainly due to small target size. This is because small
targets do not have detailed texture information. This paper focuses on practical approaches for target
detection in infrared videos using deep learning techniques. We first investigated a newer version of You
Only Look Once (YOLO v4). We then proposed a practical and effective approach by training the YOLO
model using videos from longer ranges. Experimental results using real infrared videos ranging from 1000
m to 3500 m demonstrated huge performance improvements. In particular, the average detection
percentage over the six ranges of 1000 m to 3500 m improved from 54% when we used the 1500 m videos
for training to 95% if we used the 3000 m videos for training.
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
The proposed scheme embedded the watermark during the differential pulse code modulation process and extracted through decoding the entropy details. This technique utilize the moving picture expert groups standard (MPEG-2) in which discrete cosine transform coefficients are adjusted from selected instantaneous decoder refresh frames for watermarking purpose. The subsets of frames as candidate I-frames are chosen to achieve better perceptibility and robustness. A secret key based cryptographic technique is used to select the candidate frames. Three more keys are required to extract the watermark whereas one of the key is used to stop the extraction process and the remaining two are used to display the scrambled watermark. The toughness is evaluated by testing spatial and temporal synchronization attacks. High sturdiness is achieved against video specific attacks frequently occurs in the real world. Even a single frame can accommodate thousand of watermark bits which reflect that high watermark capacity can be obtained.
Robust Video Watermarking Scheme Based on Intra-Coding Process in MPEG-2 Style IJECEIAES
The proposed scheme implemented a semi blind digital watermarking method for video exploiting MPEG-2 standard. The watermark is inserted into selected high frequency coefficients of plain types of discrete cosine transform blocks instead of edge and texture blocks during intra coding process. The selection is essential because the error in such type of blocks is less sensitive to human eyes as compared to other categories of blocks. Therefore, the perceptibility of watermarked video does not degraded sharply. Visual quality is also maintained as motion vectors used for generating the motion compensated images are untouched during the entire watermarking process. Experimental results revealed that the scheme is not only robust to re-compression attack, spatial synchronization attacks like cropping, rotation but also strong to temporal synchronization attacks like frame inserting, deleting, swapping and averaging. The superiority of the anticipated method is obtaining the best sturdiness results contrast to the recently delivered schemes.
Audio Steganography Coding Using the Discreet Wavelet TransformsCSCJournals
The performance of audio steganography compression system using discreet wavelet transform (DWT) is investigated. Audio steganography coding is the technology of transforming stego-speech into efficiently encoded version that can be decoded in the receiver side to produce a close representation of the initial signal (non compressed). Experimental results prove the efficiency of the used compression technique since the compressed stego-speech are perceptually intelligible and indistinguishable from the equivalent initial signal, while being able to recover the initial stego-speech with slight degradation in the quality .
Dynamic Threshold in Clip Analysis and RetrievalCSCJournals
Key frame extraction can be helpful in video summarization, analysis, indexing, browsing, and retrieval. Clip analysis of key frame sequences is an open research issues. The paper deals with identification and extraction of key frames using dynamic threshold followed by video retrieval. The number of key frames to be extracted for each shot depends on the activity details of the shot. This system uses the statistics of comparison between the successive frames within a level extracted on the basis of color histograms and dynamic threshold. Two program interfaces are linked for clip analysis and video indexing and retrieval using entropy. The results using proposed system on few video sequences are tested and the extracted key frames and retrieved results are shown.
H.264/AVCis currently the most widely adopted video coding standard due to its high compression capability and flexibility. However, compressed videos are highly vulnerable to channel errors which may result in severe quality degradation of a video. This paper presentsa concealment aware Unequal Error Protection (UEP) scheme for H.264 video compression using Reed Solomon (RS) codes. The proposed UEP technique assigns a code rate to each Macroblock (MB) based on the type of concealment and a Concealment Dependent Index (CDI). Two interleaving techniques, namely Frame Level Interleaving (FLI) and Group Level Interleaving (GLI) have also been employed. Finally, prioritised concealment is applied in cases where error correction is beyond the capability of the RS decoder. Simulation results have demonstrated that the proposed framework provides an average gain of 2.96 dB over a scheme that used Equal Error Protection (EEP).
Quality Assessment for Recognition and Task-based multimedia applications (QART)Mikołaj Leszczuk
Users of video to perform tasks require sufficient video quality to recognize the information needed for their application. Therefore, the fundamental measure of video quality in these applications is the success rate of these tasks (such as recognition), which is referred to as visual intelligibility or acuity. One of the major causes of reduction of visual intelligibility is loss of data, through various forms of compression. Additionally, the characteristics of the scene being captured have a direct effect on visual intelligibility and on the performance of a compression operation-specifically, the size of the target of interest, the lighting conditions, and the temporal complexity of the scene. The QART project is performing a series of tests to study the effects and interactions of compression and scene characteristics. An additional goal is to test existing or develop new objective measurements that will predict the results of the subjective tests of visual intelligibility.
PERFORMANCE EVALUATION OF H.265/MPEG-HEVC, VP9 AND H.264/MPEGAVC VIDEO CODINGijma
This study evaluates the performance of the three latest video codecs H.265/MPEG-HEVC, H.264/MPEGAVC
and VP9. The evaluation is based on both subjective and objective quality metrics. The assessment
metric Double Stimulus Impairment Scale (DSIS) is used to evaluate the subjective quality of the
compressed video sequences. The Peak Signal-to-Noise Ratio (PSNR) metricis used for the objective
evaluation. Moreover, this work studies the effect of frame rate and resolution on the encoders’
performance. The extensive number of experiments are conducted with similar encoding configurations for
the three studied encoders. The evaluation results show that H.265/MPEG-HEVC provides superior bitrate
saving capabilities compared to H.264 and VP9. However, VP9 shows lower encoding time than
H.265/MPEG-HEVC but higher encoding time compared to H.264.
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...ijma
Streaming of high-quality video contents as part of multimedia communication, has become essential
nowadays. Video delivered over the network suffers from different kind of impairments, which degrades its
quality. Such network impairments effect differs among different types of codec used. This paper present the
effects of network degradation factors such as packet loss and jitter over H.264 and H.265 encoded video
sequences. In addition to the codec used, we also focused on the different level of temporal and spatial
aspect within the videos. Among different basic test methods, double stimulus impairments scale was used
to complete the experiment as subjective measures of assessment metric from user’s perspective.The result
illustrates that differently encoded video sequences react differently to the network impairments and are
very sensitive to a transmission error. Similarly, it also shows that user’s experience is affected according
to the motion level of video.
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...ijma
Streaming of high-quality video contents as part of multimedia communication, has become essential nowadays. Video delivered over the network suffers from different kind of impairments, which degrades its quality. Such network impairments effect differs among different types of codec used. This paper present the effects of network degradation factors such as packet loss and jitter over H.264 and H.265 encoded video sequences. In addition to the codec used, we also focused on the different level of temporal and spatial aspect within the videos. Among different basic test methods, double stimulus impairments scale was used to complete the experiment as subjective measures of assessment metric from user’s perspective.The result illustrates that differently encoded video sequences react differently to the network impairments and are very sensitive to a transmission error. Similarly, it also shows that user’s experience is affected according to the motion level of video.
Analyzing Video Streaming Quality by Using Various Error Correction Methods o...IJERA Editor
Transmission video over ad hoc networks has become one of the most important and interesting subjects of study for researchers and programmers because of the strong relationship between video applications and frequent users of various mobile devices, such as laptops, PDAs, and mobile phones in all aspects of life. However, many challenges, such as packet loss, congestion (i.e., impairments at the network layer), multipath fading (i.e., impairments at the physical layer) [1], and link failure, exist in transferring video over ad hoc networks; these challenges negatively affect the quality of the perceived video [2].This study has investigated video transfer over ad hoc networks. The main challenges of transferring video over ad hoc networks as well as types of errors that may occur during video transmission, various types of video mechanisms, error correction methods, and different Quality of Service (QoS) parameters that affect the quality of the received video are also investigated.
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...IDES Editor
This paper presents a novel approach for designing
a linear phase digital high pass FIR filter using Improved
Particle Swarm Optimization (IPSO) algorithm. Design of
FIR filter is a multi-modal optimization problem. The
conservative gradient based optimization techniques are not
efficient for digital filter design. Given the specifications for
the filters to be realized, IPSO algorithm generates a set of
optimal filter coefficients and tries to meet the ideal frequency
response characteristics. This paper presents the realization
of the optimal FIR high pass filter of filter order 20 as per
given problem statements. The simulation results have been
compared to those obtained from well accepted classical
algorithms like Park and McClellan algorithm (PM), and
evolutionary algorithms like genetic algorithm (GA) and
particle swarm optimization (PSO). The results rationalize
that the proposed optimal filter design approach using IPSO
outperforms PM, RGA, PSO in the accuracy of the designed
filter, as well as in the convergence speed and solution quality
A Hybrid DWT-SVD Method for Digital Video Watermarking Using Random Frame Sel...researchinventy
Digital watermarking refers to embedding watermarks in a multimedia documents and files in order to protect them from illegal copying and identifying manipulations. We proposed a technique is Hybrid DWTSVD method for digital video watermarking. This paper presents a robust and secure watermarking algorithm for video sequence. Proposed scheme divide the video in to frames and then blue channel is used for watermarking insertion. Discrete wavelet transform is used for watermark embedding. Watermark is inserted in to mid frequency component for better resistance to video manipulation operation. PSNR and MSE are computed for testing the proposed method
PRACTICAL APPROACHES TO TARGET DETECTION IN LONG RANGE AND LOW QUALITY INFRAR...sipij
It is challenging to detect vehicles in long range and low quality infrared videos using deep learning
techniques such as You Only Look Once (YOLO) mainly due to small target size. This is because small
targets do not have detailed texture information. This paper focuses on practical approaches for target
detection in infrared videos using deep learning techniques. We first investigated a newer version of You
Only Look Once (YOLO v4). We then proposed a practical and effective approach by training the YOLO
model using videos from longer ranges. Experimental results using real infrared videos ranging from 1000
m to 3500 m demonstrated huge performance improvements. In particular, the average detection
percentage over the six ranges of 1000 m to 3500 m improved from 54% when we used the 1500 m videos
for training to 95% if we used the 3000 m videos for training.
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
The proposed scheme embedded the watermark during the differential pulse code modulation process and extracted through decoding the entropy details. This technique utilize the moving picture expert groups standard (MPEG-2) in which discrete cosine transform coefficients are adjusted from selected instantaneous decoder refresh frames for watermarking purpose. The subsets of frames as candidate I-frames are chosen to achieve better perceptibility and robustness. A secret key based cryptographic technique is used to select the candidate frames. Three more keys are required to extract the watermark whereas one of the key is used to stop the extraction process and the remaining two are used to display the scrambled watermark. The toughness is evaluated by testing spatial and temporal synchronization attacks. High sturdiness is achieved against video specific attacks frequently occurs in the real world. Even a single frame can accommodate thousand of watermark bits which reflect that high watermark capacity can be obtained.
Robust Video Watermarking Scheme Based on Intra-Coding Process in MPEG-2 Style IJECEIAES
The proposed scheme implemented a semi blind digital watermarking method for video exploiting MPEG-2 standard. The watermark is inserted into selected high frequency coefficients of plain types of discrete cosine transform blocks instead of edge and texture blocks during intra coding process. The selection is essential because the error in such type of blocks is less sensitive to human eyes as compared to other categories of blocks. Therefore, the perceptibility of watermarked video does not degraded sharply. Visual quality is also maintained as motion vectors used for generating the motion compensated images are untouched during the entire watermarking process. Experimental results revealed that the scheme is not only robust to re-compression attack, spatial synchronization attacks like cropping, rotation but also strong to temporal synchronization attacks like frame inserting, deleting, swapping and averaging. The superiority of the anticipated method is obtaining the best sturdiness results contrast to the recently delivered schemes.
Audio Steganography Coding Using the Discreet Wavelet TransformsCSCJournals
The performance of audio steganography compression system using discreet wavelet transform (DWT) is investigated. Audio steganography coding is the technology of transforming stego-speech into efficiently encoded version that can be decoded in the receiver side to produce a close representation of the initial signal (non compressed). Experimental results prove the efficiency of the used compression technique since the compressed stego-speech are perceptually intelligible and indistinguishable from the equivalent initial signal, while being able to recover the initial stego-speech with slight degradation in the quality .
Dynamic Threshold in Clip Analysis and RetrievalCSCJournals
Key frame extraction can be helpful in video summarization, analysis, indexing, browsing, and retrieval. Clip analysis of key frame sequences is an open research issues. The paper deals with identification and extraction of key frames using dynamic threshold followed by video retrieval. The number of key frames to be extracted for each shot depends on the activity details of the shot. This system uses the statistics of comparison between the successive frames within a level extracted on the basis of color histograms and dynamic threshold. Two program interfaces are linked for clip analysis and video indexing and retrieval using entropy. The results using proposed system on few video sequences are tested and the extracted key frames and retrieved results are shown.
H.264/AVCis currently the most widely adopted video coding standard due to its high compression capability and flexibility. However, compressed videos are highly vulnerable to channel errors which may result in severe quality degradation of a video. This paper presentsa concealment aware Unequal Error Protection (UEP) scheme for H.264 video compression using Reed Solomon (RS) codes. The proposed UEP technique assigns a code rate to each Macroblock (MB) based on the type of concealment and a Concealment Dependent Index (CDI). Two interleaving techniques, namely Frame Level Interleaving (FLI) and Group Level Interleaving (GLI) have also been employed. Finally, prioritised concealment is applied in cases where error correction is beyond the capability of the RS decoder. Simulation results have demonstrated that the proposed framework provides an average gain of 2.96 dB over a scheme that used Equal Error Protection (EEP).
Quality Assessment for Recognition and Task-based multimedia applications (QART)Mikołaj Leszczuk
Users of video to perform tasks require sufficient video quality to recognize the information needed for their application. Therefore, the fundamental measure of video quality in these applications is the success rate of these tasks (such as recognition), which is referred to as visual intelligibility or acuity. One of the major causes of reduction of visual intelligibility is loss of data, through various forms of compression. Additionally, the characteristics of the scene being captured have a direct effect on visual intelligibility and on the performance of a compression operation-specifically, the size of the target of interest, the lighting conditions, and the temporal complexity of the scene. The QART project is performing a series of tests to study the effects and interactions of compression and scene characteristics. An additional goal is to test existing or develop new objective measurements that will predict the results of the subjective tests of visual intelligibility.
PERFORMANCE EVALUATION OF H.265/MPEG-HEVC, VP9 AND H.264/MPEGAVC VIDEO CODINGijma
This study evaluates the performance of the three latest video codecs H.265/MPEG-HEVC, H.264/MPEGAVC
and VP9. The evaluation is based on both subjective and objective quality metrics. The assessment
metric Double Stimulus Impairment Scale (DSIS) is used to evaluate the subjective quality of the
compressed video sequences. The Peak Signal-to-Noise Ratio (PSNR) metricis used for the objective
evaluation. Moreover, this work studies the effect of frame rate and resolution on the encoders’
performance. The extensive number of experiments are conducted with similar encoding configurations for
the three studied encoders. The evaluation results show that H.265/MPEG-HEVC provides superior bitrate
saving capabilities compared to H.264 and VP9. However, VP9 shows lower encoding time than
H.265/MPEG-HEVC but higher encoding time compared to H.264.
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...ijma
Streaming of high-quality video contents as part of multimedia communication, has become essential
nowadays. Video delivered over the network suffers from different kind of impairments, which degrades its
quality. Such network impairments effect differs among different types of codec used. This paper present the
effects of network degradation factors such as packet loss and jitter over H.264 and H.265 encoded video
sequences. In addition to the codec used, we also focused on the different level of temporal and spatial
aspect within the videos. Among different basic test methods, double stimulus impairments scale was used
to complete the experiment as subjective measures of assessment metric from user’s perspective.The result
illustrates that differently encoded video sequences react differently to the network impairments and are
very sensitive to a transmission error. Similarly, it also shows that user’s experience is affected according
to the motion level of video.
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...ijma
Streaming of high-quality video contents as part of multimedia communication, has become essential nowadays. Video delivered over the network suffers from different kind of impairments, which degrades its quality. Such network impairments effect differs among different types of codec used. This paper present the effects of network degradation factors such as packet loss and jitter over H.264 and H.265 encoded video sequences. In addition to the codec used, we also focused on the different level of temporal and spatial aspect within the videos. Among different basic test methods, double stimulus impairments scale was used to complete the experiment as subjective measures of assessment metric from user’s perspective.The result illustrates that differently encoded video sequences react differently to the network impairments and are very sensitive to a transmission error. Similarly, it also shows that user’s experience is affected according to the motion level of video.
COMPARISON OF CINEPAK, INTEL, MICROSOFT VIDEO AND INDEO CODEC FOR VIDEO COMPR...ijma
The file size and picture quality are factors to be considered for streaming, storage and transmitting videos
over networks. This work compares Cinepak, Intel, Microsoft Video and Indeo Codec for video
compression. The peak signal to noise ratio is used to compare the quality of such video compressed using
AVI codecs. The most widely used objective measurement by developers of video processing systems is
Peak Signal-to-Noise Ratio (PSNR). Peak Signal to Noise Ration is measured on a logarithmic scale and
depends on the mean squared error (MSE) between an original and an impaired image or video, relative to
(2n-1)2.
COMPARISON OF CINEPAK, INTEL, MICROSOFT VIDEO AND INDEO CODEC FOR VIDEO COMPR...ijma
The file size and picture quality are factors to be considered for streaming, storage and transmitting videos
over networks. This work compares Cinepak, Intel, Microsoft Video and Indeo Codec for video
compression. The peak signal to noise ratio is used to compare the quality of such video compressed using
AVI codecs. The most widely used objective measurement by developers of video processing systems is
Peak Signal-to-Noise Ratio (PSNR). Peak Signal to Noise Ration is measured on a logarithmic scale and
depends on the mean squared error (MSE) between an original and an impaired image or video, relative to
(2n-1)2.
Previous research done regarding assessing of video quality has been mainly by the use of subjective
methods, and there is still no standard method for objective assessments. Although it has been considered
that compression might not be significant in future as storage and transmission capabilities improve, but at
low bandwidths compression makes communication possible.
COMPARISON OF CINEPAK, INTEL, MICROSOFT VIDEO AND INDEO CODEC FOR VIDEO COMPR...ijma
The file size and picture quality are factors to be considered for streaming, storage and transmitting videos
over networks. This work compares Cinepak, Intel, Microsoft Video and Indeo Codec for video
compression. The peak signal to noise ratio is used to compare the quality of such video compressed using
AVI codecs. The most widely used objective measurement by developers of video processing systems is
Peak Signal-to-Noise Ratio (PSNR). Peak Signal to Noise Ration is measured on a logarithmic scale and
depends on the mean squared error (MSE) between an original and an impaired image or video, relative to
(2n-1)2.
Previous research done regarding assessing of video quality has been mainly by the use of subjective
methods, and there is still no standard method for objective assessments. Although it has been considered
that compression might not be significant in future as storage and transmission capabilities improve, but at
low bandwidths compression makes communication possible
Comparison of Cinepak, Intel, Microsoft Video and Indeo Codec for Video Compr...ijma
The file size and picture quality are factors to be considered for streaming, storage and transmitting videos
over networks. This work compares Cinepak, Intel, Microsoft Video and Indeo Codec for video
compression. The peak signal to noise ratio is used to compare the quality of such video compressed using
AVI codecs. The most widely used objective measurement by developers of video processing systems is
Peak Signal-to-Noise Ratio (PSNR). Peak Signal to Noise Ration is measured on a logarithmic scale and
depends on the mean squared error (MSE) between an original and an impaired image or video, relative to
(2n-1)2.
Previous research done regarding assessing of video quality has been mainly by the use of subjective
methods, and there is still no standard method for objective assessments. Although it has been considered
that compression might not be significant in future as storage and transmission capabilities improve, but at
low bandwidths compression makes communication possible.
Machine learning-based energy consumption modeling and comparing of H.264 and...IJECEIAES
Advancement of the prediction models used in a variety of fields is a result of the contribution of machine learning approaches. Utilizing such modeling in feature engineering is exceptionally imperative and required. In this research, we show how to utilize machine learning to save time in research experiments, where we save more than five thousand hours of measuring the energy consumption of encoding recordings. Since measuring the energy consumption has got to be done by humans and since we require more than eleven thousand experiments to cover all the combinations of video sequences, video bit rate, and video encoding settings, we utilize machine learning to model the energy consumption utilizing linear regression. VP8 codec has been offered by Google as a free video encoder in an effort to replace the popular H.264 video encoder standard. This research model energy consumption and describes the major differences between H.264/AVC and VP8 encoders based on of energy consumption and performance through experiments that are machine learning-based modeling. Twentynine uncompressed video segments from a standard data-set are used, with several sizes, details, and dynamics, where the frame sizes ranging from QCIF(176x144) to 2160p(3840x2160). For fairness in comparison analysis, we use seven settings in VP8 encoder and fifteen types of tuning in H.264/AVC. The settings cover various video qualities. The performance metrics include video qualities, encoding time, and encoding energy consumption.
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Vignesh V Menon
Presented at Mile High Video (MHV'24)
Abstract: Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. However, keeping the encoding time within an acceptable
threshold for a smooth user experience is important to reduce the carbon footprint and energy consumption on encoding servers in video streaming applications. Toward this realization, we introduce
an encoding latency-aware dynamic resolution encoding scheme (LADRE) for adaptive video streaming applications. LADRE determines the encoding resolution for each target bitrate by utilizing a random forest-based prediction model for every video segment based on spatiotemporal features and the acceptable target latency.
Experimental results show that LADRE achieves an overall average quality improvement of 0.58 dB PSNR and 0.43 dB XPSNR while maintaining the same bitrate, compared to the HTTP Live Streaming (HLS) bitrate ladder encoding of 200 s segments using the VVenC
encoder, when the encoding latency for each representation is set to remain below the 200 s threshold. This is accompanied by a 84.17 % reduction in overall encoding energy consumption.
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionCSCJournals
Abstract The wavelet transform has become the most interesting new algorithm for video compression. Yet there are many parameters within a wavelet analysis and synthesis which govern the quality of a decoded video. In this paper different wavelet decomposition strategies and their implications for the decoded video are discussed. A pool of color video sequences has been wavelet-transformed at different settings of the wavelet filter bank and quantization threshold and with decomposition of dyadic and packet wavelet transformation strategies. The empirical evaluation of the decomposition strategy is based on three benchmarks: a first judgment regards the perceived quality of the decoded video. The compression rate is a second crucial factor, and finally the best parameter setting with regards to the Peak Signal to Noise Ratio (PSNR). The investigation proposes dyadic decomposition as the chosen decomposition strategy.
Review of video over IP testing tools including: video syntax analyzer, pixel based measurement indexes like PSNR and SSIM and the tools to measure them, IP based video quality testing.
The surveillance systems are expected to record the videos in 24/7 and obviously it requires a huge storage space. Even though the hard disks are cheaper today, the number of CCTV cameras is also vertically increasing in order to boost up security. The video compression techniques is the only better option to reduce required the storage space; however, the existing video compression techniques are not adequate at all for the modern digital surveillance system monitoring as they require huge video streams. In this paper, a novel video compression technique is presented with a critical analysis of the experimental results.
With the advancement in internet technology, everyone has access to the internet. After google, YouTube is the second largest search engine and approximately 1 billion hours are consumed by people to watch video contents on YouTube. Editing the video and processing is not very easy. Network also plays an important role. With an unsteady network it can cause video to buffer which can reduce the streaming experience of users. Many people don’t even have a good computer which can handle the editing of large video files as editing and processing the video utilizes hardware, software and both. Many video editing software are available on the internet. Either it can be paid or open source software. One of the most popular open source software available on the internet is FFmpeg Fast Forward Moving Picture Expert Group . FFmpeg with other various software together can be used for video forensic to find traces in videos. It becomes very difficult to find traces from videos that are highly compressed or the video has low resolution. In earlier times, fetching data from camera of robots and encoding the data with software generates an issue. JNI,NDK, FFmpeg, researching about these video annotations a video player was created to examine video of sports so that user can see the how player evaluates the action practically with efficiently. Demand of multimedia increase as times goes on. Today in this global pandemic, everyone has move to digitalization. From studies to working everything has been digitalized. In this paper we are going to study about FFmpeg, how it benefits user with its features. Combining this highly popular multimedia framework with other software can create some useful technologies. Well, FFmpeg is mostly known for its memory efficiency and time efficiency. From processing image to editing videos everything can be acquired from FFmpeg. H. Sumesh Singha | Dr. Bhuvana J "A Study on FFmpeg Multimedia Framework" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42362.pdf Paper URL: https://www.ijtsrd.comcomputer-science/other/42362/a-study-on-ffmpeg-multimedia-framework/h-sumesh-singha
QoS Constrained H.264/SVC video streaming over Multicast Ad Hoc NetworksIJERA Editor
Support for QoS enabled multimedia transmission over multicast ad hoc network is necessary these days.
Researchers have developed various encoding/decoding schemes which can efficiently deliver the multimedia
contents over wireless networks. In case of ad hoc networks, performance of routing protocol depends upon
different factors i.e. traffic type being used for wireless transmission, dynamic network behavior, bandwidth and
computational power of nodes etc. It is essential to investigate the performance of multicast routing protocol
using various data types because they may consume huge network resources thus results in degradation of
transmission quality. In case of multicast group communication, Audio/Video data stream can cause extra
overhead on network performance and it is quite difficult to maintain Quality of Services for such type of data.
H.264 offers a rich codec library for Scalable Video Coding, to transfer SVC video traffic efficiently over
wireless networks. In this paper, we will analyze the performance of MAODV and PUMA routing protocols
using H.264/SVC video streaming traffic under the various QoS constraints such as Throughput, PDR, Delay,
Routing Load and Jitter etc.
VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...sipij
Video/Image quality assessment (VQA/IQA) is fundamental in various fields of video/image processing.
VQA reflects the quality of a video as most people commonly perceive. This paper proposes a reducedreference
mobile VQA, in which one-dimensional (1-D) motion vector (MV) distributions are used as
features of videos. This paper focuses on reduction of data size using Laplacian modeling of MV
distributions because network resource is restricted in the case of mobile video. The proposed method is
more efficient than the conventional methods in view of the computation time, because the proposed quality
metric decodes MVs directly from video stream in the parsing process rather than reconstructing the
distorted video at a receiver. Moreover, in view of data size, the proposed method is efficient because a
sender transmits only 28 parameters. We adopt the Laplacian distribution for modeling 1-D MV
histograms. 1-D MV histograms accumulated over the whole video sequences are used, which is different
from the conventional methods that assess each image frame independently. For testing the similarity
between MV histogram of reference and distorted videos and for minimizing the fitting error in Laplacian
modeling process, we use the chi-square method. To show the effectiveness of our proposed method, we
compare the proposed method with the conventional methods with coded video clips, which are coded
under varying bit rate, image size, and frame rate by H.263 and H.264/AVC. Experimental results show
that the proposed method gives the performance comparable with the conventional methods, especially, the
proposed method requires much lower transmission data.
Similar to Comparative study of compression techniques for synthetic videos (20)
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Comparative study of compression techniques for synthetic videos
1. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
DOI : 10.5121/ijma.2014.6201 1
COMPARATIVE STUDY OF COMPRESSION
TECHNIQUES FOR SYNTHETIC VIDEOS
Ayman Abdalla, Ahmad Mazhar, Mosa Salah, Sahar Khalaf
Faculty of Science & I.T., Al-Zaytoonah University, Amman, Jordan
ABSTRACT
We evaluate the performance of three state of the art video codecs on synthetic videos. The evaluation is
based on both subjective and objective quality metrics. The subjective quality of the compressed video
sequences is evaluated using the Double Stimulus Impairment Scale (DSIS) assessment metric while the
Peak Signal-to-Noise Ratio (PSNR) is used for the objective evaluation. An extensive number of
experiments are conducted to study the effect of frame rate and resolution on codecs’ performance for
synthetic videos. The evaluation results show that video codecs respond in different ways to frame rate and
frame resolution change. H.264 shows superior capabilities compared to other codecs. Mean Opinion
Score (MOS) results are shown for various bitrates, frame rates and frame resolutions.
KEYWORDS
CIF, DSIS, H.264, MOS, MPEG, QCIF, VP-8
1. INTRODUCTION
Synthetic videos are used by a wide variety of applications such as virtual reality tours, cartoons
and games. Virtual reality tours require good compression techniques since they are often used in
real time and require environment details to be shown. Online gaming has grown to become one
of the largest entertainment sectors. More and more gamers upload videos of their games to video
sharing websites or stream them live via online platforms such as Twitch TV. Therefore, video
compression techniques are required to reduce the bitrate of gamers’ video streams without
compromising the quality. This study will investigate and compare the performance of the most
popular standard video codecs on game and virtual reality videos. The compression of synthetic
videos is a new research area. There are no specific studies on video codecs performance for
synthetic videos.
In [1], a comparative study between H.264 and Motion JPEG2000 for high definition video
coding was conducted. In [2], the performance of H.264, MPEG-4, H.263 and MPEG-2 were
studied. The codecs were compared using (Peak Signal-to-Noise Ratio) PSNR and subjective
measurements. In [3], an evaluation study of H.264 performance was conducted. The trade-off
between coding efficiency and error resilience for network applications was considered, in
addition to the end-to-end delay. In [4], an evaluation of perceptual visual quality under various
settings and requirements was conducted. The subjective assessment tests were analyzed to study
the influence of the different dimensions on the subjective evaluation. The considered dimensions
are: encoder type, video content, bitrate, frame size and frame rate, where the only codecs used in
the study were H.264 and H.263. In [5], a methodology was devised to evaluate the video
2. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
2
perceived quality when watching high motion videos. The video sequences were chosen from
some football matches with different physical quality metrics. CIF and QCIF frame resolutions
were used to study the relationship between frame rate and quantization and their impact on the
perceived quality. In [6], a new objective quality metric for video sequences was proposed. The
proposed metric was based on estimating the MOS of compressed sequences using MSE, spatial
video content and skin information. The idea focused on the factors that attract viewer attention.
In [7], a review of video quality measurement techniques was performed. Both subjective and
objective quality metrics were discussed, in addition to a hybrid metric. In [8], an evaluation
study of present video compression techniques effects on the perceptual quality of video coding
was presented. In [9], a study of the rating scale in subjective testing methodology was presented.
The study focused on the importance of the rating scale elements and their effect on the
subjective evaluation results. In [10], a wide comparative study between H.264 and MPEG-4 was
conducted. Most technical differences between the two codecs were presented. The results
showed the superiority of codec H.264 over MPEG-4.
Three types of video codecs are selected for the study in this paper; H.264, MPEG-4 and VP8.
The sequences used to perform this comparison study were captured from the “World of
Warcraft” game. This game is considered one of the most popular online games. It contains a
wide range of features and visual specifications. The sequences were captured using a software
tool called FRAPS [11]. The goal of this paper is to study the most possible frame rates and
resolutions over the common bitrate values, and show the effects of this variation on video codecs
performance for synthetic sequences. We also discuss the degree of codec sensitivity to this
variation. The study addressed subjective and objective measurements for a comprehensive study.
2. METHODOLOGY OF THE STUDY
Small frame resolutions with QCIF and CIF sizes were selected for making comparisons in this
study. Three different frame rates; 15 frames per second (fps), 25 fps and 30 fps, were selected.
Videos were compressed at 256, 512 and 1024 kbps. These bitrates were selected to suit the
average internet speeds available in most countries. According to [12-14], the selected bitrates
match the lowest available internet speed in the UK and in Jordan where the study was conducted.
The compression experiments are applied over diverse types of captured video sequences in order
to provide different scene contents. The study considered scenes that include sequences with
high-texture high-motion, high-texture low-motion, low-texture high-motion and low-texture
low-motion contents. Compression was performed using the FFmpeg software [15]. The process
used in this study to evaluate the codecs performance includes both subjective and objective
metrics. Peak signal to noise ratio (PSNR) was used as the objective metric since it is widely used
by the video processing community. It is easy to implement and fast to compute [7]. The DSIS
variant II, which is recommended by the ITU [16], was used as the subjective quality metric.
Four ten-second test video sequences were viewed by fifteen viewers, where different spatial and
temporal resolutions were considered. The original video sequence is displayed to the viewer
first; and then the compressed video sequence is displayed. The viewers cannot score before this
process is repeated. After that, the viewers give their impression on the compression quality,
compared to the original sequences, using a five-level quality scale. All viewers are B.Sc.
students with good communication skills. The experiments were conducted in a computer lab
using 19” Dell LCD monitors. The lab’s windows were covered with gray curtains and white
lights were used. All students used the same lab and monitor specifications. For the subjective
evaluation, all video sequences were displayed at the same resolution and frame rate. The frame
size 4CIF and frame rate 30 fps were chosen for all tests. The lower resolutions were upsampled
to 4CIF and lower frame rates were repeated to become 30 fps. The process of upsampling and
3. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
3
frame repetition was conducted using the H.264/AVC 6-tap half-sample interpolation filter and
frame repeat [4].
3. OBJECTIVE EXPERIMENTAL RESULTS
The comparative study with results analysis is presented in this section. The effect of frame rates
and bitrates with CIF and QCIF resolutions are also illustrated. As shown in Table 1, H.264
outperformed MPEG-4 and VP-8 at 1024 kbps CIF resolution. H.264 PSNR was more than 2
decibels (dB) higher than the PSNR of the other two codecs and the PSNR difference decreased
as the bitrate decreased. H.264 still performed better than other codecs at 512 kbps, and the
difference decreased as the frame rate decreased at the same bitrate. MPEG-4 performed slightly
better with 256 kbps than with the other frame rates. At QCIF, H.264 significantly outperformed
other codecs at higher bitrates. PSNR was around 4 dB better than VP-8 and 6 dB than MPEG-4
at 512 kbps.
Table 1: HTHM PSNR
Bit
Rate
(kbps)
Frame
Rate
(fps)
PSNR (dB)
H.264 MPEG-4 VP-8
QCIF CIF QCIF CIF QCIF CIF
256
15 41.95 34.01 41.69 35.39 39.42 32.17
25 41.47 33.63 42.71 35.05 39.28 33.97
30 41.32 33.64 43.37 35.02 39.11 33.84
512
15 50.1 37.76 43.31 37.55 46.34 37.38
25 49.5 37.42 43.5 37.09 45.85 37.26
30 49.38 37.52 43.6 37.06 45.65 37.15
1024
15 64.89 43.33 43.32 40.68 57.47 41.17
25 64.15 43.05 43.54 40.35 57.56 40.88
30 63.38 43.27 43.61 40.13 57.63 40.61
The results in Table 2 show the average PSNR for the HTLM sequences of the three codecs.
H.264 shows, on average, a 3 dB better performance than MPEG-4 and VP-8 at 1024 kbps. The
difference between H.264 and VP-8 decreased at 512 kbps where it was less than 1 dB on
average. The VP-8 codec showed a slightly better performance than H.264 at 256 kbps. However,
VP-8 outperformed other codecs at lower bitrates. The VP-8 codec showed a slightly better
performance than H.264 at the low bitrate.
Table 2: HTLM PSNR
Bit
Rate
(kbps)
Frame
Rate
(fps)
PSNR (dB)
H.264 MPEG-4 VP-8
QCIF CIF QCIF CIF QCIF CIF
256
15 52.07 42.6 43.64 39.31 47.18 43.01
25 49.73 41.56 43.85 39.65 45.54 42.18
30 48.86 40.95 43.9 39.63 45.11 41.11
512
15 61.53 47.96 43.64 42.52 56.39 46.38
25 58.27 46.76 43.85 42.44 52.18 45.95
30 56.96 46.09 43.9 42.37 51.23 45.42
1024
15 62.83 55.5 43.65 44.6 56.58 52.67
25 62.31 53.26 43.86 44.77 56.33 50.06
30 62.05 52.33 43.91 44.79 56.19 49.16
4. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
4
The QCIF results in Table 2 show that H.264 significantly outperformed other codecs, as it gave
over 6 dB more than MPEG-4 and over 4 dB better than VP-8 at 256 kbps. The average PSNR
values for LTHM video sequences are shown in Table 3. All codecs showed better PSNR when
increasing the bitrate, where they decreased when increasing the frame rate. H.264 showed better
PSNR than other codecs at higher bitrates. The QCIF results show that MPEG-4 capability is
close to H.264 at a lower bitrate, where the difference was less than 1 dB on average. Conversely,
VP-8 gave the lowest PSNR at 256 kbps. At higher bitrates, H.264 showed an increase in PSNR
with an increase in the difference from other codecs.
Table 3: LTHM PSNR
Bit
Rate
(kbps)
Frame
Rate
(fps)
PSNR (dB)
H.264 MPEG-4 VP-8
QCIF CIF QCIF CIF QCIF CIF
256
15 42.2 37.57 41.2 37.04 39.96 38.42
25 41.96 37.07 41.07 36.87 39.85 37.96
30 41.9 36.85 41.58 37.38 39.65 37.63
512
15 46.96 41.49 43.81 40.12 44.47 41.63
25 46.87 41.17 43.91 39.45 44.16 41.35
30 46.85 41.05 43.96 39.48 43.89 41.19
1024
15 52.66 45.56 43.82 43.76 44.09 44.76
25 52.78 45.37 43.92 42.91 43.88 44.47
30 52.8 45.31 43.96 42.94 43.75 44.29
Table 4 shows the average PSNR of LTLM. As shown in the table, the H.264 codec at CIF
resolution outperformed other codecs at the higher bitrates. H.264 performed on average 2 dB
better than VP-8 and was approximately 14 dB better than MPEG-4 at 1024 kbps. For QCIF
resolution, H.264 showed better performance than other codecs for all sequences. It was about 13
dB better than MPEG-4 and about 1 dB better, on average, compared to VP-8 at 256 kbps.
However, this difference clearly decreased as the frame rate increased.
Table 4: LTLM PSNR
Bit
Rate
(kbps)
Frame
Rate
(fps)
PSNR (dB)
H.264 MPEG-4 VP-8
QCIF CIF QCIF CIF QCIF CIF
256
15 60.82 47.95 44.08 42.5 57.89 48.7
25 57.32 44.36 44.15 40.44 55.73 44.51
30 54.72 42.84 44.172 39.88 54.19 43.18
512
15 63.54 55.55 44.08 44.37 61.42 53.64
25 62.84 50.78 44.15 44.42 60.68 49.47
30 61.87 48.93 44.172 43.42 60.17 48.21
1024
15 64 62.44 44.08 44.38 61.78 60.07
25 64.06 59.66 44.15 44.43 60.58 57.81
30 63.83 57.42 44.173 44.44 58.62 55.36
4. SUBJECTIVE EXPERIMENTAL RESULTS
The average DSIS of HTHM and CIF resolution is shown in Table 5. The results show that the
users preferred the sequences compressed by the H.264 codec to those compressed using MPEG-
4 and VP-8. The table also shows the average DSIS for this type of sequences. The results show
that H.264 gave better performance than other codecs, while the performance of VP-8 came in
second. However, results illustrate that all codecs achieved around 3 out of 5 on average in this
5. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
5
subjective evaluation. The reason for this result is the complexity of this type of sequences and
the compression difficulty that the codecs encountered.
The subjective evaluation results of HTLM are shown in Table 6. At CIF resolution, the H.264
codec produced the highest satisfaction with all bitrates. VP-8 was better than MPEG-4 with all
different bitrates. The difference between codecs increased as the bitrate increased and H.264
started getting better results. However, the QCIF frame size showed that VP-8 and H.264 had
similar evaluation results at 256 kbps, where both codecs were better than MPEG-4. When
increasing the bitrate, H.264 started to perform better than VP-8. Results show that the difference
increased as the bitrate increased.
Table 5: HTHM MOS Results
Codec
Bit Rate
256 512 1024
CIF QCIF CIF QCIF CIF QCIF
H.264 4.2 3 4.33 3.33 4.46 3.53
MPEG-4 3.33 2.4 3.66 2.4 3.8 3
VP-8 3.73 2.6 3.93 3.06 4.26 3.46
Table 6: HTLM MOS Results
Codec
Bit Rate
256 512 1024
CIF QCIF CIF QCIF CIF QCIF
H.264 3.73 3.13 4.13 3.26 4.46 3.66
MPEG-4 3.2 2.8 3.73 2.93 3.66 3
VP-8 3.66 3.13 4 3.2 4.2 3.4
Table 7: LTHM MOS Results
Codec
Bit Rate
256 512 1024
CIF QCIF CIF QCIF CIF QCIF
H.264 4.33 3.33 4.2 3.4 4.4 3.66
MPEG-4 3.93 3.06 4.13 3.06 4.06 3.33
VP-8 4.13 3.33 4.13 3.33 4.33 3.46
Table 8: LTLM MOS Results
Codec
Bit Rate
256 512 1024
CIF QCIF CIF QCIF CIF QCIF
H.264 4.2 3.33 4.33 3.53 4.66 4
MPEG-4 3.53 3.13 4.13 2.93 4.06 3.13
VP-8 4.13 3.33 4.26 3.46 4.6 3.73
Table 7 illustrates the average DSIS of LTHM sequences. All codecs got high averages in the CIF
evaluation results. However, it is clear that the H.264 codec got better subjective evaluation than
other codecs at all bitrates. Results of H.264 and VP-8 for QCIF were similar at a lower bitrate
and close at higher rate. The difference slightly increased in favor of H.264 as the bitrate rate
increased. Table 8 demonstrates the LTLM average DSIS. H.264 and VP-8 DSIS values for CIF
6. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
6
were almost equal with a little preference for H.264. DSIS of both codecs were better than
MPEG-4 at all bitrates rates and frame rates. At 256 kbps QCIF resolution, both H.264 and VP-8
showed comparable results, whereas both codecs were slightly better than MPEG-4. At higher
bitrates rates, H.264 started performing better than VP-8 with a small difference at 512 kbps and
increased at 1024 kbps.
5. STATISTICAL STUDY OF SUBJECTIVE RESULTS
The PSNR and MOS results of codec comparison were shown in the preceding section. The
descriptive statistical study in the section showed these simple observations: First, H.264 codec
outperformed both MPEG-4 and VP-8. Second, the type of video sequence and frame rate
affected MOS, where the higher the motion and texture video sequences are the higher the MOS
becomes.
For completeness of studying the influence of different dimensions on MOS, this section
considers a wider scope of subjective video quality assessment analysis. The evaluating of
perceptual visual quality under different requirements has been performed including five
distinctive dimensions: encoder type, video type (content), bitrate, frame rate and frame size.
The obtained subjective results have been statistically analyzed. The influence of the different
dimensions on MOS has been illustrated. The Analysis of Variance (ANOVA) is used to conduct
this evaluation. ANOVA is a statistical technical method that can be used to compare the means
of two or more groups under certain assumptions. There are several types of ANOVA found in
the literature for the purpose of comparison such as the one-way ANOVA, two-way ANOVA,
factorial analysis and other types. Refer to [17] for details of ANOVA. To test the effect of video
type (VT), Codec Type (CT), frame per second (FpS) and bitrate (BR) on MOS, three-way
ANOVA was adopted in this analysis.
The analysis focuses on capturing any difference in MOS mean under the above potential factors.
The model that will be tested is as in the following equation:
MOS = BR + FpS + VT + ε
In the above model, it should be noted that the error is assumed normally and identically
distributed with mean 0 and variance σ2
. Before applying the analysis, three assumptions for
ANOVA should be validated:
1. Normality assumption: We resort to large sample theory to justify the normality of our
sample since the sample size is over 30.
2. Independency: The independency assumption is satisfied since each DSIS result is not
affected by the others.
3. Homoscedasticity (equality of variances): Levene's Test of Equality of Error Variances is
conducted and the results are shown in Table 9. Results show that no major violation of this
assumption is observed.
In order to perform the statistical study of the three-way ANOVA, a set of treatments is
determined to study the influence of different dimensions on MOS where the variables are Codec
Type (CT), Video Type (VT) and Bitrate (BR).
Table 9: Test of Equality of Error Variances
F df1 df2 Sig.
0.902 11 36 0.548
7. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
7
Table 10: Three-Way ANOVA on MOS of QCIF
Dimensions Sum of Squares Degrees of Freedom Mean Square F statistic p-value
CT 1.819 2 0.909 15.497 0.0000
VT 1.809 3 0.603 10.276 0.0002
BR 0.548 2 0.274 4.671 0.018
The analysis results of QCIF are illustrated in Table 10. As shown in the table, p-values are small
(p ≤ 0.05). The smallness of p-values indicates that the influence of the dimensions is substantial
and all have significant effect on MOS. In addition, the magnitudes of these dimensions are
indicators of the strength of the influence on the MOS. The CT affects MOS the most followed by
VT. The least impact is of BR.
For codecs performance evaluation, the pairwise comparison results are shown in Table 11 and
Table 12. The results show that the (p-values) are significant for CT and substantially affect MOS
between H.264 and MPEG-4, and highly affected it between VP-8 and MPEG-4. Furthermore,
positive mean difference value where (I is H.264 or VP-8) means that the advantage of the
significance is for the codec represented by (I). On the contrary, negative value indicates that the
advantage is for the codec that is represented by (J).
MOS was not significantly affected when switching between VP-8 and H.264, but MOS was
noticeably affected and viewers could easily sense that some changes happened when
compression was used. This implies that MOS is severely affected by CT when comparing H.264
or VP-8 with MPEG-4 (p-values < 0.05). When comparing H.264 to VP-8, the experiments
showed that the results are insignificant (p-values > 0.05). However, mean difference results
showed that H.264 insignificantly outperforms VP-8 as the positive value was obtained when (I =
H.264) but with (p-value > 0.05). The detailed pairwise comparison results of QCIF statistical
study are shown in Table 12.
Table 11: Pairwise of Video Codecs of CIF
(I) CT (J) CT
Mean Difference
(I-J)
p-value
H264
MPEG-4 0.532 0.000
VP-8 0.145 0.154
MPEG-4
H264 -0.532 0.000
VP-8 -0.387 0.001
VP-8
H264 -0.145 0.154
MPEG-4 0.387 0.001
Table 12: Pairwise of Video Codecs of QCIF
(I) CT (J) CT
Mean Difference
(I-J)
p-value
H264
MPEG-4 0.496 0.000
VP-8 0.173 0.007
MPEG-4
H264 -0.496 0.000
VP-8 -0.323 0.000
VP-8
H264 -0.173 0.007
MPEG-4 0.323 0.000
8. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
8
Table 13: Three-Way ANOVA on MOS of CIF
Dimensions Sum of Squares Degrees of Freedom Mean Square F statistic p-value
CT 1.520 2 0.760 35.067 0.000
VT 0.625 3 0.208 9.610 0.000
BR 0.805 2 0.402 18.572 0.000
The results of the three-way ANOVA of CIF format are illustrated in Table 13. As shown in the
table, all dimensions are significant, where p-values are almost zero. It can be noticed in the sixth
column that (p ≤ 0.05). This indicates that the influence of the dimensions is substantially
significant for CT, VT and BR on MOS. In addition, the magnitudes of BR are less than the BR
magnitude of QCIF. Therefore, this is considered as an indicator of the smaller strength of
influence of BR on MOS compared to the most BR influence in QCIF. In CIF format, the effect
of CT, VT and BR on MOS is comparable.
The Pairwise comparison results of CIF are shown in Table 11. The results show that (p-values)
are significant of CT and are substantially affects the MOS between H.264 and MPEG-4, MOS is
also highly affected between VP-8 and MPEG-4. Mean Difference has either positive or negative
values, where (I is H.264 and J is VP-8). Where positive results mean that the advantage of the
significance is for the codec represented by (I), and negative value indicates that the advantage is
for the codec that is represented by (J), result implies that MOS is severely affected by CT when
comparing H.264 or VP-8 with MPEG-4.
When comparing H.264 to VP-8, the results showed a different case than in QCIF. It is clear that
(p ≤ 0.05) which means it is significant and MOS is severely affected by changing from one
codec to the other. In addition, Mean Difference results show that H.264 severely outperformed
VP-8 since the positive value was obtained when (I = H.264). As a conclusion, we can claim that
H.264 outperforms both codecs at CIF and QCIF resolutions with considerable significance of
QCIF.
6. CONCLUSIONS
The effects of frame rate and resolution on the most popular video codecs performance and
perceptual quality were evaluated. The study was conducted using PSNR, the most popular
objective metric, and extensive subjective viewing tests using DSIS II. The frame rates and
resolutions were tested at low bitrates currently available in typical uplink streaming. The results
led to new interesting perceptions of the online video gaming compression techniques. First, the
H.264 video coding technique showed high performance on synthetic video coding. Second,
perceptual quality was affected significantly by VT and CT more than the effect by BR at the
QCIF resolution. However, it was insignificantly affected at higher resolutions such as CIF.
Third, the higher frame sizes with lower frame rates were more acceptable by observers,
especially for high texture sequences. Fourth, VP-8 showed comparable compression capability
which makes it a high competitor with H.264 and MPEG-4. Fifth, H.264 outperformed other
tested codecs for synthetic video compression. The results of our research can provide common
rules for online gaming compression over relatively low bitrates, as well as guidelines for
compressing other synthetic videos such as virtual reality video segments.
9. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
9
REFERENCES
[1] Deng, C., W. Lin, B.-S. Lee, C. T. Lau & M. Paul, (2010) “Comparison between H.264/AVC and
Motion jpeg2000 for super-high definition video coding,” in the 17th IEEE International Conference
on Image Processing (ICIP), pp. 2037-2040.
[2] Wiegand, T., H. Schwarz, A. Joch, F. Kossentini & G. J. Sullivan, (2003) “Rate-constrained coder
control and comparison of video coding standards,” IEEE Transactions on Circuits and Systems for
Video Technology, Vol. 13, pp. 688-703.
[3] Brown, M., D. Bushmitch, K. Kerpez, D. Waring & Y. Wang, (2009) “Low-bit rate video codec
parameter evaluation and optimization,” in Military Communications Conference, MILCOM 2009,
IEEE, pp. 1-20.
[4] Zhai, G., J. Cai, W. Lin, X. Yang & W. Zhang, (2008) “Cross-Dimensional Perceptual Quality
Assessment for Low Bit-Rate Videos,” IEEE Transactions on Multimedia, Vol. 10, pp. 1316-1324.
[5] McCarthy, J. D., M. A. Sasse & D. Miras, (2004) “Sharp or smooth?: comparing the effects of
quantization vs. frame rate for streamed video,” in Conference on Human Factors in Computing
Systems, Vienna, Austria.
[6] Bhat, A., S. Kannangara, Y. Zhao & I. Richardson, (2012) “A Full Reference Quality Metric for
Compressed Video Based on Mean Squared Error and Video Content,” IEEE Transactions on
Circuits and Systems for Video Technology, Vol. 22.
[7] Winkler, S. & P. Mohandas, (2008) “The Evolution of Video Quality Measurement: From PSNR to
Hybrid Metrics,” IEEE Transactions on Broadcasting, Vol. 54, pp. 660-668.
[8] Seshadrinathan, K., R. Soundararajan, A. C. Bovik & L. K. Cormack, (2010) “Study of Subjective
and Objective Quality Assessment of Video,” IEEE Transactions on Image Processing, Vol. 19, pp.
1427-1441.
[9] Huynh-Thu, Q., M.-N. Garcia, F. Speranza, P. Corriveau & A. Raake, (2011) “Study of Rating Scales
for Subjective Quality Assessment of High-Definition Video,” IEEE Transactions on Broadcasting,
Vol. 57, pp. 1-14.
[10] Dembla, D., B. Patel, A. Kumar & Y. Bhomia, (2013) “Comparison of H.264 and MPEG-4 Codec
Based on PSNR – Peak Signal to Noise Ratio Algorithm”, International Journal of Advanced
Research in Computer Science and Software Engineering, Vol. 3, Issue 3, pp. 365-370.
[11] http://www.fraps.com, Accessed 5/12/2013.
[12] http://www.netindex.com, Accessed 5/12/2013.
[13] http://media.ofcom.org.uk, Accessed 5/12/2013.
[14] http://www.ispreview.co.uk, Accessed 5/12/2013.
[15] http://www.ffmpeg.org, Accessed 5/12/2013.
[16] International Telecommunication Union, (2002) “Methodology for the Subjective Assessment of the
Quality of Television Pictures,” Recommendation ITU-R BT. 500-11 ITU.
[17] Snedecor, G.W. & W.G. Cochran (1989) Statistical Methods, 8th ed., Iowa State University, USA.
10. The International Journal of Multimedia & Its Applications (IJMA) Vol.6, No.2, April 2014
10
AUTHORS
Dr. Ayman M. Abdalla has been a member of the Faculty of Science and Information
Technology at Al-Zaytoonah University since 2001, where he held different positions
including the Chair of the Department of Multimedia Systems. He received his Ph.D. in
computer science from the University of Central Florida, FL, USA; and his Master’s and
Bachelor’s degrees in computer science from Montclair State University, NJ, USA. He
has experience in research and teaching in the United States and Jordan in addition to
working in software development in a company in the United States.
Dr. Ahmad A. Mazhar has been a member of the Faculty of Science and Information
Technology at Al-Zaytoonah University since 2008. He received his Ph.D. in 2013 from
De Montfort University, UK; his Master’s degree in computer science from Al- Balqa'
Applied University, Salt, Jordan; and his Bachelor’s degree in computer science from
Al-Zaytoonah University, Amman, Jordan.
Mr. Mosa Salah has been a member of the Faculty of Science and Information
Technology at Al-Zaytoonah University since 2008. He received his Master’s degree in
computer science from the Arab Academy for Finance and Banking, Amman, Jordan;
and his Bachelor’s degree in computer science from Al-Zaytoonah University, Amman,
Jordan.
Mrs. Sahar A. Khalaf received her Bachelor’s degree in computer graphics from Princess Sumaya
University for Technology, Amman, Jordan, in 2010 and started working as a lab assistant at Al-Zaytoonah
University in 2011. Currently she is an associate researcher working on a major funded project at Al-
Zaytoonah University.