SlideShare a Scribd company logo
1 of 42
1/42 
Methods for Objective and Subjective Video Quality Assessment and for Speech Enhancement 
Muhammad Shahid Dec 2, 2014 PhD Degree Defense
2/42 
Disposition 
•Introduction 
•Part 1 
oOn classification and review of No-Reference (NR) visual quality assessment 
•Part 2 
oOn NR and Reduced-Reference (RR) methods of video quality assessment 
•Part 3 
oOn subjective methods of video quality assessment 
•Part 4 
oOn speech enhancement in modulation domain
3/42 
Introduction 
Two key areas in multimedia services 
•Video Quality Assessment (VQA) 
•Speech Enhancement 
•Video Quality 
oWhat? Perceptual quality of a video 
oWho? Subjective matter 
oWhy? We want it better 
oHow? Assessment: Subjective and Objective 
 Both are covered in the thesis 
•Speech Enhancement 
oNoisy environments; remove noise or boost speech 
oEvaluate established technique, in a different domain
4/42 
Introduction Quality 
Aristotle categorized every object of human apprehension into: 
•Substance 
•Quantity 
•Quality (from Latin – qualitas) 
•Reflection 
•Place 
•Time 
•Position 
•State 
•Action 
•Affection Quality (Meriam-Webster): how good or bad something is
5/42 
Introduction Motivation 
•Videos: Expected to be 67% of global mobile data traffic in 2017
6/42 
Introduction Motivation 
•Video Quality: Why do we bother? 
•We: 
•Consumers, Service providers, Content providers etc. 
•Bother: Low quality Dissatisfaction  Churn 
•Is that all? 
Decrease quality (resources) without loosing user? 
Charging more for better quality? 
Provide various quality options?  larger consumer base! 
• Many reasons for Video Quality Assessment
7/42 
Introduction Video Quality 
System Influence Factors that can degrade video quality 
In order to avoid/minimize any degradation, its impact on quality has to be measured!
8/42 
Introduction Video Quality Assessment 
•How to measure (assess)? 
•Compared to (b) ‘original’ MSE of (a) = 42 MSE of (c) = 25 
(a) (b) (c)
9/42 
Introduction Video Quality Assessment 
•Simple pixel comparisons are not adequate 
•Humans should be the assessors 
•Subjective assessment (mean opinion score: MOS) 
Laboratory based, standardized by ITU-T 
Crowdsourcing based, loosely controlled 
BUT: impracticable for many practical applications 
•Objective assessment, computational models that mimic subjective assessment 
’Original’ available  Full-Reference (FR) 
Features of ’original’ available  Reduced-Reference (RR) 
No access to ’original’  No-Reference (NR) 
•Real-time, online, practical application scenarios  Preferably NR or RR
10/30 
Disposition 
•Introduction 
•Part 1 
•On classification and review of NR visual quality assessment 
•Part 2 
•On NR and RR methods of video quality assessment 
•Part 3 
•On Subjective methods of video quality assessment 
•Part 4 
•On Speech enhancement in modulation domain
11/30 
Part 1 Classification and review of NR visual quality assessment
12/30 
•QP: Quantization Parameter 
•DCT: Discrete Cosine Transform 
Part 1 Classification and Review of NR visual quality assessment
13/30 
•Over 170 references reviewed 
•Pixel-based methods in majority; designed for images 
•Many image based techniques have been adapted for videos 
•Joint impact of different artifacts / methods of ’global’ quality assessment: Scarce 
•NR VQA gaining interest 
•P.NAMS and P.NBAMS standardized by ITU-T 
•Bitstream based approaches more popular (computationally less complex but still offer competitive performance) 
Part 1 Conclusions and Observations
14/42 
Disposition 
•Introduction 
•Part 1 
•On classification and review of NR visual quality assessment 
•Part 2 
•NR and RR methods of video quality assessment 
•ANN based NR method 
•LS-SVM based NR method 
•LASSO based NR and RR methods 
•Part 3 
•On subjective methods of video quality assessment 
•Part 4 
•On speech enhancement in modulation domain
15/42 
Part 2 
A general framework of NR video quality prediction/estimation
16/42 
Part 2 Artificial Neural Network (ANN) based NR method 
•ANN used in image processing, found useful in different applications 
•Bitstream-based video features 
•P16x16, P4x4, and P8x8 are partitioning sizes of blocks in percentage 
•Avg = Average 
•Perceptual Evaluation of Video Quality (PEVQ) 
• Peak Signal to Noise Ratio (PSNR) 
• Structural SIMilarity (SSIM)
17/42 
Part 2 Artificial Neural Network (ANN) based NR method 
•Two layer ANN with Levenberg-Marquardt backpropagation 
•H.264/AVC encoded test stimuli, QCIF resolution 
•7 SRCs, 6 bitrates, 4 frame-rates = 168 for training 
•5 clips from 1 SRC, 6 bitrates, 4 frame-rates = 120 for testing
18/42 
Part 2 Artificial Neural Network (ANN) based NR method Results 
•Competitive performance was observed as compared to linear regression 
•Possible improvements/extensions: MOS prediction, better regression technique…
19/42 
Part 2 Least Squares-Support Vector Machine (LS-SVM) based NR method 
•SVM is a popular machine-learning technique for regression 
•LS-SVM is computationally simpler than SVM 
•Quadratic programming  a set of linear equations 
•Test-stimuli chosen based on Spatial and Temporal perceptual Information (SI and TI)
20/42 
Part 2 LS-SVM based NR method 
•6 SRCs, 2 resolutions, each @ 2 frame-rates and 5 bitrates = 120 
•20 sec videos, last 10s considered 
•Randomly chosen 80 for training and rest for testing 
•17 Bitstream-based features representing the impact of coding distortions and content characteristics
21/42 
Part 2 LS-SVM based NR method 
•VQEG recommended performance statistics used 
•LS-SVM performed slightly better or similar to ANN 
•Much better than linear regression 
•Was it useful to have more features? 
•Which of the features are more significant? 
•Impact on performance if RR features are added?
22/42 
Part 2 LASSO based NR and RR methods 
•Least Absolute Shrinkage and Selection Operator (LASSO) regression not used for VQA before 
•Offers linear solution for regression besides co-linearity removal and dimensionality reduction 
•Ridge regression used for baseline performance 
•In LASSO: 
• The task is to minimize the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant 
• For a given non-negative λ (tuning parameter) value, it solves the following minimization problem
23/42 
Part 2 LASSO based NR and RR methods 
•144 H.264/AVC encoded videos, impaired with simulated effects of packet-loss rate of different values from École Polytechnique Fédérale de Lausanne (EPFL) 
•The selected features (51) represent motion and structural contents of a video, the energy of the video signal, the impact of the packet losses, and the impact of error concealment 
•Feature values were standardized (using zscore)
24/42 
Part 2 LASSO based NR and RR methods
25/42 
Part 2 LASSO based NR and RR methods 
•12-fold cross validation (CV) for training and testing 
•Nested 10-fold CV for determining optimal λ NR RR
26/42 
Part 2 LASSO based NR and RR methods 
Summary and Conclusions 
•A variety of video features investigated for quality estimation; LASSO uses far less features but offer a performance competitive to Ridge, VQM, PEVQ, and 5 reference methods 
•Reported the perceptual preference of block partitioning 
•Feature selection and quality estimation performed together offer promising results 
•NR approach is competitive to RR, in our case 
•Future work – Evaluate for HEVC coded videos
27/42 
Disposition 
•Introduction 
•Part 1 
•On classification and review of NR visual quality assessment 
•Part 2 
•On NR and RR methods of video quality assessment 
•RR methods 
•Part 3 
oSubjective VQA 
Low-resolution videos 
Temporal, spatial, and quantization variations based videos 
Adaptive streaming videos, crowdsourcing based 
•Part 4 
•On speech enhancement in modulation domain
28/42 
Part 3 Subjective VQA of Low Resolution Videos 
•SRC videos selected on the basis of spatio-temporal perceptual information variety (SI & TI values) 
•H.264/AVC was followed 
•6 SRCs, 2 resolutions, each @ 2 frame-rates and 5 bitrates = 120 test-stimuli
29/42 
Part 3 Subjective VQA of Low Resolution Videos 
•Subjective assessment of VQA performed in an ITU standards compliant lab 
•21 subjects participated, MOS computed of 18 subjects 
•Obtained results conform to previously reported trends 
•Bitstreams and MOS published online, used in the study of Chapter 4 of thesis 
CIF QCIF
30/42 
Part 3 Temporal, spatial, and quantization variations 
?
31/44
32/42 
Part 3 Temporal, spatial, and quantization variations Results 
•MOS Vs Bitrate values were plotted for all SRCs 
•For low TI SRCs (Elisa, City), frame-resolution significant 
•For high TI SRCs (Soccer, Ice) similar trend but to less extent 
•ANOVA : Perceptual preference in the order of frame-resolution, bits per pixel, and frame-rate
33/42 
Part 3 Adaptive streaming videos, ”crowdsourcing” based VQA 
•7 HD (1280x720) videos encoded at {5,3,1,0.6} Mbps 
•Subjective assessment of Acreo lab
34/42 
Part 3 Adaptive streaming videos, ”crowdsourcing” based VQA 
•215 workers participated, 6 removed 
•Larger subject diversity than typically in lab- based VQA
35/42 
Part 3 Adaptive streaming videos crowdsourcing based Results 
•Promising correlation with lab-based tests 
•Crowdsourcing potentially an alternative? 
•Verified already reported trends 
•Constant (less) quality preferred over freezing events
36/42 
Disposition 
•Introduction 
•Part 1 
• On classification and review of NR visual quality assessment 
•Part 2 
• On NR and RR methods of video quality assessment 
•Part 3 
• On subjective methods of video quality assessment 
•Part 4 
•Speech Enhancement 
•Spectral center-of-gravity based demodulation 
•Convex optimization based demodulation
37/42 
Part 4 Spectral center-of-gravity based demodulation 
•Speech enhancement done by Adaptive Gain Equalizer (AGE) 
•AGE boosts the speech signal, leaving noise unchanged 
•Speech signal decomposed into modulator and carrier for modulation-frequency domain processing 
•Modulation frequency domain processing employed in many applications 
•Spectral center-of-gravity based demodulation preferred
38/42 
Part 4 Spectral center-of-gravity based demodulation 
•Procedure includes: 
•Filter bank used to get sub-bands 
•Demodulation of each sub-band 
•Processing of modulators 
•Re-modulation of sub-bands 
•Signal synthesis 
Gain function of AGE 
•AGE performed well in modulation domain 
•Max SNRI of 9 dB obtained
39/42 
Part 4 Convex optimization based demodulation 
•Traditional methods of demodulation may not provide unique modulator-carrier pair 
•Convex optimization proven useful: SNRI, Spectral Distortion, PESQ MOS, and spectrogram analysis indicate its superiority of performance
40/42 
•A detailed review of recent publications in NR visual quality assessment  can be instrumental for research, a handbook for experts as well as for young researchers 
•Examined different techniques of regression proposed different methods of NR and RR VQA based on a variety of video features 
•Lab-based and crowdsourcing based experiments  contributed to subjective VQA 
•Evaluation of AGE in modulation domain  investigated the usefulness of modulation frequency domain 
Main Contributions
41/42 
Thank you! muhammad.shahid@ieee.org
42/42 
In physical science a first essential step in the direction of learning any subject is to find principles of numerical reckoning and practicable methods for measuring some quality connected with it. I often say that when you can measure what you are speaking about, and express it in numbers, you know something about it; but when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meagre and unsatisfactory kind; it may be the beginning of knowledge, but you have scarcely in your thoughts advanced to the stage of science, whatever the manner may be. — Sir William Thomson (Lord Kelvin), 1889 *[PLA, vol. 1, "Electrical Units of Measurement", 1883-05-03]

More Related Content

What's hot

LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Reviewijiert bestjournal
 
Audio Noise Removal – The State of the Art
Audio Noise Removal – The State of the ArtAudio Noise Removal – The State of the Art
Audio Noise Removal – The State of the Artijceronline
 
ECET 350 Entire Course NEW
ECET 350 Entire Course NEWECET 350 Entire Course NEW
ECET 350 Entire Course NEWshyamuopfive
 
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...IDES Editor
 
Design of Low Pass Digital FIR Filter Using Cuckoo Search Algorithm
Design of Low Pass Digital FIR Filter Using Cuckoo Search AlgorithmDesign of Low Pass Digital FIR Filter Using Cuckoo Search Algorithm
Design of Low Pass Digital FIR Filter Using Cuckoo Search AlgorithmIJERA Editor
 
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...IDES Editor
 
Interactive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionInteractive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionNU_I_TODALAB
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...eSAT Journals
 
Gsm rf-optimization
Gsm rf-optimizationGsm rf-optimization
Gsm rf-optimizationkarimfeel
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
Ladder for mixed signal test engineers
Ladder for mixed signal test engineersLadder for mixed signal test engineers
Ladder for mixed signal test engineersFangXuIEEE
 
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...Sebastian Schumann
 
IRJET- Segmentation in Digital Signal Processing
IRJET-  	  Segmentation in Digital Signal ProcessingIRJET-  	  Segmentation in Digital Signal Processing
IRJET- Segmentation in Digital Signal ProcessingIRJET Journal
 
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionEmpirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionCSCJournals
 

What's hot (18)

LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 
Audio Noise Removal – The State of the Art
Audio Noise Removal – The State of the ArtAudio Noise Removal – The State of the Art
Audio Noise Removal – The State of the Art
 
ECET 350 Entire Course NEW
ECET 350 Entire Course NEWECET 350 Entire Course NEW
ECET 350 Entire Course NEW
 
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...
FIR Filter Design using Particle Swarm Optimization with Constriction Factor ...
 
Design of Low Pass Digital FIR Filter Using Cuckoo Search Algorithm
Design of Low Pass Digital FIR Filter Using Cuckoo Search AlgorithmDesign of Low Pass Digital FIR Filter Using Cuckoo Search Algorithm
Design of Low Pass Digital FIR Filter Using Cuckoo Search Algorithm
 
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...
Design of Optimal Linear Phase FIR High Pass Filter using Improved Particle S...
 
Interactive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionInteractive voice conversion for augmented speech production
Interactive voice conversion for augmented speech production
 
Nd2421622165
Nd2421622165Nd2421622165
Nd2421622165
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basics
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...
 
Gsm rf-optimization
Gsm rf-optimizationGsm rf-optimization
Gsm rf-optimization
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Ladder for mixed signal test engineers
Ladder for mixed signal test engineersLadder for mixed signal test engineers
Ladder for mixed signal test engineers
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
 
IRJET- Segmentation in Digital Signal Processing
IRJET-  	  Segmentation in Digital Signal ProcessingIRJET-  	  Segmentation in Digital Signal Processing
IRJET- Segmentation in Digital Signal Processing
 
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionEmpirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
 

Viewers also liked (16)

MS_Learning_Transcript (1).PDF
MS_Learning_Transcript (1).PDFMS_Learning_Transcript (1).PDF
MS_Learning_Transcript (1).PDF
 
495492
495492495492
495492
 
Ilha
IlhaIlha
Ilha
 
MS_Learning_Transcript.PDF
MS_Learning_Transcript.PDFMS_Learning_Transcript.PDF
MS_Learning_Transcript.PDF
 
Ogryzlo
OgryzloOgryzlo
Ogryzlo
 
04_TIGRE_Carpinteria-TABLOIDE
04_TIGRE_Carpinteria-TABLOIDE04_TIGRE_Carpinteria-TABLOIDE
04_TIGRE_Carpinteria-TABLOIDE
 
【UDC2015】北海道ブロック
【UDC2015】北海道ブロック【UDC2015】北海道ブロック
【UDC2015】北海道ブロック
 
Amor a la filosofia y el espiritu crítico
Amor a la filosofia y el espiritu críticoAmor a la filosofia y el espiritu crítico
Amor a la filosofia y el espiritu crítico
 
496112
496112496112
496112
 
Arquivo
ArquivoArquivo
Arquivo
 
MS_Learning_Transcript (1).PDF
MS_Learning_Transcript (1).PDFMS_Learning_Transcript (1).PDF
MS_Learning_Transcript (1).PDF
 
Moraliza brasil
Moraliza brasilMoraliza brasil
Moraliza brasil
 
ELEIÇÕES 2012 - ASTORGA: SUZIE DA APMI 55555
ELEIÇÕES 2012 - ASTORGA: SUZIE DA APMI 55555ELEIÇÕES 2012 - ASTORGA: SUZIE DA APMI 55555
ELEIÇÕES 2012 - ASTORGA: SUZIE DA APMI 55555
 
Vagas
VagasVagas
Vagas
 
ArtikelBoligMagasinet
ArtikelBoligMagasinetArtikelBoligMagasinet
ArtikelBoligMagasinet
 
Jpeg 2000
Jpeg 2000Jpeg 2000
Jpeg 2000
 

Similar to Shahid presentation

Advanced Mechanisms for Delivering High-Quality Digital Content
Advanced Mechanisms for Delivering High-Quality Digital ContentAdvanced Mechanisms for Delivering High-Quality Digital Content
Advanced Mechanisms for Delivering High-Quality Digital ContentMikołaj Leszczuk
 
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...Mikolaj Leszczuk
 
Video Quality Measurements
Video Quality MeasurementsVideo Quality Measurements
Video Quality MeasurementsYoss Cohen
 
Versatile Video Coding: Compression Tools for UHD and 360° Video
Versatile Video Coding: Compression Tools for UHD and 360° VideoVersatile Video Coding: Compression Tools for UHD and 360° Video
Versatile Video Coding: Compression Tools for UHD and 360° VideoMathias Wien
 
Quality Assessment for Recognition and Task-based multimedia applications (QART)
Quality Assessment for Recognition and Task-based multimedia applications (QART)Quality Assessment for Recognition and Task-based multimedia applications (QART)
Quality Assessment for Recognition and Task-based multimedia applications (QART)Mikołaj Leszczuk
 
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...Mathias Wien
 
Video Quality Evaluation Methodology
Video Quality Evaluation MethodologyVideo Quality Evaluation Methodology
Video Quality Evaluation MethodologyRajath K
 
Study of no reference video quality metrics for hevc compression
Study of no reference video quality metrics for hevc compressionStudy of no reference video quality metrics for hevc compression
Study of no reference video quality metrics for hevc compressionMikolaj Leszczuk
 
Video capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintVideo capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintShivaditya Jatar
 
Quality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureQuality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureTouradj Ebrahimi
 
CA-SUM Video Summarization
CA-SUM Video SummarizationCA-SUM Video Summarization
CA-SUM Video SummarizationVasileiosMezaris
 
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...ssuserbd51ec
 
Comparative study of compression techniques for synthetic videos
Comparative study of compression techniques for synthetic videosComparative study of compression techniques for synthetic videos
Comparative study of compression techniques for synthetic videosijma
 
Comparative Study of Compression Techniques for Synthetic Videos
Comparative Study of Compression Techniques for Synthetic VideosComparative Study of Compression Techniques for Synthetic Videos
Comparative Study of Compression Techniques for Synthetic Videosijma
 
mpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.pptmpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.pptPawachMetharattanara
 
FutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementFutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementRADVISION Ltd.
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...ActiveEon
 
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...adil raja
 
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...Minh Nguyen
 

Similar to Shahid presentation (20)

Advanced Mechanisms for Delivering High-Quality Digital Content
Advanced Mechanisms for Delivering High-Quality Digital ContentAdvanced Mechanisms for Delivering High-Quality Digital Content
Advanced Mechanisms for Delivering High-Quality Digital Content
 
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
Selected Aspects of the New Recommendation on Subjective Methods of Assessing...
 
Video Quality Measurements
Video Quality MeasurementsVideo Quality Measurements
Video Quality Measurements
 
Versatile Video Coding: Compression Tools for UHD and 360° Video
Versatile Video Coding: Compression Tools for UHD and 360° VideoVersatile Video Coding: Compression Tools for UHD and 360° Video
Versatile Video Coding: Compression Tools for UHD and 360° Video
 
Quality Assessment for Recognition and Task-based multimedia applications (QART)
Quality Assessment for Recognition and Task-based multimedia applications (QART)Quality Assessment for Recognition and Task-based multimedia applications (QART)
Quality Assessment for Recognition and Task-based multimedia applications (QART)
 
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...
PCS 2019 Panel on Emerging Video Coding Standards: Overview on the Emerging V...
 
Video Quality Evaluation Methodology
Video Quality Evaluation MethodologyVideo Quality Evaluation Methodology
Video Quality Evaluation Methodology
 
Study of no reference video quality metrics for hevc compression
Study of no reference video quality metrics for hevc compressionStudy of no reference video quality metrics for hevc compression
Study of no reference video quality metrics for hevc compression
 
Video capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintVideo capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraint
 
Quality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureQuality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and Future
 
CA-SUM Video Summarization
CA-SUM Video SummarizationCA-SUM Video Summarization
CA-SUM Video Summarization
 
NMSL_2017summer
NMSL_2017summerNMSL_2017summer
NMSL_2017summer
 
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...
Siamese-rPPG Network: Remote Photoplethysmography Signal Estimation from Face...
 
Comparative study of compression techniques for synthetic videos
Comparative study of compression techniques for synthetic videosComparative study of compression techniques for synthetic videos
Comparative study of compression techniques for synthetic videos
 
Comparative Study of Compression Techniques for Synthetic Videos
Comparative Study of Compression Techniques for Synthetic VideosComparative Study of Compression Techniques for Synthetic Videos
Comparative Study of Compression Techniques for Synthetic Videos
 
mpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.pptmpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.ppt
 
FutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementFutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and Measurement
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
 
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
 
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Shahid presentation

  • 1. 1/42 Methods for Objective and Subjective Video Quality Assessment and for Speech Enhancement Muhammad Shahid Dec 2, 2014 PhD Degree Defense
  • 2. 2/42 Disposition •Introduction •Part 1 oOn classification and review of No-Reference (NR) visual quality assessment •Part 2 oOn NR and Reduced-Reference (RR) methods of video quality assessment •Part 3 oOn subjective methods of video quality assessment •Part 4 oOn speech enhancement in modulation domain
  • 3. 3/42 Introduction Two key areas in multimedia services •Video Quality Assessment (VQA) •Speech Enhancement •Video Quality oWhat? Perceptual quality of a video oWho? Subjective matter oWhy? We want it better oHow? Assessment: Subjective and Objective  Both are covered in the thesis •Speech Enhancement oNoisy environments; remove noise or boost speech oEvaluate established technique, in a different domain
  • 4. 4/42 Introduction Quality Aristotle categorized every object of human apprehension into: •Substance •Quantity •Quality (from Latin – qualitas) •Reflection •Place •Time •Position •State •Action •Affection Quality (Meriam-Webster): how good or bad something is
  • 5. 5/42 Introduction Motivation •Videos: Expected to be 67% of global mobile data traffic in 2017
  • 6. 6/42 Introduction Motivation •Video Quality: Why do we bother? •We: •Consumers, Service providers, Content providers etc. •Bother: Low quality Dissatisfaction  Churn •Is that all? Decrease quality (resources) without loosing user? Charging more for better quality? Provide various quality options?  larger consumer base! • Many reasons for Video Quality Assessment
  • 7. 7/42 Introduction Video Quality System Influence Factors that can degrade video quality In order to avoid/minimize any degradation, its impact on quality has to be measured!
  • 8. 8/42 Introduction Video Quality Assessment •How to measure (assess)? •Compared to (b) ‘original’ MSE of (a) = 42 MSE of (c) = 25 (a) (b) (c)
  • 9. 9/42 Introduction Video Quality Assessment •Simple pixel comparisons are not adequate •Humans should be the assessors •Subjective assessment (mean opinion score: MOS) Laboratory based, standardized by ITU-T Crowdsourcing based, loosely controlled BUT: impracticable for many practical applications •Objective assessment, computational models that mimic subjective assessment ’Original’ available  Full-Reference (FR) Features of ’original’ available  Reduced-Reference (RR) No access to ’original’  No-Reference (NR) •Real-time, online, practical application scenarios  Preferably NR or RR
  • 10. 10/30 Disposition •Introduction •Part 1 •On classification and review of NR visual quality assessment •Part 2 •On NR and RR methods of video quality assessment •Part 3 •On Subjective methods of video quality assessment •Part 4 •On Speech enhancement in modulation domain
  • 11. 11/30 Part 1 Classification and review of NR visual quality assessment
  • 12. 12/30 •QP: Quantization Parameter •DCT: Discrete Cosine Transform Part 1 Classification and Review of NR visual quality assessment
  • 13. 13/30 •Over 170 references reviewed •Pixel-based methods in majority; designed for images •Many image based techniques have been adapted for videos •Joint impact of different artifacts / methods of ’global’ quality assessment: Scarce •NR VQA gaining interest •P.NAMS and P.NBAMS standardized by ITU-T •Bitstream based approaches more popular (computationally less complex but still offer competitive performance) Part 1 Conclusions and Observations
  • 14. 14/42 Disposition •Introduction •Part 1 •On classification and review of NR visual quality assessment •Part 2 •NR and RR methods of video quality assessment •ANN based NR method •LS-SVM based NR method •LASSO based NR and RR methods •Part 3 •On subjective methods of video quality assessment •Part 4 •On speech enhancement in modulation domain
  • 15. 15/42 Part 2 A general framework of NR video quality prediction/estimation
  • 16. 16/42 Part 2 Artificial Neural Network (ANN) based NR method •ANN used in image processing, found useful in different applications •Bitstream-based video features •P16x16, P4x4, and P8x8 are partitioning sizes of blocks in percentage •Avg = Average •Perceptual Evaluation of Video Quality (PEVQ) • Peak Signal to Noise Ratio (PSNR) • Structural SIMilarity (SSIM)
  • 17. 17/42 Part 2 Artificial Neural Network (ANN) based NR method •Two layer ANN with Levenberg-Marquardt backpropagation •H.264/AVC encoded test stimuli, QCIF resolution •7 SRCs, 6 bitrates, 4 frame-rates = 168 for training •5 clips from 1 SRC, 6 bitrates, 4 frame-rates = 120 for testing
  • 18. 18/42 Part 2 Artificial Neural Network (ANN) based NR method Results •Competitive performance was observed as compared to linear regression •Possible improvements/extensions: MOS prediction, better regression technique…
  • 19. 19/42 Part 2 Least Squares-Support Vector Machine (LS-SVM) based NR method •SVM is a popular machine-learning technique for regression •LS-SVM is computationally simpler than SVM •Quadratic programming  a set of linear equations •Test-stimuli chosen based on Spatial and Temporal perceptual Information (SI and TI)
  • 20. 20/42 Part 2 LS-SVM based NR method •6 SRCs, 2 resolutions, each @ 2 frame-rates and 5 bitrates = 120 •20 sec videos, last 10s considered •Randomly chosen 80 for training and rest for testing •17 Bitstream-based features representing the impact of coding distortions and content characteristics
  • 21. 21/42 Part 2 LS-SVM based NR method •VQEG recommended performance statistics used •LS-SVM performed slightly better or similar to ANN •Much better than linear regression •Was it useful to have more features? •Which of the features are more significant? •Impact on performance if RR features are added?
  • 22. 22/42 Part 2 LASSO based NR and RR methods •Least Absolute Shrinkage and Selection Operator (LASSO) regression not used for VQA before •Offers linear solution for regression besides co-linearity removal and dimensionality reduction •Ridge regression used for baseline performance •In LASSO: • The task is to minimize the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant • For a given non-negative λ (tuning parameter) value, it solves the following minimization problem
  • 23. 23/42 Part 2 LASSO based NR and RR methods •144 H.264/AVC encoded videos, impaired with simulated effects of packet-loss rate of different values from École Polytechnique Fédérale de Lausanne (EPFL) •The selected features (51) represent motion and structural contents of a video, the energy of the video signal, the impact of the packet losses, and the impact of error concealment •Feature values were standardized (using zscore)
  • 24. 24/42 Part 2 LASSO based NR and RR methods
  • 25. 25/42 Part 2 LASSO based NR and RR methods •12-fold cross validation (CV) for training and testing •Nested 10-fold CV for determining optimal λ NR RR
  • 26. 26/42 Part 2 LASSO based NR and RR methods Summary and Conclusions •A variety of video features investigated for quality estimation; LASSO uses far less features but offer a performance competitive to Ridge, VQM, PEVQ, and 5 reference methods •Reported the perceptual preference of block partitioning •Feature selection and quality estimation performed together offer promising results •NR approach is competitive to RR, in our case •Future work – Evaluate for HEVC coded videos
  • 27. 27/42 Disposition •Introduction •Part 1 •On classification and review of NR visual quality assessment •Part 2 •On NR and RR methods of video quality assessment •RR methods •Part 3 oSubjective VQA Low-resolution videos Temporal, spatial, and quantization variations based videos Adaptive streaming videos, crowdsourcing based •Part 4 •On speech enhancement in modulation domain
  • 28. 28/42 Part 3 Subjective VQA of Low Resolution Videos •SRC videos selected on the basis of spatio-temporal perceptual information variety (SI & TI values) •H.264/AVC was followed •6 SRCs, 2 resolutions, each @ 2 frame-rates and 5 bitrates = 120 test-stimuli
  • 29. 29/42 Part 3 Subjective VQA of Low Resolution Videos •Subjective assessment of VQA performed in an ITU standards compliant lab •21 subjects participated, MOS computed of 18 subjects •Obtained results conform to previously reported trends •Bitstreams and MOS published online, used in the study of Chapter 4 of thesis CIF QCIF
  • 30. 30/42 Part 3 Temporal, spatial, and quantization variations ?
  • 31. 31/44
  • 32. 32/42 Part 3 Temporal, spatial, and quantization variations Results •MOS Vs Bitrate values were plotted for all SRCs •For low TI SRCs (Elisa, City), frame-resolution significant •For high TI SRCs (Soccer, Ice) similar trend but to less extent •ANOVA : Perceptual preference in the order of frame-resolution, bits per pixel, and frame-rate
  • 33. 33/42 Part 3 Adaptive streaming videos, ”crowdsourcing” based VQA •7 HD (1280x720) videos encoded at {5,3,1,0.6} Mbps •Subjective assessment of Acreo lab
  • 34. 34/42 Part 3 Adaptive streaming videos, ”crowdsourcing” based VQA •215 workers participated, 6 removed •Larger subject diversity than typically in lab- based VQA
  • 35. 35/42 Part 3 Adaptive streaming videos crowdsourcing based Results •Promising correlation with lab-based tests •Crowdsourcing potentially an alternative? •Verified already reported trends •Constant (less) quality preferred over freezing events
  • 36. 36/42 Disposition •Introduction •Part 1 • On classification and review of NR visual quality assessment •Part 2 • On NR and RR methods of video quality assessment •Part 3 • On subjective methods of video quality assessment •Part 4 •Speech Enhancement •Spectral center-of-gravity based demodulation •Convex optimization based demodulation
  • 37. 37/42 Part 4 Spectral center-of-gravity based demodulation •Speech enhancement done by Adaptive Gain Equalizer (AGE) •AGE boosts the speech signal, leaving noise unchanged •Speech signal decomposed into modulator and carrier for modulation-frequency domain processing •Modulation frequency domain processing employed in many applications •Spectral center-of-gravity based demodulation preferred
  • 38. 38/42 Part 4 Spectral center-of-gravity based demodulation •Procedure includes: •Filter bank used to get sub-bands •Demodulation of each sub-band •Processing of modulators •Re-modulation of sub-bands •Signal synthesis Gain function of AGE •AGE performed well in modulation domain •Max SNRI of 9 dB obtained
  • 39. 39/42 Part 4 Convex optimization based demodulation •Traditional methods of demodulation may not provide unique modulator-carrier pair •Convex optimization proven useful: SNRI, Spectral Distortion, PESQ MOS, and spectrogram analysis indicate its superiority of performance
  • 40. 40/42 •A detailed review of recent publications in NR visual quality assessment  can be instrumental for research, a handbook for experts as well as for young researchers •Examined different techniques of regression proposed different methods of NR and RR VQA based on a variety of video features •Lab-based and crowdsourcing based experiments  contributed to subjective VQA •Evaluation of AGE in modulation domain  investigated the usefulness of modulation frequency domain Main Contributions
  • 41. 41/42 Thank you! muhammad.shahid@ieee.org
  • 42. 42/42 In physical science a first essential step in the direction of learning any subject is to find principles of numerical reckoning and practicable methods for measuring some quality connected with it. I often say that when you can measure what you are speaking about, and express it in numbers, you know something about it; but when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meagre and unsatisfactory kind; it may be the beginning of knowledge, but you have scarcely in your thoughts advanced to the stage of science, whatever the manner may be. — Sir William Thomson (Lord Kelvin), 1889 *[PLA, vol. 1, "Electrical Units of Measurement", 1883-05-03]