SlideShare a Scribd company logo
1 of 15
Download to read offline
Perceptually-aware Per-title Encoding for Adaptive Video
Streaming
Vignesh V Menon1, Hadi Amirpour1, Mohammad Ghanbari1,2, and Christian Timmerer1
1
Christian Doppler Laboratory ATHENA, Alpen-Adria-Universität, Klagenfurt, Austria
2
School of Computer Science and Electronic Engineering, University of Essex, UK
19 July 2022
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 1
Outline
1 Introduction
2 PPTE
3 Results
4 Conclusion
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 2
Introduction
Per-title Encoding
In HAS, each video is encoded at a fixed set of bitrate-resolution pairs, referred to as bitrate
ladder.
The “one-size-fits-all” can be optimized per title to increase the Quality of Experience
(QoE) or decrease the bitrate of the representations as introduced for VoD services.1
0.2 0.5 1.2 4.5 16.8
Bitrate (in Mbps)
30
40
50
60
70
80
90
VMAF
Dolls-540p
Dolls-1080p
Park-540p
Park-1080p
Figure: Rate-Distortion (RD) curves using VMAF as the quality metric of Dolls and Park sequences of
MCML dataset encoded at 540p and 1080p resolutions.
1
J. De Cock et al. “Complexity-based consistent-quality encoding in the cloud”. In: 2016 IEEE International Conference on Image Processing (ICIP). 2016,
pp. 1484–1488. doi: 10.1109/ICIP.2016.7532605.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 3
Introduction
Motivation for Percetually-aware Per-title Encoding
The selection of bitrate-resolution pairs (i.e., (rt, bt) where t ≥ 0) from the convex-hull is
a challenging task.
The increased number of selected bitrate-resolution pairs for the bitrate ladder may improve
QoE, but leads to an increase in storage and bandwidth requirements.2
Furthermore, the selected bitrate-resolution pairs from the convex-hull for the bitrate ladder
may not always be perceptually different in video quality.
0.2 0.5 1.2 4.5 16.8
Bitrate (in Mbps)
50
60
70
80
90
100
VMAF
360p
432p
540p
720p
1080p
1440p
2160p
Figure: The HLS bitrate ladder of Characters sequence of MCML dataset.
2
Tianchi Huang et al. “Deep Reinforced Bitrate Ladders for Adaptive Video Streaming”. In: NOSSDAV ’21. Istanbul, Turkey: Association for Computing
Machinery, 2021, 66–73. isbn: 9781450384353. doi: 10.1145/3458306.3458873. url: https://doi.org/10.1145/3458306.3458873.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 4
Introduction
Target of the paper
b0 b1 b2 b3 b4 b5 b6
Bitrate
v0
v1=v0 + vJ(v0)
v2=v1 + vJ(v1)
v3=v2 + vJ(v2)
v4=v3 + vJ(v3)
v5=v4 + vJ(v4)
v6=v5 + vJ(v5)
VMAF
vmax
r1
r0
r2
r3
r4
r5
r6
Figure: The ideal bitrate ladder envisioned in this paper. The blue line denotes the corresponding
rate-distortion curve, while the red dotted line denotes VMAF=vmax . When the VMAF value is greater
than vmax , the video stream is deemed to be perceptually lossless.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 5
PPTE
Perceptually-aware Per-Title Encoding (PPTE)
Input Title
Feature Extraction
Bitrate Ladder Prediction
Resolutions (R)
Average JND (vJ)
Bitrate Range
{bmin, bmax }
Maximum
VMAF {vmax }
Per-title Encoding
Segments
(E,h) pairs
(r, b) pairs
Figure: PPTE architecture.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 6
PPTE Phase 1: Feature Extraction
PPTE
Phase 1: Feature Extraction
Compute texture energy per block
A DCT-based energy function is used to determine the block-wise feature of each frame
defined as:
Hk =
w−1
X
i=0
w−1
X
j=0
e|( ij
wh
)2−1|
|DCT(i, j)| (1)
where wxw is the size of the block, and DCT(i, j) is the (i, j)th DCT component when
i + j > 0, and 0 otherwise.
The energy values of blocks in a frame is averaged to determine the energy per frame.3
E =
C−1
X
k=0
Hp,k
C · w2
(2)
3
Michael King et al. “A New Energy Function for Segmentation and Compression”. In: 2007 IEEE International Conference on Multimedia and Expo. 2007,
pp. 1647–1650. doi: 10.1109/ICME.2007.4284983.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 7
PPTE Phase 1: Feature Extraction
PPTE
Phase 1: Feature Extraction
hp: SAD of the block level energy values of frame p to that of the previous frame p − 1.
hp =
C−1
X
k=0
| Hp,k, Hp−1,k |
C · w2
(3)
where C denotes the number of blocks in frame p.
Latency
Speed of feature extraction = 370fps for UHD video with 8 CPU threads and x86 SIMD
optimization
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 8
PPTE Phase 1: Feature Extraction
PPTE
Phase 2: Bitrate ladder Prediction
Step 1: b0 = bmin
vr,b0 = A0,r log
q
h
E · b2
0

+ A1,r
v0 = max(vr,b0 )
r0 = arg maxr∈R(vr,b0 )
(r0, b0) is the first point of the bitrate ladder
A0,r and A1,r
Parameters trained using linear regression
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 9
PPTE Phase 1: Feature Extraction
PPTE
Phase 2: Bitrate ladder Prediction
Step 2: t = 1
for t ≥ 1 do
vt = vt−1 + vJ(vt−1)
br,vt =
r
q
E
h e
vt −A1,r
A0,r

bt = min(br,vt )
rt = arg minr∈R(br,vt )
if bt  bmax or vt  vmax then
End of the algorithm
else
(rt, bt) is the (t + 1)th
point of the bitrate ladder.
t = t + 1
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 10
Results
Results
0.2 0.5 1.2 4.5 16.8
Bitrate (in Mbps)
30
40
50
60
70
80
VMAF
HLS Ladder
Proposed Scheme
(a) IntoTree
0.2 0.5 1.2 4.5 16.8
Bitrate (in Mbps)
20
40
60
80
VMAF
HLS Ladder
Proposed Scheme
(b) DaylightRoad2
0.2 0.5 1.2 4.5 16.8
Bitrate (in Mbps)
30
40
50
60
70
80
90
VMAF
HLS Ladder
Proposed Scheme
(c) TreeShade
Figure: Comparison of RD curves for encoding the IntoTree, DaylightRoad2, and TreeShade sequences
using the HLS bitrate ladder and PPTE.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 11
Results
Results
40 50 60 70
E
10
20
30
40
50
h
10
15
20
25
30
35
40
45
50
|
S|
Figure: ∆S results for various values of E and
h.
40 50 60 70
E
10
20
30
40
50
h
5
10
15
20
25
30
35
40
|BDR
V
|
Figure: Bjøntegaard delta rate w.r.t VMAF
(BDRV ) results for various values of E and h.
∆S = 1 −
P
bopt
P
bref
(4)
where bref and bopt represent the sum of bitrates of all representations in the fixed bitrate
ladder and the optimized bitrate ladder, respectively.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 12
Results
Results
Table: Results of PPTE against HLS bitrate ladder.
Dataset Video SI TI E h BDRV BDRP ∆S Avg. JND
JVET4 DaylightRoad2 40.51 16.21 54.78 20.35 -23.84% -10.88% -40.32% 6.99
JVET FoodMarket4 38.26 17.68 60.61 22.67 -19.22% -6.21% -28.13% 6.72
MCML5 Characters 50.43 29.85 42.66 21.06 -74.60% -71.70% -53.69% 3.82
MCML Crowd 33.76 10.13 56.74 15.89 -30.12% -15.63% -31.06% 7.85
MCML Lake 42.04 11.84 47.89 21.11 -38.00% -0.37% -44.83% 5.03
MCML Park 22.63 8.17 40.55 9.22 -10.47% -10.50% -15.35% 6.28
SJTU6 Fountains 43.37 11.42 63.30 26.83 -32.73% -2.18% -29.65% 5.80
SJTU RushHour 29.14 16.21 56.12 25.11 -20.50% -7.34% -42.73% 6.92
SJTU TrafficFlow 33.57 13.8 56.64 28.00 -53.34% -42.89% -44.83% 5.95
SJTU TreeShade 52.88 5.29 60.24 11.31 -48.38% -39.02% -31.06% 6.74
VGEG7 IntoTree 324.41 12.09 45.77 30.94 -26.23% -7.08% -40.32% 4.92
VGEG OldTownCross 29.66 11.62 50.31 27.64 -33.77% -25.07% -28.13% 5.86
VGEG ParkJoy 62.78 27.00 76.32 41.10 -15.68% -2.39% -18.16% 5.19
Average -27.02% -16.47% -30.69% 5.85
*These sequences were used for training.
4
Jill Boyce et al. JVET-J1010: JVET common test conditions and software reference configurations. July 2018.
5
Manri Cheon and Jong-Seok Lee. “Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience”. In: IEEE
Transactions on Circuits and Systems for Video Technology 28.7 (2018), pp. 1467–1480. doi: 10.1109/TCSVT.2017.2683504.
6
L. Song et al. “The SJTU 4K Video Sequence Dataset”. In: Fifth International Workshop on Quality of Multimedia Experience (QoMEX2013) (July 2013).
7
European Broadcasting Union (EBU). “The SVT High Definition Multi Format Test Set”. In: Feb. 2006. url:
https://tech.ebu.ch/docs/hdtv/svt-multiformat-conditions-v10.pdf.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 13
Conclusion
Conclusion
This paper proposed a perceptually-aware online per-title encoding (PPTE) scheme for live
streaming applications.
PPTE includes an algorithm that predicts the optimal resolution-bitrate pairs for every video
segment based on JND in visual quality perception.
Live streaming using PPTE requires 16.47% fewer bits to maintain the same PSNR and
27.02% fewer bits to maintain the same VMAF compared to the reference HLS bitrate
ladder.
The improvement in the compression efficiency is achieved with an average storage reduc-
tion of 30.69% compared to the reference HLS bitrate ladder.
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 14
Q  A
Q  A
Thank you for your attention!
Vignesh V Menon (vignesh.menon@aau.at)
Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 15

More Related Content

Similar to Perceptually-aware Per-title Encoding for Adaptive Video Streaming

ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live Streaming
ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live StreamingETPS: Efficient Two-pass Encoding Scheme for Adaptive Live Streaming
ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live StreamingAlpen-Adria-Universität
 
LiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfLiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfVignesh V Menon
 
Efficient bitrate ladder construction for live video streaming
Efficient bitrate ladder construction for live video streamingEfficient bitrate ladder construction for live video streaming
Efficient bitrate ladder construction for live video streamingMinh Nguyen
 
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...Alpen-Adria-Universität
 
Green_VCA_presentation.pdf
Green_VCA_presentation.pdfGreen_VCA_presentation.pdf
Green_VCA_presentation.pdfVignesh V Menon
 
VCIP_MCBE_presentation.pdf
VCIP_MCBE_presentation.pdfVCIP_MCBE_presentation.pdf
VCIP_MCBE_presentation.pdfVignesh V Menon
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Alpen-Adria-Universität
 
CODA_presentation.pdf
CODA_presentation.pdfCODA_presentation.pdf
CODA_presentation.pdfJunZhao68
 
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Vignesh V Menon
 
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVCIEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVCVignesh V Menon
 
INCEPT: Intra CU Depth Prediction for HEVC
INCEPT: Intra CU Depth Prediction for HEVCINCEPT: Intra CU Depth Prediction for HEVC
INCEPT: Intra CU Depth Prediction for HEVCAlpen-Adria-Universität
 
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingMiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingAlpen-Adria-Universität
 
Online Bitrate ladder prediction for Adaptive VVC Streaming
Online Bitrate ladder prediction for Adaptive VVC StreamingOnline Bitrate ladder prediction for Adaptive VVC Streaming
Online Bitrate ladder prediction for Adaptive VVC StreamingVignesh V Menon
 
Video streaming using light-weight transcoding and in-network intelligence
Video streaming using light-weight transcoding and in-network intelligenceVideo streaming using light-weight transcoding and in-network intelligence
Video streaming using light-weight transcoding and in-network intelligenceMinh Nguyen
 
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
 
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Ijripublishers Ijri
 
1 state of-the-art and trends in scalable video
1 state of-the-art and trends in scalable video1 state of-the-art and trends in scalable video
1 state of-the-art and trends in scalable videoYogananda Patnaik
 
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdf
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdfTutorial High Efficiency Video Coding Coding - Tools and Specification.pdf
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdfssuserc5a4dd
 

Similar to Perceptually-aware Per-title Encoding for Adaptive Video Streaming (20)

ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live Streaming
ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live StreamingETPS: Efficient Two-pass Encoding Scheme for Adaptive Live Streaming
ETPS: Efficient Two-pass Encoding Scheme for Adaptive Live Streaming
 
LiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfLiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdf
 
Efficient bitrate ladder construction for live video streaming
Efficient bitrate ladder construction for live video streamingEfficient bitrate ladder construction for live video streaming
Efficient bitrate ladder construction for live video streaming
 
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...
Improving Per-title Encoding for HTTP Adaptive Streaming by Utilizing Video S...
 
Green_VCA_presentation.pdf
Green_VCA_presentation.pdfGreen_VCA_presentation.pdf
Green_VCA_presentation.pdf
 
VCIP_MCBE_presentation.pdf
VCIP_MCBE_presentation.pdfVCIP_MCBE_presentation.pdf
VCIP_MCBE_presentation.pdf
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
 
JASLA_presentation.pdf
JASLA_presentation.pdfJASLA_presentation.pdf
JASLA_presentation.pdf
 
CODA_presentation.pdf
CODA_presentation.pdfCODA_presentation.pdf
CODA_presentation.pdf
 
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
 
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVCIEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC
IEEE MMSP'21: INCEPT: Intra CU Depth Prediction for HEVC
 
INCEPT: Intra CU Depth Prediction for HEVC
INCEPT: Intra CU Depth Prediction for HEVCINCEPT: Intra CU Depth Prediction for HEVC
INCEPT: Intra CU Depth Prediction for HEVC
 
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingMiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
 
Online Bitrate ladder prediction for Adaptive VVC Streaming
Online Bitrate ladder prediction for Adaptive VVC StreamingOnline Bitrate ladder prediction for Adaptive VVC Streaming
Online Bitrate ladder prediction for Adaptive VVC Streaming
 
CAPS_Presentation.pdf
CAPS_Presentation.pdfCAPS_Presentation.pdf
CAPS_Presentation.pdf
 
Video streaming using light-weight transcoding and in-network intelligence
Video streaming using light-weight transcoding and in-network intelligenceVideo streaming using light-weight transcoding and in-network intelligence
Video streaming using light-weight transcoding and in-network intelligence
 
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
 
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
Jiri ece-01-03 adaptive temporal averaging and frame prediction based surveil...
 
1 state of-the-art and trends in scalable video
1 state of-the-art and trends in scalable video1 state of-the-art and trends in scalable video
1 state of-the-art and trends in scalable video
 
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdf
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdfTutorial High Efficiency Video Coding Coding - Tools and Specification.pdf
Tutorial High Efficiency Video Coding Coding - Tools and Specification.pdf
 

More from Alpen-Adria-Universität

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesVEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesAlpen-Adria-Universität
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingGREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingAlpen-Adria-Universität
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionVEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionAlpen-Adria-Universität
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingAlpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Alpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...Alpen-Adria-Universität
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...Alpen-Adria-Universität
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Alpen-Adria-Universität
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamAlpen-Adria-Universität
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Alpen-Adria-Universität
 
Multi-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingMulti-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingAlpen-Adria-Universität
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentAlpen-Adria-Universität
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...Alpen-Adria-Universität
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesAlpen-Adria-Universität
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Alpen-Adria-Universität
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningVideo Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningAlpen-Adria-Universität
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...Alpen-Adria-Universität
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsAlpen-Adria-Universität
 
Immersive Video Delivery: From Omnidirectional Video to Holography
Immersive Video Delivery: From Omnidirectional Video to HolographyImmersive Video Delivery: From Omnidirectional Video to Holography
Immersive Video Delivery: From Omnidirectional Video to HolographyAlpen-Adria-Universität
 
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...Alpen-Adria-Universität
 

More from Alpen-Adria-Universität (20)

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesVEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingGREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video Processing
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionVEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission Prediction
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive Streaming
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
 
Multi-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingMulti-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video Streaming
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningVideo Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
 
Immersive Video Delivery: From Omnidirectional Video to Holography
Immersive Video Delivery: From Omnidirectional Video to HolographyImmersive Video Delivery: From Omnidirectional Video to Holography
Immersive Video Delivery: From Omnidirectional Video to Holography
 
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...
LLL-CAdViSE: Live Low-Latency Cloud-based Adaptive Video Streaming Evaluation...
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 

Perceptually-aware Per-title Encoding for Adaptive Video Streaming

  • 1. Perceptually-aware Per-title Encoding for Adaptive Video Streaming Vignesh V Menon1, Hadi Amirpour1, Mohammad Ghanbari1,2, and Christian Timmerer1 1 Christian Doppler Laboratory ATHENA, Alpen-Adria-Universität, Klagenfurt, Austria 2 School of Computer Science and Electronic Engineering, University of Essex, UK 19 July 2022 Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 1
  • 2. Outline 1 Introduction 2 PPTE 3 Results 4 Conclusion Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 2
  • 3. Introduction Per-title Encoding In HAS, each video is encoded at a fixed set of bitrate-resolution pairs, referred to as bitrate ladder. The “one-size-fits-all” can be optimized per title to increase the Quality of Experience (QoE) or decrease the bitrate of the representations as introduced for VoD services.1 0.2 0.5 1.2 4.5 16.8 Bitrate (in Mbps) 30 40 50 60 70 80 90 VMAF Dolls-540p Dolls-1080p Park-540p Park-1080p Figure: Rate-Distortion (RD) curves using VMAF as the quality metric of Dolls and Park sequences of MCML dataset encoded at 540p and 1080p resolutions. 1 J. De Cock et al. “Complexity-based consistent-quality encoding in the cloud”. In: 2016 IEEE International Conference on Image Processing (ICIP). 2016, pp. 1484–1488. doi: 10.1109/ICIP.2016.7532605. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 3
  • 4. Introduction Motivation for Percetually-aware Per-title Encoding The selection of bitrate-resolution pairs (i.e., (rt, bt) where t ≥ 0) from the convex-hull is a challenging task. The increased number of selected bitrate-resolution pairs for the bitrate ladder may improve QoE, but leads to an increase in storage and bandwidth requirements.2 Furthermore, the selected bitrate-resolution pairs from the convex-hull for the bitrate ladder may not always be perceptually different in video quality. 0.2 0.5 1.2 4.5 16.8 Bitrate (in Mbps) 50 60 70 80 90 100 VMAF 360p 432p 540p 720p 1080p 1440p 2160p Figure: The HLS bitrate ladder of Characters sequence of MCML dataset. 2 Tianchi Huang et al. “Deep Reinforced Bitrate Ladders for Adaptive Video Streaming”. In: NOSSDAV ’21. Istanbul, Turkey: Association for Computing Machinery, 2021, 66–73. isbn: 9781450384353. doi: 10.1145/3458306.3458873. url: https://doi.org/10.1145/3458306.3458873. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 4
  • 5. Introduction Target of the paper b0 b1 b2 b3 b4 b5 b6 Bitrate v0 v1=v0 + vJ(v0) v2=v1 + vJ(v1) v3=v2 + vJ(v2) v4=v3 + vJ(v3) v5=v4 + vJ(v4) v6=v5 + vJ(v5) VMAF vmax r1 r0 r2 r3 r4 r5 r6 Figure: The ideal bitrate ladder envisioned in this paper. The blue line denotes the corresponding rate-distortion curve, while the red dotted line denotes VMAF=vmax . When the VMAF value is greater than vmax , the video stream is deemed to be perceptually lossless. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 5
  • 6. PPTE Perceptually-aware Per-Title Encoding (PPTE) Input Title Feature Extraction Bitrate Ladder Prediction Resolutions (R) Average JND (vJ) Bitrate Range {bmin, bmax } Maximum VMAF {vmax } Per-title Encoding Segments (E,h) pairs (r, b) pairs Figure: PPTE architecture. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 6
  • 7. PPTE Phase 1: Feature Extraction PPTE Phase 1: Feature Extraction Compute texture energy per block A DCT-based energy function is used to determine the block-wise feature of each frame defined as: Hk = w−1 X i=0 w−1 X j=0 e|( ij wh )2−1| |DCT(i, j)| (1) where wxw is the size of the block, and DCT(i, j) is the (i, j)th DCT component when i + j > 0, and 0 otherwise. The energy values of blocks in a frame is averaged to determine the energy per frame.3 E = C−1 X k=0 Hp,k C · w2 (2) 3 Michael King et al. “A New Energy Function for Segmentation and Compression”. In: 2007 IEEE International Conference on Multimedia and Expo. 2007, pp. 1647–1650. doi: 10.1109/ICME.2007.4284983. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 7
  • 8. PPTE Phase 1: Feature Extraction PPTE Phase 1: Feature Extraction hp: SAD of the block level energy values of frame p to that of the previous frame p − 1. hp = C−1 X k=0 | Hp,k, Hp−1,k | C · w2 (3) where C denotes the number of blocks in frame p. Latency Speed of feature extraction = 370fps for UHD video with 8 CPU threads and x86 SIMD optimization Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 8
  • 9. PPTE Phase 1: Feature Extraction PPTE Phase 2: Bitrate ladder Prediction Step 1: b0 = bmin vr,b0 = A0,r log q h E · b2 0 + A1,r v0 = max(vr,b0 ) r0 = arg maxr∈R(vr,b0 ) (r0, b0) is the first point of the bitrate ladder A0,r and A1,r Parameters trained using linear regression Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 9
  • 10. PPTE Phase 1: Feature Extraction PPTE Phase 2: Bitrate ladder Prediction Step 2: t = 1 for t ≥ 1 do vt = vt−1 + vJ(vt−1) br,vt = r q E h e vt −A1,r A0,r bt = min(br,vt ) rt = arg minr∈R(br,vt ) if bt bmax or vt vmax then End of the algorithm else (rt, bt) is the (t + 1)th point of the bitrate ladder. t = t + 1 Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 10
  • 11. Results Results 0.2 0.5 1.2 4.5 16.8 Bitrate (in Mbps) 30 40 50 60 70 80 VMAF HLS Ladder Proposed Scheme (a) IntoTree 0.2 0.5 1.2 4.5 16.8 Bitrate (in Mbps) 20 40 60 80 VMAF HLS Ladder Proposed Scheme (b) DaylightRoad2 0.2 0.5 1.2 4.5 16.8 Bitrate (in Mbps) 30 40 50 60 70 80 90 VMAF HLS Ladder Proposed Scheme (c) TreeShade Figure: Comparison of RD curves for encoding the IntoTree, DaylightRoad2, and TreeShade sequences using the HLS bitrate ladder and PPTE. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 11
  • 12. Results Results 40 50 60 70 E 10 20 30 40 50 h 10 15 20 25 30 35 40 45 50 | S| Figure: ∆S results for various values of E and h. 40 50 60 70 E 10 20 30 40 50 h 5 10 15 20 25 30 35 40 |BDR V | Figure: Bjøntegaard delta rate w.r.t VMAF (BDRV ) results for various values of E and h. ∆S = 1 − P bopt P bref (4) where bref and bopt represent the sum of bitrates of all representations in the fixed bitrate ladder and the optimized bitrate ladder, respectively. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 12
  • 13. Results Results Table: Results of PPTE against HLS bitrate ladder. Dataset Video SI TI E h BDRV BDRP ∆S Avg. JND JVET4 DaylightRoad2 40.51 16.21 54.78 20.35 -23.84% -10.88% -40.32% 6.99 JVET FoodMarket4 38.26 17.68 60.61 22.67 -19.22% -6.21% -28.13% 6.72 MCML5 Characters 50.43 29.85 42.66 21.06 -74.60% -71.70% -53.69% 3.82 MCML Crowd 33.76 10.13 56.74 15.89 -30.12% -15.63% -31.06% 7.85 MCML Lake 42.04 11.84 47.89 21.11 -38.00% -0.37% -44.83% 5.03 MCML Park 22.63 8.17 40.55 9.22 -10.47% -10.50% -15.35% 6.28 SJTU6 Fountains 43.37 11.42 63.30 26.83 -32.73% -2.18% -29.65% 5.80 SJTU RushHour 29.14 16.21 56.12 25.11 -20.50% -7.34% -42.73% 6.92 SJTU TrafficFlow 33.57 13.8 56.64 28.00 -53.34% -42.89% -44.83% 5.95 SJTU TreeShade 52.88 5.29 60.24 11.31 -48.38% -39.02% -31.06% 6.74 VGEG7 IntoTree 324.41 12.09 45.77 30.94 -26.23% -7.08% -40.32% 4.92 VGEG OldTownCross 29.66 11.62 50.31 27.64 -33.77% -25.07% -28.13% 5.86 VGEG ParkJoy 62.78 27.00 76.32 41.10 -15.68% -2.39% -18.16% 5.19 Average -27.02% -16.47% -30.69% 5.85 *These sequences were used for training. 4 Jill Boyce et al. JVET-J1010: JVET common test conditions and software reference configurations. July 2018. 5 Manri Cheon and Jong-Seok Lee. “Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience”. In: IEEE Transactions on Circuits and Systems for Video Technology 28.7 (2018), pp. 1467–1480. doi: 10.1109/TCSVT.2017.2683504. 6 L. Song et al. “The SJTU 4K Video Sequence Dataset”. In: Fifth International Workshop on Quality of Multimedia Experience (QoMEX2013) (July 2013). 7 European Broadcasting Union (EBU). “The SVT High Definition Multi Format Test Set”. In: Feb. 2006. url: https://tech.ebu.ch/docs/hdtv/svt-multiformat-conditions-v10.pdf. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 13
  • 14. Conclusion Conclusion This paper proposed a perceptually-aware online per-title encoding (PPTE) scheme for live streaming applications. PPTE includes an algorithm that predicts the optimal resolution-bitrate pairs for every video segment based on JND in visual quality perception. Live streaming using PPTE requires 16.47% fewer bits to maintain the same PSNR and 27.02% fewer bits to maintain the same VMAF compared to the reference HLS bitrate ladder. The improvement in the compression efficiency is achieved with an average storage reduc- tion of 30.69% compared to the reference HLS bitrate ladder. Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 14
  • 15. Q A Q A Thank you for your attention! Vignesh V Menon (vignesh.menon@aau.at) Vignesh V Menon Perceptually-aware Per-title Encoding for Adaptive Video Streaming 15