SlideShare a Scribd company logo
1 of 21
Download to read offline
18-796


 Multimedia Communications:
Coding, Systems, and Networking


                      Prof. Tsuhan Chen
                     tsuhan@ece.cmu.edu




     MPEG-4




                                          1
MPEG-4
• Originally
   – A standard for very low bit rate coding of limited
     complexity audio-visual material


• In July 94, the scope was extended to
   – Functionalities not supported by other standards
      • Content-based interactivity
      • Universal access
      • High compression
   – Coding of general material for a wide bit rate range
   – Flexibility and extensibility
                                                    18-796/Spring 1999/Chen




        Content-Based Interactivity
• A scene is composed of audio-visual objects
  – Not just pixels or moving blocks
• Objects can be of different nature
  –   Text or images
  –   Rectangular or arbitrary shape
  –   2D or 3D objects
  –   Natural of synthetic
• Different coding schemes applied to different
  objects
• Compositor puts objects back in a scene
                                                    18-796/Spring 1999/Chen




                                                                              2
[MPEG-4 N1909,
“Overview of the
MPEG-4 Version
1 Standard”]


                                                             18-796/Spring 1999/Chen




                            Applications
 Human-machine interface                                     Content creation
 GUI, Virtual environment                                    Digital TV, HDTV
 Vision, Graphics                                            VCD, DVD


                   Computer                        TV/Film

                                 MPEG-4



                             Telecommunication

                     Wireless, Internet, ISDN, POTS, Cable
                                                             18-796/Spring 1999/Chen




                                                                                       3
Parts of MPEG-4
• Part 1: Systems
• Part 2: Video
• Part 3: Audio
• Part 4: Conformance testing
• Part 5: Reference software
• Part 6: Delivery multimedia integration framework
• Others
  –   Synthetic and Natural Hybrid Coding (SNHC)
  –   Requirements and applications
  –   Implementation Study
  –   Intellectual property rights (IPR)
                                                       18-796/Spring 1999/Chen




               MPEG-4 Activities
• Competitive phase
  – Proposals and evaluations


• Collaborative phase
  – Verification model and core experiments
      1. Define Verification Model (VM-n)
      2. Define core experiments for improving VM-n
      3. Perform core experiments. Compare with VM-n
      4. n++, go to Step 1




                                                       18-796/Spring 1999/Chen




                                                                                 4
MPEG-4 Time Table
• July 93    Started work
• Nov 95     Subjective tests and tool evaluation
• Jan 96     Define verification model (VM) and core
  experiments (CE)
• Mar/July/Sept/Nov 96, Feb/Apr/Jul 97
             Update VM and define a new set of CEs
• Oct 97     Committee Draft (CD)
• July 98    Final CD (FCD)
• Nov 98     Draft international standard (DIS)
• Jan 99     International standard (IS)


                                                18-796/Spring 1999/Chen




                  MPEG-4 Video
• General functionalities
   – Coding efficiency
      • For 5 kbit/s ~ 5 Mbit/s
      • From small images to TV resolution
      • Progressive/interlaced
   – Error resilience and robustness
   – Spatial and temporal scalabilities
• Content-based functionalities
   – Shape coding and sprites
   – Content-based scalabilities
   – Error resilience and robustness

                                                18-796/Spring 1999/Chen




                                                                          5
MPEG-4 Video (cont.)
• Tools
  –   Motion/texture coding derived from H.263
  –   Coding of video object plane (VOP): I, B, P
  –   Binary and gray-scale shape coding
  –   Scalabilities: temporal/spatial
  –   Static sprites
  –   Interlaced prediction
  –   12 bit video
  –   Computational graceful degradation (CGD)



                                                    18-796/Spring 1999/Chen




         Video Object Plane (VOP)
                                                      VOP1




                                                      VOP2




                                                      VOP3




                                                    18-796/Spring 1999/Chen




                                                                              6
Structure of VOP Encoder

                               VOP 0
                               Coding
                               VOP 1
               VOP             Coding
Input                                        MUX              Bitstream
             Definition        VOP 2
                               Coding




         – Note: Segmentation is outside the scope of MPEG-4
                                                       18-796/Spring 1999/Chen




             Structure of VOP Decoder

                                VOP 0
                               Decoding

                                VOP 1
                               Decoding
 Bitstream          DEMUX                      Composition         Output
                                VOP 2
                               Decoding




                                                       18-796/Spring 1999/Chen




                                                                                 7
Coding of VOP
• Motion compensation and DCT
   – Similar to H.263



• Polygon matching for motion estimation

                                         Transparent
                                         Pixels

  VOP
                                Pixels for polygon matching


                                                18-796/Spring 1999/Chen




                    VOP Padding
• Applied to the reference VOP prior to motion
  estimation/compensation




        Padded Previous Frame   Current Frame
                                                18-796/Spring 1999/Chen




                                                                          8
Shape-Adaptive DCT for Texture Coding




                    Shape Coding
• Binary shape
   – Context-based arithmetic encoding (CAE)
• Gray scale alpha plane
   – Motion compensated DCT
      • Similar to texture coding
                                               Gray-Level
                                                 Alpha


                                     Support                   Tex ture




                                      Binary                Tex ture Coder
                                    Shape Coder

                                                               18-796/Spring 1999/Chen




                                                                                         9
Binary Shape Coding
• Context-based arithmetic encoding (CAE)
  – A binary shape is treated as a binary image
  – Apply CAE to each binary alpha block (BAB)

• The “context”
                                   Current BAB    Motion Compensated BAB

          c9 c8 c7                c3 c2 c1                 c8
       c6 c5 c4 c3 c2             c0   ?             c7 c6 c5
       c1 c0   ?                                           c4

          Intra                                  Inter
                                                         18-796/Spring 1999/Chen




                        Feathering
• Feathering and translucency coding
   –   No effects
   –   Linear feathering
   –   Constant alpha
   –   Linear feathering and constant alpha
   –   Feathering filter
   –   Feathering filter and constant alpha




                                                         18-796/Spring 1999/Chen




                                                                                   10
VOP Encoder
                +                                           motion
                                                            texture    video
                    _     DCT        Q                                multiplex
                                                            coding
                                                  -1
                                               Q


                                              IDCT
                                                       +
                                              +
                         S     pred. 1
                         w
                         i                    Frame
                               pred. 2        Store
                         t
                         c
                         h     pred. 3


                         Motion
                        estimation


                                     Shape
                                     coding
                                                                      18-796/Spring 1999/Chen




                               VOP Decoder
                 video_object_layer_shape

   Coded Bit Stream
       (Shape)                             Shape
                                          Decoding                      Previous
                                                                      Reconstructed
                                                                          VOP

   Coded Bit Stream
      (Motion)                                              Motion
                              Motion                       Compen-
                             Decoding                       sation


                                                                          VOP
Demultiplexer                                                            Recon-
                                                                        struction
     Coded
    Bit Stream                   Texture Decoding
    (Texture)



                                                                      18-796/Spring 1999/Chen




                                                                                                11
Profiles of MPEG-4
   bitrate
               High bitrate
                   tools
                (interlace)
                                           Content-based
                                           functionalities
                                         (shape, scalability)

                    VLBV
                    core


                                                           functionalities

                                                                              18-796/Spring 1999/Chen




             Visual Object Profiles
• Not final yet
             Intra Coding Mode (I)                    4-12 bit pixel depth
             Inter Prediction Mode (P)
             AC/DC Prediction
             Reversible VLC
             Slice Resychronization
             Data Partitioning
             Binary Shape
             P-VOP based temporal scalability
                                        Simple                          12 bit


             H.263/MPEG-2 Quantization Table
             Bidirectional Prediction Mode (B)        Binary shape
             Advanced Prediction Mode                 Texture coding
             Static Sprites
             Interlaced tools                            Scalable Wavelet
             Temporal Scalability
                                                      Note that the binary
             • object-based
                                                      shape tool is the same
             • frame-based                            for all Object Profiles it
             Spatial Scalability (frame-based)
                                                      is used in.
                                               Core
                                                                              18-796/Spring 1999/Chen




                                                                                                        12
Core vs. Generic MPEG-4




                                                   18-796/Spring 1999/Chen




Synthetic & Natural Hybrid Coding (SNHC)
• Efficient representation and composition of
  synthetically and naturally generated audiovisual
  data
• To be integrated into MPEG-4 Video and Audio
   – Not a separate part of MPEG-4
• Applications
   – Virtual environment, conferencing, education,
     entertainment, media production, and real-time, interactive
     and broadcast media experiences



                                                   18-796/Spring 1999/Chen




                                                                             13
SNHC Target technologies
• Video
  –   Face animation
  –   2D/3D mesh compression
  –   Wavelet-based still texture coding
  –   View dependent scalability
• Audio
  – Text-to-speech synthesis, structured audio,
    environmental auralization, 3D audio, etc.




                                                                 18-796/Spring 1999/Chen




                    Face Animation
• Face animation
  – 2D/3D polygon mesh for face rendering
  – Facial Definition Parameter (FDP) Set
       • Controls shape, texture, gender, age, etc.
  – Facial Animation Parameter (FAP) Set
       • 68 parameters to produce animation and to create expressions




                                                                    Demo

                                                                 18-796/Spring 1999/Chen




                                                                                           14
Examples of FAP and FDP
c l a s s Fa c e De f i n i t i on Pa r a ms {
    p ub l i c :
         3Dme s h s h a p e ;
         3DPoi n t f e a t u r e Poi n t [ 46 ] ;
         I ma g e t e x t u r e ;
         i nt age ;
         i n t ge n de r ;
         ...;
}

c l a s s Fa c e An i ma t i on Pa r a ms {
    p ub l i c :
        i n t mov e _h _l _e y e b a l l ;
        i n t mov e _h _r _e y e b a l l ;
        i n t mov e _v _l _e y e b a l l ;
        i n t mov e _v _r _e y e b a l l ;
        i n t e n l a r g e _ l _ pu pi l ;
        i n t e n l a r g e _ r _ pu pi l ;
        ...;
}
                                                                                           18-796/Spring 1999/Chen




                              Example FAPs

 #   FAP name              FAP description                    units   Uni-    Pos      Grp     FDP      Quant
                                                                      or      motion           subgrp   step
                                                                      Bidir                    num      size
 1   viseme                Set of values determining the      na      na      na       1       na       1
                           mixture of two visemes for this
                           frame (e.g. pbm, fv, th)
 2   expression            A set of values determining the    na      na      na       1       na       1
                           mixture of two facial expression
 3   open_jaw              Vertical jaw displacement (does    MNS     U       down     2       1        8
                           not affect mouth opening)
 4   lower_t_midlip        Vertical top middle inner lip      MNS     B       down     2       2        5
                           displacement
 5   raise_b_midlip        Vertical bottom middle inner       MNS     B       up       2       3        5
                           lip displacement
 6   stretch_l_cornerlip   Horizontal displacement of left    MW      B       left     2       4        5
                           inner lip corner




                                                                                           18-796/Spring 1999/Chen




                                                                                                                     15
Facial Definition Parameters
                                                                                                                   11.5
                                                      11.5

                                                      11.4                                                         11.4
                                                                                                                                        11.1
                               11.2                   11.1           11.3
                                                                                                                       11.2
                                          4.4 4.2 4.1 4.3                                                                                    4.4
                                                                                                                                4.6
               10.2            4.6                                     4.5          10.1                       10.2                               9.6

             10.4           10.10                                      10.9                                                     10.10
                                                                                    10.3                      10.4
                                      5.2                        5.1                                                                9.14            9.12
                            10.8                                       10.7                                                     10.8
                                                                                                                                                        9.3
                    10.6                                                      10.5                  Y                  10.6           9.4
                                                                                                                                              9.2
                        2.14                                                2.13           2.14                X

                        Y                                                                    7.1                   Z             2.10
                                                        2.10
                               X            2.12 2.1 2.11                                                                             2.1
                                                                                                                          2.12
                    Z

                               Right Eye                                     Left Eye
                                                                                                                                Nose
                                         3.14                                      3.13
                                                      3.6                                    3.5
                                                                              3.1                                         9.6               9.7
                                    3.2
                    3.12                               3.8       3.11                             3.7
                                   3.4                                        3.3
                                         3.10                                      3.9                                           9.12
                                                                                                                9.14                              9.13
                                                             Teeth
                                                                                                                                 9.3
                                                               9.8                                                        9.2               9.1

                                                  9.10                      9.11                                9.4              9.15             9.5

                                                                9.9


                                                  Mouth                                                              Tongue
                                   8.6          8.9        8.10 8.5
                                                      8.1
                                            2.7          .2.2 2.6
                8.4         2.5                                              2.4 8.3
                                                                                                  6.4          6.2
                                            2.9       2.3      2.8
                                     8.8                             8.7                                                        6.3
                                                       8.2
                                                                                                        6.1
                    Feature points affected by FAPs
                    Other feature points                                                                                                                      18-796/Spring 1999/Chen




Facial Animation Parameter Units (FAPUs)
• Parameters normalized to FAPUs


                                             IRISD0

                                                                              ES0




                                                                                     ENS0




                                                                                     MNS0




                                                                                             MW0




                                                                                                                                                              18-796/Spring 1999/Chen




                                                                                                                                                                                        16
Visemes for Lip Synch
         viseme_select          phonemes             example
         0                      none                 na
         1                      p, b, m              put, bed, m ill
         2                      f, v                 far, voice
         3                      T,D                  th ink, that
         4                      t, d                 tip, doll
         5                      k, g                 call, gas
         6                      tS, dZ, S            ch air, join, sh e
         7                      s, z                 sir, zeal
         8                      n, l                 lot, n ot
         9                      r                    red
         10                     A:                   car
         11                     e                    bed
         12                     I                    tip
         13                     Q                    top
         14                     U                    book

                                                                               18-796/Spring 1999/Chen




              2D Mesh Compression




• Decoding process
                          dxn
                          dyn
                                           Mesh
                                          Geometry               xn
                                          Decoding               yn
                                                                                Decoded
                                                                 tm             Mesh
      Coded    Variable
      Data      Length
               Decoding
                                           Mesh                        Mesh
                                           Motion                      Data
                                          Decoding                    Memory
                          exn
                          eyn
                                                                               18-796/Spring 1999/Chen




                                                                                                         17
3D Mesh Compression
• Progressive representation
   – Streaming of 3D objects
   – Both spatially and temporally




• Indexing and retrieval of 3D meshes
   – Multiresolution databases
   – Related to MPEG-7                                 Demo




  Wavelets for Scalable Texture Coding
         Lowpass                         Lowpass
          H1 (z)    2   CODEC 1      2    F1 (z)

          H2 (z)    2   CODEC 2      2    F2 (z)
         Highpass                        Highpass




• Decompose the signal in the frequency domain
• Critical downsampling maintains the number of samples in
  the subbands
• For 2D case, separable filters are often used. Decompose
  into four bands: LL, LH, HL, HH
• Decompose the LL band iteratively
                                                    18-796/Spring 1999/Chen




                                                                              18
Wavelets for Scalable Texture Coding (cont.)

                    Lowest Band                  Inverse
                                  Prediction
                                               Quantization
 Coded Data                                                                  Output
              Arithmetic                                           Inverse
              Decoding                                              DWT
                                  ZeroTree       Inverse
                    Other Bands   Decoding     Quantization




                                                     “Zero Tree”
                                                              18-796/Spring 1999/Chen




         View Dependent Scalability




                                                                                        19
MPEG-4 Video Test Sequences
• Class A: Low spatial detail and low amount of
  movement
• Class B: Medium spatial detail and low amount of
  movement or vice versa
• Class C: High spatial detail and medium amount
  of movement or vice versa
• Class D: Stereoscopic
• Class E: Hybrid natural and synthetic
• Class F: 12-bit video sequences

                                                                                     18-796/Spring 1999/Chen




                      Test Sequences
          Sequence Name      Class     Input Format     YUV     Alpha    Segment
                                                        files    files    Mask
                                                                         Available
         Mother & daughter    A      ITU-R 601 (60Hz)     1       0         0
               Akiyo          A      ITU-R 601 (60Hz)   2+1       1         2
           Hall Monitor       A      ITU-R 601 (60Hz)     1       0         3
          Container Ship      A      ITU-R 601 (60Hz)     1       0         6
                Sean          A      ITU-R 601 (60Hz)     1       0         3
             Foreman          B      ITU-R 601 (50Hz)     1       0         0
                News          B      ITU-R 601 (60Hz)   4+1       3         4
           Silent Voice       B      ITU-R 601 (50Hz)     1       0         0
            Coastguard        B      ITU-R 601 (60Hz)     1       0         4
                Bus           C      ITU-R 601 (60Hz)     1       0         0
           Table Tennis       C      ITU-R 601 (50Hz)     1       0         0
               Stefan         C      ITU-R 601 (60Hz)     1       0         2
         Mobile & Calendar    C      ITU-R 601 (60Hz)     1       0         0
            Basketball        C      ITU-R 601 (50Hz)     1       0         0
             Football         C      ITU-R 602 (60Hz)     1       0         0
           Cheerleaders       C      ITU-R 601 (60Hz)     1       0         0
               Tunnel         D      ITU-R 601 (50Hz)    2x1      0         0
             Fun Fair         D      ITU-R 601 (50Hz)    2x1      0         0
             Children         E      ITU-R 601 (60Hz)   3+1       2         3
               Bream          E      ITU-R 601 (60Hz)   3+1       2         3
             Weather          E      ITU-R 601 (60Hz)   2+1       1         2
            Destruction       E      ITU-R 601 (60Hz)   11+1     10         0
                 Ti1          F       176x144 (15Hz)      1       0         0
             Man1sw           F       272x136 (15Hz)      1       0         0
             Hum2sw           F       272x136 (15Hz)      1       0         0
             Veh2sw           F       272x136 (15Hz)      1       0         0
              labview         F       176x144 (60Hz)      1       0         0
              hallway         F       176x144 (60Hz)      1       0         0




                                                                                                               20
MPEG-4 Version 2
• One year following Version 1
• Adds new profiles with new functionalities
• Video
   –   Scalable transmission of arbitrary-shaped objects
   –   Tools for additional efficiency improvements
   –   Tools for improved error robustness
   –   Coding of multiple views
   –   Body animation
   –   Coding of 3D meshes and scalabilities


                                                     18-796/Spring 1999/Chen




                      References
   – MPEG Home Page http://drogo.cselt.it/mpeg/
   – MPEG Video Home Page http://wwwam.hhi.de/mpeg-
     video/
   – MPEG4-SNHC http://www.es.com/mpeg4-snhc/
   – T. Sikora, “MPEG digital video coding standards,”
     IEEE Signal Processing Magazine, Sept. 1997, pp. 82-
     100
   – T. Sikora, “The MPEG-4 Video Standard Verification
     Model,” IEEE Trans. CSVT, Vol.7, No.1, Feb.1997




                                                     18-796/Spring 1999/Chen




                                                                               21

More Related Content

What's hot

video_compression_2004
video_compression_2004video_compression_2004
video_compression_2004aniruddh Tyagi
 
MPEG Compression Standards
MPEG Compression StandardsMPEG Compression Standards
MPEG Compression StandardsAjay
 
Video Compression Basics
Video Compression BasicsVideo Compression Basics
Video Compression BasicsSanjiv Malik
 
HEVC VIDEO CODEC By Vinayagam Mariappan
HEVC VIDEO CODEC By Vinayagam MariappanHEVC VIDEO CODEC By Vinayagam Mariappan
HEVC VIDEO CODEC By Vinayagam MariappanVinayagam Mariappan
 
video compression
video compressionvideo compression
video compressionaniruddh Tyagi
 
Multimedia presentation video compression
Multimedia presentation video compressionMultimedia presentation video compression
Multimedia presentation video compressionLaLit DuBey
 
Introduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag JainIntroduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag JainVideoguy
 
Getting the most out of H.264
Getting the most out of H.264Getting the most out of H.264
Getting the most out of H.264Iain Richardson
 
Video Compression Basics - MPEG2
Video Compression Basics - MPEG2Video Compression Basics - MPEG2
Video Compression Basics - MPEG2VijayKumarArya
 
simple video compression
simple video compression simple video compression
simple video compression LaLit DuBey
 
MPEG 4
MPEG 4MPEG 4
MPEG 4tvutech
 
Video Compression Basics by sahil jain
Video Compression Basics by sahil jainVideo Compression Basics by sahil jain
Video Compression Basics by sahil jainSahil Jain
 
Video Compression Techniques
Video Compression TechniquesVideo Compression Techniques
Video Compression Techniquescnssources
 
MPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingMPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingChristian Kehl
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)Godo Dodo
 

What's hot (20)

video_compression_2004
video_compression_2004video_compression_2004
video_compression_2004
 
Hw2
Hw2Hw2
Hw2
 
MPEG Compression Standards
MPEG Compression StandardsMPEG Compression Standards
MPEG Compression Standards
 
Video Compression Basics
Video Compression BasicsVideo Compression Basics
Video Compression Basics
 
HEVC VIDEO CODEC By Vinayagam Mariappan
HEVC VIDEO CODEC By Vinayagam MariappanHEVC VIDEO CODEC By Vinayagam Mariappan
HEVC VIDEO CODEC By Vinayagam Mariappan
 
video compression
video compressionvideo compression
video compression
 
Multimedia presentation video compression
Multimedia presentation video compressionMultimedia presentation video compression
Multimedia presentation video compression
 
85 videocompress
85 videocompress85 videocompress
85 videocompress
 
Introduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag JainIntroduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag Jain
 
Getting the most out of H.264
Getting the most out of H.264Getting the most out of H.264
Getting the most out of H.264
 
Video Compression Basics - MPEG2
Video Compression Basics - MPEG2Video Compression Basics - MPEG2
Video Compression Basics - MPEG2
 
simple video compression
simple video compression simple video compression
simple video compression
 
ISDD Video Compression
ISDD Video CompressionISDD Video Compression
ISDD Video Compression
 
MPEG 4
MPEG 4MPEG 4
MPEG 4
 
Video Compression Basics by sahil jain
Video Compression Basics by sahil jainVideo Compression Basics by sahil jain
Video Compression Basics by sahil jain
 
Video Compression Techniques
Video Compression TechniquesVideo Compression Techniques
Video Compression Techniques
 
MPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingMPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video Encoding
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)
 
Mpeg 2
Mpeg 2Mpeg 2
Mpeg 2
 
Jpeg and mpeg ppt
Jpeg and mpeg pptJpeg and mpeg ppt
Jpeg and mpeg ppt
 

Similar to mpeg4

Tutorial MPEG 3D Graphics
Tutorial MPEG 3D GraphicsTutorial MPEG 3D Graphics
Tutorial MPEG 3D GraphicsMarius Preda PhD
 
MPEG Augmented Reality Tutorial
MPEG Augmented Reality TutorialMPEG Augmented Reality Tutorial
MPEG Augmented Reality TutorialMarius Preda PhD
 
MPEG Technologies and roadmap for Augmented Reality
MPEG Technologies and roadmap for Augmented RealityMPEG Technologies and roadmap for Augmented Reality
MPEG Technologies and roadmap for Augmented RealityMarius Preda PhD
 
MPEG-4 Developments
MPEG-4 DevelopmentsMPEG-4 Developments
MPEG-4 DevelopmentsMartin Uren
 
Report
ReportReport
ReportVideoguy
 
Generic Video Adaptation Framework Towards Content – and Context Awareness in...
Generic Video Adaptation Framework Towards Content – and Context Awareness in...Generic Video Adaptation Framework Towards Content – and Context Awareness in...
Generic Video Adaptation Framework Towards Content – and Context Awareness in...Alpen-Adria-Universität
 
Mmclass6
Mmclass6Mmclass6
Mmclass6Hassan Dar
 
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.pptVideoguy
 
Emerging H.264 Standard: Overview and TMS320DM642- Based ...
Emerging H.264 Standard: Overview and TMS320DM642- Based ...Emerging H.264 Standard: Overview and TMS320DM642- Based ...
Emerging H.264 Standard: Overview and TMS320DM642- Based ...Videoguy
 
Bigger Hard Drive Jamie Lean
Bigger Hard Drive Jamie LeanBigger Hard Drive Jamie Lean
Bigger Hard Drive Jamie LeanFuture Perfect 2012
 
Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01netzwelt12345
 
How to manage the wide diversity of file formats for ingest, archiving and de...
How to manage the wide diversity of file formats for ingest, archiving and de...How to manage the wide diversity of file formats for ingest, archiving and de...
How to manage the wide diversity of file formats for ingest, archiving and de...FIAT/IFTA
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding StandardVideoguy
 

Similar to mpeg4 (20)

Tutorial MPEG 3D Graphics
Tutorial MPEG 3D GraphicsTutorial MPEG 3D Graphics
Tutorial MPEG 3D Graphics
 
MPEG Augmented Reality Tutorial
MPEG Augmented Reality TutorialMPEG Augmented Reality Tutorial
MPEG Augmented Reality Tutorial
 
MPEG Technologies and roadmap for Augmented Reality
MPEG Technologies and roadmap for Augmented RealityMPEG Technologies and roadmap for Augmented Reality
MPEG Technologies and roadmap for Augmented Reality
 
MPEG-4 Developments
MPEG-4 DevelopmentsMPEG-4 Developments
MPEG-4 Developments
 
Report
ReportReport
Report
 
Generic Video Adaptation Framework Towards Content – and Context Awareness in...
Generic Video Adaptation Framework Towards Content – and Context Awareness in...Generic Video Adaptation Framework Towards Content – and Context Awareness in...
Generic Video Adaptation Framework Towards Content – and Context Awareness in...
 
Mmclass6
Mmclass6Mmclass6
Mmclass6
 
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
/conferences/spr2004/presentations/eubanks/eubanks_mpeg4.ppt
 
The MPEG Extensible Middleware API
The MPEG Extensible Middleware APIThe MPEG Extensible Middleware API
The MPEG Extensible Middleware API
 
Emerging H.264 Standard: Overview and TMS320DM642- Based ...
Emerging H.264 Standard: Overview and TMS320DM642- Based ...Emerging H.264 Standard: Overview and TMS320DM642- Based ...
Emerging H.264 Standard: Overview and TMS320DM642- Based ...
 
3DgraphicsAndAR
3DgraphicsAndAR3DgraphicsAndAR
3DgraphicsAndAR
 
Tcd 2100 ds-120220
Tcd 2100 ds-120220Tcd 2100 ds-120220
Tcd 2100 ds-120220
 
Tcd 2100 ds-120220
Tcd 2100 ds-120220Tcd 2100 ds-120220
Tcd 2100 ds-120220
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
Bigger Hard Drive Jamie Lean
Bigger Hard Drive Jamie LeanBigger Hard Drive Jamie Lean
Bigger Hard Drive Jamie Lean
 
Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01
 
How to manage the wide diversity of file formats for ingest, archiving and de...
How to manage the wide diversity of file formats for ingest, archiving and de...How to manage the wide diversity of file formats for ingest, archiving and de...
How to manage the wide diversity of file formats for ingest, archiving and de...
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding Standard
 

More from aniruddh Tyagi

whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4aniruddh Tyagi
 
BUC BLOCK UP CONVERTER
BUC BLOCK UP CONVERTERBUC BLOCK UP CONVERTER
BUC BLOCK UP CONVERTERaniruddh Tyagi
 
digital_set_top_box2
digital_set_top_box2digital_set_top_box2
digital_set_top_box2aniruddh Tyagi
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transformaniruddh Tyagi
 
EBU_DVB_S2 READY TO LIFT OFF
EBU_DVB_S2 READY TO LIFT OFFEBU_DVB_S2 READY TO LIFT OFF
EBU_DVB_S2 READY TO LIFT OFFaniruddh Tyagi
 
ADVANCED DVB-C,DVB-S STB DEMOD
ADVANCED DVB-C,DVB-S STB DEMODADVANCED DVB-C,DVB-S STB DEMOD
ADVANCED DVB-C,DVB-S STB DEMODaniruddh Tyagi
 
haffman coding DCT transform
haffman coding DCT transformhaffman coding DCT transform
haffman coding DCT transformaniruddh Tyagi
 
quantization_PCM
quantization_PCMquantization_PCM
quantization_PCManiruddh Tyagi
 
ECMG & EMMG protocol
ECMG & EMMG protocolECMG & EMMG protocol
ECMG & EMMG protocolaniruddh Tyagi
 
fundamentals_satellite_communication_part_1
fundamentals_satellite_communication_part_1fundamentals_satellite_communication_part_1
fundamentals_satellite_communication_part_1aniruddh Tyagi
 
art_sklar7_reed-solomon
art_sklar7_reed-solomonart_sklar7_reed-solomon
art_sklar7_reed-solomonaniruddh Tyagi
 

More from aniruddh Tyagi (20)

whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4
 
BUC BLOCK UP CONVERTER
BUC BLOCK UP CONVERTERBUC BLOCK UP CONVERTER
BUC BLOCK UP CONVERTER
 
digital_set_top_box2
digital_set_top_box2digital_set_top_box2
digital_set_top_box2
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transform
 
DCT
DCTDCT
DCT
 
EBU_DVB_S2 READY TO LIFT OFF
EBU_DVB_S2 READY TO LIFT OFFEBU_DVB_S2 READY TO LIFT OFF
EBU_DVB_S2 READY TO LIFT OFF
 
ADVANCED DVB-C,DVB-S STB DEMOD
ADVANCED DVB-C,DVB-S STB DEMODADVANCED DVB-C,DVB-S STB DEMOD
ADVANCED DVB-C,DVB-S STB DEMOD
 
DVB_Arch
DVB_ArchDVB_Arch
DVB_Arch
 
haffman coding DCT transform
haffman coding DCT transformhaffman coding DCT transform
haffman coding DCT transform
 
Classification
ClassificationClassification
Classification
 
tyagi 's doc
tyagi 's doctyagi 's doc
tyagi 's doc
 
quantization_PCM
quantization_PCMquantization_PCM
quantization_PCM
 
ECMG & EMMG protocol
ECMG & EMMG protocolECMG & EMMG protocol
ECMG & EMMG protocol
 
7015567A
7015567A7015567A
7015567A
 
Basic of BISS
Basic of BISSBasic of BISS
Basic of BISS
 
euler theorm
euler theormeuler theorm
euler theorm
 
fundamentals_satellite_communication_part_1
fundamentals_satellite_communication_part_1fundamentals_satellite_communication_part_1
fundamentals_satellite_communication_part_1
 
quantization
quantizationquantization
quantization
 
art_sklar7_reed-solomon
art_sklar7_reed-solomonart_sklar7_reed-solomon
art_sklar7_reed-solomon
 
DVBSimulcrypt2
DVBSimulcrypt2DVBSimulcrypt2
DVBSimulcrypt2
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 

mpeg4

  • 1. 18-796 Multimedia Communications: Coding, Systems, and Networking Prof. Tsuhan Chen tsuhan@ece.cmu.edu MPEG-4 1
  • 2. MPEG-4 • Originally – A standard for very low bit rate coding of limited complexity audio-visual material • In July 94, the scope was extended to – Functionalities not supported by other standards • Content-based interactivity • Universal access • High compression – Coding of general material for a wide bit rate range – Flexibility and extensibility 18-796/Spring 1999/Chen Content-Based Interactivity • A scene is composed of audio-visual objects – Not just pixels or moving blocks • Objects can be of different nature – Text or images – Rectangular or arbitrary shape – 2D or 3D objects – Natural of synthetic • Different coding schemes applied to different objects • Compositor puts objects back in a scene 18-796/Spring 1999/Chen 2
  • 3. [MPEG-4 N1909, “Overview of the MPEG-4 Version 1 Standard”] 18-796/Spring 1999/Chen Applications Human-machine interface Content creation GUI, Virtual environment Digital TV, HDTV Vision, Graphics VCD, DVD Computer TV/Film MPEG-4 Telecommunication Wireless, Internet, ISDN, POTS, Cable 18-796/Spring 1999/Chen 3
  • 4. Parts of MPEG-4 • Part 1: Systems • Part 2: Video • Part 3: Audio • Part 4: Conformance testing • Part 5: Reference software • Part 6: Delivery multimedia integration framework • Others – Synthetic and Natural Hybrid Coding (SNHC) – Requirements and applications – Implementation Study – Intellectual property rights (IPR) 18-796/Spring 1999/Chen MPEG-4 Activities • Competitive phase – Proposals and evaluations • Collaborative phase – Verification model and core experiments 1. Define Verification Model (VM-n) 2. Define core experiments for improving VM-n 3. Perform core experiments. Compare with VM-n 4. n++, go to Step 1 18-796/Spring 1999/Chen 4
  • 5. MPEG-4 Time Table • July 93 Started work • Nov 95 Subjective tests and tool evaluation • Jan 96 Define verification model (VM) and core experiments (CE) • Mar/July/Sept/Nov 96, Feb/Apr/Jul 97 Update VM and define a new set of CEs • Oct 97 Committee Draft (CD) • July 98 Final CD (FCD) • Nov 98 Draft international standard (DIS) • Jan 99 International standard (IS) 18-796/Spring 1999/Chen MPEG-4 Video • General functionalities – Coding efficiency • For 5 kbit/s ~ 5 Mbit/s • From small images to TV resolution • Progressive/interlaced – Error resilience and robustness – Spatial and temporal scalabilities • Content-based functionalities – Shape coding and sprites – Content-based scalabilities – Error resilience and robustness 18-796/Spring 1999/Chen 5
  • 6. MPEG-4 Video (cont.) • Tools – Motion/texture coding derived from H.263 – Coding of video object plane (VOP): I, B, P – Binary and gray-scale shape coding – Scalabilities: temporal/spatial – Static sprites – Interlaced prediction – 12 bit video – Computational graceful degradation (CGD) 18-796/Spring 1999/Chen Video Object Plane (VOP) VOP1 VOP2 VOP3 18-796/Spring 1999/Chen 6
  • 7. Structure of VOP Encoder VOP 0 Coding VOP 1 VOP Coding Input MUX Bitstream Definition VOP 2 Coding – Note: Segmentation is outside the scope of MPEG-4 18-796/Spring 1999/Chen Structure of VOP Decoder VOP 0 Decoding VOP 1 Decoding Bitstream DEMUX Composition Output VOP 2 Decoding 18-796/Spring 1999/Chen 7
  • 8. Coding of VOP • Motion compensation and DCT – Similar to H.263 • Polygon matching for motion estimation Transparent Pixels VOP Pixels for polygon matching 18-796/Spring 1999/Chen VOP Padding • Applied to the reference VOP prior to motion estimation/compensation Padded Previous Frame Current Frame 18-796/Spring 1999/Chen 8
  • 9. Shape-Adaptive DCT for Texture Coding Shape Coding • Binary shape – Context-based arithmetic encoding (CAE) • Gray scale alpha plane – Motion compensated DCT • Similar to texture coding Gray-Level Alpha Support Tex ture Binary Tex ture Coder Shape Coder 18-796/Spring 1999/Chen 9
  • 10. Binary Shape Coding • Context-based arithmetic encoding (CAE) – A binary shape is treated as a binary image – Apply CAE to each binary alpha block (BAB) • The “context” Current BAB Motion Compensated BAB c9 c8 c7 c3 c2 c1 c8 c6 c5 c4 c3 c2 c0 ? c7 c6 c5 c1 c0 ? c4 Intra Inter 18-796/Spring 1999/Chen Feathering • Feathering and translucency coding – No effects – Linear feathering – Constant alpha – Linear feathering and constant alpha – Feathering filter – Feathering filter and constant alpha 18-796/Spring 1999/Chen 10
  • 11. VOP Encoder + motion texture video _ DCT Q multiplex coding -1 Q IDCT + + S pred. 1 w i Frame pred. 2 Store t c h pred. 3 Motion estimation Shape coding 18-796/Spring 1999/Chen VOP Decoder video_object_layer_shape Coded Bit Stream (Shape) Shape Decoding Previous Reconstructed VOP Coded Bit Stream (Motion) Motion Motion Compen- Decoding sation VOP Demultiplexer Recon- struction Coded Bit Stream Texture Decoding (Texture) 18-796/Spring 1999/Chen 11
  • 12. Profiles of MPEG-4 bitrate High bitrate tools (interlace) Content-based functionalities (shape, scalability) VLBV core functionalities 18-796/Spring 1999/Chen Visual Object Profiles • Not final yet Intra Coding Mode (I) 4-12 bit pixel depth Inter Prediction Mode (P) AC/DC Prediction Reversible VLC Slice Resychronization Data Partitioning Binary Shape P-VOP based temporal scalability Simple 12 bit H.263/MPEG-2 Quantization Table Bidirectional Prediction Mode (B) Binary shape Advanced Prediction Mode Texture coding Static Sprites Interlaced tools Scalable Wavelet Temporal Scalability Note that the binary • object-based shape tool is the same • frame-based for all Object Profiles it Spatial Scalability (frame-based) is used in. Core 18-796/Spring 1999/Chen 12
  • 13. Core vs. Generic MPEG-4 18-796/Spring 1999/Chen Synthetic & Natural Hybrid Coding (SNHC) • Efficient representation and composition of synthetically and naturally generated audiovisual data • To be integrated into MPEG-4 Video and Audio – Not a separate part of MPEG-4 • Applications – Virtual environment, conferencing, education, entertainment, media production, and real-time, interactive and broadcast media experiences 18-796/Spring 1999/Chen 13
  • 14. SNHC Target technologies • Video – Face animation – 2D/3D mesh compression – Wavelet-based still texture coding – View dependent scalability • Audio – Text-to-speech synthesis, structured audio, environmental auralization, 3D audio, etc. 18-796/Spring 1999/Chen Face Animation • Face animation – 2D/3D polygon mesh for face rendering – Facial Definition Parameter (FDP) Set • Controls shape, texture, gender, age, etc. – Facial Animation Parameter (FAP) Set • 68 parameters to produce animation and to create expressions Demo 18-796/Spring 1999/Chen 14
  • 15. Examples of FAP and FDP c l a s s Fa c e De f i n i t i on Pa r a ms { p ub l i c : 3Dme s h s h a p e ; 3DPoi n t f e a t u r e Poi n t [ 46 ] ; I ma g e t e x t u r e ; i nt age ; i n t ge n de r ; ...; } c l a s s Fa c e An i ma t i on Pa r a ms { p ub l i c : i n t mov e _h _l _e y e b a l l ; i n t mov e _h _r _e y e b a l l ; i n t mov e _v _l _e y e b a l l ; i n t mov e _v _r _e y e b a l l ; i n t e n l a r g e _ l _ pu pi l ; i n t e n l a r g e _ r _ pu pi l ; ...; } 18-796/Spring 1999/Chen Example FAPs # FAP name FAP description units Uni- Pos Grp FDP Quant or motion subgrp step Bidir num size 1 viseme Set of values determining the na na na 1 na 1 mixture of two visemes for this frame (e.g. pbm, fv, th) 2 expression A set of values determining the na na na 1 na 1 mixture of two facial expression 3 open_jaw Vertical jaw displacement (does MNS U down 2 1 8 not affect mouth opening) 4 lower_t_midlip Vertical top middle inner lip MNS B down 2 2 5 displacement 5 raise_b_midlip Vertical bottom middle inner MNS B up 2 3 5 lip displacement 6 stretch_l_cornerlip Horizontal displacement of left MW B left 2 4 5 inner lip corner 18-796/Spring 1999/Chen 15
  • 16. Facial Definition Parameters 11.5 11.5 11.4 11.4 11.1 11.2 11.1 11.3 11.2 4.4 4.2 4.1 4.3 4.4 4.6 10.2 4.6 4.5 10.1 10.2 9.6 10.4 10.10 10.9 10.10 10.3 10.4 5.2 5.1 9.14 9.12 10.8 10.7 10.8 9.3 10.6 10.5 Y 10.6 9.4 9.2 2.14 2.13 2.14 X Y 7.1 Z 2.10 2.10 X 2.12 2.1 2.11 2.1 2.12 Z Right Eye Left Eye Nose 3.14 3.13 3.6 3.5 3.1 9.6 9.7 3.2 3.12 3.8 3.11 3.7 3.4 3.3 3.10 3.9 9.12 9.14 9.13 Teeth 9.3 9.8 9.2 9.1 9.10 9.11 9.4 9.15 9.5 9.9 Mouth Tongue 8.6 8.9 8.10 8.5 8.1 2.7 .2.2 2.6 8.4 2.5 2.4 8.3 6.4 6.2 2.9 2.3 2.8 8.8 8.7 6.3 8.2 6.1 Feature points affected by FAPs Other feature points 18-796/Spring 1999/Chen Facial Animation Parameter Units (FAPUs) • Parameters normalized to FAPUs IRISD0 ES0 ENS0 MNS0 MW0 18-796/Spring 1999/Chen 16
  • 17. Visemes for Lip Synch viseme_select phonemes example 0 none na 1 p, b, m put, bed, m ill 2 f, v far, voice 3 T,D th ink, that 4 t, d tip, doll 5 k, g call, gas 6 tS, dZ, S ch air, join, sh e 7 s, z sir, zeal 8 n, l lot, n ot 9 r red 10 A: car 11 e bed 12 I tip 13 Q top 14 U book 18-796/Spring 1999/Chen 2D Mesh Compression • Decoding process dxn dyn Mesh Geometry xn Decoding yn Decoded tm Mesh Coded Variable Data Length Decoding Mesh Mesh Motion Data Decoding Memory exn eyn 18-796/Spring 1999/Chen 17
  • 18. 3D Mesh Compression • Progressive representation – Streaming of 3D objects – Both spatially and temporally • Indexing and retrieval of 3D meshes – Multiresolution databases – Related to MPEG-7 Demo Wavelets for Scalable Texture Coding Lowpass Lowpass H1 (z) 2 CODEC 1 2 F1 (z) H2 (z) 2 CODEC 2 2 F2 (z) Highpass Highpass • Decompose the signal in the frequency domain • Critical downsampling maintains the number of samples in the subbands • For 2D case, separable filters are often used. Decompose into four bands: LL, LH, HL, HH • Decompose the LL band iteratively 18-796/Spring 1999/Chen 18
  • 19. Wavelets for Scalable Texture Coding (cont.) Lowest Band Inverse Prediction Quantization Coded Data Output Arithmetic Inverse Decoding DWT ZeroTree Inverse Other Bands Decoding Quantization “Zero Tree” 18-796/Spring 1999/Chen View Dependent Scalability 19
  • 20. MPEG-4 Video Test Sequences • Class A: Low spatial detail and low amount of movement • Class B: Medium spatial detail and low amount of movement or vice versa • Class C: High spatial detail and medium amount of movement or vice versa • Class D: Stereoscopic • Class E: Hybrid natural and synthetic • Class F: 12-bit video sequences 18-796/Spring 1999/Chen Test Sequences Sequence Name Class Input Format YUV Alpha Segment files files Mask Available Mother & daughter A ITU-R 601 (60Hz) 1 0 0 Akiyo A ITU-R 601 (60Hz) 2+1 1 2 Hall Monitor A ITU-R 601 (60Hz) 1 0 3 Container Ship A ITU-R 601 (60Hz) 1 0 6 Sean A ITU-R 601 (60Hz) 1 0 3 Foreman B ITU-R 601 (50Hz) 1 0 0 News B ITU-R 601 (60Hz) 4+1 3 4 Silent Voice B ITU-R 601 (50Hz) 1 0 0 Coastguard B ITU-R 601 (60Hz) 1 0 4 Bus C ITU-R 601 (60Hz) 1 0 0 Table Tennis C ITU-R 601 (50Hz) 1 0 0 Stefan C ITU-R 601 (60Hz) 1 0 2 Mobile & Calendar C ITU-R 601 (60Hz) 1 0 0 Basketball C ITU-R 601 (50Hz) 1 0 0 Football C ITU-R 602 (60Hz) 1 0 0 Cheerleaders C ITU-R 601 (60Hz) 1 0 0 Tunnel D ITU-R 601 (50Hz) 2x1 0 0 Fun Fair D ITU-R 601 (50Hz) 2x1 0 0 Children E ITU-R 601 (60Hz) 3+1 2 3 Bream E ITU-R 601 (60Hz) 3+1 2 3 Weather E ITU-R 601 (60Hz) 2+1 1 2 Destruction E ITU-R 601 (60Hz) 11+1 10 0 Ti1 F 176x144 (15Hz) 1 0 0 Man1sw F 272x136 (15Hz) 1 0 0 Hum2sw F 272x136 (15Hz) 1 0 0 Veh2sw F 272x136 (15Hz) 1 0 0 labview F 176x144 (60Hz) 1 0 0 hallway F 176x144 (60Hz) 1 0 0 20
  • 21. MPEG-4 Version 2 • One year following Version 1 • Adds new profiles with new functionalities • Video – Scalable transmission of arbitrary-shaped objects – Tools for additional efficiency improvements – Tools for improved error robustness – Coding of multiple views – Body animation – Coding of 3D meshes and scalabilities 18-796/Spring 1999/Chen References – MPEG Home Page http://drogo.cselt.it/mpeg/ – MPEG Video Home Page http://wwwam.hhi.de/mpeg- video/ – MPEG4-SNHC http://www.es.com/mpeg4-snhc/ – T. Sikora, “MPEG digital video coding standards,” IEEE Signal Processing Magazine, Sept. 1997, pp. 82- 100 – T. Sikora, “The MPEG-4 Video Standard Verification Model,” IEEE Trans. CSVT, Vol.7, No.1, Feb.1997 18-796/Spring 1999/Chen 21