SlideShare a Scribd company logo
1 of 24
MPEG 4 VIDEO



    Presented by
   Arun S Amingad.
Topics to Cover:
  • Facial Animation Parameters(FAP)
  • Facial Definition Parameters(FDP)
  • Face Model
  • Coding of FAP’s
  • Integration of Face Animation and Text to
    Speech(TTS) synthesis.
  • (Binary Format Scene)BIFS for Facial
    Animation.
• What is (Facial Animation Parameters)FAP?
          It is based on the minimal perceptual
  actions of human beings,such as
  expressions,emotions etc..and are closely
  related to the muscle actions.
• What is (Facial Definition Parameters)FDP?
          It allows the user to configure the 3D
  facial model to be used at the receiver.(either
  sending the previously sent model or
  introducing a fresh model)
A Face model with its feature points
Face Model:
• Every MPEG-4 terminal that is able to decode FAP
  streams has to provide a face model for
  animation.
• This model is proprietary to decoder itself.
• The encoder does not know about the look of the
  face model.
• Using an FDP node MPEG-4 allows the encoder to
  specify completely the face model to animate.
• The FDP node can also be used to calibrate the
  proprietary model of the decoder.
Feature Points representation
• The decoder may choose to specify the location of all
  or some feature points.
• After specifying the feature points, the decoder can
  adapt its own proprietary face model such that the
  model conforms to the feature point positions.
• Face model adaptation also allows for the
  downloading of texture maps for the face.
• Each feature point has a different texture map
• In order to specify the mapping of the texture map
  onto the face model,the encoder sends texture
  co-ordinates for each feature point.
• Encoder specific.
• The process of adapting the feature point locations of
  the face model according to encoder specifications is
  referred to as Face Model Calibration.
• Sometimes also called as Face Model Adaptation.
Simplified scene graph for a head model.
                     Root Group


                   Head Transform X


                   Head Transform Y


                   Head Transform Z


                                          Left Eye     Right Eye
 Face     Hair    Tongue        Teeth
                                        Transform X   Transform X
                                          Left Eye     Right Eye
                                        Transform Y   Transform Y


                                          Left Eye    Right Eye
• A root node is a collection of objects.
• For the objects to move together in a group, they
  need to be in the same transform group.
• When the transform nodes contain different
  transforms, the information setting has a
  cumulative effect.
• The transform node defines geometric 3D
  transformations such as scaling,rotation etc.
• Indexed Face Set is used to define the geometry
  and the surface attributes (color and texture) of
  the object.
• The rotations for the left eye and right eye are
  also embedded in this.
Coding Of (Facial Animation
         Parameters)FAP’s:
• Tools used for coding:
    1) Arithmetic encoder(low delay)
    2)DCT coding technique (high delay)
• 1)Using Arithmetic decoder:
-Allows for low delay FAP coding
-coding efficiency is low
• 2)Using DCT:
-Introduces larger delay.
-Achieves higher coding efficiency.
Low Delay encoder for FAP
• The first set of FAP values , FAP(0) is coded without
  prediction.(At time instant zero)
• The value of a FAP at time instant k i.e FAP(k) is
  predicted using the previous encoded value FAP(k-1)
• e` is quantized using the step size QP multiplied by a
  quantization parameter FAP_QUANT.
• 0< FAP_QUANT<31
• The quantized prediction error e` is arithmetically
  encoded using a separate adaptive probability model
  for each FAP.
• FAP_QUANT>15,is usually not used because the quality
  of the animation gets reduced.
• At the decoder,the received data is arithmetically
  decoded,dequantized and added to the previously
  decoded value.
DCT:
• Applied to 16 consecutive FAP values.
• Hence,it introduces a significant delay in the coding and
  decoding processes.
• After computing the DCT of 16 consecutive values of one
  FAP,DC and AC coefficients are coded seperately.
• DC coefficients use the prediction method
• AC coefficients are directly coded.
• Both AC and DC coefficients are quantized seperately.
• The quantized coefficients are encoded with one VLC word
  defining the number of zero co-efficients,prior to next
  non-zero coefficients and another VLC for the amplitude of
  this non zero coefficient.
FAP encoder using DCT
Integration of TTS synthesizer into an MPEG4 face animation system
Integration of Face Animation and Text to
          Speech(TTS) synthesis
• Syncronization of a FAP stream with TTS
  synthesizers using the TTSI(TTS interface) is
  only possible if the encoder sends the timing
  information.
• This is because,a conventional TTS is an
  asynchronous source.
• Decoder:Decodes the text and passes it to the
  proprietary speech synthesizer.
• SYNTHESIZER:Creates speech samples that are
  handed to the compositor.
• COMPOSITOR:Provides audio or video output to
  the user.
• The second output interface of the synthesizer
  sends the phonemes of the synthesized speech as
  well as the start time and duration information of
  each phoneme to FAP converter.
• The converter translates the phonemes and
  timing information into FAP’s so that the face
  renderer can use in order to animate the face
  model.
• Bookmarks in the text of TTS is used to
  animate facial expressions.
• When the TTS finds the bookmarks in the
  text,it sends it to FAP converter.
• FAP converter transforms the phonemes into
  visemes and timing information into the FAP’s.
• When the TTS finds the bookmark in the
  text,it sends this bookmark to the FAP
  converter.
• The bookmark defines the start point and
  duration of transition to FAP amplitude.
Integration with MPEG-4 Systems:(BIFS)
• To use face animation in MPEG-4 systems,a BIFS scene
  graph has to be transmitted.
• Minimum scene graph should contain a face node and FAP
  node.
• The nodes of FAP’s may be the high level FAP’s such as
  visemes and expressions.
• The scene graph would enable the encoder to animate the
  proprietary face model of the decoder.
• In order to download a face model to the decoder,it
  requires a FDP node.
• A FDP node is further divided into its children,viz Face
  definition table(Fdef),Face Definition Mesh(FDM),Face
  Definition Transform(FDT).
Nodes of the BIFS scene that are used to describe and animate a face

More Related Content

Viewers also liked (20)

MPEG Compression Standards
MPEG Compression StandardsMPEG Compression Standards
MPEG Compression Standards
 
MPEG4 AVC-H.264
MPEG4 AVC-H.264MPEG4 AVC-H.264
MPEG4 AVC-H.264
 
Chs parts of computer
Chs parts of computerChs parts of computer
Chs parts of computer
 
mpeg4
mpeg4mpeg4
mpeg4
 
28 h 264-avc_by_dhchang
28   h 264-avc_by_dhchang28   h 264-avc_by_dhchang
28 h 264-avc_by_dhchang
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
Cours5 cdmm
Cours5 cdmmCours5 cdmm
Cours5 cdmm
 
A short history of video coding
A short history of video codingA short history of video coding
A short history of video coding
 
Analog Video
Analog Video Analog Video
Analog Video
 
whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4
 
Lcd
LcdLcd
Lcd
 
Satellite communication
Satellite  communicationSatellite  communication
Satellite communication
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compression
 
Difference between led & lcd
Difference between led & lcdDifference between led & lcd
Difference between led & lcd
 
All about Lcd monitors
All about Lcd monitorsAll about Lcd monitors
All about Lcd monitors
 
Compression techniques
Compression techniquesCompression techniques
Compression techniques
 
data compression technique
data compression techniquedata compression technique
data compression technique
 
Data compression introduction
Data compression introductionData compression introduction
Data compression introduction
 
Video formats
Video formatsVideo formats
Video formats
 
Digital Video Broadcasting (DVB)
Digital Video Broadcasting (DVB)Digital Video Broadcasting (DVB)
Digital Video Broadcasting (DVB)
 

Similar to MPEG 4 VIDEO

The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)lennartkats
 
Compiler Design Introduction
Compiler Design Introduction Compiler Design Introduction
Compiler Design Introduction Thapar Institute
 
Introduction to compiler
Introduction to compilerIntroduction to compiler
Introduction to compilerAbha Damani
 
고급컴파일러구성론_개레_230303.pptx
고급컴파일러구성론_개레_230303.pptx고급컴파일러구성론_개레_230303.pptx
고급컴파일러구성론_개레_230303.pptxssuser1e7611
 
ERTMSFormalSpecs Presentation - October 2016
ERTMSFormalSpecs Presentation - October 2016ERTMSFormalSpecs Presentation - October 2016
ERTMSFormalSpecs Presentation - October 2016ERTMS Solutions
 
ERTMSFormalSpecs Presentation 9/10/2015
ERTMSFormalSpecs Presentation 9/10/2015ERTMSFormalSpecs Presentation 9/10/2015
ERTMSFormalSpecs Presentation 9/10/2015Stanislas Pinte
 
Compier Design_Unit I.ppt
Compier Design_Unit I.pptCompier Design_Unit I.ppt
Compier Design_Unit I.pptsivaganesh293
 
Compier Design_Unit I.ppt
Compier Design_Unit I.pptCompier Design_Unit I.ppt
Compier Design_Unit I.pptsivaganesh293
 
Introduction to Compiler Construction
Introduction to Compiler Construction Introduction to Compiler Construction
Introduction to Compiler Construction Sarmad Ali
 
Unit iii-111206004501-phpapp02
Unit iii-111206004501-phpapp02Unit iii-111206004501-phpapp02
Unit iii-111206004501-phpapp02riddhi viradiya
 
System Programing Unit 1
System Programing Unit 1System Programing Unit 1
System Programing Unit 1Manoj Patil
 
Utilizing the Pre-trained Model Effectively for Speech Translation
Utilizing the Pre-trained Model Effectively for Speech TranslationUtilizing the Pre-trained Model Effectively for Speech Translation
Utilizing the Pre-trained Model Effectively for Speech TranslationChen Xu
 
Sap Tips and Tricks Training for End user
Sap Tips and Tricks Training for End userSap Tips and Tricks Training for End user
Sap Tips and Tricks Training for End userArghadip Kar
 

Similar to MPEG 4 VIDEO (20)

Atks (Arabic Toolkit services)
Atks (Arabic Toolkit services)Atks (Arabic Toolkit services)
Atks (Arabic Toolkit services)
 
Face recognition system
Face recognition systemFace recognition system
Face recognition system
 
The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)
 
Compiler Design Introduction
Compiler Design Introduction Compiler Design Introduction
Compiler Design Introduction
 
Introduction to compiler
Introduction to compilerIntroduction to compiler
Introduction to compiler
 
Programming Techniques.pptx
Programming Techniques.pptxProgramming Techniques.pptx
Programming Techniques.pptx
 
Assembler
AssemblerAssembler
Assembler
 
고급컴파일러구성론_개레_230303.pptx
고급컴파일러구성론_개레_230303.pptx고급컴파일러구성론_개레_230303.pptx
고급컴파일러구성론_개레_230303.pptx
 
ERTMSFormalSpecs Presentation - October 2016
ERTMSFormalSpecs Presentation - October 2016ERTMSFormalSpecs Presentation - October 2016
ERTMSFormalSpecs Presentation - October 2016
 
ERTMSFormalSpecs Presentation 9/10/2015
ERTMSFormalSpecs Presentation 9/10/2015ERTMSFormalSpecs Presentation 9/10/2015
ERTMSFormalSpecs Presentation 9/10/2015
 
Unit 3 sp assembler
Unit 3 sp assemblerUnit 3 sp assembler
Unit 3 sp assembler
 
System Programming Overview
System Programming OverviewSystem Programming Overview
System Programming Overview
 
EC8691-MPMC-PPT.pptx
EC8691-MPMC-PPT.pptxEC8691-MPMC-PPT.pptx
EC8691-MPMC-PPT.pptx
 
Compier Design_Unit I.ppt
Compier Design_Unit I.pptCompier Design_Unit I.ppt
Compier Design_Unit I.ppt
 
Compier Design_Unit I.ppt
Compier Design_Unit I.pptCompier Design_Unit I.ppt
Compier Design_Unit I.ppt
 
Introduction to Compiler Construction
Introduction to Compiler Construction Introduction to Compiler Construction
Introduction to Compiler Construction
 
Unit iii-111206004501-phpapp02
Unit iii-111206004501-phpapp02Unit iii-111206004501-phpapp02
Unit iii-111206004501-phpapp02
 
System Programing Unit 1
System Programing Unit 1System Programing Unit 1
System Programing Unit 1
 
Utilizing the Pre-trained Model Effectively for Speech Translation
Utilizing the Pre-trained Model Effectively for Speech TranslationUtilizing the Pre-trained Model Effectively for Speech Translation
Utilizing the Pre-trained Model Effectively for Speech Translation
 
Sap Tips and Tricks Training for End user
Sap Tips and Tricks Training for End userSap Tips and Tricks Training for End user
Sap Tips and Tricks Training for End user
 

Recently uploaded

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 

Recently uploaded (20)

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 

MPEG 4 VIDEO

  • 1. MPEG 4 VIDEO Presented by Arun S Amingad.
  • 2. Topics to Cover: • Facial Animation Parameters(FAP) • Facial Definition Parameters(FDP) • Face Model • Coding of FAP’s • Integration of Face Animation and Text to Speech(TTS) synthesis. • (Binary Format Scene)BIFS for Facial Animation.
  • 3. • What is (Facial Animation Parameters)FAP? It is based on the minimal perceptual actions of human beings,such as expressions,emotions etc..and are closely related to the muscle actions. • What is (Facial Definition Parameters)FDP? It allows the user to configure the 3D facial model to be used at the receiver.(either sending the previously sent model or introducing a fresh model)
  • 4. A Face model with its feature points
  • 5. Face Model: • Every MPEG-4 terminal that is able to decode FAP streams has to provide a face model for animation. • This model is proprietary to decoder itself. • The encoder does not know about the look of the face model. • Using an FDP node MPEG-4 allows the encoder to specify completely the face model to animate. • The FDP node can also be used to calibrate the proprietary model of the decoder.
  • 7. • The decoder may choose to specify the location of all or some feature points. • After specifying the feature points, the decoder can adapt its own proprietary face model such that the model conforms to the feature point positions. • Face model adaptation also allows for the downloading of texture maps for the face. • Each feature point has a different texture map • In order to specify the mapping of the texture map onto the face model,the encoder sends texture co-ordinates for each feature point.
  • 8. • Encoder specific. • The process of adapting the feature point locations of the face model according to encoder specifications is referred to as Face Model Calibration. • Sometimes also called as Face Model Adaptation.
  • 9.
  • 10. Simplified scene graph for a head model. Root Group Head Transform X Head Transform Y Head Transform Z Left Eye Right Eye Face Hair Tongue Teeth Transform X Transform X Left Eye Right Eye Transform Y Transform Y Left Eye Right Eye
  • 11. • A root node is a collection of objects. • For the objects to move together in a group, they need to be in the same transform group. • When the transform nodes contain different transforms, the information setting has a cumulative effect. • The transform node defines geometric 3D transformations such as scaling,rotation etc. • Indexed Face Set is used to define the geometry and the surface attributes (color and texture) of the object. • The rotations for the left eye and right eye are also embedded in this.
  • 12. Coding Of (Facial Animation Parameters)FAP’s: • Tools used for coding: 1) Arithmetic encoder(low delay) 2)DCT coding technique (high delay)
  • 13. • 1)Using Arithmetic decoder: -Allows for low delay FAP coding -coding efficiency is low • 2)Using DCT: -Introduces larger delay. -Achieves higher coding efficiency.
  • 14. Low Delay encoder for FAP
  • 15. • The first set of FAP values , FAP(0) is coded without prediction.(At time instant zero) • The value of a FAP at time instant k i.e FAP(k) is predicted using the previous encoded value FAP(k-1) • e` is quantized using the step size QP multiplied by a quantization parameter FAP_QUANT. • 0< FAP_QUANT<31 • The quantized prediction error e` is arithmetically encoded using a separate adaptive probability model for each FAP. • FAP_QUANT>15,is usually not used because the quality of the animation gets reduced. • At the decoder,the received data is arithmetically decoded,dequantized and added to the previously decoded value.
  • 16. DCT: • Applied to 16 consecutive FAP values. • Hence,it introduces a significant delay in the coding and decoding processes. • After computing the DCT of 16 consecutive values of one FAP,DC and AC coefficients are coded seperately. • DC coefficients use the prediction method • AC coefficients are directly coded. • Both AC and DC coefficients are quantized seperately. • The quantized coefficients are encoded with one VLC word defining the number of zero co-efficients,prior to next non-zero coefficients and another VLC for the amplitude of this non zero coefficient.
  • 18. Integration of TTS synthesizer into an MPEG4 face animation system
  • 19. Integration of Face Animation and Text to Speech(TTS) synthesis • Syncronization of a FAP stream with TTS synthesizers using the TTSI(TTS interface) is only possible if the encoder sends the timing information. • This is because,a conventional TTS is an asynchronous source. • Decoder:Decodes the text and passes it to the proprietary speech synthesizer.
  • 20. • SYNTHESIZER:Creates speech samples that are handed to the compositor. • COMPOSITOR:Provides audio or video output to the user. • The second output interface of the synthesizer sends the phonemes of the synthesized speech as well as the start time and duration information of each phoneme to FAP converter. • The converter translates the phonemes and timing information into FAP’s so that the face renderer can use in order to animate the face model.
  • 21. • Bookmarks in the text of TTS is used to animate facial expressions. • When the TTS finds the bookmarks in the text,it sends it to FAP converter. • FAP converter transforms the phonemes into visemes and timing information into the FAP’s. • When the TTS finds the bookmark in the text,it sends this bookmark to the FAP converter. • The bookmark defines the start point and duration of transition to FAP amplitude.
  • 22.
  • 23. Integration with MPEG-4 Systems:(BIFS) • To use face animation in MPEG-4 systems,a BIFS scene graph has to be transmitted. • Minimum scene graph should contain a face node and FAP node. • The nodes of FAP’s may be the high level FAP’s such as visemes and expressions. • The scene graph would enable the encoder to animate the proprietary face model of the decoder. • In order to download a face model to the decoder,it requires a FDP node. • A FDP node is further divided into its children,viz Face definition table(Fdef),Face Definition Mesh(FDM),Face Definition Transform(FDT).
  • 24. Nodes of the BIFS scene that are used to describe and animate a face