MaryTTS UnitSelection

•Download as ODP, PDF•

2 likes•770 views

Munzey

MaryTTS Unit Selection Synthesis Overview

Software

Overview
● Unit Selection Synthesis
● MaryTTS implementation/structure
● How can we improve it?

TTS – Unit Selection (U.S.)
Input text NLP tasks
Unit
Selection
Speech
Database
Speech

TTS – Unit Selection (U.S.)
● Input text is processed and split into tokens.
● The best possible candidates for each token
are then selected from the speech database.
● The desired target utterance is then created by
determining the best chain of these candidate
units and concatenating them together →
Viterbi algorithm(using join and target costs).

MaryTTS implementation
● MaryTTS is a “modularised” system
● U.S. packages are currently embedded within
the marytts-runtime module.

MaryTTS implementation
● These packages are divided into the following groups:
- Base classes (marytts.unitselection)
- Data (marytts.unitselection.data)
- Selection & Cost Functions (marytts.unitselection.select)
- Viterbi (marytts.unitselection.select.viterbi)
- Weighting Functions
(marytts.unitselection.weightingfunctions)
- Target Features (marytts.features)
- Voice Properties (marytts.server)

MaryTTS Implementation
UML Schema of Unit Selection Packages in MaryTTS - woah!
***Only modelling the Unit selection part...concatenation excluded

MaryTTS U.S. step-by-step:
● A request is made to the mary server to output
audio from some input using a unit selection
voice.
● The Synthesis module is called to process the
input data → calls voice.synthesize()
● In this case, the unit selection voice (extension
of voice class) calls the unit selection
synthesizer.

MaryTTS U.S. step-by-step:
UnitSelectionSynthesizer.synthesize(tokens, UnitSelVoice):
→ Processes the tokens into audio by calling on the voice's
database, unit selector and concatenator.
– UnitSelectionVoice loads these objects by reading in
properties from the voice's .config file using MaryProperties
– The UnitDatabase class contains the target and join cost
functions, as well as a way to access the speech database
to retrieve target candidates etc.
– Unit selector selects the units
– Unit Concatenator concatenates these units into a single
audio stream

Unit Selector
● Contains a reference to the voice's database
● .selectUnits(tokens, voice):
- tokens converted to targets
- target feature vectors computed for each
target
- viterbi algorithm applied to find best path

How can we improve the system?
● Restructuring of codebase?

Similar to MaryTTS UnitSelection

SPEECH CLASSIFICATION USING ZERNIKE MOMENTScscpconf

Feature Engineering in Machine LearningPyingkodi Maran

XL-MINER:Partitionxlminer content

XL-MINER:PartitionDataminingTools Inc

WekaShuang Wu

Query optimizationPooja Dixit

DBMS Full.pptpritikanamaity600

Flink Forward San Francisco 2019: TensorFlow Extended: An end-to-end machine ...Flink Forward

Machine Learning in GATE Valentin Tablanbutest

Optimization Technique for Feature Selection and Classification Using Support...IJTET Journal

Chapter 1 Introduction to Data Structures and Algorithms.pdfAxmedcarb

VB.netPallaviKadam

Implementation of query optimization for reducing run timeAlexander Decker

SAP BASIS Training in ChennaiThecreating Experts

Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...IRJET Journal

DLT UNIT-3.docx0567Padma

PythonML.pptxHussain395748

IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...IRJET Journal

[2017/2018] AADL - Architecture Analysis and Design LanguageIvano Malavolta

Poster (2)Mukund Krishna Ravi

Similar to MaryTTS UnitSelection (20)

SPEECH CLASSIFICATION USING ZERNIKE MOMENTS

Feature Engineering in Machine Learning

XL-MINER:Partition

Weka

Query optimization

DBMS Full.ppt

Flink Forward San Francisco 2019: TensorFlow Extended: An end-to-end machine ...

Machine Learning in GATE Valentin Tablan

Optimization Technique for Feature Selection and Classification Using Support...

Chapter 1 Introduction to Data Structures and Algorithms.pdf

VB.net

Implementation of query optimization for reducing run time

SAP BASIS Training in Chennai

Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...

DLT UNIT-3.docx

PythonML.pptx

IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...

[2017/2018] AADL - Architecture Analysis and Design Language

Poster (2)

Recently uploaded

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Test Automation Strategy for Frontend and BackendArshad QA

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

Diamond Application Development Crafting Solutions with PrecisionSolGuruz

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Exploring iOS App Development: Simplifying the ProcessEvangelist Apps https://twitter.com/EvangelistSW/

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

Recently uploaded (20)

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Test Automation Strategy for Frontend and Backend

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...

A Secure and Reliable Document Management System is Essential.docx

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

Diamond Application Development Crafting Solutions with Precision

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

5 Signs You Need a Fashion PLM Software.pdf

Advancing Engineering with AI through the Next Generation of Strategic Projec...

The Ultimate Test Automation Guide_ Best Practices and Tips.pdf

Salesforce Certified Field Service Consultant

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI

why an Opensea Clone Script might be your perfect match.pdf

Exploring iOS App Development: Simplifying the Process

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

Optimizing AI for immediate response in Smart CCTV

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

MaryTTS UnitSelection

1. Unit Selection Synthesis in MaryTTS

2. Overview ● Unit Selection Synthesis ● MaryTTS implementation/structure ● How can we improve it?

3. TTS – Unit Selection (U.S.) Input text NLP tasks Unit Selection Speech Database Speech

4. TTS – Unit Selection (U.S.) ● Input text is processed and split into tokens. ● The best possible candidates for each token are then selected from the speech database. ● The desired target utterance is then created by determining the best chain of these candidate units and concatenating them together → Viterbi algorithm(using join and target costs).

5. MaryTTS implementation ● MaryTTS is a “modularised” system ● U.S. packages are currently embedded within the marytts-runtime module.

6. MaryTTS implementation ● These packages are divided into the following groups: - Base classes (marytts.unitselection) - Data (marytts.unitselection.data) - Selection & Cost Functions (marytts.unitselection.select) - Viterbi (marytts.unitselection.select.viterbi) - Weighting Functions (marytts.unitselection.weightingfunctions) - Target Features (marytts.features) - Voice Properties (marytts.server)

7. MaryTTS Implementation UML Schema of Unit Selection Packages in MaryTTS - woah! ***Only modelling the Unit selection part...concatenation excluded

8. MaryTTS U.S. step-by-step: ● A request is made to the mary server to output audio from some input using a unit selection voice. ● The Synthesis module is called to process the input data → calls voice.synthesize() ● In this case, the unit selection voice (extension of voice class) calls the unit selection synthesizer.

9. MaryTTS U.S. step-by-step: UnitSelectionSynthesizer.synthesize(tokens, UnitSelVoice): → Processes the tokens into audio by calling on the voice's database, unit selector and concatenator. – UnitSelectionVoice loads these objects by reading in properties from the voice's .config file using MaryProperties – The UnitDatabase class contains the target and join cost functions, as well as a way to access the speech database to retrieve target candidates etc. – Unit selector selects the units – Unit Concatenator concatenates these units into a single audio stream

10. Unit Selector ● Contains a reference to the voice's database ● .selectUnits(tokens, voice): - tokens converted to targets - target feature vectors computed for each target - viterbi algorithm applied to find best path

11. How can we improve the system? ● Restructuring of codebase?

MaryTTS UnitSelection

Recommended

Recommended

More Related Content

Similar to MaryTTS UnitSelection

Similar to MaryTTS UnitSelection (20)

Recently uploaded

Recently uploaded (20)

MaryTTS UnitSelection