Speech Recognition: Advanced Topics

•Download as PPTX, PDF•

1 like•120 views

Tariqul islam

Basic concept for how for recognize the speech. Using for Artificial Intelligence

Data & Analytics

SPEECH RECOGNITION: ADVANCED
TOPICS
True, their voice-print machine was unfortunately a crude
one. It could discriminate among only a few frequencies, and
it indicated amplitude by indecipherable blots. But it had
never been intended for such vitally important work.
--- Aleksandr I. Solzhenitsyn, The First Circle, p. 505

INTRODUCTION
• keju civil service examinations of Imperial China lasted almost 1300 years,
• from the year 606 until it was abolished in 1905. this exam is taken for finding
the high rank officer for china.
• This keju examination process are called as keju algorithm, which is
incremental and progressive process of finding the individual candidate.
• Keju algorithm are also used for speech recognition. Which is very expensive
algorithm for speech recognition.
• So Author introduce the `Multipass Decoding Algorithm` which is efficient but
dumper decoding algorithm to produce shortlists of potential candidates

TOPIC OF THIS CHAPTER
This chapter author introduce several methodology to discuss
1. Multi pass decoding algorithm
Dumper decoding algorithm produce the shortlist of probabilistic path, in
which finding and rescore the potential candidate.
2. Context dependent acoustic model
Smart process for create the large vocabulary of speech recognition.
3. Discriminative training
4. Modeling the variation

MULTI PASS DECODING
For multi pass decoding, we have use stack decoder or A* decoder, which divide the
multi pass decoding in to two stage.
First Stage we have use efficient knowledge sources to perform optimal search and
second stage using sophisticated and slower decoding algorithm to reduce the search
space.
1. N-BEST LISTS
2. WORD Lattices

N-BEST LISTS
• This algorithm is modification of `Viterbi algorithm` to return the N-Best Sentence
from given speech.

LIMITATION OF N-BEST LIST
1. One problem with an N-best list is that when N is large, listing all the sentences is
extremely inefficient.
2. Another problem is that N-best lists don’t give quite as much information as we
might want for a second-pass decoder.

WORD LATTICE
The output of the first pass decoder is usually a more sophisticated representation called a
word lattice(Murveit et al., 1993; Aubert and Ney, 1995). A word Lattice is a directed graph
that efficiently represents much more information about possible word sequence. There is
two part of directed graph,
• Node
• Arcs
Nodes in the graph are word
Arcs are transitions between word
Arch Represent word hypotheses and nodes are points in time

N-BEST LIST AND WORD LATTICES GOAL
N-Best List and Word Lattices goal is to
rescoring the probabilistic candidate and
replace with 1-best utterance with a different
utterance.

A∗ (‘STACK’) DECODING
The A∗ decoding algorithm allows us to use the complete forward probability, avoiding the
Viterbi approximation
A∗ decoding also allows us to use any arbitrary language model.
Thus A∗ is a one-pass alternative to multi-pass decoding
The A* decoding algorithm is a best first search of the tree that implicitly defines the
sequence of allowable word in the language. This algorithm has tow parts,
1. The root: start node on the left or START point of the path
2. Leaf: Difference path of the probabilistic candidate, each Leaf define the on sentence ot
the language and create the path

A∗ (‘STACK’) DECODING
Priority Queue
The A* decoder must thus find the path (word sequence), the root to a leaf which has
the highest probabilities, where the path probability is defined as the product of its
language model probability(prior) and its acoustic match to the data.
Fast match
A Fast match is used to select the likely next words. A fast match is one of a class of
heuristics designed to efficiently winnow down the number of possible following words,
often by computing some approximation to the forward probability.

What's hot

Introduction to fa and dfadeepinderbedi

A Concurrent Language for ArgumentationCarlo Taticchi

Chomsky classification of LanguageDipankar Boruah

FINITE STATE MACHINE AND CHOMSKY HIERARCHYnishimanglani

Finite Automata in compiler designRiazul Islam

Neural Machine Translation via Binary Code PredictionYusuke Oda

B.tech admission in indiaEdhole.com

Finite AutomataMukesh Tekwani

Types of Language in Theory of ComputationAnkur Singh

Speech To Sign Language Interpreter Systemkkkseld

Finite automataBipul Roy Bpl

Implementation of lexical analyserArchana Gopinath

Regular expressions and languages pdfDilouar Hossain

C programming Ms. Pranoti DokePranoti Doke

Looking for Invariant Operators in ArgumentationCarlo Taticchi

A Concurrent Language for Argumentation: Preliminary NotesCarlo Taticchi

Finite automataankitamakin

Csr2011 june17 15_15_kaminskiCSR2011

Sdd Syntax Descriptionsgavhays

International journal of compilingAndivann

What's hot (20)

Introduction to fa and dfa

A Concurrent Language for Argumentation

Chomsky classification of Language

FINITE STATE MACHINE AND CHOMSKY HIERARCHY

Finite Automata in compiler design

Neural Machine Translation via Binary Code Prediction

B.tech admission in india

Finite Automata

Types of Language in Theory of Computation

Speech To Sign Language Interpreter System

Finite automata

Implementation of lexical analyser

Regular expressions and languages pdf

C programming Ms. Pranoti Doke

Looking for Invariant Operators in Argumentation

A Concurrent Language for Argumentation: Preliminary Notes

Finite automata

Csr2011 june17 15_15_kaminski

Sdd Syntax Descriptions

International journal of compiling

Similar to Speech Recognition: Advanced Topics

Arabic Phoneme Recognition using Hierarchical Neural Fuzzy Petri Net and LPC ...CSCJournals

Speech Compression using LPCDisha Modi

2 praylaIAEME Publication

2 praylaprjpublications

Isolated word recognition using lpc & vector quantizationeSAT Journals

Isolated word recognition using lpc & vector quantizationeSAT Publishing House

lempel_zivLitu Rout

Pcd question bank Sumathi Gnanasekaran

111111111111111111111111111111111789.pptAllamJayaPrakash

haenelt.pptssuser4293bd

Speech compression using loosy predictive coding (lpc)Harshal Ladhe

Build your own ASR engineKorakot Chaovavanich

A practical parser with combined parsingijseajournal

Knowledge-poor and Knowledge-rich Approaches for Multilingual Terminology Ext...Christophe Tricot

Voice RecognitionAmrita More

(Icca 2014) shortest path analysis in social graphsWaqas Nawaz

EasyChair-Preprint-7375.pdfNohaGhoweil

11.the novel lossless text compression technique using ambigram logic and huf...Alexander Decker

Arabic named entity recognition using deep learning approachIJECEIAES

Similar to Speech Recognition: Advanced Topics (20)

Arabic Phoneme Recognition using Hierarchical Neural Fuzzy Petri Net and LPC ...

Speech Compression using LPC

2 prayla

Isolated word recognition using lpc & vector quantization

lempel_ziv

Pcd question bank

111111111111111111111111111111111789.ppt

haenelt.ppt

Speech compression using loosy predictive coding (lpc)

Build your own ASR engine

A practical parser with combined parsing

Knowledge-poor and Knowledge-rich Approaches for Multilingual Terminology Ext...

Voice Recognition

(Icca 2014) shortest path analysis in social graphs

EasyChair-Preprint-7375.pdf

11.the novel lossless text compression technique using ambigram logic and huf...

Arabic named entity recognition using deep learning approach

Recently uploaded

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823

CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Discover Why Less is More in B2B Researchmichael115558

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Recently uploaded (20)

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand

CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Anomaly detection and data imputation within time series

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Detecting Credit Card Fraud: A Machine Learning Approach

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Discover Why Less is More in B2B Research

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Speech Recognition: Advanced Topics

1. SPEECH RECOGNITION: ADVANCED TOPICS True, their voice-print machine was unfortunately a crude one. It could discriminate among only a few frequencies, and it indicated amplitude by indecipherable blots. But it had never been intended for such vitally important work. --- Aleksandr I. Solzhenitsyn, The First Circle, p. 505

2. INTRODUCTION • keju civil service examinations of Imperial China lasted almost 1300 years, • from the year 606 until it was abolished in 1905. this exam is taken for finding the high rank officer for china. • This keju examination process are called as keju algorithm, which is incremental and progressive process of finding the individual candidate. • Keju algorithm are also used for speech recognition. Which is very expensive algorithm for speech recognition. • So Author introduce the `Multipass Decoding Algorithm` which is efficient but dumper decoding algorithm to produce shortlists of potential candidates

3. TOPIC OF THIS CHAPTER This chapter author introduce several methodology to discuss 1. Multi pass decoding algorithm Dumper decoding algorithm produce the shortlist of probabilistic path, in which finding and rescore the potential candidate. 2. Context dependent acoustic model Smart process for create the large vocabulary of speech recognition. 3. Discriminative training 4. Modeling the variation

4. MULTI PASS DECODING For multi pass decoding, we have use stack decoder or A* decoder, which divide the multi pass decoding in to two stage. First Stage we have use efficient knowledge sources to perform optimal search and second stage using sophisticated and slower decoding algorithm to reduce the search space. 1. N-BEST LISTS 2. WORD Lattices

5. N-BEST LISTS • This algorithm is modification of `Viterbi algorithm` to return the N-Best Sentence from given speech.

6. LIMITATION OF N-BEST LIST 1. One problem with an N-best list is that when N is large, listing all the sentences is extremely inefficient. 2. Another problem is that N-best lists don’t give quite as much information as we might want for a second-pass decoder.

7. WORD LATTICE The output of the first pass decoder is usually a more sophisticated representation called a word lattice(Murveit et al., 1993; Aubert and Ney, 1995). A word Lattice is a directed graph that efficiently represents much more information about possible word sequence. There is two part of directed graph, • Node • Arcs Nodes in the graph are word Arcs are transitions between word Arch Represent word hypotheses and nodes are points in time

8. WORD LATTICES

9. N-BEST LIST AND WORD LATTICES GOAL N-Best List and Word Lattices goal is to rescoring the probabilistic candidate and replace with 1-best utterance with a different utterance.

10. A∗ (‘STACK’) DECODING The A∗ decoding algorithm allows us to use the complete forward probability, avoiding the Viterbi approximation A∗ decoding also allows us to use any arbitrary language model. Thus A∗ is a one-pass alternative to multi-pass decoding The A* decoding algorithm is a best first search of the tree that implicitly defines the sequence of allowable word in the language. This algorithm has tow parts, 1. The root: start node on the left or START point of the path 2. Leaf: Difference path of the probabilistic candidate, each Leaf define the on sentence ot the language and create the path

11. A∗ (‘STACK’) DECODING

12. A∗ (‘STACK’) DECODING Priority Queue The A* decoder must thus find the path (word sequence), the root to a leaf which has the highest probabilities, where the path probability is defined as the product of its language model probability(prior) and its acoustic match to the data. Fast match A Fast match is used to select the likely next words. A fast match is one of a class of heuristics designed to efficiently winnow down the number of possible following words, often by computing some approximation to the forward probability.

13. A∗ (‘STACK’) DECODING

14. STACK DECODING

15. END FOR TODAY……………………..

Speech Recognition: Advanced Topics

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Speech Recognition: Advanced Topics

Similar to Speech Recognition: Advanced Topics (20)

Recently uploaded

Recently uploaded (20)

Speech Recognition: Advanced Topics