Acoustic Model Selection for Recognition
of Regional Accented Speech
By Maryam Najafian
Supervisor Prof. Martin Russell
University of Birmingham
4th December 2015
Email: m.najafian@pgr.bham.ac.uk
Motivation
1/9
Objectives
2/9
• This research is concerned with automatic speech recognition (ASR) for accented
speech using a range of different AID systems for GMM-HMM and DNN-HMM
based acoustic model selection
• Trained on the SI training set (92 speakers, 7861 utterances) of the WSJCAM0
corpus of read British English speech
• Tested/adapted on ABI Corpus, 14 different accents (285 speakers)
Baseline AID System Design
Phonotactic
Accuracy : 80.65 %
I-vector
Accuracy :76.76%
ACCDIST-SVM
Accuracy: 95 %
3/9
ACCDIST Accent ID feature space
4/9
GMM-HMM: Unsupervised Adaptation
5/9
6/9
GMM-HMM: Speaker versus Accent Adaptation
Supervised speaker versus accent adaptation
Unsupervised speaker versus accent adaptation
DNN-HMM versus GMM-HMM
7/9
DNN-HMM: Extra Training Material (ETM) &
Extra Pre-Training Material (EPM)
8/9
9/9
Summary and publications
Thank you for listening
Email: m.najafian@pgr.bham.ac.uk

presentation_Viva