S1170136 week8

•

0 likes•98 views

s1170136_

Speaker verification
using Oguma histogram

S117036 Ami Inoamta

Supervisor:M.Sugiyama

Outline
● Background
● Oguma histogram
● Speaker verification algorithm
● Speech analysis condition
● Learn histogram , Verification histogram
● Result of speaker verification

Background
● Diffusion of smartphone.
● Security is more important.
→Speaker verification system

● VQ : most popular. But calculation amount is large.
● Oguma histogram : calculation amount is small.
→smartphone application.

● First, test this performance in PC terminal.

Oguma histogram
● Do not use VQ.
● Directly make histogram
from feature vectors.
1. Set out threshold in each
dimension of feature
vectors.
2. Compare
→Space division
3.Set out Region ID .

Speech analysis condition
Database TIMIT
Dialect region New England
Head count 10 person
Learning , verification 5 sentence , 5 sentence
Recording format wav
Sampling rate 16 kHz
Dimension number of MFCC 16
Filter bank channel 24
Window size 16 ms
Frame shift 8 ms, 16 ms
Dimension number of 16, 32, 64, 128, 256
histogram

Result of speaker verification
Female: Male:

Speaker verification result (frame shift 8ms)

Viewers also liked

media studies house styleshadiorr

Syllabus etica y responsabilidad social presencialMilber Fuentes

Las relaciones entre_la_etica_y_la_politicaMilber Fuentes

Syllabus etica y formción ciudadana presencialMilber Fuentes

Zygmunt bauman-trabajo-consumismo-y-nuevos-pobres-libro-completoMilber Fuentes

Razon y dominacion_la_legitimidad_en_webMilber Fuentes

Viewers also liked (6)

media studies house style

Syllabus etica y responsabilidad social presencial

Las relaciones entre_la_etica_y_la_politica

Syllabus etica y formción ciudadana presencial

Zygmunt bauman-trabajo-consumismo-y-nuevos-pobres-libro-completo

Razon y dominacion_la_legitimidad_en_web

Similar to S1170136 week8

Development of voice password based speaker verification systemniranjan kumar

HMM based Automatic Arabic Sign Language Translator usingعمر أمين

Information and data security pseudorandom number generation and stream cipherMazin Alwaaly

Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Ahmed Ayman

Dcase2016 oral presentation - Experiments on DCASE 2016: Acoustic Scene Class...Ankit Shah

Google and SRI talk September 2016Hagai Aronowitz

FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machin...Alpen-Adria-Universität

lesson 2 digital data acquisition and data processingMathew John

ASR_finalBidhan Barai

TinyML - 4 speech recognition 艾鍗科技

Fast and Reliable Estimation Schemes in RFID Systems.pptnovrain1

Pin pointpresentationLevan Huan

IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline

Text independent speaker recognition systemDeepesh Lekhak

Speaker recognition.Nimmagadda Ushakiran

STEGANOGRAPHY BASED ASYMMETRIC KEY CRYPTOSYSTEM USING TRELLIS CODED GENETIC A...ijesajournal

Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...ijcisjournal

Encrypted Traffic MiningHenry Huang

The Case for a Signal Oriented Data Stream Management SystemReza Rahimi

Similar to S1170136 week8 (20)

Development of voice password based speaker verification system

HMM based Automatic Arabic Sign Language Translator using

Information and data security pseudorandom number generation and stream cipher

Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...

Dcase2016 oral presentation - Experiments on DCASE 2016: Acoustic Scene Class...

Google and SRI talk September 2016

FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machin...

lesson 2 digital data acquisition and data processing

ASR_final

TinyML - 4 speech recognition

Fast and Reliable Estimation Schemes in RFID Systems.ppt

Pin pointpresentation

IJCER (www.ijceronline.com) International Journal of computational Engineerin...

Text independent speaker recognition system

Speaker recognition.

STEGANOGRAPHY BASED ASYMMETRIC KEY CRYPTOSYSTEM USING TRELLIS CODED GENETIC A...

Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...

Encrypted Traffic Mining

The Case for a Signal Oriented Data Stream Management System

S1170136 week8

1. Speaker verification using Oguma histogram S117036 Ami Inoamta Supervisor:M.Sugiyama

2. Outline ● Background ● Oguma histogram ● Speaker verification algorithm ● Speech analysis condition ● Learn histogram , Verification histogram ● Result of speaker verification

3. Background ● Diffusion of smartphone. ● Security is more important. →Speaker verification system ● VQ : most popular. But calculation amount is large. ● Oguma histogram : calculation amount is small. →smartphone application. ● First, test this performance in PC terminal.

4. Oguma histogram ● Do not use VQ. ● Directly make histogram from feature vectors. 1. Set out threshold in each dimension of feature vectors. 2. Compare →Space division 3.Set out Region ID .

5. Speaker verification algorithm

6. Speech analysis condition Database TIMIT Dialect region New England Head count 10 person Learning , verification 5 sentence , 5 sentence Recording format wav Sampling rate 16 kHz Dimension number of MFCC 16 Filter bank channel 24 Window size 16 ms Frame shift 8 ms, 16 ms Dimension number of 16, 32, 64, 128, 256 histogram

7. Learn histogram,Verification histogram

8. Result of speaker verification Female: Male: Speaker verification result (frame shift 8ms)

S1170136 week8

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (6)

Similar to S1170136 week8

Similar to S1170136 week8 (20)

S1170136 week8