SlideShare a Scribd company logo
Real Time Voice Actuation System 
Pragya Agrawal, Dominic Calabrese, David Martel, Nathan Sawicki 
Project Description 
Thegoalofourprojectistodesignandbuildareal-timespeechrecognitionsystem.ThisprojectpresentsmanyhardwareandsoftwarechallengesforimplementationinanembeddedenvironmentandisaperfectprojectforEECS452.Usingasourcefiltermodelofspeechandasupportvectormachineclassifieronaprerecordedlibraryofvocalcommands,ourteamwasabletorecognizethecommands“one” “two”“three”and“four”withhighaccuracy.Thefinishedsystemexecutesinreal-timeandhasGPIO-basedactuationtodemonstratefunctionalvoicerecognition. 
Hardware 
TMS320C5515 eZdsp™ USB Stick Development Tool[1] 
VocalinputandfeatureextractionishandledonthisfixedpointDSPchip 
Highspeedautocorrelationfunctionidealforfeatureextraction 
Raspberry Pi Model B+[3] 
700MHzprocessor, BroadComSoCwithfloatingpointunit 
40pinGPIO 
Idealforrunningclassificationalgorithminrealtime 
SparkfunBluetooth Modem –Bluesmirf[2] 
PairstheC5515andtheRaspberryPi 
PairswithotherBluesmirfmoduleswithrelativeease 
115200baudratecapableofreal-timetransmissionandreceiving 
Raspberry Pi : Classification & Actuation 
C5515 : Detection & Feature Extraction 
Implementsthethresholddemonstratedinthefigureontherighttodetectcommands 
DSPlibraryprovidessimplefunctionsusefulforreal- timeDSPonC5515 
Computesthe16-bitautocorrelationforspeechcommandsandtransmitsthisdataoverbluetoothUART 
Support Vector Machine Learning 
SupportVectorMachine(SVM)isasupervisedlearningalgorithmusedforanalyzingdataforbinaryclassificationandregressionanalysis. 
WeutilizeMulti-classSupportVectorMachine,anextensionofSVMforrealtimeclassificationofspokenwordsintooneofthefourclasses:“ONE”,“TWO”, ”THREE”,”FOUR” 
Ouralgorithmusesone-against-onemethodtoconstruct(k*(k-1)/2)classifiers(k=numberofclasses),oneSVMforeachpairofclasses.EachSVMistrainedondatafromtwoclasses,todistinguishthesamplesofoneclassfromanother.Classificationofanunknownpatternisdoneaccordingtomaximumvoting,whereeachSVMvotesforoneclass. 
LIBSVMtool,anintegratedsoftwareformulti-classsupportvectorclassificationisused.Itemploysradialbasiskernelfunctionforclassification. 
Source-Filter Model of Speech 
Wordcharacterizationshouldbeindependentofvolume,pitch,anddurationoftheword 
Simplifyspeechproductionmodeltobeing: 
1. 
Source-vibrationofvocalchords 
2. 
Filter–vocaltract(i.e.positioningoftongue,mouth,etc.) 
Filtermostaffectssoundofwordpositionvocaltractdifferentlytosaydifferentwords 
Accuratelymodelingthefilterprovidesabasisforwordrecognition[4] 
Broad sweeps of spectrum (formants) result from the filter configuration. Rapidly varying peaks come from source resonances 
All-Pole Filter Coefficients 
Firstnfiltercoefficientscanberoughlycalculatedusingthefirstntimeshiftsoftheautocorrelationofasignal 
Autocorrelationiscomputedbyconvolvingasignalwithtimeshiftedversionofitself 
Levinson-DurbinrecursionalgorithmallowsforquickcomputationifthematrixisToeplitzSymmetric 
Wanttocapturespectralenvelope,sowant~10filtercoefficients[5] 
Too many coefficients leads to over-fitting of curve 
References 
[1]http://www.spectrumdigital.com/product_info.php?cPath=31&products_id=238 
[2]https://www.sparkfun.com/products/12577 
[3]http://www.adafruit.com/product/1914 
[4]Dutoit,T.,Moreau,N.,Kroon,P.,Howisspeechprocessedinacellphoneconversation?,2009 
[5]Rabiner,L.,Schafer,R.,IntroductiontoDigitalSpeechProcessing,2007 
Bluesmirf : Bluetooth Communication 
Bluetoothimplementationwasagoalfortheteam.Usingpre-configuredBluesmirfdevices,thec5515isabletotransmitUARTdatatotheRaspberryPi. 
Transmit‘$$$’overUARTtosendBluesmirfintocommandmode 
ProgramtheBluetoothaddressofanotherBluesmirfmoduleintomemory.Transmita‘C’commandtopairthedevices 
DevelopedpreliminaryalgorithmusingMATLABanddesktopcomputerstomimicfunctionofC5515andRaspberryPi 
UsedMATLABCoderToolboxtoconvertcorealgorithmintoCcode 
ImplementedwrapperaroundalgorithmthathandlesUARTcommunication,GPIOtogglingandOpenVGlibraryimagegenerationanddataplotting 
ImplementedLibSVMforclassification 
EECS 452, Digital Signal Processing Design Lab, Fall 2014

More Related Content

What's hot

Waveform Generation Using TMS320C6745 DSP
Waveform Generation Using TMS320C6745 DSPWaveform Generation Using TMS320C6745 DSP
Waveform Generation Using TMS320C6745 DSP
Pantech ProLabs India Pvt Ltd
 
RAPIDS Overview
RAPIDS OverviewRAPIDS Overview
RAPIDS Overview
NVIDIA Japan
 
Advances in GPU Computing
Advances in GPU ComputingAdvances in GPU Computing
Advances in GPU Computing
Frédéric Parienté
 
GPU and Deep learning best practices
GPU and Deep learning best practicesGPU and Deep learning best practices
GPU and Deep learning best practices
Lior Sidi
 
Distributed deep learning optimizations - AI WithTheBest
Distributed deep learning optimizations - AI WithTheBestDistributed deep learning optimizations - AI WithTheBest
Distributed deep learning optimizations - AI WithTheBest
geetachauhan
 
The Revolution of Deep Learning
The Revolution of Deep LearningThe Revolution of Deep Learning
The Revolution of Deep Learning
Frédéric Parienté
 
TensorFlow for HPC?
TensorFlow for HPC?TensorFlow for HPC?
TensorFlow for HPC?
inside-BigData.com
 
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
Ai Forum at Computex 2017 - Keynote Slides by Jensen HuangAi Forum at Computex 2017 - Keynote Slides by Jensen Huang
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
NVIDIA Taiwan
 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
Rakuten Group, Inc.
 
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
Yuma Koizumi
 
Breaking New Frontiers in Robotics and Edge Computing with AI
Breaking New Frontiers in Robotics and Edge Computing with AIBreaking New Frontiers in Robotics and Edge Computing with AI
Breaking New Frontiers in Robotics and Edge Computing with AI
Dustin Franklin
 
GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用
NVIDIA Taiwan
 

What's hot (12)

Waveform Generation Using TMS320C6745 DSP
Waveform Generation Using TMS320C6745 DSPWaveform Generation Using TMS320C6745 DSP
Waveform Generation Using TMS320C6745 DSP
 
RAPIDS Overview
RAPIDS OverviewRAPIDS Overview
RAPIDS Overview
 
Advances in GPU Computing
Advances in GPU ComputingAdvances in GPU Computing
Advances in GPU Computing
 
GPU and Deep learning best practices
GPU and Deep learning best practicesGPU and Deep learning best practices
GPU and Deep learning best practices
 
Distributed deep learning optimizations - AI WithTheBest
Distributed deep learning optimizations - AI WithTheBestDistributed deep learning optimizations - AI WithTheBest
Distributed deep learning optimizations - AI WithTheBest
 
The Revolution of Deep Learning
The Revolution of Deep LearningThe Revolution of Deep Learning
The Revolution of Deep Learning
 
TensorFlow for HPC?
TensorFlow for HPC?TensorFlow for HPC?
TensorFlow for HPC?
 
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
Ai Forum at Computex 2017 - Keynote Slides by Jensen HuangAi Forum at Computex 2017 - Keynote Slides by Jensen Huang
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
 
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
Black-box Optimization of DNN-based Source Enhancement for Increasing Objecti...
 
Breaking New Frontiers in Robotics and Edge Computing with AI
Breaking New Frontiers in Robotics and Edge Computing with AIBreaking New Frontiers in Robotics and Edge Computing with AI
Breaking New Frontiers in Robotics and Edge Computing with AI
 
GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用
 

Similar to Team Jarvis Poster

Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
philip_lamb
 
Captchadecoding
CaptchadecodingCaptchadecoding
Captchadecoding
popat_sagar
 
Hacking for salone: drone races
Hacking for salone: drone racesHacking for salone: drone races
Hacking for salone: drone races
Emanuele Di Saverio
 
From printed circuit boards to exploits
From printed circuit boards to exploitsFrom printed circuit boards to exploits
From printed circuit boards to exploits
virtualabs
 
아마존의 딥러닝 기술 활용 사례
아마존의 딥러닝 기술 활용 사례아마존의 딥러닝 기술 활용 사례
아마존의 딥러닝 기술 활용 사례
NAVER Engineering
 
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
Codemotion
 
Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.
Alison B. Lowndes
 
Tutorial 38 3D Print Coding
Tutorial 38 3D Print CodingTutorial 38 3D Print Coding
Tutorial 38 3D Print Coding
Max Kleiner
 
AI, A New Computing Model
AI, A New Computing ModelAI, A New Computing Model
AI, A New Computing Model
NVIDIA Taiwan
 
Real-Time Signal Processing: Implementation and Application
Real-Time Signal Processing:  Implementation and ApplicationReal-Time Signal Processing:  Implementation and Application
Real-Time Signal Processing: Implementation and Application
sathish sak
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI Platform
Indrajit Poddar
 
Scientech trainer kit catalog
Scientech trainer kit catalogScientech trainer kit catalog
Scientech trainer kit catalog
ABHAYTAVPSC
 
Luigi Presentation at OSCON 2013
Luigi Presentation at OSCON 2013Luigi Presentation at OSCON 2013
Luigi Presentation at OSCON 2013
Erik Bernhardsson
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
WithTheBest
 
SoftKinetic
SoftKinetic SoftKinetic
SoftKinetic
imec
 
JETSON : AI at the EDGE
JETSON : AI at the EDGEJETSON : AI at the EDGE
JETSON : AI at the EDGE
Skolkovo Robotics Center
 
資工人為什麼需要學習數位電路?
資工人為什麼需要學習數位電路?資工人為什麼需要學習數位電路?
資工人為什麼需要學習數位電路?
Murphy Chen
 
Possibilities of generative models
Possibilities of generative modelsPossibilities of generative models
Possibilities of generative models
Alison B. Lowndes
 
Microsoft & Machine Learning / Artificial Intelligence
Microsoft & Machine Learning / Artificial IntelligenceMicrosoft & Machine Learning / Artificial Intelligence
Microsoft & Machine Learning / Artificial Intelligence
İbrahim KIVANÇ
 
Puppet Camp Dallas 2014: How Puppet Ops Rolls
Puppet Camp Dallas 2014: How Puppet Ops RollsPuppet Camp Dallas 2014: How Puppet Ops Rolls
Puppet Camp Dallas 2014: How Puppet Ops Rolls
Puppet
 

Similar to Team Jarvis Poster (20)

Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
Better, Faster, Cheaper: Pick Any Three
. Building Open-Source Tools for Augm...
 
Captchadecoding
CaptchadecodingCaptchadecoding
Captchadecoding
 
Hacking for salone: drone races
Hacking for salone: drone racesHacking for salone: drone races
Hacking for salone: drone races
 
From printed circuit boards to exploits
From printed circuit boards to exploitsFrom printed circuit boards to exploits
From printed circuit boards to exploits
 
아마존의 딥러닝 기술 활용 사례
아마존의 딥러닝 기술 활용 사례아마존의 딥러닝 기술 활용 사례
아마존의 딥러닝 기술 활용 사례
 
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
Hacking for Salone: Drone Races - Di Saverio; Lippolis - Codemotion Milan 2016
 
Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.
 
Tutorial 38 3D Print Coding
Tutorial 38 3D Print CodingTutorial 38 3D Print Coding
Tutorial 38 3D Print Coding
 
AI, A New Computing Model
AI, A New Computing ModelAI, A New Computing Model
AI, A New Computing Model
 
Real-Time Signal Processing: Implementation and Application
Real-Time Signal Processing:  Implementation and ApplicationReal-Time Signal Processing:  Implementation and Application
Real-Time Signal Processing: Implementation and Application
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI Platform
 
Scientech trainer kit catalog
Scientech trainer kit catalogScientech trainer kit catalog
Scientech trainer kit catalog
 
Luigi Presentation at OSCON 2013
Luigi Presentation at OSCON 2013Luigi Presentation at OSCON 2013
Luigi Presentation at OSCON 2013
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
 
SoftKinetic
SoftKinetic SoftKinetic
SoftKinetic
 
JETSON : AI at the EDGE
JETSON : AI at the EDGEJETSON : AI at the EDGE
JETSON : AI at the EDGE
 
資工人為什麼需要學習數位電路?
資工人為什麼需要學習數位電路?資工人為什麼需要學習數位電路?
資工人為什麼需要學習數位電路?
 
Possibilities of generative models
Possibilities of generative modelsPossibilities of generative models
Possibilities of generative models
 
Microsoft & Machine Learning / Artificial Intelligence
Microsoft & Machine Learning / Artificial IntelligenceMicrosoft & Machine Learning / Artificial Intelligence
Microsoft & Machine Learning / Artificial Intelligence
 
Puppet Camp Dallas 2014: How Puppet Ops Rolls
Puppet Camp Dallas 2014: How Puppet Ops RollsPuppet Camp Dallas 2014: How Puppet Ops Rolls
Puppet Camp Dallas 2014: How Puppet Ops Rolls
 

Team Jarvis Poster

  • 1. Real Time Voice Actuation System Pragya Agrawal, Dominic Calabrese, David Martel, Nathan Sawicki Project Description Thegoalofourprojectistodesignandbuildareal-timespeechrecognitionsystem.ThisprojectpresentsmanyhardwareandsoftwarechallengesforimplementationinanembeddedenvironmentandisaperfectprojectforEECS452.Usingasourcefiltermodelofspeechandasupportvectormachineclassifieronaprerecordedlibraryofvocalcommands,ourteamwasabletorecognizethecommands“one” “two”“three”and“four”withhighaccuracy.Thefinishedsystemexecutesinreal-timeandhasGPIO-basedactuationtodemonstratefunctionalvoicerecognition. Hardware TMS320C5515 eZdsp™ USB Stick Development Tool[1] VocalinputandfeatureextractionishandledonthisfixedpointDSPchip Highspeedautocorrelationfunctionidealforfeatureextraction Raspberry Pi Model B+[3] 700MHzprocessor, BroadComSoCwithfloatingpointunit 40pinGPIO Idealforrunningclassificationalgorithminrealtime SparkfunBluetooth Modem –Bluesmirf[2] PairstheC5515andtheRaspberryPi PairswithotherBluesmirfmoduleswithrelativeease 115200baudratecapableofreal-timetransmissionandreceiving Raspberry Pi : Classification & Actuation C5515 : Detection & Feature Extraction Implementsthethresholddemonstratedinthefigureontherighttodetectcommands DSPlibraryprovidessimplefunctionsusefulforreal- timeDSPonC5515 Computesthe16-bitautocorrelationforspeechcommandsandtransmitsthisdataoverbluetoothUART Support Vector Machine Learning SupportVectorMachine(SVM)isasupervisedlearningalgorithmusedforanalyzingdataforbinaryclassificationandregressionanalysis. WeutilizeMulti-classSupportVectorMachine,anextensionofSVMforrealtimeclassificationofspokenwordsintooneofthefourclasses:“ONE”,“TWO”, ”THREE”,”FOUR” Ouralgorithmusesone-against-onemethodtoconstruct(k*(k-1)/2)classifiers(k=numberofclasses),oneSVMforeachpairofclasses.EachSVMistrainedondatafromtwoclasses,todistinguishthesamplesofoneclassfromanother.Classificationofanunknownpatternisdoneaccordingtomaximumvoting,whereeachSVMvotesforoneclass. LIBSVMtool,anintegratedsoftwareformulti-classsupportvectorclassificationisused.Itemploysradialbasiskernelfunctionforclassification. Source-Filter Model of Speech Wordcharacterizationshouldbeindependentofvolume,pitch,anddurationoftheword Simplifyspeechproductionmodeltobeing: 1. Source-vibrationofvocalchords 2. Filter–vocaltract(i.e.positioningoftongue,mouth,etc.) Filtermostaffectssoundofwordpositionvocaltractdifferentlytosaydifferentwords Accuratelymodelingthefilterprovidesabasisforwordrecognition[4] Broad sweeps of spectrum (formants) result from the filter configuration. Rapidly varying peaks come from source resonances All-Pole Filter Coefficients Firstnfiltercoefficientscanberoughlycalculatedusingthefirstntimeshiftsoftheautocorrelationofasignal Autocorrelationiscomputedbyconvolvingasignalwithtimeshiftedversionofitself Levinson-DurbinrecursionalgorithmallowsforquickcomputationifthematrixisToeplitzSymmetric Wanttocapturespectralenvelope,sowant~10filtercoefficients[5] Too many coefficients leads to over-fitting of curve References [1]http://www.spectrumdigital.com/product_info.php?cPath=31&products_id=238 [2]https://www.sparkfun.com/products/12577 [3]http://www.adafruit.com/product/1914 [4]Dutoit,T.,Moreau,N.,Kroon,P.,Howisspeechprocessedinacellphoneconversation?,2009 [5]Rabiner,L.,Schafer,R.,IntroductiontoDigitalSpeechProcessing,2007 Bluesmirf : Bluetooth Communication Bluetoothimplementationwasagoalfortheteam.Usingpre-configuredBluesmirfdevices,thec5515isabletotransmitUARTdatatotheRaspberryPi. Transmit‘$$$’overUARTtosendBluesmirfintocommandmode ProgramtheBluetoothaddressofanotherBluesmirfmoduleintomemory.Transmita‘C’commandtopairthedevices DevelopedpreliminaryalgorithmusingMATLABanddesktopcomputerstomimicfunctionofC5515andRaspberryPi UsedMATLABCoderToolboxtoconvertcorealgorithmintoCcode ImplementedwrapperaroundalgorithmthathandlesUARTcommunication,GPIOtogglingandOpenVGlibraryimagegenerationanddataplotting ImplementedLibSVMforclassification EECS 452, Digital Signal Processing Design Lab, Fall 2014