Brain signal seminar

•Download as PPTX, PDF•

0 likes•16 views

Nafiz Ishtiaque Ahmed

Education

Scientific Reports - Nature
Speech synthesis from ECoG using
densely connected 3D convolutional neural
networks
-
Miguel Angrick, Christian Herff, Emily Mugler , Matthew C Tate ,
Marc W Slutzky , Dean J Krusienski and Tanja Schultz
16 April 2019
NAFIZ ISHTIAQUE AHMED
LITERATURE REVIEW
UOU – UNIVERSITY OF ULSAN

소개
간단한 선형 모델은 신경 활동과 지속적인 구어 연설 사이의
관계를 포착 할 수 없습니다.
기사에서는 심 신경 네트워크를 사용하여 ECoG를 매핑하여
음성을 생성 할 수 있음을 보여줍니다.

Sogae
gandanhan seonhyeong model-eun singyeong hwaldong-gwa
jisogjeog-in gueo yeonseol saiui gwangyeleul pochag hal su eobs-
seubnida.
gisa-eseoneun sim singyeong neteuwokeuleul sayonghayeo
ECoGleul maepinghayeo eumseong-eul saengseong hal su
iss-eum-eul boyeojubnida.

Introduction
ECoG signals which supplies the necessary
temporal and spatial resolution could provide a
fast and natural way of communication to
people with neurological diseases.
However; simple linear models are not good
enough to make the relation between neural
activity with continuous spoken speech.
Thus; deep neural networks can be used to
map ECoG from speech production areas onto
an intermediate representation of speech.

Introduction
Brainstem stroke can result in a loss of this ability to speak.
Where; BCI with ECoG is particularly is well-suited for the
decoding of speech processes from Invasively-measured
brain activity.
Densely-connected convolutional neural networks is applied
on ECoG data that results reconstructing high-quality audio
from neural signals during speech production.

Experiment
ECoG from six native English speaking participants. All
subjects had normal speech and language function and
normal hearing.
ECoG was recorded with a medium-density, 64-channel, 8
× 8 electrode grid.
Participants read between 244 and 372 single words shown
to them on a screen.

Architecture of the Decoding approach
ECoG features for each time window are fed into DenseNet regression
model to reconstruct the logarithmic mel-scaled spectrogram. Wavenet
is then used to reconstruct an audio waveform from the spectrogram.

Reconstruction performance
(a) Pearson correlation coefficients between original and reconstructed
spectrograms for each participant. Bars indicate the mean over all
logarithmic mel-scaled coefficients
(b) Detailed performance across all spectral bins for participant 5.

Reconstruction example for visual inspection
(a) compares a time-aligned excerpt of participant 5
(b) generated waveform representation of the same excerpt as in the
spectrogram comparison.

Discussion
It is evident that the model has learned a distinguishable
representation between silence and acoustic speech and
captures many of the intricate dynamics of human speech.
This network transforms the measured brain activity to
spectral features of speech. Correlations up to r = 0.69
across all frequency bands were achieved by this network
This is the first time that high quality audio of speech has
been reconstructed from neural recordings of speech
production using deep neural networks.

Thank You
Nafiz Ishtiaque Ahmed
cse.ishtiaque@gmail.com
@Nafiz_Ishtiaque

Similar to Brain signal seminar

METHODS OF COMMAND RECOGNITION USING SINGLE-CHANNEL EEGSijistjournal

International Journal of Engineering Research and Development (IJERD)IJERD Editor

A Novel Single-Trial Analysis Scheme for Characterizing the Presaccadic Brain...konmpoz

METHODS OF COMMAND RECOGNITION USING SINGLE-CHANNEL EEGSijistjournal

EEG S IGNAL Q UANTIFICATION B ASED ON M ODUL L EVELS sipij

SUITABLE MOTHER WAVELET SELECTION FOR EEG SIGNALS ANALYSIS: FREQUENCY BANDS D...sipij

0 eeg based 3 d visual fatigue evaluation using cnnHoopeer Hoopeer

EEG Signal Classification using Deep Neural NetworkIRJET Journal

VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROS...ijistjournal

Unspoken Words Recognition: A Reviewidescitation

A monkey model of auditory scene analysisPradeepD32

forney_techrep2015Elliott Forney

Analysis of EEG data Using ICA and Algorithm Development for Energy Comparisonijsrd.com

System Architecture for Brain-Computer Interface based on Machine Learning an...ShahanawajAhamad1

RVS CONFERENCE.pptxRAJALAKSHMIJ10

A Novel Approach For Detection of Neurological Disorders through Electrical P...IJECEIAES

IRJET- An Efficient Approach for Removal of Ocular Artifacts in EEG-Brain Com...IRJET Journal

Similar to Brain signal seminar (20)

METHODS OF COMMAND RECOGNITION USING SINGLE-CHANNEL EEGS

International Journal of Engineering Research and Development (IJERD)

A Novel Single-Trial Analysis Scheme for Characterizing the Presaccadic Brain...

METHODS OF COMMAND RECOGNITION USING SINGLE-CHANNEL EEGS

EEG S IGNAL Q UANTIFICATION B ASED ON M ODUL L EVELS

SUITABLE MOTHER WAVELET SELECTION FOR EEG SIGNALS ANALYSIS: FREQUENCY BANDS D...

0 eeg based 3 d visual fatigue evaluation using cnn

EEG Signal Classification using Deep Neural Network

VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROS...

Unspoken Words Recognition: A Review

A monkey model of auditory scene analysis

forney_techrep2015

Analysis of EEG data Using ICA and Algorithm Development for Energy Comparison

System Architecture for Brain-Computer Interface based on Machine Learning an...

RVS CONFERENCE.pptx

A Novel Approach For Detection of Neurological Disorders through Electrical P...

IRJET- An Efficient Approach for Removal of Ocular Artifacts in EEG-Brain Com...

Recently uploaded

How to Make a Pirate ship Primary Education.pptxmanuelaromero2013

Accessible design: Minimum effort, maximum impactdawncurless

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

A Critique of the Proposed National Education Policy ReformChameera Dedduwage

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Alper Gobel In Media Res Media ComponentInMediaRes1

Paris 2024 Olympic Geographies - an activityGeoBlogs

The Most Excellent Way | 1 Corinthians 13Steve Thomason

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

Presiding Officer Training module 2024 lok sabha electionsanshu789521

TataKelola dan KamSiber Kecerdasan Buatan v022.pdfSarwono Sutikno, Dr.Eng.,CISA,CISSP,CISM,CSX-F

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Recently uploaded (20)

How to Make a Pirate ship Primary Education.pptx

Accessible design: Minimum effort, maximum impact

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️

CARE OF CHILD IN INCUBATOR..........pptx

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

A Critique of the Proposed National Education Policy Reform

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Alper Gobel In Media Res Media Component

Paris 2024 Olympic Geographies - an activity

The Most Excellent Way | 1 Corinthians 13

Sanyam Choudhary Chemistry practical.pdf

Presiding Officer Training module 2024 lok sabha elections

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝

Brain signal seminar

1. Scientific Reports - Nature Speech synthesis from ECoG using densely connected 3D convolutional neural networks - Miguel Angrick, Christian Herff, Emily Mugler , Matthew C Tate , Marc W Slutzky , Dean J Krusienski and Tanja Schultz 16 April 2019 NAFIZ ISHTIAQUE AHMED LITERATURE REVIEW UOU – UNIVERSITY OF ULSAN

2. 소개 간단한 선형 모델은 신경 활동과 지속적인 구어 연설 사이의 관계를 포착 할 수 없습니다. 기사에서는 심 신경 네트워크를 사용하여 ECoG를 매핑하여 음성을 생성 할 수 있음을 보여줍니다.

3. Sogae gandanhan seonhyeong model-eun singyeong hwaldong-gwa jisogjeog-in gueo yeonseol saiui gwangyeleul pochag hal su eobs- seubnida. gisa-eseoneun sim singyeong neteuwokeuleul sayonghayeo ECoGleul maepinghayeo eumseong-eul saengseong hal su iss-eum-eul boyeojubnida.

4. Introduction ECoG signals which supplies the necessary temporal and spatial resolution could provide a fast and natural way of communication to people with neurological diseases. However; simple linear models are not good enough to make the relation between neural activity with continuous spoken speech. Thus; deep neural networks can be used to map ECoG from speech production areas onto an intermediate representation of speech.

5. Introduction Brainstem stroke can result in a loss of this ability to speak. Where; BCI with ECoG is particularly is well-suited for the decoding of speech processes from Invasively-measured brain activity. Densely-connected convolutional neural networks is applied on ECoG data that results reconstructing high-quality audio from neural signals during speech production.

6. Experiment ECoG from six native English speaking participants. All subjects had normal speech and language function and normal hearing. ECoG was recorded with a medium-density, 64-channel, 8 × 8 electrode grid. Participants read between 244 and 372 single words shown to them on a screen.

7. Architecture of the Decoding approach ECoG features for each time window are fed into DenseNet regression model to reconstruct the logarithmic mel-scaled spectrogram. Wavenet is then used to reconstruct an audio waveform from the spectrogram.

8. Reconstruction performance (a) Pearson correlation coefficients between original and reconstructed spectrograms for each participant. Bars indicate the mean over all logarithmic mel-scaled coefficients (b) Detailed performance across all spectral bins for participant 5.

9. Reconstruction example for visual inspection (a) compares a time-aligned excerpt of participant 5 (b) generated waveform representation of the same excerpt as in the spectrogram comparison.

10. Discussion It is evident that the model has learned a distinguishable representation between silence and acoustic speech and captures many of the intricate dynamics of human speech. This network transforms the measured brain activity to spectral features of speech. Correlations up to r = 0.69 across all frequency bands were achieved by this network This is the first time that high quality audio of speech has been reconstructed from neural recordings of speech production using deep neural networks.

11. Thank You Nafiz Ishtiaque Ahmed cse.ishtiaque@gmail.com @Nafiz_Ishtiaque

Brain signal seminar

Recommended

Recommended

More Related Content

Similar to Brain signal seminar

Similar to Brain signal seminar (20)

More from Nafiz Ishtiaque Ahmed

More from Nafiz Ishtiaque Ahmed (20)

Recently uploaded

Recently uploaded (20)

Brain signal seminar