The document describes two algorithms for weakly supervised denoising of EEG data:
1. An ICA and multi-instance learning solution that uses ICA to decompose EEG signals into components, extracts SAX features from the components, and uses multi-instance learning to classify components as artifacts or not.
2. An asymmetric generative adversarial network solution that is proposed to improve the model by making it online, fully automated, and end-to-end.
The talk discusses challenges in using EEG data like noise and the need for artifact removal algorithms, and provides an overview of related work on artifact removal including ICA-based approaches.
Two Algorithms Weakly Supervised Denoising EEG Data
1. Two Algorithms for Weakly Supervised
Denoising of EEG Data
Tim Oates
Professor
University of Maryland Baltimore County
2. The Real Brains
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 2
Sunil Gandhi
Ph.D. student
Will graduate soon!
3. Agenda
• Introduction To Artifact Removal From EEG data
• ICA And Multi-instance Learning Solution
• Asymmetric Generative Adversarial Network Solution
• Future Work
• Questions
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 3
4. • Introduction To Artifact Removal From EEG data
• ICA And Multi-instance Learning Solution
• Asymmetric Generative Adversarial Network Solution
• Future Work
• Questions
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 4
5. Electroencephalography (EEG)
• Electroencephalography (EEG) is used to record electrical
activity in the brain
5
Image Source: https://en.wikipedia.org/wiki/Electroencephalography
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
6. Electroencephalography (EEG) Data
• Electroencephalography (EEG) is used to record electrical
activity in the brain
– Seizure detection
6
Page, Adam, Chris Sagedy, Emily Smith, Nasrin Attaran, Tim Oates, and Tinoosh Mohsenin. "A flexible multichannel EEG
feature extractor and classifier for seizure detection." IEEE Transactions on Circuits and Systems II: Express Briefs 62, no. 2
(2015): 109-113.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
7. Electroencephalography (EEG) Data
• Electroencephalography (EEG) is used to record electrical
activity in the brain
– Seizure detection
– Determining cognitive load
7
Bashivan, Pouya, Irina Rish, Mohammed Yeasin, and Noel Codella. "Learning representations from EEG with deep recurrent-
convolutional neural networks." International Conference on Learning Representations 2016.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
8. Electroencephalography (EEG) Data
• Electroencephalography (EEG) is used to record electrical
activity in the brain
– Seizure detection
– Determining cognitive load
– Brain Computer interfaces (BCI)
8
Source: https://www.youtube.com/watch?v=rYwgnFeFqmc
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
9. Electroencephalography (EEG) Data
• Electroencephalography (EEG) is used to record electrical
activity in the brain
– Seizure detection
– Determining cognitive load
– Brain Computer interfaces (BCI)
– Driver drowsiness estimation
9
Lance, Brent J., W. David Hairston, Greg Apker, Keith W. Whitaker, Geoff Slipher, Randy Mrozek, Scott E. Kerick, Jason Metcalfe,
Christopher Manteuffel, and Matthew Jaswa. 2012 year-end report on neurotechnologies for in-vehicle applications. No. ARL-SR-267.
ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD, 2013.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
10. Challenges in using EEG data
• Compact comfortable headsets
• Noisy Data
• Online, fully automated algorithms
• Hardware implementations
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 10
11. Challenges in using EEG data
• Compact comfortable headsets
• Noisy Data
• Online, fully automated algorithms
• Hardware implementations
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 11
12. Challenges in using EEG data
• Compact comfortable headsets
• Noisy Data
• Online, fully automated algorithms
• Hardware implementations
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 12
13. EEG Artifacts
• Artifacts: Unwanted electrical activity arising from sources
other than the brain
• Types of Artifacts
– Biological Artifacts
• Ocular artifacts
• Muscular artifacts
• Cardiac artifacts
– External artifacts
• Electrode motion
• External device artifacts
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 13
14. EEG Artifacts
• Artifacts: Unwanted electrical activity arising from sources
other than the brain
• Types of Artifacts
– Biological Artifacts
• Ocular artifacts
• Muscular artifacts
• Cardiac artifacts
– External artifacts
• Electrode motion
• External device artifacts
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 14
Eye Blink Artifact
15. EEG Artifacts
• Artifacts: Unwanted electrical activity arising from sources other
than the brain
• Types of Artifacts
– Biological Artifacts
– External Artifacts
• Why is it important to remove them?
– Increase the chance of false alarms in seizure detection (Seneviratne et al.
2013)
– They can also alter the shape of neurological events causing unintentional
control of BCI systems (Vaughan et al. 2003).
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 15
16. EEG Artifacts
• Artifacts: Unwanted electrical activity arising from sources other
than the brain
• Types of Artifacts
– Biological Artifacts
– External Artifacts
• Why is it important to remove them?
• Can occurrence of artifacts be avoided?
– The occurrence of external artifacts can be reduced by proper placement
of electrodes, but it is impossible to avoid artifacts of biological origin.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 16
17. Contributions
• Propose a system for artifact removal from EEG data using
weak supervisory information.
• Improve the model by proposing an online, fully automated,
end-to-end system for artifact removal trained using unpaired
training corpora.
• Creating a framework for evaluation of artifact removal
algorithms
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for denoising speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 17
18. Contributions
• Propose a system for artifact removal from EEG data using
weak supervisory information.
• Improve the model by proposing an online, fully automated,
end-to-end system for artifact removal trained using unpaired
training corpora.
• Creating a framework for evaluation of artifact removal
algorithms
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for denoising speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 18
19. • Introduction To Artifact Removal From EEG data
• ICA And Multi-instance Learning Solution
• Asymmetric Generative Adversarial Network Solution
• Future Work
• Questions
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 19
20. Related Work
• Artifact Rejection(Nolan, Whelan, and Reilly 2010)
– Train an artifact detector
– Remove the segments of data where artifacts are present
– Excessive loss of information
• Regression using reference channel (Croft and Barry 2000)
– Use a reference channel like Electrocardiography (ECG), Electrooculography
(EOG) or Electromyography (EMG).
– Fail if a reference signal is not available.
• Adaptive filtering (Sweeney, Ward, and McLoone 2012)
– Generate an artifact signal that is uncorrelated to the EEG signal but correlated
to the reference signal.
– Fail if a reference signal is not available.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 20
21. Related Work
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 21
• Wavelet transforms (Islam et al. 2017) (Daly et al. 2015)
– Decompose each channel using DWT
– Threshold the low frequency component and reconstruct the signal
– Process each channel separately and could miss important clues for
removing the artifact.
23. Related Work
• Principal component analysis (PCA) (Berg and Scherg 1991)
– One of the initial approaches that used principal component analysis
(PCA) for decomposing the signal and removing artifacts from the
decomposed signal.
– Uses orthogonal transformation to convert a multi channel EEG signal
into a set of linearly uncorrelated variables.
– It has been demonstrated that PCA is unable to separate some
artifactual components from brain signals, especially when they have
similar amplitudes (Fitzgibbon et al. 2007b).
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 23
24. Independent component analysis (ICA)
• Most popular technique for artifact removal from EEG data
• ICA is used to recover independent source signals called
components and then the components corresponding to
artifacts are identified.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 24
25. ICA Based Artifact Removal
• Shackman, Alexander J., et al. "Identifying robust and sensitive frequency bands for
interrogating neural oscillations." Neuroimage 51.4 (2010): 1319-1333.
• Fitzgibbon, S. P., et al. "Removal of EEG noise and artifact using blind source separation."
Journal of Clinical Neurophysiology 24.3 (2007): 232-243.
• Joyce, Carrie A., Irina F. Gorodnitsky, and Marta Kutas. "Automatic removal of eye
movement and blink artifacts from EEG data using blind component separation."
Psychophysiology 41.2 (2004): 313-325.
• Jung, Tzyy-Ping, et al. "Removal of eye activity artifacts from visual event-related potentials
in normal and clinical subjects." Clinical Neurophysiology 111.10 (2000): 1745-1758.
• Jung, Tzyy-Ping, et al. "Removing electroencephalographic artifacts by blind source
separation." Psychophysiology 37.02 (2000): 163-178.
• Keren, Alon S., Shlomit Yuval-Greenberg, and Leon Y. Deouell. "Saccadic spike potentials in
gamma-band EEG: characterization, detection and suppression." Neuroimage 49.3 (2010):
2248-2263.
253/2/2019 Artifact Removal From Electroencephalography (EEG) Data
26. ICA Based Artifact Removal
• Shackman, Alexander J., et al. "Identifying robust and sensitive frequency bands for
interrogating neural oscillations." Neuroimage 51.4 (2010): 1319-1333.
• Fitzgibbon, S. P., et al. "Removal of EEG noise and artifact using blind source separation."
Journal of Clinical Neurophysiology 24.3 (2007): 232-243.
• Joyce, Carrie A., Irina F. Gorodnitsky, and Marta Kutas. "Automatic removal of eye
movement and blink artifacts from EEG data using blind component separation."
Psychophysiology 41.2 (2004): 313-325.
• Jung, Tzyy-Ping, et al. "Removal of eye activity artifacts from visual event-related potentials
in normal and clinical subjects." Clinical Neurophysiology 111.10 (2000): 1745-1758.
• Jung, Tzyy-Ping, et al. "Removing electroencephalographic artifacts by blind source
separation." Psychophysiology 37.02 (2000): 163-178.
• Keren, Alon S., Shlomit Yuval-Greenberg, and Leon Y. Deouell. "Saccadic spike potentials in
gamma-band EEG: characterization, detection and suppression." Neuroimage 49.3 (2010):
2248-2263.
Artifact components need to be manually
identified or supervisory signal is needed for
training
263/2/2019 Artifact Removal From Electroencephalography (EEG) Data
27. ICA Based Artifact Removal
• Shackman, Alexander J., et al. "Identifying robust and sensitive frequency bands for
interrogating neural oscillations." Neuroimage 51.4 (2010): 1319-1333.
• Fitzgibbon, S. P., et al. "Removal of EEG noise and artifact using blind source separation."
Journal of Clinical Neurophysiology 24.3 (2007): 232-243.
• Joyce, Carrie A., Irina F. Gorodnitsky, and Marta Kutas. "Automatic removal of eye
movement and blink artifacts from EEG data using blind component separation."
Psychophysiology 41.2 (2004): 313-325.
• Jung, Tzyy-Ping, et al. "Removal of eye activity artifacts from visual event-related potentials
in normal and clinical subjects." Clinical Neurophysiology 111.10 (2000): 1745-1758.
• Jung, Tzyy-Ping, et al. "Removing electroencephalographic artifacts by blind source
separation." Psychophysiology 37.02 (2000): 163-178.
• Keren, Alon S., Shlomit Yuval-Greenberg, and Leon Y. Deouell. "Saccadic spike potentials in
gamma-band EEG: characterization, detection and suppression." Neuroimage 49.3 (2010):
2248-2263.
Key Idea: Identify components corresponding to
artifacts using weak supervisory signal
273/2/2019 Artifact Removal From Electroencephalography (EEG) Data
28. System Architecture
28
Mul$ Channel EEG Signal
Channel 1
Channel 2
Channel 64
.
...
..
ICA
Bag of components
.
..
SAX
Feature
Extrac0on .
..
0 1 0 1 0 0
Mul0-
Instance
Learning
(MIL)
0 0 1 1 0 0
0 0 1 1 1 1
Vector represen$ng
component
Sax string: "aaa" "abc" "aab" "aba" "acb" "acc"
Probability of noise
for each component
Ar$fact/ Not Ar$fact?
• Main Processing Blocks
⎻ Independent Component Analysis
⎻ SAX feature extraction
⎻ Multi Instance Learning
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
32. Independent component analysis (ICA)
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 32
ICA
Clean EEG
Signal
Artifact
Mixing
System
Observed 1
Observed 2
Observed 3
Denoised EEG
Artifact
Unknown
33. System Architecture
33
Mul$ Channel EEG Signal
Channel 1
Channel 2
Channel 64
.
...
..
ICA
Bag of components
.
..
SAX
Feature
Extrac0on .
..
0 1 0 1 0 0
Mul0-
Instance
Learning
(MIL)
0 0 1 1 0 0
0 0 1 1 1 1
Vector represen$ng
component
Sax string: "aaa" "abc" "aab" "aba" "acb" "acc"
Probability of noise
for each component
Ar$fact/ Not Ar$fact?
• Main Processing Blocks
⎻ Independent Component Analysis
⎻ SAX feature extraction
⎻ Multi Instance Learning
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
34. Slide by Eamonn Keogh(http://www.cs.ucr.edu/~eamonn/SAX.ppt)
34
How do we obtain SAX features?
0
00 20 40 60 80 100 120
bb
b
a
c
c
c
a
baabccbc
0
20
20 40 60 80 100 120
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
35. System Architecture
35
Mul$ Channel EEG Signal
Channel 1
Channel 2
Channel 64
.
...
..
ICA
Bag of components
.
..
SAX
Feature
Extrac0on .
..
0 1 0 1 0 0
Mul0-
Instance
Learning
(MIL)
0 0 1 1 0 0
0 0 1 1 1 1
Vector represen$ng
component
Sax string: "aaa" "abc" "aab" "aba" "acb" "acc"
Probability of noise
for each component
Ar$fact/ Not Ar$fact?
• Main Processing Blocks
⎻ Independent Component Analysis
⎻ SAX feature extraction
⎻ Multi Instance Learning
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
37. Multi-Instance Learning
• A bag is labelled positive if it has at least one positive instance
• Goal: Predict bag label and instance labels
Bag
Positive/Negative Bag?
373/2/2019 Artifact Removal From Electroencephalography (EEG) Data
38. Multi-Instance Learning
• A bag is labelled positive if it has at least one positive instance
• Goal: Predict bag label and instance label
Bag
Positive/Negative Instance?
Positive/Negative Instance?
Positive/Negative Instance?
Positive/Negative Instance?
383/2/2019 Artifact Removal From Electroencephalography (EEG) Data
39. System Architecture
39
Mul$ Channel EEG Signal
Channel 1
Channel 2
Channel 64
.
...
..
ICA
Bag of components
.
..
SAX
Feature
Extrac0on .
..
0 1 0 1 0 0
Mul0-
Instance
Learning
(MIL)
0 0 1 1 0 0
0 0 1 1 1 1
Vector represen$ng
component
Sax string: "aaa" "abc" "aab" "aba" "acb" "acc"
Probability of noise
for each component
Ar$fact/ Not Ar$fact?
• Main Processing Blocks
⎻ Independent Component Analysis
⎻ SAX feature extraction
⎻ Multi Instance Learning
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
40. EEG Dataset Generated by the US Army Research
Laboratory
40
EEG when participant is
not performing any activity
• 17 subjects
• Artifacts
o Sync pulse
o Clench jaw
o Move jaw vertically
o Blink (no squinting!)
o Move eyes left, then back
to center
o Move eyes up, then back
to center
o Raise and lower eyebrows
o Rotate head side-to-side
o Shrug shoulders
o Rotate torso (hips) EEG when participant is
raising and lowering eyebrows
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
41. Results
41
Algorithm Kernel Accuracy
Normalized Set Kernel Linear 95.2%
MISVM Linear 73.2%
miSVM Linear 70.0%
MISVM Quadratic Kernel 81.6%
miSVM Quadratic Kernel 66.0%
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
43. Accuracy
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 43
Patient ID LOSO Accuracy Personalized Accuracy
1 72.54901961 98.04(3.39)
2 76.47058824 96.08(3.39)
3 76.47058824 92.16(6.79)
4 82.35294118 98.04(3.39)
5 84.31372549 98.04(3.39)
6 31.37254902 94.12(0)
7 94.11764706 100(0)
8 100 100(0)
9 98.03921569 100(0)
10 66.66666667 90.2(6.8)
11 98.03921569 100(0)
12 80.39215686 94.12(10.19)
13 88.23529412 100(0)
14 49.01960784 100(0)
15 66.66666667 90.2(12.25)
16 54.90196078 100(0)
17 90.19607843 100(0)
Average 77.0472895 97.12
Patient ID Accuracy
1 100(0)
2 89(7.3)
3 100(0)
4 99(2.8)
5 100((0)
6 97(5.5)
7 97(5.5)
Average 98(5.5)
Pooled 97(1.9)
Bootstrap 14(17.7)
LOSO 75
Lawhern, Vernon, W. David Hairston, Kaleb McDowell,
Marissa Westerfield, and Kay Robbins. "Detection and
classification of subject-generated artifacts in EEG
signals using autoregressive models." Journal of
neuroscience methods 208, no. 2 (2012): 181-189.
44. Summary
• We presented an EEG artifact removal system, using ICA, SAX
feature extraction, and Multi-Instance Learning algorithms. The
proposed system uses a weak supervisory signal to indicate
that some noise is occurring, but not what the source of the
noise is or how it is manifested in the EEG signal. We optimize
the hyper-parameters of the system to reduce the execution
time of the system while maintaining accuracy.
45
Jafari, Ali, Sunil Gandhi, Sri Harsha Konuru, W. David Hairston, Tim Oates, and Tinoosh Mohsenin. "An EEG artifact
identification embedded system using ICA and multi-instance learning." In Circuits and Systems (ISCAS), 2017 IEEE
International Symposium on, pp. 1-4. IEEE, 2017.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
45. Limitations
• Independent component analysis needs large number of
samples to converge. ICA has high computational complexity
and large memory requirements, making it unsuitable for real-
time applications.
• Each subsystem has its own hyperparameters and tuning them
jointly is a challenging task.
463/2/2019 Artifact Removal From Electroencephalography (EEG) Data
46. Limitations
• Independent component analysis needs large number of
samples to converge. ICA has high computational complexity
and large memory requirements, making it unsuitable for real-
time applications.
• Each subsystem has its own hyperparameters and tuning them
jointly is a challenging task. Also,
47
Can we learn an end to end system to generate
clean EEG from noisy EEG using deep neural
networks
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data
47. • Introduction To Artifact Removal From EEG data
• ICA And Multi-instance Learning Solution
• Asymmetric Generative Adversarial Network Solution
• Future Work
• Questions
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 48
48. Background
• Generative Adversarial network (GAN)
• CycleGAN: Unpaired Image-to-Image Translation using Cycle-
Consistent Adversarial Networks
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 49
49. Generative Adversarial Networks
• Generator network: try to fool the discriminator by generating real-looking images
• Discriminator network: try to distinguish between real and fake images
real or fake?
Discriminator
z
G(z)
D
Generator
G
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 50
56. CycleGAN Model
A B
G_B
G_A
Da Db
• Problems
– Network G_A does not have
access to predicted noise
signal to reconstruct A
– The network is penalized for
not adding noise that is the
same as predicted noise
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 57
58. AsymmetricGAN Model
A B
G_B
G_A
Da Db
We preserve the noise in the signal and use it for reconstruction of the original signal
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 59
A BN
G_B
G_NG_N
+
Da Db
60. AsymmetricGAN Model
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 61
G_N is a function that extracts noise if the input is a noisy signal A. It generates noise if the input is a clean signal B.
61. Generator and discriminator network architecture
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 62
62. Generator and discriminator network architecture
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 63
• Network is independent of the
relative ordering of the
channels
• Channel indices do not
correspond to the spatial
locations of the electrodes.
• Filter Size:
• 1D convolution with filter
size K
• 2D convolution with filter
size N ✕ k
Qi, Charles R., Hao Su, Kaichun Mo, and Leonidas J. Guibas.
"Pointnet: Deep learning on point sets for 3d classification and
segmentation." Proc. Computer Vision and Pattern
Recognition (CVPR), IEEE 1, no. 2 (2017): 4.
63. Evaluation
• Evaluation of artifact removal from EEG data is difficult as
ground truth signal does not exist.
• We just have set of clean EEG signals and noisy EEG signals. We
don’t have clean version of noisy EEG.
• Visualizing and understanding EEG data is a time consuming
task.
• To validate the model we create a simpler synthetic dataset.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 64
64. Synthetic Dataset
• Clean signal is a linear combination of a sine and a square wave
• Noisy signal is linear combination of sine, square and sawtooth
waves
• The period of the sine and square waves is randomly selected
between 2 and 5. Sawtooth has fixed period of 6
• Number of samples in each time series is 1000
• Training set: 4000 Signals
• Validation set: 1000 Signals
• Test set: 100 Signals
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 65
65. Synthetic Dataset
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 66
(e) True Sources
(c) Signal cleaned by Asymmetric GAN
(a) Signal with Artifact (b) Clean Signal
(d) Noise Signal predicted by Asymmetric GAN
66. Synthetic Dataset
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 67
(e) True Sources
(c) Signal cleaned by Asymmetric GAN
(a) Signal with Artifact (b) Clean Signal
(d) Noise Signal predicted by Asymmetric GAN
The MSE between the ground truth clean signal and
denoised signal is 0.0406. The mean MSE error for entire
test set is 0.0387 and standard deviation is 0.0043.
67. EEG Dataset Generated by the US Army Research
Laboratory
• During collection of this dataset, participants were told to not move and look
straight at the computer screen for the collection of clean data.
• Despite the instruction to the patients, we noticed that there were artifacts even
in “clean” data. We also noticed that some “noisy” EEG did not contain the
corresponding artifact.
• Manually annotating all artifacts from all channels for all artifacts is a time
consuming task. So in this work, we focus on ocular artifacts in the Fp1 electrode
of the frontal region.
• Remove all patients that have more than two ocular artifacts in the clean data
and do not have artifacts in the region of eyebrow raising.
• In the resulting dataset, we have 4 patients with clean data and 10 patients with
noisy data.
• Each patient’s clean data has 4836 samples and noisy data has 420354 samples.
We use this manually annotated data in all experiments below.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 68
70. Evaluation by detection
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 71
• We use artifact detection as a way of measuring the
performance of artifact removal.
• We first train artifact detection and artifact removal and use
the artifact detector to classify every window in EEG data
denoised by the artifact removal algorithm.
• The error is given by the percent of windows where an artifact
was detected in denoised EEG data.
71. Dataset Division
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 72
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
Clean Noisy Clean Noisy Clean Noisy Clean Noisy
Train Test Train Test
Artifact Detector Artifact Removal
72. Evaluation by detection
• Artifact Detector accuracy: 97.39%
• Denoised signal was classified as clean by artifact detector
72.37% times.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 73
73. Summary
• We presented an online, fully automated, end-to-end system
for denoising EEG data. Our system for denoising is trained
using unpaired training corpora. It does not need any
information about the source of the noise or how it is
manifested in the EEG signal. We created a synthetic dataset
and used it to validate our network. We also used our system
to remove artifacts from existing EEG dataset
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 74
Sunil Gandhi, Tim Oates, Tinoosh Mohsenin, Dave Hairston. ”Denoising Time Series Data Using Asymmetric Generative
Adversarial Networks.” In Pacific-Asia Conference on Knowledge Discovery and Data Mining, 2018.
74. • Introduction To Artifact Removal From EEG data
• ICA And Multi-instance Learning Solution
• Asymmetric Generative Adversarial Network Solution
• Future Work
• Questions
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 75
75. Future Work
• Creating a framework for evaluation of artifact removal
algorithms
• Removing multiple artifacts using AsymmetricGAN
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 76
76. Evaluation of artifact removal algorithms
• Evaluating artifact removal algorithms is important before they
can be used in clinical contexts.
• But, it is challenging because of a lack of availability of ground
truth clean EEG signals.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 77
77. Evaluation of artifact removal algorithms
• Evaluating artifact removal algorithms is important before they
can be used in clinical contexts.
• But, it is challenging because of a lack of availability of ground
truth clean EEG signals.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 78
The performance evaluation of artifact removal methods found in the literature is
always problematic. It can be done either by visually by expert which is subjective
or by synthetic/semi-synthetic data (but uncertainty of reconstructed data whether
perfectly realistic or not). Since there is neither any ground truth data available nor
any universal or standard quantitative metric(s) used in the literature that can
capture both amount of artifact removal and distortion. … Therefore, it is not fair
to tell which performs best based on the study.
Islam, Md Kafiul, Amir Rastegarnia, and Zhi Yang. "Methods for artifact detection and removal from scalp EEG: A
review." Neurophysiologie Clinique/Clinical Neurophysiology 46, no. 4-5 (2016): 287-305.
78. Evaluation of artifact removal algorithms
• Qualitative Evaluation
• Evaluation using simulated data
• Evaluation using detection
• Correlating with the reference signal
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 79
79. Evaluation of artifact removal algorithms
• Qualitative Evaluation
– Subjective
– Does not give quantitative metric
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 80
80. Evaluation of artifact removal algorithms
• Qualitative Evaluation
• Evaluation using simulated data
– Allows usage of metrics like signal-to-noise ratio (SNR) and
normalized mean squared error to compare deviation between the
denoised signal and clean signal.
– Training and evaluation can be performed on large datasets
– But it is very difficult to replicate all the characteristics of EEG data
like synchronization among channels, time-locking to ERPs and
contamination by different types of artifacts in a realistic manner.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 81
81. Evaluation of artifact removal algorithms
• Qualitative Evaluation
• Evaluation using simulated data
• Evaluation using detection
– Use artifact detector to check if artifacts are present in the denoised signal.
– Can be used to evaluate denoising of real EEG data
– Automated, not subjective and gives a quantitative measure to compare several
artifact removal algorithms.
– Usefulness of this metric is dependent on the performance of the artifact
detector.
– Does not give a measure of the sensitivity (whether the method removes the
artifacts) and specificity (whether it preserves EEG signals) of the artifact
removal algorithm.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 82
82. Evaluation of artifact removal algorithms
• Qualitative Evaluation
• Evaluation using simulated data
• Evaluation using detection
• Correlating with the reference signal
– Cannot be performed if a reference signal for the corresponding
artifact is not available.
– Does not give a measure of the sensitivity (whether the method
removes the artifacts) and specificity (whether it preserves EEG
signals) of the artifact removal algorithm.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 83
83. Evaluation of artifact removal algorithms
• Evaluation Methods
– Qualitative Evaluation
– Evaluation using simulated data
– Evaluation using detection
– Correlating with the reference signal
• Multiple evaluation methods with several variants of each of these methods exist in
literature.
• Each of these methods has their strengths and limitations. This makes performance
comparison of these methods hard.
• In future we plan to
– Standardize the evaluation mechanism
– Create a tool for evaluating artifact removal algorithms using all of the above approaches.
– Comparing our approach with existing methods using the tool
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 84
84. Future Work
• Creating a framework for evaluation of artifact removal
algorithms
• Removing multiple artifacts using AsymmetricGAN
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 85
85. Removing multiple artifacts using AsymmetricGAN
• Having separate algorithms for removal of different types of
artifacts makes the preprocessing step inefficient.
• In future, we plan to adapt the AsymmetricGAN architecture for
removing multiple types of artifacts.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 86
G_B
G_NG_N
+
Da Db
MAA B
OA
86. Future Work
• Creating a framework for evaluation of artifact removal
algorithms
• Removing multiple artifacts using AsymmetricGAN
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 87
87. Improving artifact removal using additional
annotations
• Modify AsymmetricGAN to utilize
– artifact type
– location of artifact
– reference channel.
• We will study differences in performance because of each type
of additional data.
• This will give insight on which annotations are most helpful for
artifact removal algorithms, thus helping creation of future
datasets.
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 88
88. Future Work
• Creating a framework for evaluation of artifact removal
algorithms
• Removing multiple artifacts using AsymmetricGAN
• Improving artifact removal using additional annotations
• Generalizing AsymmetricGAN for speech data
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 89
89. Generalizing AsymmetricGAN for speech data
• Speech denoising produces noise-free speech signals from noisy recordings
• Applications
– Recognizing speech in environments with background noise like car driving.
– Reduce discomfort and increase understanding of the speech for people wearing
hearing aids
• Easier to evaluate
• Evaluate generalizability of asymmetricGAN across domains
• Datasets:
– Voice Bank corpus: Speech data for 400 sentences from 28 speakers each.
– Demand dataset: Real world background noise of 18 diverse environments
3/2/2019 Artifact Removal From Electroencephalography (EEG) Data 90
Wu, Dongrui, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, and Chin-Teng Lin. "Offline EEG-based driver drowsiness estimation using enhanced batch-mode active learning (EBMAL) for regression." In Systems, Man, and Cybernetics (SMC), 2016 IEEE International Conference on, pp. 000730-000736. IEEE, 2016.Harvard
Lance, Brent J., W. David Hairston, Greg Apker, Keith W. Whitaker, Geoff Slipher, Randy Mrozek, Scott E. Kerick, Jason Metcalfe, Christopher Manteuffel, and Matthew Jaswa. 2012 year-end report on neurotechnologies for in-vehicle applications. No. ARL-SR-267. ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD, 2013.
Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression
assess the state of human vehicle operators and integrate that state information into future vehicle system
Wu, Dongrui, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, and Chin-Teng Lin. "Offline EEG-based driver drowsiness estimation using enhanced batch-mode active learning (EBMAL) for regression." In Systems, Man, and Cybernetics (SMC), 2016 IEEE International Conference on, pp. 000730-000736. IEEE, 2016.Harvard
Lance, Brent J., W. David Hairston, Greg Apker, Keith W. Whitaker, Geoff Slipher, Randy Mrozek, Scott E. Kerick, Jason Metcalfe, Christopher Manteuffel, and Matthew Jaswa. 2012 year-end report on neurotechnologies for in-vehicle applications. No. ARL-SR-267. ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD, 2013.
Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression
assess the state of human vehicle operators and integrate that state information into future vehicle system
Page, Adam, Chris Sagedy, Emily Smith, Nasrin Attaran, Tim Oates, and Tinoosh Mohsenin. "A flexible multichannel EEG feature extractor and classifier for seizure detection." IEEE Transactions on Circuits and Systems II: Express Briefs 62, no. 2 (2015): 109-113.
Wu, Dongrui, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, and Chin-Teng Lin. "Offline EEG-based driver drowsiness estimation using enhanced batch-mode active learning (EBMAL) for regression." In Systems, Man, and Cybernetics (SMC), 2016 IEEE International Conference on, pp. 000730-000736. IEEE, 2016.Harvard
Lance, Brent J., W. David Hairston, Greg Apker, Keith W. Whitaker, Geoff Slipher, Randy Mrozek, Scott E. Kerick, Jason Metcalfe, Christopher Manteuffel, and Matthew Jaswa. 2012 year-end report on neurotechnologies for in-vehicle applications. No. ARL-SR-267. ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD, 2013.
Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression
assess the state of human vehicle operators and integrate that state information into future vehicle system
Wu, Dongrui, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, and Chin-Teng Lin. "Offline EEG-based driver drowsiness estimation using enhanced batch-mode active learning (EBMAL) for regression." In Systems, Man, and Cybernetics (SMC), 2016 IEEE International Conference on, pp. 000730-000736. IEEE, 2016.Harvard
assess the state of human vehicle operators and integrate that state information into future vehicle system
There are bunch of other applications that I won’t go into.
Some electrodes
There is a strong need for method for denoising the EEG data
We saw few slides back that it is important problem and large number of methods have been proposed
Users with relatively short data files, for whom segment rejection would lead to an unacceptably low remaining number of segments for analysis, may choose an optional post-segmentation step involving the interpolation of data for channels determined to be artifact-contaminated within each individual segment, as implemented by FASTER software (Nolanet al., 2010). Each channel in each segment is evaluated onthe four FASTER criteria (variance, median gradient, amplitude range, and deviation from mean amplitude), and the Z score(a measure of standard deviation from the mean) for each channel in that segment is generated for each of the fourmetrics. Any channels with one or more Z scores that are greater than 3 standard deviations from the mean for an individual segment are marked bad for that segment. These criteria may identify segments with residual high-amplitude artifacts(e.g., eye artifacts), electrode discontinuity (e.g., electrode has lost contact with the scalp temporarily), and muscle artifact. Subsequently, for each segment, the bad channels have their data interpolated with spherical splines, as in FASTER. This allows users to maintain the maximum number of available segments, while still maximizing artifact rejection within individual segments.
captures both frequency and location information
Artifacts should be decomposable in a wavelet basis.
Mother wavelet, the shrinkage rule, and the threshold are important to the design of the noise removal method (Daly et al. 2013).
This problem is widely studied and large number of
Annotation or reference channel
Annotation or reference channel
In this paper, we present a novel software-hardware system that uses a weak supervisory signal to indicate that some noise is occurring, but not what the source of the noise is or how it is manifested in the EEG signal.
The slide shows our proposed systtem architecture , First
First 64 EEG data are decomposed into independent components using ICA, and these components form bags that are labeled 1/0 where 1 indicates that there is a noise.
Each bag is then classified by a multi-instance learning algorithm to identify if it has noise component or not. We use Symoblic Aggregate Approximation (SAX) for featurizing each component.
We go through each of these subsystems in next slides
In this paper, we present a novel software-hardware system that uses a weak supervisory signal to indicate that some noise is occurring, but not what the source of the noise is or how it is manifested in the EEG signal.
The slide shows our proposed systtem architecture , First
First 64 EEG data are decomposed into independent components using ICA, and these components form bags that are labeled 1/0 where 1 indicates that there is a noise.
Each bag is then classified by a multi-instance learning algorithm to identify if it has noise component or not. We use Symoblic Aggregate Approximation (SAX) for featurizing each component.
We go through each of these subsystems in next slides
We featurize each component using SAX before using multi instance learning. This slide demonstrates how the SAX works on a simple time series signal. We partition the signal into different levels where each level manifests a letter. For example, this window of the signal is partitioned into three levels which represents the symbols of “baabccbc” word. We use bag of words generated from SAX to classify the time series.
For featurizing EEG, we convert time series to symbols In this case time series shown in first figure is converted to symbols “baabccbc”.
In this paper, we present a novel software-hardware system that uses a weak supervisory signal to indicate that some noise is occurring, but not what the source of the noise is or how it is manifested in the EEG signal.
The slide shows our proposed systtem architecture , First
First 64 EEG data are decomposed into independent components using ICA, and these components form bags that are labeled 1/0 where 1 indicates that there is a noise.
Each bag is then classified by a multi-instance learning algorithm to identify if it has noise component or not. We use Symoblic Aggregate Approximation (SAX) for featurizing each component.
We go through each of these subsystems in next slides
Our goal is given a new bag predict if it is positive or negative.
Multi instance learning algorithms also allows us to predict if the instance is positive or negative. In our case positive instance corresponds to noisy component in EEG data.
In this paper, we present a novel software-hardware system that uses a weak supervisory signal to indicate that some noise is occurring, but not what the source of the noise is or how it is manifested in the EEG signal.
The slide shows our proposed systtem architecture , First
First 64 EEG data are decomposed into independent components using ICA, and these components form bags that are labeled 1/0 where 1 indicates that there is a noise.
Each bag is then classified by a multi-instance learning algorithm to identify if it has noise component or not. We use Symoblic Aggregate Approximation (SAX) for featurizing each component.
We go through each of these subsystems in next slides
Although few reasons these numbers cannot be directly compared:
We do 3-Fold cross validation, they randomly generate 20 train test split
They are using 7 patients from same dataset. I am not sure which patients they are using
The goals are different they are doing classification, we are doing classification as well as artifact component removal.
Also, I know there are problems with evaluation, I will talk about evaluation at the end
Another approach we are working on to remove artifacts from noisy EEG data is using generative adversarial networks. This approach has been used recently in computer vision for converting images from one domain to another. For example, in this paper, unpaired image-to-image translation using cycle-consistent adversarial networks (CycleGAN) convert paintings of Monet to photos. They also convert photos of zebras to horses, winter to summer and reverse.
We want to use this deep neural network to remove artifacts from EEG data. This architecture is a natural fit for this problem as it does not require any paired data. I will go over how cycleGAN work in the next few slides.
For explaining cycleGAN, I will use example of converting a horse image to zebra image as it is easier to explain. In this example, X is images of all horses and Y is images of all zebras. G is convolutional autoencoder which converts horse image to corresponding zebra image and F is another convolutional autoencoder which converts zebra image to horse image. Dx and Dy are convolutional neural networks that discriminate between generated and real images. For training this neural network, we give a horse and zebra image as input and calculate the loss. After calculating the loss, we use standard gradient descent for backpropagation and train the network.
We describe how we calculate the loss function in next slide.
Another approach we are working on to remove artifacts from noisy EEG data is using generative adversarial networks. This approach has been used recently in computer vision for converting images from one domain to another. For example, in this paper, unpaired image-to-image translation using cycle-consistent adversarial networks (CycleGAN) convert paintings of Monet to photos. They also convert photos of zebras to horses, winter to summer and reverse.
We want to use this deep neural network to remove artifacts from EEG data. This architecture is a natural fit for this problem as it does not require any paired data. I will go over how cycleGAN work in the next few slides.
Give Example
Talk about how network cheats
Visual Analogy of the problem
Give Example
Talk about how network cheats
Give Example
Talk about how network cheats
Color the arrows
To train the asymmetricGAN and reduce model oscillation, we use the strategy of
(Shrivastava et al. 2016) of saving the history of generated time series. We update the discriminators using a history of generated signals rather than the ones produced by the latest generative networks. We keep a buffer that stores the 50 previously generated signals for training. In the next section, we show the effectiveness of our method on synthetic and EEG datasets.
Explain the architecture visually
Artifact removal from EEG data is a problem where ground truth does not exist be- cause we do not have a clean version of the noisy signal. This makes the evaluation of artifact removal methods difficult. Also, visualizing and understanding EEG data is a time consuming task. We solve this problem by creating a simpler synthetic dataset.
Talk about the fact that this is qualitative example how mse just gives consistency across the dataset
MSE(a,b) = 0.5
Talk about the fact that this is qualitative example how mse just gives consistency across the dataset
MSE(a,b) = 0.5
Give Example
More Examples in Thesis
Qualitative Evaluation
Subjective
Useful for validating the algorithm.
Most used
Evaluation using detection
Most used
Methods generally differ in how they model characteristics of artifact signals. (Daly et al. 2012) proposes to use mean power and its standard deviation, the maximum amplitude and its standard deviation, kurtosis, and skewness of the amplitude of resting state background rhythms to characterize the clean and noisy EEG signal. (Mognon et al. 2010) learns features such as kur- tosis, spatial average difference, and maximum epoch variance for modeling ocular artifacts. (Delorme, Sejnowski, and Makeig 2007b) learns to characterize a larger variety of artifacts using extreme values, linear trends, data improbability, kurtosis and spectral patterns. We use an artifact detector that uses a convolutional neural network to evaluate artifact removal algorithms in Chapter 4.
Most used
Methods generally differ in how they model characteristics of artifact signals. (Daly et al. 2012) proposes to use mean power and its standard deviation, the maximum amplitude and its standard deviation, kurtosis, and skewness of the amplitude of resting state background rhythms to characterize the clean and noisy EEG signal. (Mognon et al. 2010) learns features such as kur- tosis, spatial average difference, and maximum epoch variance for modeling ocular artifacts. (Delorme, Sejnowski, and Makeig 2007b) learns to characterize a larger variety of artifacts using extreme values, linear trends, data improbability, kurtosis and spectral patterns. We use an artifact detector that uses a convolutional neural network to evaluate artifact removal algorithms in Chapter 4.
Most used
Multiple evaluation methods for artifact removal algorithms have been proposed in the literature. Each of these methods has their strengths and limitations. Thus, it is important to use as many evaluation mechanisms as possible. But, most existing papers use one or two of these methods to evaluate their algorithm. This makes performance comparison of these methods hard. This problem is exacerbated because several variants of each of these methods exist in literature and they are tested on different datasets. Our goal in this thesis is to standardize the evaluation mechanism and create a tool for evaluating artifact removal algorithms using all of the above approaches. We also intend to use this tool for comparing our approach with existing methods discussed above.
where speech can be denoised before amplification.