Biologically Inspired Methods for Adversarially Robust Deep Learning

Biologically Inspired
Methods for
Adversarially Robust
Deep Learning
Muhammad Ahmed Shah

Robustness of DNNs to Input Perturbations
• DNNs are sensitive to perturbations that humans are invariant to
• This causes them to fail in counter-intuitive ways and raises questions
about their reliability
goldfinch
crow
2

Adversarial Attacks
• Pernicious because they are imperceptible
• Can be realized in the physical world
• Need to defend against them
Goodfellow, Ian
J., Jonathon
Shlens, and
Christian
Szegedy.
"Explaining and
harnessing
adversarial
examples." arXiv
preprint
arXiv:1412.6572
(2014).
𝛿 𝑥 + 𝜖𝛿
ϵ
3

Various Types of Adversarial Attacks
ℓ𝑝 bounded
perturbations
Unbounded
Parametric
Perceptual
distance bounded
perturbations
Wasserstein
distance bounded
perturbations
Adversarial
Patch
Physically
Realisable
Ideally DNNs should be robust to all such perturbations
But, often, they are not
4

Defenses Against Adversarial Attacks
Adversarial
Defenses
Attack Aware
Defenses
Certified
Defenses
Image
Transforms
Attack
Detectors
Attack Agnostic
Defenses
Adversarial
Training
Neural
Architecture
Search
Pruning
Do not generalize
Do not make the DNN robust
Only adds computational load
Generalizable
5
Attack
Neutralizers

Attack Agnostic Defenses
Structural Priors
Design elements
conducive to
robustness.
Biological Priors
Biological
principles related
to robustness.
6

Differences Between Human and DNN
Decision Functions Enable Adversarial Attacks
8
cat
+

Hypothesis: Aligning DNNs with Human
Perception Will Make Them More Robust
9
Dapello+20

Overview
10
1. Foveation via Adaptive
Blurring and Desaturation
2. Biologically Inspired
Audio Features
3. Fixed Interneuron
Covariability
4. Recurrence

Biological Priors
11
Audio Features
Covariability
4. Recurrence

R-Blur: Foveation via Adaptive Blurring and
Desaturation
12
Shah, M.A., Kashaf, A. and Raj, B., 2023. Training on Foveated Images Improves Robustness to Adversarial Attacks. NeurIPS (2023)

R-Blur: Overview
13
+ 𝛿~𝒩(0, 𝜎)
𝜶𝑐
𝜶𝑟
1. Select fixation point
2. Add Gaussian Noise
3. Split into color and grey channels and
apply adaptive blurring
4. Combine the color and grey channels

Computing Eccentricity
• Eccentricity ≡ Distance from
fixation point
• Generally measured radially, i.e.
Euclidian distance
• Need to extract circular regions to blur
– inefficient
• We use a different distance metric
𝑒𝑝𝑥,𝑝𝑦
=
max( 𝑝𝑥 − 𝑓𝑥 , |𝑝𝑦 − 𝑓𝑦|)
𝑊
• Regions with same eccentricity are
squares – can be extracted by slicing
the image tensor
14

Estimating Visual Acuity
• Visual acuity := the ability to perceive
spatial details in the image
• The acuity of color vision decreases
exponentially with eccentricity.
• The acuity of grey vision generally
much lower, and is minimal at the
fixation point.
• We approximate color and grey acuity
as:
𝒟𝐶 𝑒 ; 𝜎𝐶 = max Λ 𝑒; 0, 𝜎𝐶 , 𝜍 𝑒; 0,2.5𝜎𝐶
𝒟𝑅 𝑒; 𝜎𝑅, 𝑚 = 𝑚(1 − 𝒟𝐶 𝑒 ; 𝜎𝑅 )
• Λ and 𝜍 are the PDF for the Laplace and
Cauchi distribution
• We set 𝜎𝐶 = 0.12, 𝜎𝑅 = 0.09 and 𝑚 =
0.12
15
Eccentricity
Estimated
Acuity

Quantizing Visual Acuity
• The visual acuity at a pixel
determines the std. dev. of
Gaussian blur applied to it.
• #Kernels = # Unique acuity values
= # unique eccentricity values =
max{𝑊, 𝐻}
• To improve efficiency, we
quantize the estimated visual
acuity values
16

Applying Blur
• We compute the std. dev. of the
Gaussian kernel at each pixel as
𝛽𝐷(𝑒𝑝𝑥,𝑝𝑦
)
• 𝐷(𝑒𝑝𝑥,𝑝𝑦
) is the estimated acuity,
and 𝛽 = 0.05
17

Desaturation via Combination
• The blurred grey and color
images are combined in a
pixelwise combination
• The pixel weights are the
normalized color and grey visual
acuity values
18

Fixation Point Selection
19
DeepGaze DeepGaze DeepGaze DeepGaze
Original
Predefined
Initial Fixation
ResNet ResNet ResNet ResNet ResNet
Output
Average

R-Blur Improves Adversarial Robustness of
ResNet
20
CIFAR-10 Ecoset Imagenet
Std. ResNet
5 Rand. Affine Tfms
R-Blur

21
53
41
31
21
13
49
40
32
24
18
52
45
40
31
23
1 2 3 4 5
Accuracy
(%)
Corruption Severity
Std. ResNet AT R-Blur
R-Blur is Robust to Common Corruptions

R-Blur Compares Favorably to Biological
Defenses
22
-8.2
5.3
3.2
-7.2
15
6.5
Clean Adv. Com. Cor.
Difference
in
Accuracy
(R-Blur
-
Other)
VOneNet (Dapello+20) R-Warp (Vuyyuru+20)

Role of Number of Fixations
23

Role of Fixation Location
24
70
61
70
0
18 20
0
10
20
30
40
50
60
70
80
Std. ResNet R-Blur w.
DeepGaze
R-Blur w.
Opt. Fix.
Accuracy Clean
Adv.

Key Takeaways
• R-Blur significantly improves the robustness of DNNs without being
trained on perturbed data.
• The robustness of R-Blur generalizes better than adversarial training
to different perturbation types.
• R-Blur shows the promise of biologically motivated approaches to
improving the robustness of DNNs.
25

Biological Priors
26
Audio Features
Covariability
4. Recurrence

Audio Features For Automatic Speech
Recognition
• Early ASR approaches used hand-crafted features -- often inspired by
biology, particularly the cochlea
• The simple and popular approach
• Time-frequency analysis via FFT (sim. basilar membrane)
• Bank of band-pass Mel filters (sim. characteristic frequencies in cochlea)
• Non-linearity (sim. hair cell response)
27
Spectrogram Log Mel-Spectrogram
STFT
Waveform Mel-Filterbank Log

Bio-plausible Audio Features for ASR
• More bio-plausible feature
extraction methods exist but not
widely used
• Known to improve noise
robustness, but adversarial
robustness not evaluated
• We evaluate the performance of
several exiting biologically
plausible audio feature
• We also propose novel features
28

Features Evaluated So Far
Feature Salient Feature
Log Spectrogram Time-frequency representation + non-linearity
Log Mel Spectrogram Triangular RFs with CFs on the Mel Scale
Cochleagram
[Feather+23]
Gammatone RFs with CFs on the ERB scale + power-law non-linearity
Gammatone Spectrogram Same as Cochleagram but computed by transforming the STFT
Power Normalized
Coefficients [Kim+10]
Power-normalized Gammatone RFs with CFs on the ERB scale +
temporal masking + noise suppression + power-law non-linearity
Difference of
Gammatones
Lateral suppression by frequencies around the CF
Frequency Masked
Spectrogram
Simulates simultaneous frequency masking
29

Lateral Suppression via Difference of
Gammatone Filters
• Lateral Suppression:
• the response at a given CF may be suppressed by the energy at adjacent
frequencies [Stern & Morgan 12]
• Enhances responses to spectral changes and reduces impact of noise
• Proposal: take a difference of Gammatone filterbank
31

Difference of Gammatone Filters
• Create 2 Gammatone
frequency response curves
with different widths, and
subtract.
• Normalize by sum of positive
values
𝐺𝑑 𝑘 = 𝐺1 𝑘 − 𝐺2 𝑘
𝐺𝑑 𝑘, 𝑡 =
𝐺𝑑[𝑘, 𝑡]
𝑡 max{𝐺𝑑[𝑘, 𝑡] , 0}
Frequency
Amplitude

Difference of Gammatone Filterbank
Power Normalized Gammatone Filterbank Responses Normalized Difference of Gammatone
Filterbank Responses
Frequency (FFT bin)
Amplitude
Amplitude
Frequency (FFT bin)

Applying DoG Filterbank
• Convolve the DoG Filterbank over the STFT
𝑆𝑥
𝐺 = 𝑆𝑥 ∗ 𝐺
• Half-wave Rectify
𝑆𝑥
𝐺
+
𝑘, 𝑡 = max 𝑆𝑥
𝐺 𝑘, 𝑡 , 0
• Non-linear Compression
𝑆𝑥[𝑘, 𝑡] = 𝑆𝑥
𝐺
+
𝑘, 𝑡 0.3
34

Example: Concord Returned To Its Place
Amidst The Tents
Power Normalized Gammatone Normalized Difference of Gammatone
Time (Window)
Frequency
(FFT
bin)
Frequency
(FFT
bin) Time (Window)

Effect on Robustness
36
• Model: CNN + 16 layer conformer
• 65 non-adversarial audio transforms
• Untargeted gradient-based attack
• SNR-bounded PGD @ 10,20,30,40 dB
• NWER:
1
|Δ| 𝛿∈Δ
𝑊𝐸𝑅𝑓
𝛿
𝑊𝐸𝑅𝑀𝐹𝐶𝐶
𝛿

Simultaneous Frequency Masking
• High power at a frequency can raise the threshold of hearing for
adjacent frequencies
• Frequencies below the threshold are inaudible, i.e. masked
• Exploited for MP3 compression – more compression in masked spectro-
temporal regions.
• Proposal: Compute the hearing the hearing threshold and zero-out
the masked region
37

Frequency Masked Spectrogram
• Estimate the masking threshold
for each (FFT) frequency [Qin+19,
Lin+15]
• Zero-out regions of the
spectrogram where Power-
Spectral Density (PSD) falls below
the threshold
38
Time (window) Time (window)
Frequency
(FFT
bin)
Frequency
(FFT
bin)

Estimating the Masking Threshold [Qin+19]
1. Smoothed Normalized PSD
𝑝𝑥 𝑘 = 20log10 𝑠𝑥 𝑘
1
𝑁
𝑝𝑥 𝑘 = 96 − max
𝑘
𝑝𝑥 𝑘 + 𝑝𝑥 𝑘
𝑝𝑥
𝑚
𝑘
= 10log10 10𝑝𝑥 𝑘−1
+ 10𝑝𝑥 𝑘
+ 10𝑝𝑥 𝑘+1
2. Two-sided spreading function
𝑆𝐹 𝑖, 𝑗 =
27Δ𝑏𝑖𝑗 Δ𝑏𝑖𝑗 > 0
𝐺 𝑖 ⋅ Δ𝑏𝑖𝑗 𝑜𝑤
Δ𝑏𝑖𝑗 = 𝑏𝑎𝑟𝑘 𝑓𝑗 − 𝑏𝑎𝑟𝑘 𝑓𝑖
𝐺 𝑖 = −27 + 0.37max{𝑝𝑥 𝑖 − 40,0}
39
masker
maskee

Estimating the Masking Threshold (cont.)
3. Pairwise Threshold
𝑇 𝑖, 𝑗 = 𝑝𝑥
𝑚
𝑖 + Δ𝑚 𝑖 + 𝑆𝐹 𝑖, 𝑗
Δ𝑚 𝑖 = −6.025 − 0.275 ⋅ 𝑏𝑎𝑟𝑘(𝑖)
4. Global Threshold
𝜃𝑥 𝑘 = 10log10 10
𝐴𝑇𝐻(𝑖)
10 +
𝑖
10
𝑇[𝑖,𝑘]
10
40
Time
PSD

Applying Masking
5. Create a mask
𝛼𝑥(𝑘) = 𝐼 𝑝𝑥
𝑚
𝑘 > 𝜃𝑥 𝑘
6. Apply Mask to Spectrogram
𝑠𝑥(𝑘) = 𝑠𝑥 𝑘 ⊙ 𝛼𝑥(𝑘)
7. Apply non-linearity
𝑠𝑥 𝑘 0.3
41
Time
Time

Robustness of All Features
• 65 non-adversarial audio transforms
• Untargeted gradient-based attack
• SNR-bounded PGD @ 10,20,30,40 dB
• NWER:
1
|Δ| 𝛿∈Δ
𝑊𝐸𝑅𝑓
𝛿
𝑊𝐸𝑅𝑀𝐹𝐶𝐶
𝛿
• Gammatone FB generally improves
robustness
• Best against Adv: Difference of
Gammatone
• Best against non-Adv: Gammatone
Spectrogram
42

Clean WER of All Features
• All features have low WER
• Gammatone Spectrogram lowest WER
43

Key Takeaway and Future Work
• Certain biological phenomenon (lateral suppression) improves
robustness to adversarial attack
• While others (temporal masking) do not
• Gammatone FB generally improves robustness
• The gammatone spectrogram has lowest WER on clean data and non-
adversarial perturbations
• Simulate detailed cochlear models (e.g. CARFAC [Lyon 12], Seneff)
• Creating efficient PyTorch implementation taking time.
44

Biological Priors
45
Audio Features
Covariability
4. Recurrence

Fixed Inter-Neuron Covariability Induces
Adversarial Robustness
46
• Inter-neuron correlations in the brain are
rigid [Hennig+21]
• Inter-neuron correlations in DNNs are
flexible
• Change based on stimulus distribution
Shah, M.A. and Raj, B., Fixed Inter-Neuron Covariability Induces Adversarial Robustness, ICASSP
(2024)

SCA Layer
47
Transform the activations so they respect the learned correlation
1. 𝒖 ← 𝑓 𝑥
2. For 𝑡: 1 → 𝑇 do
3. 𝒂𝒙 ← 𝜙(𝒖)
4. 𝐽 ← 𝒂𝒙 − 𝜙 𝑾𝒈𝒂𝒙 + 𝒃𝒈 2
+ 𝜆 𝒙 − 𝑾ℎ𝒂𝒙 − 𝒃ℎ 2
5. 𝒖 ← 𝜂∇𝒂𝒙
𝐽
5. End for
6. 𝒂𝒙 ← 𝜙(𝒖)
Map to correlation regularization
𝑊
𝑔
0
𝒂𝒙
Shah, M.A. and Raj, B., Fixed Inter-Neuron Covariability Induces Adversarial Robustness, ICASSP (2024)
Diagonal 0

Result #1: SCA Layer Reduces Change in Inter-
Neuron Correlation
48
FMNIST MNIST Speech Commands

Results #2: SCA Layer Makes Models More
Robust
49
-0.5
1.5 1.8
0.7 0.5
4.6
-1.3
-4.2
3.3
Clean Non-Adv Perturb Adv Perturb
Difference
In
Accuracy
(SCA-MLP)
MNIST FMNIST Speech Commands

Biological Priors
50
Audio Features
Covariability
4. Recurrence

Recurrent Connections in the Brain
• Recurrent circuits are wide spread in the brain
[Bullier+01, Briggs+20]
• Lateral connections between neurons in the
same region
• Feedback connections from higher cognitive
areas to lower areas
• May fill in missing information due to crowding or
occlusion [Spoerer+17, Boutin+21]
• Not represented in DNNs
51

Classification
Adding Recurrence to DNNs
52
Input
Conv-Pool
Conv+Upsample Conv
Conv-Pool
Global
Pooling
Conv-Pool
Linear Proj
Feedforward
Pathways
Lateral
Recurrence
Feedback
𝑡 = 1
𝑡 = 2
𝑡 = 𝑇
Conv+Upsample

Results
53
Adding recurrence improves accuracy on clean and
adversarially perturbed data
CIFAR-10
Time Steps

Reconstructing From Feedback Signal
• Without constraints degenerate solutions are possible
• Recurrent connections may learn identity functions
• Explicitly encourage models to fill in missing information
54

Conv+Upsample
Conv-Pool
Reconstructing From Feedback Signal
ℒ 𝑥, 𝑥, 𝑦 = 𝑥 − 𝑥 2 + 𝑋𝑒𝑛𝑡(𝑥, 𝑦)
55
Classification
Conv+Upsample Conv
Conv-Pool
Global
Pooling
Conv-Pool
Linear Proj
Reconstruction
Input
Radom
Occlusions
tanh
Prediction of
occluded
information
𝑥
𝑥

Results
56
Adding reconstruction significantly improves accuracy on
adversarially perturbed data,
but reduces accuracy on clean data

Key Takeaways and Future Work
• Scale up experiments to larger models and datasets
• Explore synergies with other works like R-Blur and bio-plausible audio
feature
57

Summary
58
Foveation via Adaptive
Biologically Inspired
Audio Features
Fixed Interneuron
Covariability
Recurrence
Improved robustness to
adv & non-adv
perturbations
lateral suppression
improved adv
robustness
Improved robustness
to adv & non-adv
perturbations
Improved adv
robustness.
• FB w/ recon yields
better robustness.

References
P. Benz, C. Zhang, and I. S. Kweon. Batch normalization increases adversarial vulnerability and
decreases adversarial transferability: A non-robust feature perspective. In Proceedings of the
IEEE/CVF International Conference on Computer Vision, pages 7818–7827, 2021.
A. Brandmeyer, R. Lyon, and R. Weiss. Cascade of asymmetric resonators with fast-acting com-
pression cochlear model, 2015.
S. Bubeck and M. Sellke. A universal law of robustness via isoperimetry. Advances in Neural
Information Processing Systems, 34:28811–28822, 2021.
B. Choksi, M. Mozafari, C. Biggs O’May, B. Ador, A. Alamia, and R. VanRullen. Predify: Aug- menting
deep neural networks with brain-inspired predictive coding dynamics. Advances in Neural
J. Dapello, T. Marques, M. Schrimpf, F. Geiger, D. Cox, and J. J. DiCarlo. Simulating a primary visual
cortex at the front of cnns improves robustness to image perturbations. Advances in Neural

References
J. M. Gant, A. Banburski, and A. Deza. Evaluating the adversarial robustness of a foveated texture
transform module in a cnn. In SVRHM 2021 Workshop@ NeurIPS, 2021.
H. Hermansky, N. Morgan, A. Bayya, and P. Kohn. Rasta-plp speech analysis. In Proc. IEEE Int’l Conf.
Acoustics, speech and signal processing, volume 1, pages 121–124, 1991.
Y. Huang, J. Gornet, S. Dai, Z. Yu, T. Nguyen, D. Tsao, and A. Anandkumar. Neural networks with recurrent
generative feedback. Advances in Neural Information Processing Systems, 33: 535–545, 2020.
J. Kubilius, M. Schrimpf, A. Nayebi, D. Bear, D. L. Yamins, and J. J. DiCarlo. Cornet: Modeling the neural
mechanisms of core object recognition. BioRxiv, page 408385, 2018.
A. Jonnalagadda, W. Y. Wang, B. Manjunath, and M. Eckstein. Foveater: Foveated transformer for image
classification, 2022.
R. Lyon. Computational models of neural auditory processing. In ICASSP’84. IEEE International Conference
on Acoustics, Speech, and Signal Processing, volume 9, pages 41–44. IEEE, 1984.
Lin, Y. and Abdulla, W. H. Principles of psychoacoustics. In Audio Watermark, pp. 15–49. Springer, 2015.

All Components of R-Blur Improve Robustness
62

R-Blur vs. Gaussian Noise vs. Gaussian Blur
63

Biologically Inspired Methods for Adversarially Robust Deep Learning

Recommended

Recommended

More Related Content

Similar to Biologically Inspired Methods for Adversarially Robust Deep Learning

Similar to Biologically Inspired Methods for Adversarially Robust Deep Learning (20)

Recently uploaded

Recently uploaded (20)

Biologically Inspired Methods for Adversarially Robust Deep Learning

Editor's Notes