SlideShare a Scribd company logo
1 of 44
Download to read offline
Minimum Complexity Echo State
Networks
For  Genome and Sequence Analysis
DR. CHRISTOF TEUSCHER
PORTLAND STATE UNIVERSITY
DEPARTMENT OF ELECTRICAL & COMPUTER ENGINEERING (ECE)
www.teuscher-lab.com Portland State University
1
This work
was
supported
by the
National
Science
Foundation
under grant
no. 1518833
Chris Neighbor
2
www.teuscher-lab.com Portland State University
01
02
03
04
05
06
07
08
Introduction.
Neural Networks.
Reservoir Computing.
Oscillating Neural Networks.
Results.
Dengue Virus.
Chemical Reaction Networks.
Conclusion.
Introduction.
www.teuscher-lab.com Portland State University
4
What’s the problem?
❖ Increasing viral illnesses threatens global
human health and welfare. 
❖ Dengue virus infects 390 million people a year
with reports increasing 6-fold from 2010 to
2016. Around 9,000 deaths per year. [WHO]
❖ Developing tools which can directly detect the
virus can aid in early detection. The Coronavirus -- www.cdc.gov/coronavirus
 https:/
/www.who.int/news-room/fact-sheets/detail/dengue-
and-severe-dengue, 
5
What’s the solution?
DNA Computing. A tool implemented using DNA
molecules which has promising applications in
medical situations due to its biocompatibility and
potential for the direct sensing of biological
molecules.
www.teuscher-lab.com Portland State University
6
How is it currently done?
❖ Require viral isolation and
incubation for sensing and can still
have an only 40% detection rate.
[Tang]
❖ Other methods like Polymerase
Chain Reaction (PCR) requires
special equipment and trained
technicians for detection.
Current Practices DNA Computing
❖ Can perform sensing, computation,
and classification in a
self-contained system
biocompatible with the body.
❖ Has potential for early detection
before symptoms appear.
K. Tang et al, “Diagnosis of dengue: an update”, Expert Review of Anti-infective Therapy, 2012
7
What are we trying to do?
❖ Implement a machine learning architecture on DNA.
❖ Explore the use of dynamic randomized networks in order to
accomplish the task of pathogen classification. 
❖ With the aim that these kinds of randomized computing systems could
be more easily implemented in DNA. 
❖ Identify accuracy and complexity tradeoffs by exploring impact size has
on network performance. 
www.teuscher-lab.com Portland State University
Neural Networks.
www.teuscher-lab.com Portland State University
9
What is a “Neuron”?
www.teuscher-lab.com Portland State University
https:/
/en.wikipedia.org/wiki/Biological_neuron_model
Activation Functions: we
predominantly used tanh
10
What is a “Neural Network”?
❖ Feed Forward multi-layered perceptron
network, “Deep Learning”. 
❖ The input layer is passed forward
through the hidden layer then on to the
output layer.
❖ Each of the nodes within the hidden
layer are a perceptron. 
❖ Due to its multi-layered structure it is
capable of solving non-linear
separation problems.
www.teuscher-lab.com Portland State University
11
What is a “Recurrent Neural Network”?
❖ By adding a feedback into the
hidden layer, features across
time can be extracted.
❖ In this model, memory and
computation are linked together
in the network. 
❖ Other nodes can be used which
can have a separate memory
state.
www.teuscher-lab.com Portland State University
Reservoir Computing.
www.teuscher-lab.com Portland State University
13
What is “Reservoir Computing”?
❖ An input layer to the system which is randomly
initialized.
❖ A fixed dynamic hidden layer known as the
“reservoir” which extracts features.
❖ Liquid State Machines: spiking nodes 
❖ Echo State Networks: simple recurrent nodes
❖ Other dynamical systems and substrates, e.g.
oscillators, self-assembled nanowires, and random
chemical reaction network [Lukoševičius]
❖ A final linear readout layer which can be
optimized to match a target output.
❖ A main advantage of reservoir computing is the
low learning complexity because only the output
layer is trained.
Three components:
M. Lukoševičius, “A practical guide to applying echo state networks,” in Neural Networks: Tricks of the Trade, 2012
X[n + 1] =f(Win
U[n] +Wres
X[n])
Y [n] =g(X[n]Wout
)
14
Optimizing the Linear Readout Layer.
❖ Classification: Logistic Regression & Support Vector Machine.
❖ Output probability of each class using softmax layer. 
❖ Can use gradient descent to train the readout layer.
www.teuscher-lab.com Portland State University
Training the final
readout layer:
Ridge Regression
15
Why Reservoir Computing?
F. Wang, et al, “Implementing digital computing with DNA-based switching circuits,” Nature Communications , 2020
❖ Molecular circuits can be difficult to scale due to:
❖ Cross-talk (network pathway signal interference).
❖ Errors from manufacturing of DNA strands which require purification.
❖ Requirement of orthogonality of the strands. [Wang]
❖ The reservoir layer is unstructured allowing for easier biochemical
implementation.
❖ Fully or partially random networks of chemical circuits with a trained
linear output layer can be used for complex tasks such as pathogen
detection.
Bridging the Gap:
Reservoir Computing & Chemical Implementation.
16
Machine Learning Methods
G. Dion et al, “Reservoir computing with a single delay-coupled non-linear mechanical oscillator,” Journal of Applied Physics, 2018
H. Jaeger, “The echo state approach to analysing and training recurrent neural networks-with an erratum note,” GMD Technical Report, 2001
K. M. Cherry et al, “Scaling up molecular pattern recognition with DNA-based winner-take-all neural networks,” Nature, 2018
L.  Qian et al,  “Neural network  computation with DNA  strand displacement cascades,” Nature, 2011.
A. Hjelmfelt et al, “Chemical implementation of neural networks and turing machines,” Proceedings of the National Academy of Sciences, 1991.
R. Lopez et al, “A molecular multi-gene classifier for disease diagnostics,” Nature Chemistry, 2018.
A.  Goudarzi et al,  “DNA reservoir computing:   a novel molecular computing approach,” in International Workshop on DNA-Based Computers, 2013
W. Yahiro et al, “A reservoir computing approach for molecular computing,” in Artificial Life Conference Proceedings,  2018
D. Soloveichik et al, “DNA as a universal substrate for chemical kinetics, Proceedings of the National Academy of Sciences, 2010.
Construction
17
What is an
“Echo State Network”?
❖ From this variety of Neural Networks, we zoom
in on one particular architecture.
❖ Echo State Networks (ESN), the hidden layer
consists of randomized recurrent neural
networks with a trained output layer. The ESN
is a subset of reservoir computing.
www.teuscher-lab.com Portland State University
Echo State Network (ESN)
https:/
/www.asimovinstitute.org/neural-network-zoo/
18
ESN – Initialization Property.
❖ Initialize the weight matrix
from Uniform[-1,1].
❖ Divide by the absolute value
of the largest eigenvalue.
❖ Scale by the desired spectral
radius value <1.
www.teuscher-lab.com Portland State University
❖ Randomness allows for less
precise constraints for
implementation as the scale of
these networks is increased.
❖ Readout complexity increases
linearly as network size
increases. 
Echo State Property Initialization Process
19
ESN – Effect of Spectral Radius.
www.teuscher-lab.com Portland State University
Spectral radius = 0.95 Spectral radius = 1.1
Input: 𝛿[n], (1 at n=0 and 0 afterwards), Nodes=100
The spectral radius is the largest eigenvalue of the hidden weight matrix.
20
ESN – Effect of Sparsity.
Sparsity = 0.1 Fully Connected, Sparsity = 1.0
Input: 𝛿[n] N=100
Local networks
can form with
oscillations
between them.
[Lukoševičius]
M. Lukoševičius, “A practical guide to applying echo state networks,” in Neural Networks: Tricks of the Trade, pp. 659–686, Springer, 2012.
Oscillating
Neural Networks.
www.teuscher-lab.com Portland State University
22
Oscillating
Neural Network
❖ Direct conversions between
weights and reaction pathways.
❖ Chemical Reaction Concepts:
❖ Autocatalysis: Feedback to its
own reaction, 
❖ Inhibition: Negative feedback
to other reactions, 
❖ Activation: Positive feedback
to reactions
Based on three DNA inhibitor circuits
with a product and substrate reaction.
[Goudarzi, Yahiro]
A.  Goudarzi et al,,  “DNA reservoir computing:   a novel molecular computing approach,” in International Workshop on DNA-Based Computers, 2013
W. Yahiro et al, “A reservoir computing approach for molecular computing,” in Artificial Life Conference Proceedings, 2018
23
Why ONN?
❖ Examining impact of size and spectral radius on the
computational capabilities of an oscillating system, we can
translate these insights into chemical oscillator scale, rate of
reactions, and efflux rate.
❖ Implement in chemistry as it has a direct one to one
conversion capability.
❖ These ONNs can have qualitatively similar outputs to
chemical oscillators once equilibrium is shifted and
normalized.
www.teuscher-lab.com Portland State University
24
Tasks.
❖ NARMA-Task A
❖ NARMA-10
❖ MNIST
❖ Splice Junction Gene Sequences
❖ Dengue Virus Genome Serotypes
www.teuscher-lab.com Portland State University
*For others, please see Thesis.
Results.
www.teuscher-lab.com Portland State University
26
What is “NARMA”?
NARMA-10 - The Nonlinear
Autoregressive Moving Average has
long term non-linear dependencies
and is generated using the equation:
www.teuscher-lab.com Portland State University
𝛼 = 0.3
𝛽 = 0.05 
u is a random vector~U [0,0.5]
𝛾 = 1.5
𝛿 = 0.1
n = 10
27
www.teuscher-lab.com Portland State University
MSE = mean((y-ŷ)2)
❖ Mean Squared error for:
❖ Ridge regression linear models
❖ Reservoir Layers:
❖ Chemical Oscillator 
❖ Echo State Network
❖ Oscillating Neural Network
Results:
NARMA-10
Best
28
❖ Plot showing the relation of size of the
network with performance.
❖ This was for a fully connected ESN.  
❖ ESN’s tend to have larger error bars for
smaller networks as the random
initializations can have a bigger impact
on performance.
www.teuscher-lab.com Portland State University
Results:
NARMA-10 ESN
29
❖ Plot showing the relation of size and
sparsity of the network with
performance.
❖ Sparsity = 1.0 is fully connected.
❖ There is not a significant deterioration in
accuracy as network connectivity is
reduced, network with fewer
connections could be easier to
implement.
www.teuscher-lab.com Portland State University
Results:
NARMA-10 ESN
= Linear, memoryless
= Linear, input memory = 50
30
❖ Plot showing the relation of size and spectral
radius of the network with performance.
❖ This was for an oscillating neural network  
❖ The spectral radius could translate to
characterizing the chemical efflux rate from
a micro-reactor setup
www.teuscher-lab.com Portland State University
Results:
NARMA-10 ONN
= Linear, memoryless
= Linear, input memory = 50
Dengue Virus.
www.teuscher-lab.com Portland State University
32
Structure of Dengue Virus
❖ ~11,000 base pairs, 4 Serotypes.
❖ Immunity to one serotype does not grant immunity to
others. 65% genome similarity. 3-5% variation in same
class. 
❖ 200 base pair subsampling of the untranslated region
(UTR) capsid regions of the genome.
Untranslated Region (UTR):
Important in the regulation of
translation and replication of
the viral genome.
Capsid: Protein shell that
protects the genetic material
of a virus. [Guzman]
Task: Classify 200 base pair subsample
from the untranslated region (UTR)
capsid regions of the genome
M.G. Guzman et al. “Dengue: A continuing global threat.” Nature Reviews Microbiology, 2010.
33
DNA Encoding
For physical
implementation, this
translates into an
amplification step with
the conversion of each
nucleotide into a
corresponding
oligonucleotide.
www.teuscher-lab.com Portland State University
❖ One-hot encoding of the nucleotides:
❖ A = [1, 0, 0, 0]
❖ T = [0, 1, 0, 0]
❖ C = [0, 0, 1, 0]
❖ G = [0, 0, 0, 1]
34
www.teuscher-lab.com Portland State University
Results:
Dengue Virus
❖ While the simulated chemical
oscillator struggled to effectively
differentiate the classes, relatively
small echo state networks and
oscillating neural networks
provided improvements in
separability of the classes.
Accuracies of the different classifiers on the task of Dengue serotype classification.
35
www.teuscher-lab.com Portland State University
Results:
Dengue Virus ESN
❖ Large error bars evident in smaller
networks due to initialization
variation.
❖ Networks larger than 50 nodes,
improvements over linear models
are evident.
36
www.teuscher-lab.com Portland State University
Results:
Dengue Virus ONN
❖ The improved performance at higher
spectral radius shows that having a
longer memory of previous inputs to
the network can improve performance.
❖ This increased memory and
computation accomplished with higher
node values allows for the extraction of
effective genomic features for
distinguishing serotype.
= Logistic regression threshold
Chemical Reaction
Networks.
www.teuscher-lab.com Portland State University
38
Portland State University
The
Belousov-
Zhabotinsky
Reaction
www.teuscher-lab.com
39
Molecular
Dynamics
❖ DNA can be used to construct
arbitrary chemical reactions.
❖ Rock, paper, scissors reaction as
described capable of non-linear
dynamics. [Srinivas] 
❖ Stochastic simulation can also be
explored for small numbers of
molecules using the Gillespie
algorithm [Rao]. 
N. Srinivas, et al,  “Enzyme-free nucleic acid dynamical systems,” Science, 2017.
C. V. Rao et al,  “Stochastic chemical kinetics and the quasi-steady-state assumption:  Application to the Gillespie algorithm,” The Journal of Chemical Physics, 2003.
40
Toehold Mediated Strand
Displacement (TMSD)
Boolean logic and arbitrary chemical reaction equations can be
implemented by cascading these circuits together. [Zhang, Srinivas]
How this type
of network
would be
physically
implemented.
D. Y. Zhang et al,. Dynamic DNA nanotechnology using strand displacement reactions. Nature Chemistry, 2011.
N. Srinivas, et al,  “Enzyme-free nucleic acid dynamical systems,” Science, 2017.
Conclusion.
www.teuscher-lab.com Portland State University
42
Contributions.
❖ Prepared tutorials for others in the lab for reading and encoding genomic data files from the splice junction gene
sequences dataset. Along with examples for creating echo state networks using PyTorch and Keras at
https:/
/github.com/Neighborc/tlabTutorial
❖ Evaluate the simulated use of a chemical oscillator as a reservoir layer for sequence prediction tasks. Showing that these
chemical oscillators can improve performance above the memoryless linear baseline for NARMA-A from MSE= 0.42 to 0.34.
and for NARMA-10 from MSE=8.8x103
to 6.6x103
.
❖ Evaluate the simulated use of a chemical oscillator as a reservoir layer for the MNIST dataset achieving 90.6% classification
accuracy
❖ Explored the use of Echo State Networks for the multivariate sequence classification task for dengue genome serotype
classification achieving 96.4% accuracy 
❖ Presented a research poster at the 2019 International Conference on DNA Computing and Molecular Programming in
Seattle, WA, where I published a peer-reviewed abstract for my poster in the conference poster proceedings. Title:
“Reservoir Computing for Genome Sequence Classification."
❖ Currently drafting a paper based on this research for submission to the Artificial Life journal
www.teuscher-lab.com Portland State University
43
Conclusions.
❖ By implementing these networks in
DNA and perturbing them with a
nucleotide input sequence, the task
of pathogen classification can be
accomplished 
❖ This work demonstrates that
randomized non-linear networks of
limited size and complexity can
achieve an improvement over linear
baseline models
www.teuscher-lab.com Portland State University
Future work
❖ Expand the section of genome
sampled.
❖ Adjust subsample length to be
smaller and characterize accuracy
loss.
❖ Encoding input in a different way
such as through k-mer one-hot
encoding.
Questions?
www.teuscher-lab.com Portland State University

More Related Content

What's hot

NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018Alexander Pico
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallAlexander Pico
 
NetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoNetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoAlexander Pico
 
NetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicNetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicAlexander Pico
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
 
NetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana MilenkovicNetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana MilenkovicAlexander Pico
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiAlexander Pico
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017Alexander Pico
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuAlexander Pico
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicAlexander Pico
 
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...Thitichai Sripan
 
Brain Network - Thalamocortical Motif
Brain Network - Thalamocortical MotifBrain Network - Thalamocortical Motif
Brain Network - Thalamocortical MotifJimmy Lu
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonAlexander Pico
 
NetBioSIG2012 chrisevelo
NetBioSIG2012 chriseveloNetBioSIG2012 chrisevelo
NetBioSIG2012 chriseveloAlexander Pico
 
Exploring Complex Networks
Exploring Complex NetworksExploring Complex Networks
Exploring Complex NetworksJimmy Lu
 
Presentation
PresentationPresentation
Presentationsidra ali
 
NetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarNetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarAlexander Pico
 

What's hot (20)

NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: Overall
 
NetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoNetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon Cho
 
NetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicNetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk Janjic
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive Networks
 
NetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana MilenkovicNetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana Milenkovic
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang Su
 
10.1.1.80.2149
10.1.1.80.214910.1.1.80.2149
10.1.1.80.2149
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana Milenkovic
 
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
A Reliable Password-based User Authentication Scheme for Web-based Human Geno...
 
Brain Network - Thalamocortical Motif
Brain Network - Thalamocortical MotifBrain Network - Thalamocortical Motif
Brain Network - Thalamocortical Motif
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
 
NetBioSIG2012 chrisevelo
NetBioSIG2012 chriseveloNetBioSIG2012 chrisevelo
NetBioSIG2012 chrisevelo
 
Exploring Complex Networks
Exploring Complex NetworksExploring Complex Networks
Exploring Complex Networks
 
Presentation
PresentationPresentation
Presentation
 
A tutorial in Connectome Analysis (2) - Marcus Kaiser
A tutorial in Connectome Analysis (2) - Marcus KaiserA tutorial in Connectome Analysis (2) - Marcus Kaiser
A tutorial in Connectome Analysis (2) - Marcus Kaiser
 
NetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarNetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David Amar
 

Similar to Minimum Complexity Echo State Networks For Genome and Sequence Analysis

Open Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Networking Summit
 
NARCO ANALYSIS AND DEEP LEARNING
NARCO ANALYSIS AND DEEP LEARNINGNARCO ANALYSIS AND DEEP LEARNING
NARCO ANALYSIS AND DEEP LEARNINGIRJET Journal
 
Chapter 5 applications of neural networks
Chapter 5           applications of neural networksChapter 5           applications of neural networks
Chapter 5 applications of neural networksPunit Saini
 
Dissertation character recognition - Report
Dissertation character recognition - ReportDissertation character recognition - Report
Dissertation character recognition - Reportsachinkumar Bharadva
 
Quantum neural network
Quantum neural networkQuantum neural network
Quantum neural networksurat murthy
 
DNA sequencing: rapid improvements and their implications
DNA sequencing: rapid improvements and their implicationsDNA sequencing: rapid improvements and their implications
DNA sequencing: rapid improvements and their implicationsJeffrey Funk
 
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...IJERDJOURNAL
 
Reliable and Efficient Data Acquisition in Wireless Sensor Network
Reliable and Efficient Data Acquisition in Wireless Sensor NetworkReliable and Efficient Data Acquisition in Wireless Sensor Network
Reliable and Efficient Data Acquisition in Wireless Sensor NetworkIJMTST Journal
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012Elif Ceylan
 
informatics_future.pdf
informatics_future.pdfinformatics_future.pdf
informatics_future.pdfAdhySugara2
 
28 15017 estimation of turbidity in water(edit)
28 15017 estimation of turbidity in water(edit)28 15017 estimation of turbidity in water(edit)
28 15017 estimation of turbidity in water(edit)nooriasukmaningtyas
 
A Study of Deep Learning Applications
A Study of Deep Learning ApplicationsA Study of Deep Learning Applications
A Study of Deep Learning Applicationsijtsrd
 
Biotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingBiotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingMelanie Swan
 
20131019 生物物理若手 Journal Club
20131019 生物物理若手 Journal Club20131019 生物物理若手 Journal Club
20131019 生物物理若手 Journal ClubMed_KU
 

Similar to Minimum Complexity Echo State Networks For Genome and Sequence Analysis (20)

Open Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of Cancer
 
NARCO ANALYSIS AND DEEP LEARNING
NARCO ANALYSIS AND DEEP LEARNINGNARCO ANALYSIS AND DEEP LEARNING
NARCO ANALYSIS AND DEEP LEARNING
 
Chapter 5 applications of neural networks
Chapter 5           applications of neural networksChapter 5           applications of neural networks
Chapter 5 applications of neural networks
 
Dissertation character recognition - Report
Dissertation character recognition - ReportDissertation character recognition - Report
Dissertation character recognition - Report
 
Collins seattle-2014-final
Collins seattle-2014-finalCollins seattle-2014-final
Collins seattle-2014-final
 
Quantum neural network
Quantum neural networkQuantum neural network
Quantum neural network
 
DNA sequencing: rapid improvements and their implications
DNA sequencing: rapid improvements and their implicationsDNA sequencing: rapid improvements and their implications
DNA sequencing: rapid improvements and their implications
 
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...
Study of Membrane Transport for Protein Filtration Using Artificial Neural Ne...
 
UNMSymposium2014
UNMSymposium2014UNMSymposium2014
UNMSymposium2014
 
Anuj 21PCE039.pptx
Anuj 21PCE039.pptxAnuj 21PCE039.pptx
Anuj 21PCE039.pptx
 
Reliable and Efficient Data Acquisition in Wireless Sensor Network
Reliable and Efficient Data Acquisition in Wireless Sensor NetworkReliable and Efficient Data Acquisition in Wireless Sensor Network
Reliable and Efficient Data Acquisition in Wireless Sensor Network
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012
 
informatics_future.pdf
informatics_future.pdfinformatics_future.pdf
informatics_future.pdf
 
28 15017 estimation of turbidity in water(edit)
28 15017 estimation of turbidity in water(edit)28 15017 estimation of turbidity in water(edit)
28 15017 estimation of turbidity in water(edit)
 
A Study of Deep Learning Applications
A Study of Deep Learning ApplicationsA Study of Deep Learning Applications
A Study of Deep Learning Applications
 
presentation
presentationpresentation
presentation
 
Biotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore SequencingBiotechnophysics: DNA Nanopore Sequencing
Biotechnophysics: DNA Nanopore Sequencing
 
20131019 生物物理若手 Journal Club
20131019 生物物理若手 Journal Club20131019 生物物理若手 Journal Club
20131019 生物物理若手 Journal Club
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Lodhi final viva voce
Lodhi final viva voceLodhi final viva voce
Lodhi final viva voce
 

Recently uploaded

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 

Recently uploaded (20)

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 

Minimum Complexity Echo State Networks For Genome and Sequence Analysis

  • 1. Minimum Complexity Echo State Networks For  Genome and Sequence Analysis DR. CHRISTOF TEUSCHER PORTLAND STATE UNIVERSITY DEPARTMENT OF ELECTRICAL & COMPUTER ENGINEERING (ECE) www.teuscher-lab.com Portland State University 1 This work was supported by the National Science Foundation under grant no. 1518833 Chris Neighbor
  • 2. 2 www.teuscher-lab.com Portland State University 01 02 03 04 05 06 07 08 Introduction. Neural Networks. Reservoir Computing. Oscillating Neural Networks. Results. Dengue Virus. Chemical Reaction Networks. Conclusion.
  • 4. 4 What’s the problem? ❖ Increasing viral illnesses threatens global human health and welfare.  ❖ Dengue virus infects 390 million people a year with reports increasing 6-fold from 2010 to 2016. Around 9,000 deaths per year. [WHO] ❖ Developing tools which can directly detect the virus can aid in early detection. The Coronavirus -- www.cdc.gov/coronavirus  https:/ /www.who.int/news-room/fact-sheets/detail/dengue- and-severe-dengue, 
  • 5. 5 What’s the solution? DNA Computing. A tool implemented using DNA molecules which has promising applications in medical situations due to its biocompatibility and potential for the direct sensing of biological molecules. www.teuscher-lab.com Portland State University
  • 6. 6 How is it currently done? ❖ Require viral isolation and incubation for sensing and can still have an only 40% detection rate. [Tang] ❖ Other methods like Polymerase Chain Reaction (PCR) requires special equipment and trained technicians for detection. Current Practices DNA Computing ❖ Can perform sensing, computation, and classification in a self-contained system biocompatible with the body. ❖ Has potential for early detection before symptoms appear. K. Tang et al, “Diagnosis of dengue: an update”, Expert Review of Anti-infective Therapy, 2012
  • 7. 7 What are we trying to do? ❖ Implement a machine learning architecture on DNA. ❖ Explore the use of dynamic randomized networks in order to accomplish the task of pathogen classification.  ❖ With the aim that these kinds of randomized computing systems could be more easily implemented in DNA.  ❖ Identify accuracy and complexity tradeoffs by exploring impact size has on network performance.  www.teuscher-lab.com Portland State University
  • 9. 9 What is a “Neuron”? www.teuscher-lab.com Portland State University https:/ /en.wikipedia.org/wiki/Biological_neuron_model Activation Functions: we predominantly used tanh
  • 10. 10 What is a “Neural Network”? ❖ Feed Forward multi-layered perceptron network, “Deep Learning”.  ❖ The input layer is passed forward through the hidden layer then on to the output layer. ❖ Each of the nodes within the hidden layer are a perceptron.  ❖ Due to its multi-layered structure it is capable of solving non-linear separation problems. www.teuscher-lab.com Portland State University
  • 11. 11 What is a “Recurrent Neural Network”? ❖ By adding a feedback into the hidden layer, features across time can be extracted. ❖ In this model, memory and computation are linked together in the network.  ❖ Other nodes can be used which can have a separate memory state. www.teuscher-lab.com Portland State University
  • 13. 13 What is “Reservoir Computing”? ❖ An input layer to the system which is randomly initialized. ❖ A fixed dynamic hidden layer known as the “reservoir” which extracts features. ❖ Liquid State Machines: spiking nodes  ❖ Echo State Networks: simple recurrent nodes ❖ Other dynamical systems and substrates, e.g. oscillators, self-assembled nanowires, and random chemical reaction network [Lukoševičius] ❖ A final linear readout layer which can be optimized to match a target output. ❖ A main advantage of reservoir computing is the low learning complexity because only the output layer is trained. Three components: M. Lukoševičius, “A practical guide to applying echo state networks,” in Neural Networks: Tricks of the Trade, 2012 X[n + 1] =f(Win U[n] +Wres X[n]) Y [n] =g(X[n]Wout )
  • 14. 14 Optimizing the Linear Readout Layer. ❖ Classification: Logistic Regression & Support Vector Machine. ❖ Output probability of each class using softmax layer.  ❖ Can use gradient descent to train the readout layer. www.teuscher-lab.com Portland State University Training the final readout layer: Ridge Regression
  • 15. 15 Why Reservoir Computing? F. Wang, et al, “Implementing digital computing with DNA-based switching circuits,” Nature Communications , 2020 ❖ Molecular circuits can be difficult to scale due to: ❖ Cross-talk (network pathway signal interference). ❖ Errors from manufacturing of DNA strands which require purification. ❖ Requirement of orthogonality of the strands. [Wang] ❖ The reservoir layer is unstructured allowing for easier biochemical implementation. ❖ Fully or partially random networks of chemical circuits with a trained linear output layer can be used for complex tasks such as pathogen detection.
  • 16. Bridging the Gap: Reservoir Computing & Chemical Implementation. 16 Machine Learning Methods G. Dion et al, “Reservoir computing with a single delay-coupled non-linear mechanical oscillator,” Journal of Applied Physics, 2018 H. Jaeger, “The echo state approach to analysing and training recurrent neural networks-with an erratum note,” GMD Technical Report, 2001 K. M. Cherry et al, “Scaling up molecular pattern recognition with DNA-based winner-take-all neural networks,” Nature, 2018 L.  Qian et al,  “Neural network  computation with DNA  strand displacement cascades,” Nature, 2011. A. Hjelmfelt et al, “Chemical implementation of neural networks and turing machines,” Proceedings of the National Academy of Sciences, 1991. R. Lopez et al, “A molecular multi-gene classifier for disease diagnostics,” Nature Chemistry, 2018. A.  Goudarzi et al,  “DNA reservoir computing:   a novel molecular computing approach,” in International Workshop on DNA-Based Computers, 2013 W. Yahiro et al, “A reservoir computing approach for molecular computing,” in Artificial Life Conference Proceedings,  2018 D. Soloveichik et al, “DNA as a universal substrate for chemical kinetics, Proceedings of the National Academy of Sciences, 2010. Construction
  • 17. 17 What is an “Echo State Network”? ❖ From this variety of Neural Networks, we zoom in on one particular architecture. ❖ Echo State Networks (ESN), the hidden layer consists of randomized recurrent neural networks with a trained output layer. The ESN is a subset of reservoir computing. www.teuscher-lab.com Portland State University Echo State Network (ESN) https:/ /www.asimovinstitute.org/neural-network-zoo/
  • 18. 18 ESN – Initialization Property. ❖ Initialize the weight matrix from Uniform[-1,1]. ❖ Divide by the absolute value of the largest eigenvalue. ❖ Scale by the desired spectral radius value <1. www.teuscher-lab.com Portland State University ❖ Randomness allows for less precise constraints for implementation as the scale of these networks is increased. ❖ Readout complexity increases linearly as network size increases.  Echo State Property Initialization Process
  • 19. 19 ESN – Effect of Spectral Radius. www.teuscher-lab.com Portland State University Spectral radius = 0.95 Spectral radius = 1.1 Input: 𝛿[n], (1 at n=0 and 0 afterwards), Nodes=100 The spectral radius is the largest eigenvalue of the hidden weight matrix.
  • 20. 20 ESN – Effect of Sparsity. Sparsity = 0.1 Fully Connected, Sparsity = 1.0 Input: 𝛿[n] N=100 Local networks can form with oscillations between them. [Lukoševičius] M. Lukoševičius, “A practical guide to applying echo state networks,” in Neural Networks: Tricks of the Trade, pp. 659–686, Springer, 2012.
  • 22. 22 Oscillating Neural Network ❖ Direct conversions between weights and reaction pathways. ❖ Chemical Reaction Concepts: ❖ Autocatalysis: Feedback to its own reaction,  ❖ Inhibition: Negative feedback to other reactions,  ❖ Activation: Positive feedback to reactions Based on three DNA inhibitor circuits with a product and substrate reaction. [Goudarzi, Yahiro] A.  Goudarzi et al,,  “DNA reservoir computing:   a novel molecular computing approach,” in International Workshop on DNA-Based Computers, 2013 W. Yahiro et al, “A reservoir computing approach for molecular computing,” in Artificial Life Conference Proceedings, 2018
  • 23. 23 Why ONN? ❖ Examining impact of size and spectral radius on the computational capabilities of an oscillating system, we can translate these insights into chemical oscillator scale, rate of reactions, and efflux rate. ❖ Implement in chemistry as it has a direct one to one conversion capability. ❖ These ONNs can have qualitatively similar outputs to chemical oscillators once equilibrium is shifted and normalized. www.teuscher-lab.com Portland State University
  • 24. 24 Tasks. ❖ NARMA-Task A ❖ NARMA-10 ❖ MNIST ❖ Splice Junction Gene Sequences ❖ Dengue Virus Genome Serotypes www.teuscher-lab.com Portland State University *For others, please see Thesis.
  • 26. 26 What is “NARMA”? NARMA-10 - The Nonlinear Autoregressive Moving Average has long term non-linear dependencies and is generated using the equation: www.teuscher-lab.com Portland State University 𝛼 = 0.3 𝛽 = 0.05  u is a random vector~U [0,0.5] 𝛾 = 1.5 𝛿 = 0.1 n = 10
  • 27. 27 www.teuscher-lab.com Portland State University MSE = mean((y-ŷ)2) ❖ Mean Squared error for: ❖ Ridge regression linear models ❖ Reservoir Layers: ❖ Chemical Oscillator  ❖ Echo State Network ❖ Oscillating Neural Network Results: NARMA-10 Best
  • 28. 28 ❖ Plot showing the relation of size of the network with performance. ❖ This was for a fully connected ESN.   ❖ ESN’s tend to have larger error bars for smaller networks as the random initializations can have a bigger impact on performance. www.teuscher-lab.com Portland State University Results: NARMA-10 ESN
  • 29. 29 ❖ Plot showing the relation of size and sparsity of the network with performance. ❖ Sparsity = 1.0 is fully connected. ❖ There is not a significant deterioration in accuracy as network connectivity is reduced, network with fewer connections could be easier to implement. www.teuscher-lab.com Portland State University Results: NARMA-10 ESN = Linear, memoryless = Linear, input memory = 50
  • 30. 30 ❖ Plot showing the relation of size and spectral radius of the network with performance. ❖ This was for an oscillating neural network   ❖ The spectral radius could translate to characterizing the chemical efflux rate from a micro-reactor setup www.teuscher-lab.com Portland State University Results: NARMA-10 ONN = Linear, memoryless = Linear, input memory = 50
  • 32. 32 Structure of Dengue Virus ❖ ~11,000 base pairs, 4 Serotypes. ❖ Immunity to one serotype does not grant immunity to others. 65% genome similarity. 3-5% variation in same class.  ❖ 200 base pair subsampling of the untranslated region (UTR) capsid regions of the genome. Untranslated Region (UTR): Important in the regulation of translation and replication of the viral genome. Capsid: Protein shell that protects the genetic material of a virus. [Guzman] Task: Classify 200 base pair subsample from the untranslated region (UTR) capsid regions of the genome M.G. Guzman et al. “Dengue: A continuing global threat.” Nature Reviews Microbiology, 2010.
  • 33. 33 DNA Encoding For physical implementation, this translates into an amplification step with the conversion of each nucleotide into a corresponding oligonucleotide. www.teuscher-lab.com Portland State University ❖ One-hot encoding of the nucleotides: ❖ A = [1, 0, 0, 0] ❖ T = [0, 1, 0, 0] ❖ C = [0, 0, 1, 0] ❖ G = [0, 0, 0, 1]
  • 34. 34 www.teuscher-lab.com Portland State University Results: Dengue Virus ❖ While the simulated chemical oscillator struggled to effectively differentiate the classes, relatively small echo state networks and oscillating neural networks provided improvements in separability of the classes. Accuracies of the different classifiers on the task of Dengue serotype classification.
  • 35. 35 www.teuscher-lab.com Portland State University Results: Dengue Virus ESN ❖ Large error bars evident in smaller networks due to initialization variation. ❖ Networks larger than 50 nodes, improvements over linear models are evident.
  • 36. 36 www.teuscher-lab.com Portland State University Results: Dengue Virus ONN ❖ The improved performance at higher spectral radius shows that having a longer memory of previous inputs to the network can improve performance. ❖ This increased memory and computation accomplished with higher node values allows for the extraction of effective genomic features for distinguishing serotype. = Logistic regression threshold
  • 39. 39 Molecular Dynamics ❖ DNA can be used to construct arbitrary chemical reactions. ❖ Rock, paper, scissors reaction as described capable of non-linear dynamics. [Srinivas]  ❖ Stochastic simulation can also be explored for small numbers of molecules using the Gillespie algorithm [Rao].  N. Srinivas, et al,  “Enzyme-free nucleic acid dynamical systems,” Science, 2017. C. V. Rao et al,  “Stochastic chemical kinetics and the quasi-steady-state assumption:  Application to the Gillespie algorithm,” The Journal of Chemical Physics, 2003.
  • 40. 40 Toehold Mediated Strand Displacement (TMSD) Boolean logic and arbitrary chemical reaction equations can be implemented by cascading these circuits together. [Zhang, Srinivas] How this type of network would be physically implemented. D. Y. Zhang et al,. Dynamic DNA nanotechnology using strand displacement reactions. Nature Chemistry, 2011. N. Srinivas, et al,  “Enzyme-free nucleic acid dynamical systems,” Science, 2017.
  • 42. 42 Contributions. ❖ Prepared tutorials for others in the lab for reading and encoding genomic data files from the splice junction gene sequences dataset. Along with examples for creating echo state networks using PyTorch and Keras at https:/ /github.com/Neighborc/tlabTutorial ❖ Evaluate the simulated use of a chemical oscillator as a reservoir layer for sequence prediction tasks. Showing that these chemical oscillators can improve performance above the memoryless linear baseline for NARMA-A from MSE= 0.42 to 0.34. and for NARMA-10 from MSE=8.8x103 to 6.6x103 . ❖ Evaluate the simulated use of a chemical oscillator as a reservoir layer for the MNIST dataset achieving 90.6% classification accuracy ❖ Explored the use of Echo State Networks for the multivariate sequence classification task for dengue genome serotype classification achieving 96.4% accuracy  ❖ Presented a research poster at the 2019 International Conference on DNA Computing and Molecular Programming in Seattle, WA, where I published a peer-reviewed abstract for my poster in the conference poster proceedings. Title: “Reservoir Computing for Genome Sequence Classification." ❖ Currently drafting a paper based on this research for submission to the Artificial Life journal www.teuscher-lab.com Portland State University
  • 43. 43 Conclusions. ❖ By implementing these networks in DNA and perturbing them with a nucleotide input sequence, the task of pathogen classification can be accomplished  ❖ This work demonstrates that randomized non-linear networks of limited size and complexity can achieve an improvement over linear baseline models www.teuscher-lab.com Portland State University Future work ❖ Expand the section of genome sampled. ❖ Adjust subsample length to be smaller and characterize accuracy loss. ❖ Encoding input in a different way such as through k-mer one-hot encoding.