Innovations in Sequencing & Bioinformatics
Talk for
Healthy Central Valley Together
Research Workshop
Jonathan A. Eisen
University of California, Davis
January 31, 2024
linktr.ee/jonathaneisen
Sequence Based Surveillance Workflow
Sequence IFX
Sequence IFX
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence Based Surveillance Workflow
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Sequencing Overview
Cost / Mb
10
Cost / Mb
https://www.genome.gov/about-genomics/fact-sheets/DNA-Sequencing-Costs-Data
Key Issues
• Cost / bp
• Read length
• Output per run
• Paired end
• Ease of feeding
• Error profiles and accuracy
• Barcoding potential
• Detection of modifications
• Sensitivity to low quality DNA/RNA
• RNA, DNA, or mixed RNA/DNA
12
12
Evolution of Sequencing
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
Gen 1:
Manual
Sequencing
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
ABI-Solid
Solexa-
Illumina
454-
Roche
Ion
Torrent
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Gen 3:
Clusters
Not
Clones
Oxford
NanoPore
Pacbio
Helicos
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
ABI-Solid
Solexa-
Illumina
454-
Roche
Ion
Torrent
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Gen 3:
Clusters
Not
Clones
Gen 4:
Single
Molecule
Output per Run
https://www.mdpi.com/2079-7737/12/7/997
Sample Prep Differences
https://www.sciencedirect.com/science/article/pii/S0048969723044923
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Library Overview
Library Preparation
Multiplex PCR vs Capture
https://www.celemics.com/resources/blogs/target-capture-multiplex-pcr-vs-hybridization-capture/
Capture Methods
18
High throughput sample preparation
Sample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
Nature Methods, 2010, 7: 111-118
RainDance
Microdroplet PCR
Roche Nimblegen
Salid-phase capture with custom-
designed oligonucleotide microarray
Reported 84% of
capture efficiency
Reported 65-90% of capture efficiency
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
18
High throughput sample preparation
Sample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
Agilent SureSelect
Solution-phase capture with
streptavidin-coated magnetic beads
Reported 60-80% of capture efficiency
From Slideshare presentation of
Cosentino Cristian
http://www.slideshare.net/cosentia/high-
throughput-equencing
Capture Methods
19 19
Small Amounts of DNA
20
http://www.epibio.com/docs/default-source/protocols/nextera-dna-sample-prep-kit-(illumina--compatible).pdf?sfvrsn=4
20
HiC
21
Suggested by Carlos Bustamante and Keith Bradnam
21
Dovetail “Chicago” Libraries
22
http://arxiv.org/abs/1502.05331
Suggested by
22
Single Cell Sequencing
23
Suggested by Keith Robison
23
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Library Overview
Library Preparation
Library Prep Questions
• Specific targets?
• Taxa?
• Genes (e.g., antibiotic resistance)
• How many targets?
• Total community analysis?
• Looking for novel taxa or subtypes?
• Looking for novel genes? (e.g., drug
resistance)
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Informatics Overview
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Informatics Overview
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Informatics Overview
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence IFX
Informatics Overview
Shotgun Sequencing
DNA
inputs of fixed carbon or nitrogen from external sources. As with
Leptospirillum group I, both Leptospirillum group II and III have the
genes needed to fix carbon by means of the Calvin–Benson–
Bassham cycle (using type II ribulose 1,5-bisphosphate carboxy-
lase–oxygenase). All genomes recovered from the AMD system
contain formate hydrogenlyase complexes. These, in combination
with carbon monoxide dehydrogenase, may be used for carbon
fixation via the reductive acetyl coenzyme A (acetyl-CoA) pathway
by some, or all, organisms. Given the large number of ABC-type
sugar and amino acid transporters encoded in the Ferroplasma type
Figure 4 Cell metabolic cartoons constructed from the annotation of 2,180 ORFs
identified in the Leptospirillum group II genome (63% with putative assigned function) and
1,931 ORFs in the Ferroplasma type II genome (58% with assigned function). The cell
cartoons are shown within a biofilm that is attached to the surface of an acid mine
drainage stream (viewed in cross-section). Tight coupling between ferrous iron oxidation,
pyrite dissolution and acid generation is indicated. Rubisco, ribulose 1,5-bisphosphate
carboxylase–oxygenase. THF, tetrahydrofolate.
articles
NATURE | doi:10.1038/nature02340 | www.nature.com/nature 5
©2004 NaturePublishing Group
Issues in Informatics
• Compute resources
• Remote analysis?
• Linking to other data (e.g. clinical,
environmental, etc)
• Reference data / databases
• Need support for FTEs - many
“automated” analyses are good but need
tuning
• Shotgun vs targeted very different
Informatics Overview
https://www.sciencedirect.com/science/article/pii/S0048969723044923
Sequence IFX
DNA
RNA
Amplicon
Shotgun
Target
Enrichment
Sequence Based Surveillance Workflow

Eisen.CentralValley2024.pdf