Applied Bioinformatics Journal Club Pacbio RNA-Seq

•Download as PPTX, PDF•

2 likes•1,152 views

Jennifer Shelton

Education

Journal Club
A single-molecule long-read survey of
the human transcriptome
Sharon et al., Nature Biotechnology 31, 1009–1014 (2013)
Sanzhen Liu
Plant Pathology
3/12/2014

PacBio technology
• Amplification-free sequencing
• Very long (up to 20kb, peak on 2-6 kb)
• High errors (random, no-context-specific errors)
PacBio website

CCS approach
• High-quality, single-molecule, circular-
consensus (CCS) reads
http://flxlexblog.wordpress.com/2013/02/11/applications-for-pacbio-circular-consensus-sequencing/

Figure 1
• Input: pooled RNAs from 20 tissues
• Approach: prepare double-stranded cDNAs -> CCS library -> PacBio sequencing
• Output: 476,000 CCS reads, mean=1kb
• 61% reads cover all introns and most first and last exons
• CCS reads well cover (generally >90%) short transcripts (<1.2 kb) but stay low
coverage for long transcripts, especially for those with >2.4 kb

Figure 2
Missing 3’ ends
Missing 5’ ends
The correlations
of the number of
reads and …
ERCC, mixture of known/quantified RNAs

Figure 3
• 67% molecules with splicing sites were estimated
• CSMM: consensus split-mapped molecule (accurate CCS reads with splicing sites?)
• Splicing sites well match annotated splicing sites
• PacBio (versus 454) exhibits much higher power to detect isoforms with >=10 introns
• Estimate: 21,000 genes and 139,000 isoforms can be detected with high-depth seq

Summary
• Full-length RNA of up to 1.5kb can readily be
monitored with little sequence loss at the 5’
ends
• With 476k CCS reads (>300bp), 14,000 spliced
genes were identified.
• The majority of introns are consistent with
annotations, but >10% are novel.

Conclusion
• Isoforms can be monitored at a single-molecule level
without amplification or fragmentation
• The majority of reads represent all splice sites of the
original transcripts
• Unannotated splice isoforms: long non-coding RNAs
with few introns and isoforms of known protein-
coding genes with many introns

Similar to Applied Bioinformatics Journal Club Pacbio RNA-Seq

Introduction to 16S Microbiome AnalysisBioinformatics and Computational Biosciences Branch

BioSB meeting 2015hansjansen9999

Exploiting long read sequencing technology to build a substantially improved ...Genome Reference Consortium

NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...VHIR Vall d’Hebron Institut de Recerca

Open pacbiomodelorgpaper j_landolin_20150121Jane Landolin

High Throughput Sequencing Technologies: What We Can KnowBrian Krueger

Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingJonathan Eisen

Next Generation Sequencing methods Zohaib HUSSAIN

Simple, rapid preparation of genomic libraries from single human cells and ba...Thermo Fisher Scientific

Long read sequencing - WEHI bioinformatics seminar - tue 16 june 2015Torsten Seemann

High Throughput Sequencing Technologies: On the path to the $0* genomeBrian Krueger

Johannes Bergsten Dna Barcodingbioinfocourse

So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisUniversity of California, Davis

2013 pag-equine-workshopc.titus.brown

Making powerful science: an introduction to NGS and beyondAdamCribbs1

whole-genome-sequencing-guide-small-genomes.pdf.pdfCRISTIANALONSORODRIG1

Telomere-to-telomere assembly of a complete human chromosomesGenome Reference Consortium

PacMin @ AMPLab All-Handsfnothaft

GLBIO/CCBC Metagenomics WorkshopMorgan Langille

CALS_Stewards_of_Future_2015_Yow_IsoSeqAshley Yow

Similar to Applied Bioinformatics Journal Club Pacbio RNA-Seq (20)

Introduction to 16S Microbiome Analysis

BioSB meeting 2015

Exploiting long read sequencing technology to build a substantially improved ...

NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...

Open pacbiomodelorgpaper j_landolin_20150121

High Throughput Sequencing Technologies: What We Can Know

Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing

Next Generation Sequencing methods

Simple, rapid preparation of genomic libraries from single human cells and ba...

Long read sequencing - WEHI bioinformatics seminar - tue 16 june 2015

High Throughput Sequencing Technologies: On the path to the $0* genome

Johannes Bergsten Dna Barcoding

So you want to do a: RNAseq experiment, Differential Gene Expression Analysis

2013 pag-equine-workshop

Making powerful science: an introduction to NGS and beyond

whole-genome-sequencing-guide-small-genomes.pdf.pdf

Telomere-to-telomere assembly of a complete human chromosomes

PacMin @ AMPLab All-Hands

GLBIO/CCBC Metagenomics Workshop

CALS_Stewards_of_Future_2015_Yow_IsoSeq

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr

internship ppt on smartinternz platform as salesforce developerunnathinaik

Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN

9953330565 Low Rate Call Girls In Rohini Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna

भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke

Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos

Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth

ESSENTIAL of (CS/IT/IS) class 06 (database)Dr. Mazin Mohamed alkathiri

Roles & Responsibilities in PharmacovigilanceSamikshaHamane

Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke

How to Configure Email Server in Odoo 17Celine George

Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari

Biting mechanism of poisonous snakes.pdfadityarao40181

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a

Types of Journalistic Writing Grade 8.pptxEyham Joco

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Alper Gobel In Media Res Media ComponentInMediaRes1

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...

internship ppt on smartinternz platform as salesforce developer

Solving Puzzles Benefits Everyone (English).pptx

9953330565 Low Rate Call Girls In Rohini Delhi NCR

CELL CYCLE Division Science 8 quarter IV.pptx

भारत-रोम व्यापार.pptx, Indo-Roman Trade,

Final demo Grade 9 for demo Plan dessert.pptx

Introduction to ArtificiaI Intelligence in Higher Education

ESSENTIAL of (CS/IT/IS) class 06 (database)

Roles & Responsibilities in Pharmacovigilance

Painted Grey Ware.pptx, PGW Culture of India

How to Configure Email Server in Odoo 17

Employee wellbeing at the workplace.pptx

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf

Biting mechanism of poisonous snakes.pdf

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf

Types of Journalistic Writing Grade 8.pptx

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Alper Gobel In Media Res Media Component

Applied Bioinformatics Journal Club Pacbio RNA-Seq

1. Journal Club A single-molecule long-read survey of the human transcriptome Sharon et al., Nature Biotechnology 31, 1009–1014 (2013) Sanzhen Liu Plant Pathology 3/12/2014

2. PacBio technology • Amplification-free sequencing • Very long (up to 20kb, peak on 2-6 kb) • High errors (random, no-context-specific errors) PacBio website

3. CCS approach • High-quality, single-molecule, circular- consensus (CCS) reads http://flxlexblog.wordpress.com/2013/02/11/applications-for-pacbio-circular-consensus-sequencing/

4. Figure 1 • Input: pooled RNAs from 20 tissues • Approach: prepare double-stranded cDNAs -> CCS library -> PacBio sequencing • Output: 476,000 CCS reads, mean=1kb • 61% reads cover all introns and most first and last exons • CCS reads well cover (generally >90%) short transcripts (<1.2 kb) but stay low coverage for long transcripts, especially for those with >2.4 kb

5. Figure 2 Missing 3’ ends Missing 5’ ends The correlations of the number of reads and … ERCC, mixture of known/quantified RNAs

6. Figure 3 • 67% molecules with splicing sites were estimated • CSMM: consensus split-mapped molecule (accurate CCS reads with splicing sites?) • Splicing sites well match annotated splicing sites • PacBio (versus 454) exhibits much higher power to detect isoforms with >=10 introns • Estimate: 21,000 genes and 139,000 isoforms can be detected with high-depth seq

7. Summary • Full-length RNA of up to 1.5kb can readily be monitored with little sequence loss at the 5’ ends • With 476k CCS reads (>300bp), 14,000 spliced genes were identified. • The majority of introns are consistent with annotations, but >10% are novel.

8. Conclusion • Isoforms can be monitored at a single-molecule level without amplification or fragmentation • The majority of reads represent all splice sites of the original transcripts • Unannotated splice isoforms: long non-coding RNAs with few introns and isoforms of known protein- coding genes with many introns

Applied Bioinformatics Journal Club Pacbio RNA-Seq

Recommended

Recommended

More Related Content

Similar to Applied Bioinformatics Journal Club Pacbio RNA-Seq

Similar to Applied Bioinformatics Journal Club Pacbio RNA-Seq (20)

More from Jennifer Shelton

More from Jennifer Shelton (16)

Recently uploaded

Recently uploaded (20)

Applied Bioinformatics Journal Club Pacbio RNA-Seq