SlideShare a Scribd company logo
Yaoyu E. Wang, Ph.D
Center for Cancer Computational Biology, DFCI
SPECSII webinar
June 05, 2013
- Transcriptome profiling represents a static gene expression
state of a biological sample across the genome
- Allows for direct genomic comparisons with multiple samples
to determine genes that exhibit differential expression in
different state (i.e. normal vs. tumor)
- Allows for hypothesis generation on molecular abnormalities
and mechanisms that may contribute to the tumor phenotype
- Provides information on molecular subtypes, the development
of prognostic and predictive molecular signatures
- Two main technologies:
a. Microarray
b. RNA-Sequencing (RNASeq) using next generation
sequencing
Affymetrix
GeneChip
scanner
Blencowe B J et al. Genes Dev. 2009;23:1379-1386
Illumina HiSeq
.bcl files
CASAVA processing
•Demultiplexing
•Fastq file generation
•Sequencing filtering
Raw files containing base calls
and quality scores
Illumina defined
quality filters
Split into Project and Sample Folders
Jones_Lab
ChIP_A ChIP-B
Marcus_Lab
RNA-SeqA RNA-SeqB RNA-SeqC
Williams_Lab
Exome1 Exome2
Fastq Files Fastq Files Fastq Files
Haas & Zody. Nature Biotechnology 28, 421–423 (2010)
Using known
annotations
And compare to
known annotations
•Differential Expression
•Differential Isoform Abundance
•RNA editing
•SNP, indel detection
Technology RNASeq Microarray
High run-to run reproducibility Yes Yes
Dynamic Range Comparable to
actual transcript abundance
>8000-fold
Hundred
fold
Able to detect alternative splice site
and novel isoforms
Yes No
De novo analysis of samples without
reference genome
Yes No
Multiplexing Samples in one run Yes No
Required amount of total RNA >100 ng ~1 ug
Re-analyzable data Yes No
Technology RNASeq Microarray
Heterogeneity of read coverage
across an expressed region
Yes No
Well understood sources of
experimental bias
No Yes
Data portable on a flush drive (~4G) No Yes
Data is analyzable by any PC No Yes
Cheaper cost per sample No(?) Yes(?)
RNA-Seq Experiment
GEO Database
White paper, Illumina
White paper, Illumina
Comparing Expression Profiles from Microarrays to RNASeq
n=7532 n=4537
Mooney M, PloSOne (2013)
10 Lymphoma (3T-cell, 7 B-cell)
4 Normal lymph node
Total RNA
PE100 run
50-100 million
mapped reads
Compare 15,092 annotated genes on chip
Mooney M, PloSOne (2013)
T
NB
r=0.6; p<10-15
c. elegans
Biological Replicates for
L2 andYA stages
AffyTilingArrays* Illumina RNASeq
Agarwal, BMC Genomics (2010)
* Covers whole c.elegans genome
Differential Expression genes between the L2 andYA stage
Agarwal, BMC Genomics (2010)
RNA-Seq and tiling arrays
Tiling Array
Microarray
Maximum
Sensitivity
RNASeq 11-plex
RNASeq 6-plex
Agarwal, BMC Genomics (2010)
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
6-plex
11-plex
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
Data Per Sample
Time to
download 1
Sample
Time to download
100 samples
Cost to Store on the
Cloud per Month
RNASeq 30-65GB 1 Hr 6 days $270
Microarray 30MB 5 second 8 minutes $0.30
http://www.ncbi.nlm.nih.gov/genbank/statistics
-Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only
handle very few samples
-Knowledge of Linux server, scripting language, programming language is
absolutely REQUIRED
-Lack of detailed understanding in NGS technology and data leads to
diverse bioinformatics tools with different characteristics
LawWC ,Voom!, Bionconductor (2013)
The answer isYes
- Transcriptome profiles generated by microarray and RNASeq
are in strongly concordance
- Microarray data generated in the last decades is durable
- RNASeq is it offers more a lot more biological information
than microarray that is re-analyzable
- NGS is getting cheaper
However, the devil is in the data
- NGS data is a lot more expensive to store and analyze
- Specialized computing infrastructure and personnel are
required to take advantage of the information from NGS data

More Related Content

What's hot

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicskiran singh
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatsidjena70
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2GCUF
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopMorgan Langille
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingShelomi Karoon
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function predictionLars Juhl Jensen
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities Paolo Dametto
 
Next generation sequencing
Next  generation  sequencingNext  generation  sequencing
Next generation sequencingNidhi Singh
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seqJyoti Singh
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyQIAGEN
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and toolsKAUSHAL SAHU
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Yaoyu Wang
 

What's hot (20)

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
 
Gene expression profiling
Gene expression profilingGene expression profiling
Gene expression profiling
 
Snp genotyping
Snp genotypingSnp genotyping
Snp genotyping
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Next generation sequencing
Next  generation  sequencingNext  generation  sequencing
Next generation sequencing
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seq
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) Technology
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 
Gene expression profiling i
Gene expression profiling  iGene expression profiling  i
Gene expression profiling i
 

Similar to Comparison between RNASeq and Microarray for Gene Expression Analysis

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochureAffymetrix
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...eventi-ITBbari
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisdrelamuruganvet
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;applicationFyzah Bashir
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewPaolo Dametto
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsThermo Fisher Scientific
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesMartin Hartmann
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Thermo Fisher Scientific
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysisDr. Olusoji Adewumi
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016ExternalEvents
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposalGenomeInABottle
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Anne Deslattes Mays
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...VHIR Vall d’Hebron Institut de Recerca
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSLubna MRL
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAUpendra Kumar Devisetty
 

Similar to Comparison between RNASeq and Microarray for Gene Expression Analysis (20)

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overview
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
 
Dna microarray mehran- u of toronto
Dna microarray  mehran- u of torontoDna microarray  mehran- u of toronto
Dna microarray mehran- u of toronto
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysis
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNA
 

More from Yaoyu Wang

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB ServicesYaoyu Wang
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformYaoyu Wang
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsYaoyu Wang
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 

More from Yaoyu Wang (6)

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB Services
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud Platform
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 

Recently uploaded

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekCzechDreamin
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 

Recently uploaded (20)

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 

Comparison between RNASeq and Microarray for Gene Expression Analysis

  • 1. Yaoyu E. Wang, Ph.D Center for Cancer Computational Biology, DFCI SPECSII webinar June 05, 2013
  • 2. - Transcriptome profiling represents a static gene expression state of a biological sample across the genome - Allows for direct genomic comparisons with multiple samples to determine genes that exhibit differential expression in different state (i.e. normal vs. tumor) - Allows for hypothesis generation on molecular abnormalities and mechanisms that may contribute to the tumor phenotype - Provides information on molecular subtypes, the development of prognostic and predictive molecular signatures - Two main technologies: a. Microarray b. RNA-Sequencing (RNASeq) using next generation sequencing
  • 4. Blencowe B J et al. Genes Dev. 2009;23:1379-1386 Illumina HiSeq
  • 5. .bcl files CASAVA processing •Demultiplexing •Fastq file generation •Sequencing filtering Raw files containing base calls and quality scores Illumina defined quality filters Split into Project and Sample Folders Jones_Lab ChIP_A ChIP-B Marcus_Lab RNA-SeqA RNA-SeqB RNA-SeqC Williams_Lab Exome1 Exome2 Fastq Files Fastq Files Fastq Files
  • 6. Haas & Zody. Nature Biotechnology 28, 421–423 (2010) Using known annotations And compare to known annotations •Differential Expression •Differential Isoform Abundance •RNA editing •SNP, indel detection
  • 7. Technology RNASeq Microarray High run-to run reproducibility Yes Yes Dynamic Range Comparable to actual transcript abundance >8000-fold Hundred fold Able to detect alternative splice site and novel isoforms Yes No De novo analysis of samples without reference genome Yes No Multiplexing Samples in one run Yes No Required amount of total RNA >100 ng ~1 ug Re-analyzable data Yes No
  • 8. Technology RNASeq Microarray Heterogeneity of read coverage across an expressed region Yes No Well understood sources of experimental bias No Yes Data portable on a flush drive (~4G) No Yes Data is analyzable by any PC No Yes Cheaper cost per sample No(?) Yes(?)
  • 12. Comparing Expression Profiles from Microarrays to RNASeq n=7532 n=4537
  • 13. Mooney M, PloSOne (2013) 10 Lymphoma (3T-cell, 7 B-cell) 4 Normal lymph node Total RNA PE100 run 50-100 million mapped reads Compare 15,092 annotated genes on chip
  • 14. Mooney M, PloSOne (2013) T NB r=0.6; p<10-15
  • 15. c. elegans Biological Replicates for L2 andYA stages AffyTilingArrays* Illumina RNASeq Agarwal, BMC Genomics (2010) * Covers whole c.elegans genome
  • 16. Differential Expression genes between the L2 andYA stage Agarwal, BMC Genomics (2010)
  • 17. RNA-Seq and tiling arrays Tiling Array Microarray Maximum Sensitivity RNASeq 11-plex RNASeq 6-plex Agarwal, BMC Genomics (2010)
  • 18.
  • 19. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13 6-plex 11-plex
  • 20. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13
  • 21.
  • 22. Data Per Sample Time to download 1 Sample Time to download 100 samples Cost to Store on the Cloud per Month RNASeq 30-65GB 1 Hr 6 days $270 Microarray 30MB 5 second 8 minutes $0.30 http://www.ncbi.nlm.nih.gov/genbank/statistics
  • 23. -Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only handle very few samples -Knowledge of Linux server, scripting language, programming language is absolutely REQUIRED -Lack of detailed understanding in NGS technology and data leads to diverse bioinformatics tools with different characteristics LawWC ,Voom!, Bionconductor (2013)
  • 24.
  • 25. The answer isYes - Transcriptome profiles generated by microarray and RNASeq are in strongly concordance - Microarray data generated in the last decades is durable - RNASeq is it offers more a lot more biological information than microarray that is re-analyzable - NGS is getting cheaper However, the devil is in the data - NGS data is a lot more expensive to store and analyze - Specialized computing infrastructure and personnel are required to take advantage of the information from NGS data

Editor's Notes

  1. The basic concept behind the use of GeneChip arrays for gene expression is simple: labeled cDNA or cRNA targets derived from the mRNA of an experimental sample are hybridized to nucleic acid probes attached to the solid support. By monitoring the amount of dye label associated with each DNA location, it is possible to infer the abundance of each mRNA species represented. For transcriptome profiling, the input is usually about 1ug total RNA that are poly-A selected to ensure only mature mRNA is being assayed.
  2. Poly(A)+ mRNA is purified, fragmented, and then converted to a cDNA library with 5′ and 3′ adapter sequences. Short sequence reads are generated from the cDNA library. Normally, reads are mapped to previously annotate known transcripts and a pile un-mapped reads are kept. Reads that map to novel expressed sequences, including alternative exons and corresponding splice junction sequences
  3. Two RNA sample types MAQC brain and universal human Reference RNA were processed using 5 technical replicates on both microarray and RNA-Seq. Once teh data is generated, the microarray data was processed using MAQC. For RNA-Seq, the sample cDNA libraries were prepared with Illumina protocol and sequenced to a depth of ~30 million mapped reads.
  4. This is the scatter plot of technical replicates of the samples analyzed by RNA-Seq and microarray. The false positive rates are comparable between the two methods, and both methods have extremely high correlation between replicates (R&gt;0.99). The plots demonstrate that RNA-Seq identifies more genes and spans a wider dynamic range compared to the microarray.
  5. Scatterplot of fold change per gene as measured by RNASeq and microarray. Genes identified as differentially expressed by both platform are plotted in red, genes identified by RNASeq in blue, microarray in yellow and neither ins green. While the correlation between the two platforms in identifying differentially expressed genes is really high, this figure clearly indicates that a discrepancy between the platforms in the ability to identify genes as differentially expressed. The gene subset segmentation reveal that RNA-Seq counts identified significantly more differentially expressed genes. However, microarray does detect gene expression differences. Further valudation from a subset of 1000 genes for which PCR data is available, RNASeq data shows higher concordance with PCR results than microarray.
  6. A study by Mooney et al, use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of 4 normal canine lymph nodes and 10 canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies. We use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of four normal canine lymph nodes and ten canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies and compare the 15,092 annotated genes on chip.
  7. Both RNA-Seq and microarray observations provide present detection calls for 15,092 genes in each of the 14 samples. Thepercent present detection calls provided by the two technologies agreed with high frequency (73%) and were statistically associated(Table 3; p,10215, odds ratio .40). Among genes probed by both methods, percent present detection frequencies of 69% and 44%were obtained by RNA-Seq and microarray, respectively. Among genes called present using microarray over 97% were detectedusing RNA-Seq.Variation among expression profiles obtained using RNASeq is similar to that obtained using microarray after removing contributions of the first surrogate variable [42]. Each letter denotes a sample from a dog having a normal (N), B-cell (B), or T-cell (T) diagnosis as in the legend, with subscript ‘m’ run on the microarray platform and subscript ‘r’ run onthe NGS platform. a) Principal component scores b) Hierarchical clustering
  8. Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from thesecond larval stage of C. elegansto Young adult (YA)for all genes Each point represents a gene from the composite model. RNA-Seq expression levels per gene were measured using RPKM, and tiling array levels were measured using the mean intensity of probes falling within composite exons. The Spearman&apos;s coefficient is 0.90, indicating that the platforms correlate well on identical samples. The disproportionate number of genes in the upper left likely represents cross-hybridization.
  9. Differential expression of genes between the L2 and YA stages. (a) Correlation of log2(YA/L2) ratios between RNA-Seq and tiling arrays.. Black: not significantly differentially expressed between samples.Blue: significantly differentially expressed (q ≤ 0.01). The ratio of expression levels is well-correlated, but RNA-Seq has a larger dynamic range. (b) Venn diagram of genes called differentially expressed by each platform. There is significant overlap (8,976) between the two platforms, but more genes were called differentially expressed by RNA-Seq (14,201) than by tiling arrays (10,283), likely reflecting its greater dynamic range. A total of 4,326 genes were not called differentially expressed by either technology. 
  10. ROC curve analysis. Black: tiling array. Red: RNA-Seq with all 32 million reads. It is evident that the RNA-Seq substantially outperforms the tiling array with consistently higher sensitivity at lower FPR. Remaining curves are for RNA-Seq with only a subset of reads utilized. At an FPR = 0.05, just 4 million reads (blue) are required to attain the same sensitivity as two tiling array replicates.