SlideShare a Scribd company logo
1 of 1
Download to read offline
Introduction Experimental Design
Gene expression profiling is being widely applied in cancer
research to identify biomarkers for clinical endpoint prediction
such as diagnosis, prognosis, or prediction of therapeutic
response. The advent of RNA deep sequencing (RNA-Seq)
technologies has revolutionized eukaryotic transcriptome analysis
by providing the ability to decipher global gene expression
patterns beyond the limitation of microarrays, including the ability
to discover novel genes and splice variants. However, data
generated by the Sequencing Quality Control (SEQC) consortium
has shown that RNA-Seq and microarray-based gene expression
models are comparable in clinical endpoint prediction
performance, even when the additional information unique to
RNA-Seq is included in the prediction models.
In order to understand and characterize the reproducibility and
validity of gene expression experiments, exogenous RNA spike-in
controls and associated ratio performance metrics have been
introduced by the External RNA Controls Consortium (ERCC). The
ERCC developed a library of 96 exogenous control transcripts from
synthetic or bacterial sequences that are spiked into RNA at
defined abundances and ratios, providing data that measures the
technical performance of gene expression profiling methods.
Scientists from the National Institute of Standards and Technology
(NIST) have created a Bioconductor software tool called
“ERCCdashboard” that produces ERCC performance metrics from
expression values such as microarray signals or RNA-Seq read
counts. Using the ERCC spike-in control transcripts and the
“ERCCdashboard” analysis tool, we compare the performance of
Agilent gene expression microarray and Agilent RNA-Seq
platforms run with the same total RNA samples, and highlight
performance differences between the two different Agilent gene
expression measurement technologies.
Using ERCC Spike-In Control Transcripts Provides Confidence in Agilent Microarray and RNA-Seq Gene Expression Data
Anne Bergstrom Lucas, Namrata Joglekar, Bilan Hsue, Bahram Arezi, Mary Napier, Paul Wolber and Scott Happe
Agilent Technologies Genomics R&D, Santa Clara, California
ERCC RNA Ratio/Abundance in Mixtures
Two mixtures (Mix 1 and Mix 2) were prepared from 92 ERCC RNA
control transcripts that were placed in different subpools (a to d) in four
defined abundance ratios of 4:1, 1:2, 1:1.5 and 1:1 between the mixtures
(Figure a). Within each subpool there are 23 controls that span a large
dynamic range of 220 or 6 orders of magnitude (Figure b). These ERCC
control transcript mixes are technology-independent and can be spiked
into the RNA at the beginning of any gene expression analysis
experiment. (Munro, S., et al. Assessing technical performance in
differential gene expression experiments with external spike-in RNA
control ratio mixtures. (2014) Nature Communications, 5:5125)
The technology-independent spike-in control
transcripts developed by the ERCC can serve as a
truth set to benchmark the accuracy of endogenous
transcript ratio measurements.
The ERCCdashboard tool enables rapid, reproducible,
and automated analysis of the ERCC spike-in control
transcripts in any differential expression experiment.
The ERCC controls demonstrate that both the Agilent
One-Color Gene Expression Microarray platform and
the Agilent RNA WT NGS platform can generate
linear gene expression data spanning 5 orders of
magnitude with the ability to detect sensitive 1.5-fold
changes at levels as low as:
36 counts with the microarray workflow
55 mapped reads with the RNA-Seq workflow
For Research Use Only. Not for use in diagnostic procedures.
This figure demonstrates the ability of each platform to detect small changes
in ERCC spike-in concentration between the UHRR and HBRR samples. Shown
are the average fluorescence intensity (microarrays) or the average counts
(RNA-Seq) versus the differential expression test P-values calculated by the
ERCCdashboard program. The heavy black dashed line indicates the false
discovery rate (FDR) cutoff of FDR = 0.01 chosen for this analysis. Colored
arrows indicate the LODR estimate (average intensity or average counts) for
each fold change that crosses the line indicating the upper boundary of the
90% confidence interval. LODR results and confidence intervals derived from
the plots are shown in the annotation tables below the plots.
ERCCdashboard derived signal-abundance plots compare the dynamic range
capabilities of the two different gene expression experimental platforms using
the same RNA samples. In these figures the points are colored by ratio subpool,
error bars represent the standard deviation of the replicate ratio measurements,
and shape represents the sample type (UHRR or HBRR). In the RNA-Seq results,
ERCC controls that did not have at least one count in three libraries for either
sample were not included in the signal-abundance plot. There are several ERCC
control transcripts that consistently give outlier signals (ERCC-00022, ERCC-00131,
ERCC-00144) on the microarray experiments. Note that the linear dynamic range
on these plots spans 220 (microarrays) or 216 (RNA-Seq).
Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS
Using the ERCCdashboard program, true-positive and true-negative
ERCC control ratios can be used in receiver operator characteristic
(ROC) curve analysis of the rank-ordered differential expression test
P-values. ROC curves and the corresponding area under the curve
(AUC) statistics change based on the discrimination of true-positive
values and true-negative values, where ideal performance is
represented by AUC = 1.0 for all 23 controls spiked at a given ratio.
Note that the microarray experiments were able to detect all 23 ERCC
spike-in controls at each fold change tested with AUCs >0.90 whereas
the RNA-Seq experiments at a read depth of 100M reads detected 16
(or 17) of the 23 spike-in controls with AUCs of <0.90 for the smaller
fold changes (1:2 fold and 1:1.5 fold) tested.
ERCC Signal-Abundance Plot Comparisons
Estimating Limit of Detection Rates (LODR)
MA Plots Demonstrate Measurement Variability Conclusions
Magnitude of fold-change (log ratio) versus average signal/count (MA) plots
contain ERCC data points colored by expected ratio with error bars that
represent the standard deviation of the replicate ratio measurements. Filled
circles indicate ERCC ratios above the limit of detection (LODR) for the 4:1, 1:2,
and the 1:1.5 ratios. Endogenous transcript ratio measurements are shown as
gray points. The estimate of mRNA fraction differences between the UHRR and
HBRR RNA samples (rm) with weighted standard errors is provided in an inset
table and is used to adjust the nominal ERCC ratios. The nominal ratios are
annotated with the light solid color lines for each ratio subpool and the adjusted
ratios are annotated with the heavy dashed lines.
Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS
ROC Curves and AUC Statistics
Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS
Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS
Two ERCC spike-in RNA transcript mixtures (Mix 1 and Mix 2) were obtained
from Ambion (p/n 4456739) and 2 µl of a 1:10 dilution of each respective
spike-in mix was added to 10 µg of either the Universal Human Reference RNA
(UHRR, Agilent p/n 740000) or to the Human Brain Reference RNA (HBRR,
Ambion p/n AM6050) as noted in the figure above.
Four (4x) technical replicate 100 ng aliquots of total RNA from each sample
were prepared for Agilent One-Color Gene Expression microarray analysis
using the Agilent Low Input Quick Amp kit (p/n 5190-2305) and run on the
SurePrint G3 Human Gene Expression v3 8x60K Microarrays (p/n G4851C).
Three (3x) technical replicate 2 µg aliquots of total RNA from each sample
(one less technical replicate as compared to the figure above) were prepared
for RNA-Seq analysis using the Agilent SureSelect Strand-Specific RNA Library
Preparation Kit kit (p/n G9691A/G9692A). The barcoded libraries for both the
UHRR and HBRR technical replicates were pooled and prepared for paired-end
multiplexed sequencing on the Illumina platform. The mean number of
mapped reads per indexed library was 151,155,130 for UHRR samples and
115,602,753 for HBRR samples.
ERCC Mix 1 ERCC Mix 2
UHRR HBRR
ERCC-00022
ERCC-00131
ERCC-00144

More Related Content

What's hot

Q pcr poster-20070314
Q pcr poster-20070314Q pcr poster-20070314
Q pcr poster-20070314Elsa von Licy
 
Q biomarkersomaticmutation
Q biomarkersomaticmutationQ biomarkersomaticmutation
Q biomarkersomaticmutationElsa von Licy
 
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...Thermo Fisher Scientific
 

What's hot (9)

Aacr2009 cignal
Aacr2009 cignalAacr2009 cignal
Aacr2009 cignal
 
Pcrarraywhitepaper
PcrarraywhitepaperPcrarraywhitepaper
Pcrarraywhitepaper
 
Q pcr poster-20070314
Q pcr poster-20070314Q pcr poster-20070314
Q pcr poster-20070314
 
20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop
 
Q biomarkersomaticmutation
Q biomarkersomaticmutationQ biomarkersomaticmutation
Q biomarkersomaticmutation
 
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...
Verification of an Ion AmpliSeq™ RNA Fusion Lung Cancer Research Panel, workf...
 
Aacr poster2007
Aacr poster2007Aacr poster2007
Aacr poster2007
 
Abrf poster2007
Abrf poster2007Abrf poster2007
Abrf poster2007
 
Pcrhandbook ver4
Pcrhandbook ver4Pcrhandbook ver4
Pcrhandbook ver4
 

Viewers also liked

The importance of controls and novel solutions for successful real-time qPCR
The importance of controls and novel solutions for successful real-time qPCRThe importance of controls and novel solutions for successful real-time qPCR
The importance of controls and novel solutions for successful real-time qPCRQIAGEN
 
Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16GenomeInABottle
 
Quality Control of Sequencing Data
Quality Control of Sequencing DataQuality Control of Sequencing Data
Quality Control of Sequencing DataSurya Saha
 
Aug2013 reference material selection and design working group
Aug2013 reference material selection and design working groupAug2013 reference material selection and design working group
Aug2013 reference material selection and design working groupGenomeInABottle
 
Sept2016 plenary mercer_sequins
Sept2016 plenary mercer_sequinsSept2016 plenary mercer_sequins
Sept2016 plenary mercer_sequinsGenomeInABottle
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Sebastian Schmeier
 
RNA Integrity and Quality – Standardize RNA Quality Control
RNA Integrity and Quality – Standardize RNA Quality Control RNA Integrity and Quality – Standardize RNA Quality Control
RNA Integrity and Quality – Standardize RNA Quality Control QIAGEN
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingmikaelhuss
 
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...Thermo Fisher Scientific
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAGRF_Ltd
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 

Viewers also liked (19)

The importance of controls and novel solutions for successful real-time qPCR
The importance of controls and novel solutions for successful real-time qPCRThe importance of controls and novel solutions for successful real-time qPCR
The importance of controls and novel solutions for successful real-time qPCR
 
Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16
 
Quality Control of Sequencing Data
Quality Control of Sequencing DataQuality Control of Sequencing Data
Quality Control of Sequencing Data
 
Aug2013 reference material selection and design working group
Aug2013 reference material selection and design working groupAug2013 reference material selection and design working group
Aug2013 reference material selection and design working group
 
Sept2016 plenary mercer_sequins
Sept2016 plenary mercer_sequinsSept2016 plenary mercer_sequins
Sept2016 plenary mercer_sequins
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)
 
RNA Integrity and Quality – Standardize RNA Quality Control
RNA Integrity and Quality – Standardize RNA Quality Control RNA Integrity and Quality – Standardize RNA Quality Control
RNA Integrity and Quality – Standardize RNA Quality Control
 
20140710 3 l_paul_ercc2.0_workshop
20140710 3 l_paul_ercc2.0_workshop20140710 3 l_paul_ercc2.0_workshop
20140710 3 l_paul_ercc2.0_workshop
 
20140710 1 day1_nist_ercc2.0workshop
20140710 1 day1_nist_ercc2.0workshop20140710 1 day1_nist_ercc2.0workshop
20140710 1 day1_nist_ercc2.0workshop
 
20140710 5 k_thompson_ercc2.0_workshop
20140710 5 k_thompson_ercc2.0_workshop20140710 5 k_thompson_ercc2.0_workshop
20140710 5 k_thompson_ercc2.0_workshop
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
20140710 4 a_bergstrom_lucas_ercc2.0_workshop
20140710 4 a_bergstrom_lucas_ercc2.0_workshop20140710 4 a_bergstrom_lucas_ercc2.0_workshop
20140710 4 a_bergstrom_lucas_ercc2.0_workshop
 
Algamantas juozapavcius
Algamantas juozapavciusAlgamantas juozapavcius
Algamantas juozapavcius
 
ICP QC protocol
ICP  QC  protocolICP  QC  protocol
ICP QC protocol
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
 
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...
Circulating Cell Free DNA Targeted Sequencing Workflow | ESHG 2015 Poster PM1...
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 

Similar to NIEHS2016_ERCC_Agilent_RUO_Final

Ascb 2007-pcr array-poster
Ascb 2007-pcr array-posterAscb 2007-pcr array-poster
Ascb 2007-pcr array-posterElsa von Licy
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqHimanshu Sethi
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsThermo Fisher Scientific
 
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...QIAGEN
 
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_finalLawrence Hwang
 
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...Multicopy reference assay (MRef) — a superior normalizer of sample input in D...
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...QIAGEN
 
Purification of total RNA from peripheral blood mononuclear cells - Download ...
Purification of total RNA from peripheral blood mononuclear cells - Download ...Purification of total RNA from peripheral blood mononuclear cells - Download ...
Purification of total RNA from peripheral blood mononuclear cells - Download ...QIAGEN
 
Technical Guide to Qiagen PCR Arrays - Download the Guide
Technical Guide to Qiagen PCR Arrays - Download the GuideTechnical Guide to Qiagen PCR Arrays - Download the Guide
Technical Guide to Qiagen PCR Arrays - Download the GuideQIAGEN
 
Global Gene Expression Profiles from Bladder Tumor FFPE Samples
Global Gene Expression Profiles from Bladder Tumor FFPE SamplesGlobal Gene Expression Profiles from Bladder Tumor FFPE Samples
Global Gene Expression Profiles from Bladder Tumor FFPE SamplesThermo Fisher Scientific
 
Hotspot mutation and fusion transcript detection from the same non-small cell...
Hotspot mutation and fusion transcript detection from the same non-small cell...Hotspot mutation and fusion transcript detection from the same non-small cell...
Hotspot mutation and fusion transcript detection from the same non-small cell...Thermo Fisher Scientific
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationIJAEMSJORNAL
 
Ashg poster sp_compressed
Ashg poster sp_compressedAshg poster sp_compressed
Ashg poster sp_compressedAmy Cullinan
 
Overview of the glossary related to pcr
Overview of the glossary related to pcrOverview of the glossary related to pcr
Overview of the glossary related to pcrMohammadAtif41
 
R Gene Expression and Transcription Profiling
R Gene Expression and Transcription ProfilingR Gene Expression and Transcription Profiling
R Gene Expression and Transcription ProfilingShraddha Karcho
 
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...QIAGEN
 

Similar to NIEHS2016_ERCC_Agilent_RUO_Final (20)

Ascb 2007-pcr array-poster
Ascb 2007-pcr array-posterAscb 2007-pcr array-poster
Ascb 2007-pcr array-poster
 
Chigot poster2007
Chigot poster2007Chigot poster2007
Chigot poster2007
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
 
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...
Rapid and accurate Cancer somatic mutation profiling with the qBiomarker Soma...
 
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final
6-8-2015 AACC Poster HIV p24 S-PLEX - Stengelin_final
 
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...Multicopy reference assay (MRef) — a superior normalizer of sample input in D...
Multicopy reference assay (MRef) — a superior normalizer of sample input in D...
 
Validaternai
ValidaternaiValidaternai
Validaternai
 
Purification of total RNA from peripheral blood mononuclear cells - Download ...
Purification of total RNA from peripheral blood mononuclear cells - Download ...Purification of total RNA from peripheral blood mononuclear cells - Download ...
Purification of total RNA from peripheral blood mononuclear cells - Download ...
 
Technical Guide to Qiagen PCR Arrays - Download the Guide
Technical Guide to Qiagen PCR Arrays - Download the GuideTechnical Guide to Qiagen PCR Arrays - Download the Guide
Technical Guide to Qiagen PCR Arrays - Download the Guide
 
Prabhakar singh ii sem-paper v-bioanalyzer
Prabhakar singh  ii sem-paper v-bioanalyzerPrabhakar singh  ii sem-paper v-bioanalyzer
Prabhakar singh ii sem-paper v-bioanalyzer
 
KO poster 8:13
KO poster 8:13KO poster 8:13
KO poster 8:13
 
Global Gene Expression Profiles from Bladder Tumor FFPE Samples
Global Gene Expression Profiles from Bladder Tumor FFPE SamplesGlobal Gene Expression Profiles from Bladder Tumor FFPE Samples
Global Gene Expression Profiles from Bladder Tumor FFPE Samples
 
Hotspot mutation and fusion transcript detection from the same non-small cell...
Hotspot mutation and fusion transcript detection from the same non-small cell...Hotspot mutation and fusion transcript detection from the same non-small cell...
Hotspot mutation and fusion transcript detection from the same non-small cell...
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Ashg poster sp_compressed
Ashg poster sp_compressedAshg poster sp_compressed
Ashg poster sp_compressed
 
Overview of the glossary related to pcr
Overview of the glossary related to pcrOverview of the glossary related to pcr
Overview of the glossary related to pcr
 
R Gene Expression and Transcription Profiling
R Gene Expression and Transcription ProfilingR Gene Expression and Transcription Profiling
R Gene Expression and Transcription Profiling
 
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...
RT2 Profiler PCR Arrays: Pathway-focused Gene Expression Profiling with qRT-P...
 
Real Time PCR
Real Time PCRReal Time PCR
Real Time PCR
 

NIEHS2016_ERCC_Agilent_RUO_Final

  • 1. Introduction Experimental Design Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction such as diagnosis, prognosis, or prediction of therapeutic response. The advent of RNA deep sequencing (RNA-Seq) technologies has revolutionized eukaryotic transcriptome analysis by providing the ability to decipher global gene expression patterns beyond the limitation of microarrays, including the ability to discover novel genes and splice variants. However, data generated by the Sequencing Quality Control (SEQC) consortium has shown that RNA-Seq and microarray-based gene expression models are comparable in clinical endpoint prediction performance, even when the additional information unique to RNA-Seq is included in the prediction models. In order to understand and characterize the reproducibility and validity of gene expression experiments, exogenous RNA spike-in controls and associated ratio performance metrics have been introduced by the External RNA Controls Consortium (ERCC). The ERCC developed a library of 96 exogenous control transcripts from synthetic or bacterial sequences that are spiked into RNA at defined abundances and ratios, providing data that measures the technical performance of gene expression profiling methods. Scientists from the National Institute of Standards and Technology (NIST) have created a Bioconductor software tool called “ERCCdashboard” that produces ERCC performance metrics from expression values such as microarray signals or RNA-Seq read counts. Using the ERCC spike-in control transcripts and the “ERCCdashboard” analysis tool, we compare the performance of Agilent gene expression microarray and Agilent RNA-Seq platforms run with the same total RNA samples, and highlight performance differences between the two different Agilent gene expression measurement technologies. Using ERCC Spike-In Control Transcripts Provides Confidence in Agilent Microarray and RNA-Seq Gene Expression Data Anne Bergstrom Lucas, Namrata Joglekar, Bilan Hsue, Bahram Arezi, Mary Napier, Paul Wolber and Scott Happe Agilent Technologies Genomics R&D, Santa Clara, California ERCC RNA Ratio/Abundance in Mixtures Two mixtures (Mix 1 and Mix 2) were prepared from 92 ERCC RNA control transcripts that were placed in different subpools (a to d) in four defined abundance ratios of 4:1, 1:2, 1:1.5 and 1:1 between the mixtures (Figure a). Within each subpool there are 23 controls that span a large dynamic range of 220 or 6 orders of magnitude (Figure b). These ERCC control transcript mixes are technology-independent and can be spiked into the RNA at the beginning of any gene expression analysis experiment. (Munro, S., et al. Assessing technical performance in differential gene expression experiments with external spike-in RNA control ratio mixtures. (2014) Nature Communications, 5:5125) The technology-independent spike-in control transcripts developed by the ERCC can serve as a truth set to benchmark the accuracy of endogenous transcript ratio measurements. The ERCCdashboard tool enables rapid, reproducible, and automated analysis of the ERCC spike-in control transcripts in any differential expression experiment. The ERCC controls demonstrate that both the Agilent One-Color Gene Expression Microarray platform and the Agilent RNA WT NGS platform can generate linear gene expression data spanning 5 orders of magnitude with the ability to detect sensitive 1.5-fold changes at levels as low as: 36 counts with the microarray workflow 55 mapped reads with the RNA-Seq workflow For Research Use Only. Not for use in diagnostic procedures. This figure demonstrates the ability of each platform to detect small changes in ERCC spike-in concentration between the UHRR and HBRR samples. Shown are the average fluorescence intensity (microarrays) or the average counts (RNA-Seq) versus the differential expression test P-values calculated by the ERCCdashboard program. The heavy black dashed line indicates the false discovery rate (FDR) cutoff of FDR = 0.01 chosen for this analysis. Colored arrows indicate the LODR estimate (average intensity or average counts) for each fold change that crosses the line indicating the upper boundary of the 90% confidence interval. LODR results and confidence intervals derived from the plots are shown in the annotation tables below the plots. ERCCdashboard derived signal-abundance plots compare the dynamic range capabilities of the two different gene expression experimental platforms using the same RNA samples. In these figures the points are colored by ratio subpool, error bars represent the standard deviation of the replicate ratio measurements, and shape represents the sample type (UHRR or HBRR). In the RNA-Seq results, ERCC controls that did not have at least one count in three libraries for either sample were not included in the signal-abundance plot. There are several ERCC control transcripts that consistently give outlier signals (ERCC-00022, ERCC-00131, ERCC-00144) on the microarray experiments. Note that the linear dynamic range on these plots spans 220 (microarrays) or 216 (RNA-Seq). Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS Using the ERCCdashboard program, true-positive and true-negative ERCC control ratios can be used in receiver operator characteristic (ROC) curve analysis of the rank-ordered differential expression test P-values. ROC curves and the corresponding area under the curve (AUC) statistics change based on the discrimination of true-positive values and true-negative values, where ideal performance is represented by AUC = 1.0 for all 23 controls spiked at a given ratio. Note that the microarray experiments were able to detect all 23 ERCC spike-in controls at each fold change tested with AUCs >0.90 whereas the RNA-Seq experiments at a read depth of 100M reads detected 16 (or 17) of the 23 spike-in controls with AUCs of <0.90 for the smaller fold changes (1:2 fold and 1:1.5 fold) tested. ERCC Signal-Abundance Plot Comparisons Estimating Limit of Detection Rates (LODR) MA Plots Demonstrate Measurement Variability Conclusions Magnitude of fold-change (log ratio) versus average signal/count (MA) plots contain ERCC data points colored by expected ratio with error bars that represent the standard deviation of the replicate ratio measurements. Filled circles indicate ERCC ratios above the limit of detection (LODR) for the 4:1, 1:2, and the 1:1.5 ratios. Endogenous transcript ratio measurements are shown as gray points. The estimate of mRNA fraction differences between the UHRR and HBRR RNA samples (rm) with weighted standard errors is provided in an inset table and is used to adjust the nominal ERCC ratios. The nominal ratios are annotated with the light solid color lines for each ratio subpool and the adjusted ratios are annotated with the heavy dashed lines. Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS ROC Curves and AUC Statistics Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS Agilent Gene Expression Microarrays Agilent SureSelect WT RNA-Seq NGS Two ERCC spike-in RNA transcript mixtures (Mix 1 and Mix 2) were obtained from Ambion (p/n 4456739) and 2 µl of a 1:10 dilution of each respective spike-in mix was added to 10 µg of either the Universal Human Reference RNA (UHRR, Agilent p/n 740000) or to the Human Brain Reference RNA (HBRR, Ambion p/n AM6050) as noted in the figure above. Four (4x) technical replicate 100 ng aliquots of total RNA from each sample were prepared for Agilent One-Color Gene Expression microarray analysis using the Agilent Low Input Quick Amp kit (p/n 5190-2305) and run on the SurePrint G3 Human Gene Expression v3 8x60K Microarrays (p/n G4851C). Three (3x) technical replicate 2 µg aliquots of total RNA from each sample (one less technical replicate as compared to the figure above) were prepared for RNA-Seq analysis using the Agilent SureSelect Strand-Specific RNA Library Preparation Kit kit (p/n G9691A/G9692A). The barcoded libraries for both the UHRR and HBRR technical replicates were pooled and prepared for paired-end multiplexed sequencing on the Illumina platform. The mean number of mapped reads per indexed library was 151,155,130 for UHRR samples and 115,602,753 for HBRR samples. ERCC Mix 1 ERCC Mix 2 UHRR HBRR ERCC-00022 ERCC-00131 ERCC-00144