SlideShare a Scribd company logo
Intensity Screening.
We eliminated all probe sets from the analysis that
produced low-intensity signals. For this study, we
consider active probe sets only - probe sets that
detected a target mRNA in at least one of the
experimental conditions.
Statistical Methodology
We used mixed effect ANOVA model to identify genes
in which the redundant probe sets produced
discordant differential expression readings across
strains. For each gene, we fitted a mixed effects
linear model:
yijk = μ + Si + Pk + Si*Pk + Mj(i) + εjk
where yijk is a measure of gene expression produced
by an individual probe set; μ is the grand mean of all
probe set readings for the gene being considered;
Genome-based analysis of expression microarray data
Ann Loraine (1), Xiangqin Cui (1) and Gary Churchill (2)
(1) Section on Statistical Genetics, Dept. of Biostatistics, University of Alabama at Birmingham; (2) The Jackson Laboratory, Bar Harbor, ME
Abstract Methods Methods
Conclusion
We developed a genome-based analysis method
that uses data from expression microarray
experiments (Affymetrix platform) to investigate
alternative mRNA processing. This approach
revealed numerous examples of genes whose
expression patterns suggest strain-dependent
differential mRNA processing in mouse.
Introduction
References
Current Affymetrix expression microarray designs
typically include numerous redundant probe sets
that interrogate different regions of the same gene.
This is a side effect of a design process which
creates probe sets matching alternative splicing or
other types of mRNA variants. Thus, when
redundant probe sets produce discordant results,
such as differential expression in opposite
directions, the most logical explanation is that the
condition under investigation affects mRNA
processing pathways, such as alternative splicing
and alternative polyadenylation site choice.
Both types of alternative mRNA processing can
have profound impacts on gene function when
important functional motifs, such as conserved
protein motifs or mRNA stability determinants, are
affected. Discordant behavior of redundant probe
sets therefore may represent an opportunity to
study how alternative mRNA processing pathways
operate. Here we investigate this idea using
expression data collected from two mouse strains.
Data collection and pre-processing.
Labeled mRNA samples were prepared from the
livers of six mice, three of strain AJ and three of
strain B6. Labeled samples were hybridized to two
Affymetrix 430_2 microarrays per mouse. The
array data were processed using RMA and
normalized using quantile-quantile normalization.
Redundant probe sets.
Affymetrix provides annotation data files that map
probe set ids to Entrez Gene ids as well as ‘psl’
files that map probe set design sequences onto the
May 2004 mouse genome. We used these files to
create and then screen a provisional list of
redundant probe sets for use in our study.
Methods
Probe sets
per Entrez
Gene ID
# Entrez Gene
IDS
1 11,511
2 4,703
3 2,309
4 1,062
5 454
6 257
7 109
>7 87
TOTAL 20,492
The 430_2 mouse
array contains
37,449 probe set ids
that map onto 20,492
Entrez Gene ids.
Thus, according to
the Affymetrix
annotations, over
9,000 Entrez Gene
ids are associated
with two or more
probe sets.
Screening step # Probe Sets
One genomic location, one Entrez Gene id 34,196
Inconsistent with Refseq genomic alignment -2036
Reliable Refseq alignment not available -7,895
TOTAL remaining 24,265
These annotation files contain a number of
inconsistencies, such as probe sets mapping to an
unexpected chromosome or strand. To remove
these, we compared genomic alignments of the
probe set design sequences with alignments for
Refseq mRNAs associated with the annotated Entrez
Gene ids. Probe sets whose design sequence
alignments were inconsistent with a reliable RefSeq
alignment were eliminated.
Results
The analysis identified 1,400 genes with adjusted p-
values less than or equal to 0.05. We examined a
number of these using the Integrated Genome
Browser - see below [1]. In some cases, alternative
mRNA processing alters the coding region and
modifies conserved functional motifs, a common
occurence among alternatively spliced genes [2,3]. probe set AJ B6 orientation
1424176_a_at 8.76 8.43 -0.33 proximal
1421223_a_at 9.01 9.62 +0.61 distal
annexin A4
probe set average intensity readings
design sequence alignments
probes
Results
Si, Mj(i) and εjk are the strain, mouse and residual
effects; Pk is the probe set effect; and Si*Pk is the
interaction between strain and probe set.
In this setting, different probe sets can provide
expression measurements for a single gene, and
these measurements may vary across different mouse
strains (fixed effects) as well as across individual mice
(random effects). However, some probe sets -
because they interrogate different regions of the same
gene - may respond differently in the different strains.
The interaction term Si*Pk captures any differential
expression arising from the combined effect of probe
set and strain. Our null hypothesis was that this strain
by probe set interaction term is zero. We tested this
using an F test for mixed effect ANOVA models and
used an adaptive FDR method to adjust p-values for a
desired FDR of 0.05. Note that this test does not
identify the individual probe sets which are discordant
but instead identifies genes in which not all probe sets
respond to the strain difference in the same way.
[1] The Integrated Genome Browser is open source software available from
www.genoviz.org.
[2] Loraine, et al. (2002) Protein-based analysis of alternative splicing in the
human genome. IEEE Comput So Bioinform Conf. pp. 321-326
[3] Loraine, et al. (2003) Exploring alternative transcript structure in the human
genome using BLOCKS and InterPro. J Bioinform Comput Biol. pp. 289-306.
Based on these preliminary results, we conclude that
analyzing the relative expression of redundant probe
sets from Affymetrix expression microarray designs
has the potential to reveal how diverse experimental
conditions affect differential mRNA processing and
degradation pathways.
probe set AJ B6 orientation
1450502_at 8.05 7.77 -0.28 proximal
1433770_at 9.32 9.58 +0.26 distal
Dpysl2 dihydropyrimidinase-like 2
probe set average intensity readings
design sequence alignments
probes
Nipsnap1 4-nitrophenylphosphatase
domain and non-neuronal SNAP25-like
probe set AJ B6 orientation
1448465_at 10.4 10.6 +0.2 proximal
1450184_s_at 11.8 11.4 -0.4 distal
probe set average intensity readings
design sequence alignments
probes
UTR
coding region

More Related Content

What's hot

Making Protein Function and Subcellular Localization Predictions: Challenges ...
Making Protein Function and Subcellular Localization Predictions: Challenges ...Making Protein Function and Subcellular Localization Predictions: Challenges ...
Making Protein Function and Subcellular Localization Predictions: Challenges ...
fionabrinkman
 
Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...
Klaas Vandepoele
 
Yasset iso point-cigb-2012
Yasset iso point-cigb-2012Yasset iso point-cigb-2012
Yasset iso point-cigb-2012
Yasset Perez-Riverol
 
Gene Array Analyzer
Gene Array AnalyzerGene Array Analyzer
Gene Array Analyzer
university of education,Lahore
 
Bacteriology exam (1) examination
Bacteriology exam (1) examination Bacteriology exam (1) examination
Bacteriology exam (1) examination
Ahmed Zaahir
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
Kishor Tappita
 
Bioinformatic jc 08_14_2013_formal
Bioinformatic jc 08_14_2013_formalBioinformatic jc 08_14_2013_formal
Bioinformatic jc 08_14_2013_formal
Jennifer Shelton
 
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
inventionjournals
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientation
ijaia
 
Computational approach of the homeostatic turnover of memory B cells.
Computational approach of the homeostatic turnover of memory B cells.Computational approach of the homeostatic turnover of memory B cells.
Computational approach of the homeostatic turnover of memory B cells.
IJERA Editor
 
Advanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug DiscoveryAdvanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug Discovery
Mikel Txopitea Elorriaga
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
BITS
 
Proteomics 2 d gel, mass spectrometry, maldi tof
Proteomics 2 d gel, mass spectrometry, maldi tofProteomics 2 d gel, mass spectrometry, maldi tof
Proteomics 2 d gel, mass spectrometry, maldi tof
nirvarna gr
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
Karan Veer Singh
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Sucheta Tripathy
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
Laurence Dawkins-Hall
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomic
Susan Rey
 
Mapping and association mapping
Mapping and association mappingMapping and association mapping
Mapping and association mapping
FAO
 

What's hot (20)

Making Protein Function and Subcellular Localization Predictions: Challenges ...
Making Protein Function and Subcellular Localization Predictions: Challenges ...Making Protein Function and Subcellular Localization Predictions: Challenges ...
Making Protein Function and Subcellular Localization Predictions: Challenges ...
 
Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...
 
Yasset iso point-cigb-2012
Yasset iso point-cigb-2012Yasset iso point-cigb-2012
Yasset iso point-cigb-2012
 
Gene Array Analyzer
Gene Array AnalyzerGene Array Analyzer
Gene Array Analyzer
 
Bacteriology exam (1) examination
Bacteriology exam (1) examination Bacteriology exam (1) examination
Bacteriology exam (1) examination
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
Bioinformatic jc 08_14_2013_formal
Bioinformatic jc 08_14_2013_formalBioinformatic jc 08_14_2013_formal
Bioinformatic jc 08_14_2013_formal
 
MI-4
MI-4MI-4
MI-4
 
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientation
 
Computational approach of the homeostatic turnover of memory B cells.
Computational approach of the homeostatic turnover of memory B cells.Computational approach of the homeostatic turnover of memory B cells.
Computational approach of the homeostatic turnover of memory B cells.
 
Advanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug DiscoveryAdvanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug Discovery
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
 
Proteomics 2 d gel, mass spectrometry, maldi tof
Proteomics 2 d gel, mass spectrometry, maldi tofProteomics 2 d gel, mass spectrometry, maldi tof
Proteomics 2 d gel, mass spectrometry, maldi tof
 
15 arrays
15 arrays15 arrays
15 arrays
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomic
 
Mapping and association mapping
Mapping and association mappingMapping and association mapping
Mapping and association mapping
 

Similar to Arrays and alternative splicing

RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing ResearchTanmay Ghai
 
Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...
Anton Yuryev
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
improvemed
 
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...J. Colin Cox
 
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral AnalysisIdentifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
IJMER
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
IJAEMSJORNAL
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
ASHG 2015 - Redundant Annotations in Tertiary Analysis
ASHG 2015 - Redundant Annotations in Tertiary AnalysisASHG 2015 - Redundant Annotations in Tertiary Analysis
ASHG 2015 - Redundant Annotations in Tertiary Analysis
James Warren
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
VHIR Vall d’Hebron Institut de Recerca
 
Liu_Jiangyuan_1201662_FR
Liu_Jiangyuan_1201662_FRLiu_Jiangyuan_1201662_FR
Liu_Jiangyuan_1201662_FR姜圆 刘
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
KAUSHAL SAHU
 
MCB 432 Final Table PP 01.06.16
MCB 432 Final Table PP 01.06.16MCB 432 Final Table PP 01.06.16
MCB 432 Final Table PP 01.06.16Keegan McAuliffe
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
IJERD Editor
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
ChijiokeNsofor
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
Lars Juhl Jensen
 
Journal club Aug04 2015 GeneMarkET
Journal club Aug04 2015 GeneMarkETJournal club Aug04 2015 GeneMarkET
Journal club Aug04 2015 GeneMarkET
Hiroya Morimoto
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
saswat tripathy
 
A Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
A Methodology For Motif Discovery Employing Iterated Cluster Re-AssignmentA Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
A Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
Angela Tyger
 

Similar to Arrays and alternative splicing (20)

RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing Research
 
Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...
 
Poster2
Poster2Poster2
Poster2
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...
Allert2010-Multifactorial_determinants_of_protein_expression_in_prokaryotic_o...
 
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral AnalysisIdentifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
Identifying the Coding and Non Coding Regions of DNA Using Spectral Analysis
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
ASHG 2015 - Redundant Annotations in Tertiary Analysis
ASHG 2015 - Redundant Annotations in Tertiary AnalysisASHG 2015 - Redundant Annotations in Tertiary Analysis
ASHG 2015 - Redundant Annotations in Tertiary Analysis
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Liu_Jiangyuan_1201662_FR
Liu_Jiangyuan_1201662_FRLiu_Jiangyuan_1201662_FR
Liu_Jiangyuan_1201662_FR
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
presentation
presentationpresentation
presentation
 
MCB 432 Final Table PP 01.06.16
MCB 432 Final Table PP 01.06.16MCB 432 Final Table PP 01.06.16
MCB 432 Final Table PP 01.06.16
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
Journal club Aug04 2015 GeneMarkET
Journal club Aug04 2015 GeneMarkETJournal club Aug04 2015 GeneMarkET
Journal club Aug04 2015 GeneMarkET
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
A Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
A Methodology For Motif Discovery Employing Iterated Cluster Re-AssignmentA Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
A Methodology For Motif Discovery Employing Iterated Cluster Re-Assignment
 

More from Ann Loraine

Use Integrated Genome Browser to explore, analyze, and publish genomic data
Use Integrated Genome Browser to explore, analyze, and publish genomic dataUse Integrated Genome Browser to explore, analyze, and publish genomic data
Use Integrated Genome Browser to explore, analyze, and publish genomic data
Ann Loraine
 
Visualize genomes with Integrated Genome Browser
Visualize genomes with Integrated Genome BrowserVisualize genomes with Integrated Genome Browser
Visualize genomes with Integrated Genome Browser
Ann Loraine
 
BINF 3121 Data Analysis Report How-To
BINF 3121 Data Analysis Report How-ToBINF 3121 Data Analysis Report How-To
BINF 3121 Data Analysis Report How-To
Ann Loraine
 
Giving great talks in Bioinformatics - from Professional Communication class ...
Giving great talks in Bioinformatics - from Professional Communication class ...Giving great talks in Bioinformatics - from Professional Communication class ...
Giving great talks in Bioinformatics - from Professional Communication class ...
Ann Loraine
 
Interviewing - why some questions are off limits
Interviewing - why some questions are off limitsInterviewing - why some questions are off limits
Interviewing - why some questions are off limits
Ann Loraine
 
RNA-Seq Analysis of Blueberry Fruit Development and Ripening
RNA-Seq Analysis of Blueberry Fruit Development and RipeningRNA-Seq Analysis of Blueberry Fruit Development and Ripening
RNA-Seq Analysis of Blueberry Fruit Development and Ripening
Ann Loraine
 
Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016
Ann Loraine
 
Em pcr 16x9
Em pcr 16x9Em pcr 16x9
Em pcr 16x9
Ann Loraine
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotations
Ann Loraine
 
wings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualizewings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualize
Ann Loraine
 
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitrWiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
Ann Loraine
 
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological InterpretationRNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
Ann Loraine
 
Linking IGB with Galaxy
Linking IGB with GalaxyLinking IGB with Galaxy
Linking IGB with Galaxy
Ann Loraine
 
IGB genome genometry data models by Gregg Helt and Cyrus Harmon
IGB genome genometry data models by Gregg Helt and Cyrus HarmonIGB genome genometry data models by Gregg Helt and Cyrus Harmon
IGB genome genometry data models by Gregg Helt and Cyrus Harmon
Ann Loraine
 
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
Ann Loraine
 

More from Ann Loraine (15)

Use Integrated Genome Browser to explore, analyze, and publish genomic data
Use Integrated Genome Browser to explore, analyze, and publish genomic dataUse Integrated Genome Browser to explore, analyze, and publish genomic data
Use Integrated Genome Browser to explore, analyze, and publish genomic data
 
Visualize genomes with Integrated Genome Browser
Visualize genomes with Integrated Genome BrowserVisualize genomes with Integrated Genome Browser
Visualize genomes with Integrated Genome Browser
 
BINF 3121 Data Analysis Report How-To
BINF 3121 Data Analysis Report How-ToBINF 3121 Data Analysis Report How-To
BINF 3121 Data Analysis Report How-To
 
Giving great talks in Bioinformatics - from Professional Communication class ...
Giving great talks in Bioinformatics - from Professional Communication class ...Giving great talks in Bioinformatics - from Professional Communication class ...
Giving great talks in Bioinformatics - from Professional Communication class ...
 
Interviewing - why some questions are off limits
Interviewing - why some questions are off limitsInterviewing - why some questions are off limits
Interviewing - why some questions are off limits
 
RNA-Seq Analysis of Blueberry Fruit Development and Ripening
RNA-Seq Analysis of Blueberry Fruit Development and RipeningRNA-Seq Analysis of Blueberry Fruit Development and Ripening
RNA-Seq Analysis of Blueberry Fruit Development and Ripening
 
Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016
 
Em pcr 16x9
Em pcr 16x9Em pcr 16x9
Em pcr 16x9
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotations
 
wings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualizewings2014 Workshop 1 Design, sequence, align, count, visualize
wings2014 Workshop 1 Design, sequence, align, count, visualize
 
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitrWiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
WiNGS 2014 Workshop 2 R, RStudio, and reproducible research with knitr
 
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological InterpretationRNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
RNA-Seq data analysis at wings 2014 - Workshop 3 Biological Interpretation
 
Linking IGB with Galaxy
Linking IGB with GalaxyLinking IGB with Galaxy
Linking IGB with Galaxy
 
IGB genome genometry data models by Gregg Helt and Cyrus Harmon
IGB genome genometry data models by Gregg Helt and Cyrus HarmonIGB genome genometry data models by Gregg Helt and Cyrus Harmon
IGB genome genometry data models by Gregg Helt and Cyrus Harmon
 
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
 

Recently uploaded

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 

Recently uploaded (20)

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 

Arrays and alternative splicing

  • 1. Intensity Screening. We eliminated all probe sets from the analysis that produced low-intensity signals. For this study, we consider active probe sets only - probe sets that detected a target mRNA in at least one of the experimental conditions. Statistical Methodology We used mixed effect ANOVA model to identify genes in which the redundant probe sets produced discordant differential expression readings across strains. For each gene, we fitted a mixed effects linear model: yijk = μ + Si + Pk + Si*Pk + Mj(i) + εjk where yijk is a measure of gene expression produced by an individual probe set; μ is the grand mean of all probe set readings for the gene being considered; Genome-based analysis of expression microarray data Ann Loraine (1), Xiangqin Cui (1) and Gary Churchill (2) (1) Section on Statistical Genetics, Dept. of Biostatistics, University of Alabama at Birmingham; (2) The Jackson Laboratory, Bar Harbor, ME Abstract Methods Methods Conclusion We developed a genome-based analysis method that uses data from expression microarray experiments (Affymetrix platform) to investigate alternative mRNA processing. This approach revealed numerous examples of genes whose expression patterns suggest strain-dependent differential mRNA processing in mouse. Introduction References Current Affymetrix expression microarray designs typically include numerous redundant probe sets that interrogate different regions of the same gene. This is a side effect of a design process which creates probe sets matching alternative splicing or other types of mRNA variants. Thus, when redundant probe sets produce discordant results, such as differential expression in opposite directions, the most logical explanation is that the condition under investigation affects mRNA processing pathways, such as alternative splicing and alternative polyadenylation site choice. Both types of alternative mRNA processing can have profound impacts on gene function when important functional motifs, such as conserved protein motifs or mRNA stability determinants, are affected. Discordant behavior of redundant probe sets therefore may represent an opportunity to study how alternative mRNA processing pathways operate. Here we investigate this idea using expression data collected from two mouse strains. Data collection and pre-processing. Labeled mRNA samples were prepared from the livers of six mice, three of strain AJ and three of strain B6. Labeled samples were hybridized to two Affymetrix 430_2 microarrays per mouse. The array data were processed using RMA and normalized using quantile-quantile normalization. Redundant probe sets. Affymetrix provides annotation data files that map probe set ids to Entrez Gene ids as well as ‘psl’ files that map probe set design sequences onto the May 2004 mouse genome. We used these files to create and then screen a provisional list of redundant probe sets for use in our study. Methods Probe sets per Entrez Gene ID # Entrez Gene IDS 1 11,511 2 4,703 3 2,309 4 1,062 5 454 6 257 7 109 >7 87 TOTAL 20,492 The 430_2 mouse array contains 37,449 probe set ids that map onto 20,492 Entrez Gene ids. Thus, according to the Affymetrix annotations, over 9,000 Entrez Gene ids are associated with two or more probe sets. Screening step # Probe Sets One genomic location, one Entrez Gene id 34,196 Inconsistent with Refseq genomic alignment -2036 Reliable Refseq alignment not available -7,895 TOTAL remaining 24,265 These annotation files contain a number of inconsistencies, such as probe sets mapping to an unexpected chromosome or strand. To remove these, we compared genomic alignments of the probe set design sequences with alignments for Refseq mRNAs associated with the annotated Entrez Gene ids. Probe sets whose design sequence alignments were inconsistent with a reliable RefSeq alignment were eliminated. Results The analysis identified 1,400 genes with adjusted p- values less than or equal to 0.05. We examined a number of these using the Integrated Genome Browser - see below [1]. In some cases, alternative mRNA processing alters the coding region and modifies conserved functional motifs, a common occurence among alternatively spliced genes [2,3]. probe set AJ B6 orientation 1424176_a_at 8.76 8.43 -0.33 proximal 1421223_a_at 9.01 9.62 +0.61 distal annexin A4 probe set average intensity readings design sequence alignments probes Results Si, Mj(i) and εjk are the strain, mouse and residual effects; Pk is the probe set effect; and Si*Pk is the interaction between strain and probe set. In this setting, different probe sets can provide expression measurements for a single gene, and these measurements may vary across different mouse strains (fixed effects) as well as across individual mice (random effects). However, some probe sets - because they interrogate different regions of the same gene - may respond differently in the different strains. The interaction term Si*Pk captures any differential expression arising from the combined effect of probe set and strain. Our null hypothesis was that this strain by probe set interaction term is zero. We tested this using an F test for mixed effect ANOVA models and used an adaptive FDR method to adjust p-values for a desired FDR of 0.05. Note that this test does not identify the individual probe sets which are discordant but instead identifies genes in which not all probe sets respond to the strain difference in the same way. [1] The Integrated Genome Browser is open source software available from www.genoviz.org. [2] Loraine, et al. (2002) Protein-based analysis of alternative splicing in the human genome. IEEE Comput So Bioinform Conf. pp. 321-326 [3] Loraine, et al. (2003) Exploring alternative transcript structure in the human genome using BLOCKS and InterPro. J Bioinform Comput Biol. pp. 289-306. Based on these preliminary results, we conclude that analyzing the relative expression of redundant probe sets from Affymetrix expression microarray designs has the potential to reveal how diverse experimental conditions affect differential mRNA processing and degradation pathways. probe set AJ B6 orientation 1450502_at 8.05 7.77 -0.28 proximal 1433770_at 9.32 9.58 +0.26 distal Dpysl2 dihydropyrimidinase-like 2 probe set average intensity readings design sequence alignments probes Nipsnap1 4-nitrophenylphosphatase domain and non-neuronal SNAP25-like probe set AJ B6 orientation 1448465_at 10.4 10.6 +0.2 proximal 1450184_s_at 11.8 11.4 -0.4 distal probe set average intensity readings design sequence alignments probes UTR coding region