SlideShare a Scribd company logo
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
technology
Affymetrix microarrays manufacture.
2
3
4
5
6
Microarray experiment workflow.
Quality Controls.
Different types of Affymetrix arrays.
1 Introduction
Different types of arrays. Manufactoring. DNA/RNA/Protein
1 Introduction
 reproducibility
 only show you what you’re looking for
 what about ‘indels’, inversions, translocations...
 accuracy
 sensitivity
1 Introduction
1 Introduction
 RNA-Seq was superior in detecting low abundance transcripts
 also better detecting differentiating biologically isoforms
 RNA-Seq demonstrated a broader dynamic range than microarray.
1 Introduction
• In molecular biology exist a lot of techniques to measure the gene expression
(Northern blot)
• Main characteristic from the microarrays discovery (Schena et al. (1995)
Science 270:467-70), was not what could be measured, instead the quantity of
simultaneous measures that could be done.
• Pre microarrays time: study of genes was one by one
• Post microarrays time: all the genes together.
1 Introduction
• But.... what is a microarray in few words?
 DNA fixed to a solid surface (nylon, silica, glass,...)
 RNA “problem” is labeled and have to bind to DNA
fixed in the solid surface in an specific way.
 DNA binded usually is called “probe”
 Labeled RNA usually is called “target”
Important to know in advanced...
1 Introduction
• Microarrays are usually hypothesis-generating:
They highlight specific genes or features that are particularly
interesting for follow-up experiments.
An exception would be the biomarkers discovery studies.
• This does not reduce the importance of experimental design
2
Two color microarrays (cDNA)
• Usually probes are long (20nt)
• Probe is fixed to a glass
• Labeling is with two fluorocrom (Cy3/Cy5).
• Direct comparison of the two samples due
to they are hybridized in the same array.
• Each gene appear few times in the array
• Long probes facilitate crosshybridization
• Not very good reproducibility.
Different types of arrays. Manufactoring. DNA/RNA
2
One color microarrays
• Short probes (20-25 nt)
• Target is labeled with only one fluorocrom
• Only one sample is hybridized in each array.
• Each gene is represented by a lot of probes
in the array
Different types of arrays. Manufactoring. DNA/RNA
2 Different types of arrays. Manufactoring. DNA/RNA
• DNA Polymorphism (GWAS)
• Transcription Factors
• Resequencing
• Cytogenetics
• Expression
• Alternative splicing
• microRNA
DNA RNA
2 Different types of Affymetrix arrays.
3’5’
3’ IVT Arrays
• Biased measurement of the gene expression
• Array more used in the literature. A lot of species present.
Only genes with polyA tail and good 3’ site will
be amplified and will have the chance of
hybridize correctly.
2 Different types of Affymetrix arrays.
3’5’
Gene Arrays
Exon Arrays
Gene/Exon Arrays
• Gene arrays are the most used (good quality and price ratio)
• Gene arrays 2.0 more updated library and also includes lncRNAs
2 Different types of expression arrays.
•153 organisms in the array (human, mouse, rat, canine, ….)
•100% miRBase v17
•2.216 snoRNAs and scaRNAs (human small nuclear RNAs)
•Low inputs amounts (130 ng total RNA)
•2.999 probe sets unique to pre-miRNA hairpins
•Able to differentiate pre and mature miRNAs
•Useful for FFPE samples
miRNA
2 Different types of expression arrays.
HTA array
Affymetrix microarrays manufacture.3
Photolitografy
Affymetrix microarrays manufacture.3
5 Microarray experiment workflow
5 Microarray experiment workflow
5 Microarray experiment workflow
6 Quality Controls
6 Quality Controls
6 Quality Controls
Length of amplified cRNA
6 Quality Controls
Length of fragmented cRNA
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
Data Analysis
Filtering
2
3
4
5
6
Statistical inference of diferential expression
Clustering
Normalization
1 Introduction. Experimental design
Quality control
7
8
Annotation
Biological interpretation
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
Microarrays Analysis
Workflow
2 Quality Control
2 Quality Control
Was the experiment a success???
• Microarray experiments generate huge quantitites of data
• Standard statistical approach use plots to check the quality
 show all data together
 highlight structures
 may help to detect problems (“unusual patterns”)
It is hard to decide if things “seem to be
all right” just by looking at the numbers.
2 Quality Control
Diagnostics plots for microarrays:
• Microarray data usually considered at two levels
1. Low level. Data directly coming from the scanner
2. High level. Processed from low level data. Expression values,
normalized or not.
• Some plots are specific for some type of arrays or for some level
2 Quality Control
Diagnostics plots for microarrays:
1. Low level:
 Layout image
 Degradation plots (only in 3’IVT)
 Histogram/density plots
 PCA, Boxplot
2. High level:
 MA plots
 Model based plots (NUSE,RLE,)
 PCA, Boxplot
2 Quality Control
Diganostics plots for microarrays. Low level. Layout image.
2 Quality Control
Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
2 Quality Control
Diagnostics plots for microarrays. Low level. Histogram/density Plot
2 Quality Control
Diagnostics plots for microarrays. Low level. Boxplot
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. RLE
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. NUSE
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
• MA plots allow pair wise comparison of log-intensity of each array to a
reference array and identification of intensity-dependent biases.
• The Y axis of the plot contains the log-ratio intentsity of one array to the
reference median array, which is called “M” while the X axis contains the
average log-intensity of both arrays – called “A”.
• The probe levels are not likely to differ a lot so we expect a MA plot centered
on the Y=0 axis from low to high intensities.
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
2 Quality Control
3 Normalization
The goal of normalization is to adjust for the effects that are due to variations in the
technology rather than the biology.
3 Normalization
3 Normalization
3 Normalization
4 Filtering
• In a microarray experiment only a few hundreds/thousand of genes change their
expression due to the different conditions
•Researcher is interested in keeping the number of tests/genes as low as possible
while keeping the interesting genes in the selected subset.
•If the truly diferentially expressed genes are over-represented among those
selectec in the filtering step, the FDR associated with a certain threshold of the
statistic test will be lowered due to the filtering.
Genes that do not change introduce
noise, therefore is better not to be
present when the statistical analysis is
done
4 Filtering
Exists different types of filtering:
• Annotation features (specific):
 Specific gene features (i.e. GO term, presence of transcriptional regulative
elements in promoters, etc.)
Data derived from IPA
• Signal features (non specific)
 % intensities greater of a user defined value
 Interquantile range (IQR) greater of a defined value
4 Filtering
Signal filtering: This technique has as its premise the removal of genes that are
deemed to be not expressed or unchanged according to some specific criterion that
is under the control of the user.
5 Statistical inference of diferential expression
• Indirect comparisons: 2 groups, unpaired
• Direct comparsions: 2 groups. paired
5 Statistical inference of diferential expression
Limma package (Gordon Smith)
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
6 Clustering
Types:
 Supervised clustering try to find the best partition for data that belong to a
know set o classes
 Unsupervised clustering try to define the number and the size of the classes
in which the transcription profiles can be fitted in.
6 Clustering
6 Clustering
Hierarchical Clustering (HCL)
• HCL is an agglomerative /divise clustering method.
• The iterative process continues until all groups are
connected in a hierarchical tree.
• Samples more similar between them are closed.
6 Clustering
7 Annotation
8 Biological interpretation
Gene Ontology
8 Biological interpretation

More Related Content

What's hot

BLAST
BLASTBLAST
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
CharupriyaChauhan1
 
Genomics
Genomics Genomics
Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
James McInerney
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
Mrinal Vashisth
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analyses
rjorton
 
Protein identification - peptide mass fingerprinting
Protein identification - peptide mass fingerprintingProtein identification - peptide mass fingerprinting
Protein identification - peptide mass fingerprinting
Creative Proteomics
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
arunchacko14
 
Microarray
MicroarrayMicroarray
Microarray
Ankitha Hirematha
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
Uzma Jabeen
 
DNA MICROARRAY
DNA MICROARRAYDNA MICROARRAY
DNA MICROARRAY
rishabhaks
 
Microarray full detail
Microarray full detailMicroarray full detail
Microarray full detail
Devendra Choudhary
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
Dayananda Salam
 
Differential gene profiling methods
Differential gene profiling methodsDifferential gene profiling methods
Differential gene profiling methods
sonamyadav82
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plot
Li Shen
 
DNA microarray ppt
DNA microarray pptDNA microarray ppt
DNA microarray ppt
Mohit Kohli
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
prateek kumar
 
Sage
SageSage
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
Neeraj Sharma
 

What's hot (20)

BLAST
BLASTBLAST
BLAST
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Genomics
Genomics Genomics
Genomics
 
Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analyses
 
Protein identification - peptide mass fingerprinting
Protein identification - peptide mass fingerprintingProtein identification - peptide mass fingerprinting
Protein identification - peptide mass fingerprinting
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
 
Microarray
MicroarrayMicroarray
Microarray
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
DNA MICROARRAY
DNA MICROARRAYDNA MICROARRAY
DNA MICROARRAY
 
Microarray full detail
Microarray full detailMicroarray full detail
Microarray full detail
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Differential gene profiling methods
Differential gene profiling methodsDifferential gene profiling methods
Differential gene profiling methods
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plot
 
DNA microarray ppt
DNA microarray pptDNA microarray ppt
DNA microarray ppt
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Sage
SageSage
Sage
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 

Similar to Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
Computer Science Club
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
ajay301
 
12 arrays
12 arrays12 arrays
12 arrays
12 arrays12 arrays
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
Sreenivasa Reddy Thalla
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific Applications
Integrated DNA Technologies
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
Heidi Barrier
 
Microarray @ujjwal sirohi
Microarray @ujjwal sirohiMicroarray @ujjwal sirohi
Microarray @ujjwal sirohi
ujjwal sirohi
 
A comprehensive study of microarray
A comprehensive study of microarrayA comprehensive study of microarray
A comprehensive study of microarray
PRABAL SINGH
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
Pawan Kumar
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Ilya Klabukov
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
Eden D'souza
 
Cignal webina
Cignal webinaCignal webina
Cignal webina
Elsa von Licy
 
Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
Prof. Wim Van Criekinge
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminar
GenomeInABottle
 
Si rna 2013
Si rna 2013Si rna 2013
Si rna 2013
Elsa von Licy
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
AishwaryaTeli5
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
SCHOTT
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
GenomeInABottle
 
Seftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.pptSeftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.ppt
SamerPaser
 

Similar to Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona) (20)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
12 arrays
12 arrays12 arrays
12 arrays
 
12 arrays
12 arrays12 arrays
12 arrays
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific Applications
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 
Microarray @ujjwal sirohi
Microarray @ujjwal sirohiMicroarray @ujjwal sirohi
Microarray @ujjwal sirohi
 
A comprehensive study of microarray
A comprehensive study of microarrayA comprehensive study of microarray
A comprehensive study of microarray
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
 
Cignal webina
Cignal webinaCignal webina
Cignal webina
 
Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminar
 
Si rna 2013
Si rna 2013Si rna 2013
Si rna 2013
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Seftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.pptSeftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.ppt
 

More from VHIR Vall d’Hebron Institut de Recerca

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
VHIR Vall d’Hebron Institut de Recerca
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
VHIR Vall d’Hebron Institut de Recerca
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
VHIR Vall d’Hebron Institut de Recerca
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
VHIR Vall d’Hebron Institut de Recerca
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
VHIR Vall d’Hebron Institut de Recerca
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
VHIR Vall d’Hebron Institut de Recerca
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
VHIR Vall d’Hebron Institut de Recerca
 
Information management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cmsInformation management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cms
VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - MicroarraysCurso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGSCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
VHIR Vall d’Hebron Institut de Recerca
 

More from VHIR Vall d’Hebron Institut de Recerca (20)

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Information management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cmsInformation management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cms
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
 
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - MicroarraysCurso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
 
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGSCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
 

Recently uploaded

Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 

Recently uploaded (20)

Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 

Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

  • 1. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray technology
  • 2. Affymetrix microarrays manufacture. 2 3 4 5 6 Microarray experiment workflow. Quality Controls. Different types of Affymetrix arrays. 1 Introduction Different types of arrays. Manufactoring. DNA/RNA/Protein
  • 3. 1 Introduction  reproducibility  only show you what you’re looking for  what about ‘indels’, inversions, translocations...  accuracy  sensitivity
  • 5. 1 Introduction  RNA-Seq was superior in detecting low abundance transcripts  also better detecting differentiating biologically isoforms  RNA-Seq demonstrated a broader dynamic range than microarray.
  • 6. 1 Introduction • In molecular biology exist a lot of techniques to measure the gene expression (Northern blot) • Main characteristic from the microarrays discovery (Schena et al. (1995) Science 270:467-70), was not what could be measured, instead the quantity of simultaneous measures that could be done. • Pre microarrays time: study of genes was one by one • Post microarrays time: all the genes together.
  • 7. 1 Introduction • But.... what is a microarray in few words?  DNA fixed to a solid surface (nylon, silica, glass,...)  RNA “problem” is labeled and have to bind to DNA fixed in the solid surface in an specific way.  DNA binded usually is called “probe”  Labeled RNA usually is called “target”
  • 8. Important to know in advanced... 1 Introduction • Microarrays are usually hypothesis-generating: They highlight specific genes or features that are particularly interesting for follow-up experiments. An exception would be the biomarkers discovery studies. • This does not reduce the importance of experimental design
  • 9. 2 Two color microarrays (cDNA) • Usually probes are long (20nt) • Probe is fixed to a glass • Labeling is with two fluorocrom (Cy3/Cy5). • Direct comparison of the two samples due to they are hybridized in the same array. • Each gene appear few times in the array • Long probes facilitate crosshybridization • Not very good reproducibility. Different types of arrays. Manufactoring. DNA/RNA
  • 10. 2 One color microarrays • Short probes (20-25 nt) • Target is labeled with only one fluorocrom • Only one sample is hybridized in each array. • Each gene is represented by a lot of probes in the array Different types of arrays. Manufactoring. DNA/RNA
  • 11. 2 Different types of arrays. Manufactoring. DNA/RNA • DNA Polymorphism (GWAS) • Transcription Factors • Resequencing • Cytogenetics • Expression • Alternative splicing • microRNA DNA RNA
  • 12. 2 Different types of Affymetrix arrays. 3’5’ 3’ IVT Arrays • Biased measurement of the gene expression • Array more used in the literature. A lot of species present. Only genes with polyA tail and good 3’ site will be amplified and will have the chance of hybridize correctly.
  • 13. 2 Different types of Affymetrix arrays. 3’5’ Gene Arrays Exon Arrays Gene/Exon Arrays • Gene arrays are the most used (good quality and price ratio) • Gene arrays 2.0 more updated library and also includes lncRNAs
  • 14. 2 Different types of expression arrays. •153 organisms in the array (human, mouse, rat, canine, ….) •100% miRBase v17 •2.216 snoRNAs and scaRNAs (human small nuclear RNAs) •Low inputs amounts (130 ng total RNA) •2.999 probe sets unique to pre-miRNA hairpins •Able to differentiate pre and mature miRNAs •Useful for FFPE samples miRNA
  • 15. 2 Different types of expression arrays. HTA array
  • 23. 6 Quality Controls Length of amplified cRNA
  • 24. 6 Quality Controls Length of fragmented cRNA
  • 25. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray Data Analysis
  • 26. Filtering 2 3 4 5 6 Statistical inference of diferential expression Clustering Normalization 1 Introduction. Experimental design Quality control 7 8 Annotation Biological interpretation
  • 32. 1 Introduction. Experimental design Microarrays Analysis Workflow
  • 34. 2 Quality Control Was the experiment a success??? • Microarray experiments generate huge quantitites of data • Standard statistical approach use plots to check the quality  show all data together  highlight structures  may help to detect problems (“unusual patterns”) It is hard to decide if things “seem to be all right” just by looking at the numbers.
  • 35. 2 Quality Control Diagnostics plots for microarrays: • Microarray data usually considered at two levels 1. Low level. Data directly coming from the scanner 2. High level. Processed from low level data. Expression values, normalized or not. • Some plots are specific for some type of arrays or for some level
  • 36. 2 Quality Control Diagnostics plots for microarrays: 1. Low level:  Layout image  Degradation plots (only in 3’IVT)  Histogram/density plots  PCA, Boxplot 2. High level:  MA plots  Model based plots (NUSE,RLE,)  PCA, Boxplot
  • 37. 2 Quality Control Diganostics plots for microarrays. Low level. Layout image.
  • 38. 2 Quality Control Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
  • 39. 2 Quality Control Diagnostics plots for microarrays. Low level. Histogram/density Plot
  • 40. 2 Quality Control Diagnostics plots for microarrays. Low level. Boxplot
  • 42. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 43. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 45. 2 Quality Control Diagnostics plots for microarrays. High level. RLE
  • 47. 2 Quality Control Diagnostics plots for microarrays. High level. NUSE
  • 48. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots • MA plots allow pair wise comparison of log-intensity of each array to a reference array and identification of intensity-dependent biases. • The Y axis of the plot contains the log-ratio intentsity of one array to the reference median array, which is called “M” while the X axis contains the average log-intensity of both arrays – called “A”. • The probe levels are not likely to differ a lot so we expect a MA plot centered on the Y=0 axis from low to high intensities.
  • 49. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots
  • 51. 3 Normalization The goal of normalization is to adjust for the effects that are due to variations in the technology rather than the biology.
  • 55. 4 Filtering • In a microarray experiment only a few hundreds/thousand of genes change their expression due to the different conditions •Researcher is interested in keeping the number of tests/genes as low as possible while keeping the interesting genes in the selected subset. •If the truly diferentially expressed genes are over-represented among those selectec in the filtering step, the FDR associated with a certain threshold of the statistic test will be lowered due to the filtering. Genes that do not change introduce noise, therefore is better not to be present when the statistical analysis is done
  • 56. 4 Filtering Exists different types of filtering: • Annotation features (specific):  Specific gene features (i.e. GO term, presence of transcriptional regulative elements in promoters, etc.) Data derived from IPA • Signal features (non specific)  % intensities greater of a user defined value  Interquantile range (IQR) greater of a defined value
  • 57. 4 Filtering Signal filtering: This technique has as its premise the removal of genes that are deemed to be not expressed or unchanged according to some specific criterion that is under the control of the user.
  • 58.
  • 59. 5 Statistical inference of diferential expression • Indirect comparisons: 2 groups, unpaired • Direct comparsions: 2 groups. paired
  • 60. 5 Statistical inference of diferential expression Limma package (Gordon Smith)
  • 61. 5 Statistical inference of diferential expression
  • 62. 5 Statistical inference of diferential expression
  • 63. 5 Statistical inference of diferential expression
  • 64. 5 Statistical inference of diferential expression
  • 65. 6 Clustering Types:  Supervised clustering try to find the best partition for data that belong to a know set o classes  Unsupervised clustering try to define the number and the size of the classes in which the transcription profiles can be fitted in.
  • 67. 6 Clustering Hierarchical Clustering (HCL) • HCL is an agglomerative /divise clustering method. • The iterative process continues until all groups are connected in a hierarchical tree. • Samples more similar between them are closed.