SlideShare a Scribd company logo
1 of 71
Download to read offline
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
technology
Affymetrix microarrays manufacture.
2
3
4
5
6
Microarray experiment workflow.
Quality Controls.
Different types of Affymetrix arrays.
1 Introduction
Different types of arrays. Manufactoring. DNA/RNA/Protein
1 Introduction
 reproducibility
 only show you what you’re looking for
 what about ‘indels’, inversions, translocations...
 accuracy
 sensitivity
1 Introduction
1 Introduction
 RNA-Seq was superior in detecting low abundance transcripts
 also better detecting differentiating biologically isoforms
 RNA-Seq demonstrated a broader dynamic range than microarray.
1 Introduction
• In molecular biology exist a lot of techniques to measure the gene expression
(Northern blot)
• Main characteristic from the microarrays discovery (Schena et al. (1995)
Science 270:467-70), was not what could be measured, instead the quantity of
simultaneous measures that could be done.
• Pre microarrays time: study of genes was one by one
• Post microarrays time: all the genes together.
1 Introduction
• But.... what is a microarray in few words?
 DNA fixed to a solid surface (nylon, silica, glass,...)
 RNA “problem” is labeled and have to bind to DNA
fixed in the solid surface in an specific way.
 DNA binded usually is called “probe”
 Labeled RNA usually is called “target”
Important to know in advanced...
1 Introduction
• Microarrays are usually hypothesis-generating:
They highlight specific genes or features that are particularly
interesting for follow-up experiments.
An exception would be the biomarkers discovery studies.
• This does not reduce the importance of experimental design
2
Two color microarrays (cDNA)
• Usually probes are long (20nt)
• Probe is fixed to a glass
• Labeling is with two fluorocrom (Cy3/Cy5).
• Direct comparison of the two samples due
to they are hybridized in the same array.
• Each gene appear few times in the array
• Long probes facilitate crosshybridization
• Not very good reproducibility.
Different types of arrays. Manufactoring. DNA/RNA
2
One color microarrays
• Short probes (20-25 nt)
• Target is labeled with only one fluorocrom
• Only one sample is hybridized in each array.
• Each gene is represented by a lot of probes
in the array
Different types of arrays. Manufactoring. DNA/RNA
2 Different types of arrays. Manufactoring. DNA/RNA
• DNA Polymorphism (GWAS)
• Transcription Factors
• Resequencing
• Cytogenetics
• Expression
• Alternative splicing
• microRNA
DNA RNA
2 Different types of Affymetrix arrays.
3’5’
3’ IVT Arrays
• Biased measurement of the gene expression
• Array more used in the literature. A lot of species present.
Only genes with polyA tail and good 3’ site will
be amplified and will have the chance of
hybridize correctly.
2 Different types of Affymetrix arrays.
3’5’
Gene Arrays
Exon Arrays
Gene/Exon Arrays
• Gene arrays are the most used (good quality and price ratio)
• Gene arrays 2.0 more updated library and also includes lncRNAs
2 Different types of expression arrays.
•153 organisms in the array (human, mouse, rat, canine, ….)
•100% miRBase v17
•2.216 snoRNAs and scaRNAs (human small nuclear RNAs)
•Low inputs amounts (130 ng total RNA)
•2.999 probe sets unique to pre-miRNA hairpins
•Able to differentiate pre and mature miRNAs
•Useful for FFPE samples
miRNA
2 Different types of expression arrays.
HTA array
Affymetrix microarrays manufacture.3
Photolitografy
Affymetrix microarrays manufacture.3
5 Microarray experiment workflow
5 Microarray experiment workflow
5 Microarray experiment workflow
6 Quality Controls
6 Quality Controls
6 Quality Controls
Length of amplified cRNA
6 Quality Controls
Length of fragmented cRNA
Bioinformàtica per a la
Recerca Biomèdica
Ricardo Gonzalo Sanz
ricardo.gonzalo@vhir.org
20/05/14
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Basic aspects of Microarray
Data Analysis
Filtering
2
3
4
5
6
Statistical inference of diferential expression
Clustering
Normalization
1 Introduction. Experimental design
Quality control
7
8
Annotation
Biological interpretation
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
1 Introduction. Experimental design
Microarrays Analysis
Workflow
2 Quality Control
2 Quality Control
Was the experiment a success???
• Microarray experiments generate huge quantitites of data
• Standard statistical approach use plots to check the quality
 show all data together
 highlight structures
 may help to detect problems (“unusual patterns”)
It is hard to decide if things “seem to be
all right” just by looking at the numbers.
2 Quality Control
Diagnostics plots for microarrays:
• Microarray data usually considered at two levels
1. Low level. Data directly coming from the scanner
2. High level. Processed from low level data. Expression values,
normalized or not.
• Some plots are specific for some type of arrays or for some level
2 Quality Control
Diagnostics plots for microarrays:
1. Low level:
 Layout image
 Degradation plots (only in 3’IVT)
 Histogram/density plots
 PCA, Boxplot
2. High level:
 MA plots
 Model based plots (NUSE,RLE,)
 PCA, Boxplot
2 Quality Control
Diganostics plots for microarrays. Low level. Layout image.
2 Quality Control
Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
2 Quality Control
Diagnostics plots for microarrays. Low level. Histogram/density Plot
2 Quality Control
Diagnostics plots for microarrays. Low level. Boxplot
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
Diagnostics plots for microarrays. Low level. PCA
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. RLE
2 Quality Control
2 Quality Control
Diagnostics plots for microarrays. High level. NUSE
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
• MA plots allow pair wise comparison of log-intensity of each array to a
reference array and identification of intensity-dependent biases.
• The Y axis of the plot contains the log-ratio intentsity of one array to the
reference median array, which is called “M” while the X axis contains the
average log-intensity of both arrays – called “A”.
• The probe levels are not likely to differ a lot so we expect a MA plot centered
on the Y=0 axis from low to high intensities.
2 Quality Control
Diagnostics plots for microarrays. High level. MA plots
2 Quality Control
3 Normalization
The goal of normalization is to adjust for the effects that are due to variations in the
technology rather than the biology.
3 Normalization
3 Normalization
3 Normalization
4 Filtering
• In a microarray experiment only a few hundreds/thousand of genes change their
expression due to the different conditions
•Researcher is interested in keeping the number of tests/genes as low as possible
while keeping the interesting genes in the selected subset.
•If the truly diferentially expressed genes are over-represented among those
selectec in the filtering step, the FDR associated with a certain threshold of the
statistic test will be lowered due to the filtering.
Genes that do not change introduce
noise, therefore is better not to be
present when the statistical analysis is
done
4 Filtering
Exists different types of filtering:
• Annotation features (specific):
 Specific gene features (i.e. GO term, presence of transcriptional regulative
elements in promoters, etc.)
Data derived from IPA
• Signal features (non specific)
 % intensities greater of a user defined value
 Interquantile range (IQR) greater of a defined value
4 Filtering
Signal filtering: This technique has as its premise the removal of genes that are
deemed to be not expressed or unchanged according to some specific criterion that
is under the control of the user.
5 Statistical inference of diferential expression
• Indirect comparisons: 2 groups, unpaired
• Direct comparsions: 2 groups. paired
5 Statistical inference of diferential expression
Limma package (Gordon Smith)
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
5 Statistical inference of diferential expression
6 Clustering
Types:
 Supervised clustering try to find the best partition for data that belong to a
know set o classes
 Unsupervised clustering try to define the number and the size of the classes
in which the transcription profiles can be fitted in.
6 Clustering
6 Clustering
Hierarchical Clustering (HCL)
• HCL is an agglomerative /divise clustering method.
• The iterative process continues until all groups are
connected in a hierarchical tree.
• Samples more similar between them are closed.
6 Clustering
7 Annotation
8 Biological interpretation
Gene Ontology
8 Biological interpretation

More Related Content

What's hot

What's hot (20)

20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
 
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
 
Nanopore Sequencing
Nanopore SequencingNanopore Sequencing
Nanopore Sequencing
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applications
 
DNA microarray ppt
DNA microarray pptDNA microarray ppt
DNA microarray ppt
 
NGS.pptx
NGS.pptxNGS.pptx
NGS.pptx
 
Microarray technology, biochip, DNA chip
Microarray technology, biochip, DNA chip Microarray technology, biochip, DNA chip
Microarray technology, biochip, DNA chip
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Rna seq
Rna seqRna seq
Rna seq
 
Sanger sequencing
Sanger sequencingSanger sequencing
Sanger sequencing
 
Microarray
MicroarrayMicroarray
Microarray
 
Microarray
MicroarrayMicroarray
Microarray
 
NANOPORE SEQUENCING
NANOPORE SEQUENCINGNANOPORE SEQUENCING
NANOPORE SEQUENCING
 
Illumina (sequencing by synthesis) method
Illumina (sequencing by synthesis) methodIllumina (sequencing by synthesis) method
Illumina (sequencing by synthesis) method
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
DNA microarray final ppt.
DNA microarray final ppt.DNA microarray final ppt.
DNA microarray final ppt.
 

Similar to Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
Computer Science Club
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
SCHOTT
 

Similar to Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona) (20)

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
12 arrays
12 arrays12 arrays
12 arrays
 
12 arrays
12 arrays12 arrays
12 arrays
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific Applications
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 
Microarray @ujjwal sirohi
Microarray @ujjwal sirohiMicroarray @ujjwal sirohi
Microarray @ujjwal sirohi
 
A comprehensive study of microarray
A comprehensive study of microarrayA comprehensive study of microarray
A comprehensive study of microarray
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
 
Cignal webina
Cignal webinaCignal webina
Cignal webina
 
Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminar
 
Si rna 2013
Si rna 2013Si rna 2013
Si rna 2013
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 

More from VHIR Vall d’Hebron Institut de Recerca

Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
VHIR Vall d’Hebron Institut de Recerca
 

More from VHIR Vall d’Hebron Institut de Recerca (20)

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Information management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cmsInformation management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cms
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
 
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - MicroarraysCurso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
 
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGSCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
 

Recently uploaded

Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 

Recently uploaded (20)

Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 

Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformatics Course - Session 3.2 - VHIR, Barcelona)

  • 1. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray technology
  • 2. Affymetrix microarrays manufacture. 2 3 4 5 6 Microarray experiment workflow. Quality Controls. Different types of Affymetrix arrays. 1 Introduction Different types of arrays. Manufactoring. DNA/RNA/Protein
  • 3. 1 Introduction  reproducibility  only show you what you’re looking for  what about ‘indels’, inversions, translocations...  accuracy  sensitivity
  • 5. 1 Introduction  RNA-Seq was superior in detecting low abundance transcripts  also better detecting differentiating biologically isoforms  RNA-Seq demonstrated a broader dynamic range than microarray.
  • 6. 1 Introduction • In molecular biology exist a lot of techniques to measure the gene expression (Northern blot) • Main characteristic from the microarrays discovery (Schena et al. (1995) Science 270:467-70), was not what could be measured, instead the quantity of simultaneous measures that could be done. • Pre microarrays time: study of genes was one by one • Post microarrays time: all the genes together.
  • 7. 1 Introduction • But.... what is a microarray in few words?  DNA fixed to a solid surface (nylon, silica, glass,...)  RNA “problem” is labeled and have to bind to DNA fixed in the solid surface in an specific way.  DNA binded usually is called “probe”  Labeled RNA usually is called “target”
  • 8. Important to know in advanced... 1 Introduction • Microarrays are usually hypothesis-generating: They highlight specific genes or features that are particularly interesting for follow-up experiments. An exception would be the biomarkers discovery studies. • This does not reduce the importance of experimental design
  • 9. 2 Two color microarrays (cDNA) • Usually probes are long (20nt) • Probe is fixed to a glass • Labeling is with two fluorocrom (Cy3/Cy5). • Direct comparison of the two samples due to they are hybridized in the same array. • Each gene appear few times in the array • Long probes facilitate crosshybridization • Not very good reproducibility. Different types of arrays. Manufactoring. DNA/RNA
  • 10. 2 One color microarrays • Short probes (20-25 nt) • Target is labeled with only one fluorocrom • Only one sample is hybridized in each array. • Each gene is represented by a lot of probes in the array Different types of arrays. Manufactoring. DNA/RNA
  • 11. 2 Different types of arrays. Manufactoring. DNA/RNA • DNA Polymorphism (GWAS) • Transcription Factors • Resequencing • Cytogenetics • Expression • Alternative splicing • microRNA DNA RNA
  • 12. 2 Different types of Affymetrix arrays. 3’5’ 3’ IVT Arrays • Biased measurement of the gene expression • Array more used in the literature. A lot of species present. Only genes with polyA tail and good 3’ site will be amplified and will have the chance of hybridize correctly.
  • 13. 2 Different types of Affymetrix arrays. 3’5’ Gene Arrays Exon Arrays Gene/Exon Arrays • Gene arrays are the most used (good quality and price ratio) • Gene arrays 2.0 more updated library and also includes lncRNAs
  • 14. 2 Different types of expression arrays. •153 organisms in the array (human, mouse, rat, canine, ….) •100% miRBase v17 •2.216 snoRNAs and scaRNAs (human small nuclear RNAs) •Low inputs amounts (130 ng total RNA) •2.999 probe sets unique to pre-miRNA hairpins •Able to differentiate pre and mature miRNAs •Useful for FFPE samples miRNA
  • 15. 2 Different types of expression arrays. HTA array
  • 23. 6 Quality Controls Length of amplified cRNA
  • 24. 6 Quality Controls Length of fragmented cRNA
  • 25. Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 20/05/14 Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray Data Analysis
  • 26. Filtering 2 3 4 5 6 Statistical inference of diferential expression Clustering Normalization 1 Introduction. Experimental design Quality control 7 8 Annotation Biological interpretation
  • 32. 1 Introduction. Experimental design Microarrays Analysis Workflow
  • 34. 2 Quality Control Was the experiment a success??? • Microarray experiments generate huge quantitites of data • Standard statistical approach use plots to check the quality  show all data together  highlight structures  may help to detect problems (“unusual patterns”) It is hard to decide if things “seem to be all right” just by looking at the numbers.
  • 35. 2 Quality Control Diagnostics plots for microarrays: • Microarray data usually considered at two levels 1. Low level. Data directly coming from the scanner 2. High level. Processed from low level data. Expression values, normalized or not. • Some plots are specific for some type of arrays or for some level
  • 36. 2 Quality Control Diagnostics plots for microarrays: 1. Low level:  Layout image  Degradation plots (only in 3’IVT)  Histogram/density plots  PCA, Boxplot 2. High level:  MA plots  Model based plots (NUSE,RLE,)  PCA, Boxplot
  • 37. 2 Quality Control Diganostics plots for microarrays. Low level. Layout image.
  • 38. 2 Quality Control Diagnostic plots for microarrays. Low level. RNA degradation plot (3’IVT arrays)
  • 39. 2 Quality Control Diagnostics plots for microarrays. Low level. Histogram/density Plot
  • 40. 2 Quality Control Diagnostics plots for microarrays. Low level. Boxplot
  • 42. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 43. 2 Quality Control Diagnostics plots for microarrays. Low level. PCA
  • 45. 2 Quality Control Diagnostics plots for microarrays. High level. RLE
  • 47. 2 Quality Control Diagnostics plots for microarrays. High level. NUSE
  • 48. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots • MA plots allow pair wise comparison of log-intensity of each array to a reference array and identification of intensity-dependent biases. • The Y axis of the plot contains the log-ratio intentsity of one array to the reference median array, which is called “M” while the X axis contains the average log-intensity of both arrays – called “A”. • The probe levels are not likely to differ a lot so we expect a MA plot centered on the Y=0 axis from low to high intensities.
  • 49. 2 Quality Control Diagnostics plots for microarrays. High level. MA plots
  • 51. 3 Normalization The goal of normalization is to adjust for the effects that are due to variations in the technology rather than the biology.
  • 55. 4 Filtering • In a microarray experiment only a few hundreds/thousand of genes change their expression due to the different conditions •Researcher is interested in keeping the number of tests/genes as low as possible while keeping the interesting genes in the selected subset. •If the truly diferentially expressed genes are over-represented among those selectec in the filtering step, the FDR associated with a certain threshold of the statistic test will be lowered due to the filtering. Genes that do not change introduce noise, therefore is better not to be present when the statistical analysis is done
  • 56. 4 Filtering Exists different types of filtering: • Annotation features (specific):  Specific gene features (i.e. GO term, presence of transcriptional regulative elements in promoters, etc.) Data derived from IPA • Signal features (non specific)  % intensities greater of a user defined value  Interquantile range (IQR) greater of a defined value
  • 57. 4 Filtering Signal filtering: This technique has as its premise the removal of genes that are deemed to be not expressed or unchanged according to some specific criterion that is under the control of the user.
  • 58.
  • 59. 5 Statistical inference of diferential expression • Indirect comparisons: 2 groups, unpaired • Direct comparsions: 2 groups. paired
  • 60. 5 Statistical inference of diferential expression Limma package (Gordon Smith)
  • 61. 5 Statistical inference of diferential expression
  • 62. 5 Statistical inference of diferential expression
  • 63. 5 Statistical inference of diferential expression
  • 64. 5 Statistical inference of diferential expression
  • 65. 6 Clustering Types:  Supervised clustering try to find the best partition for data that belong to a know set o classes  Unsupervised clustering try to define the number and the size of the classes in which the transcription profiles can be fitted in.
  • 67. 6 Clustering Hierarchical Clustering (HCL) • HCL is an agglomerative /divise clustering method. • The iterative process continues until all groups are connected in a hierarchical tree. • Samples more similar between them are closed.