SlideShare a Scribd company logo
1 of 49
Proteomics repositories
Dr. Juan Antonio Vizcaíno
PRIDE Group Coordinator
Proteomics Services Team
EMBL-EBI
Hinxton, Cambridge, UK
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Genomics
Transcriptomics
Proteomics
From the genome to the proteome
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Corresponding public repositories
Genomics
Transcript-
omics
Proteomics
DNA sequence databases
(GenBank, EMBL, DDJB)
ArrayExpress (EBI), GEO (NCBI)
MS proteomics resources (ProteomeXchange)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Data sharing in Proteomics
• Proteomics data can be very complex and its interpretation is
often troublesome and/or controversial.
• In other ‘omics’ fields, data sharing ‘culture’ is well established.
Generally, it is considered to be a good scientific practise.
• In proteomics, the ‘culture’ is definitely evolving in that direction.
A big shift is happening in the last few years.
• Scientific journals and funding agencies are two of the main
drivers.
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• 1) Data producers are not always the best data analysts
Sharing of data allows analysts access to real data, and in turn allows
better analysis tools to be developed
• 2) Meta-analysis of data can recycle previous findings for new
tasks
Putting findings in the context of other findings increases their scope
• 3) Sharing data allows independent review of the findings
When actual replication of an experiment is often impossible, a re-
analysis or spot checks on the obtained data become vitally important
• 4) Direct benefit for the field
Development of fragmentation models, spectral libraries, SRM
assays, ...
Data sharing. Why?
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
What is a proteomics publication in 2015?
• Proteomics studies generate potentially large amounts of
data and results.
• Ideally, a proteomics publication needs to:
• Summarize the results of the study
• Provide supporting information for reliability of any
results reported
• Information in a publication:
• Manuscript
• Supplementary material
• Associated data submitted to a public repository
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS
proteomics repositories
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Main types of information stored
• 1) Original experimental data recorded by the mass
spectrometer (primary data) -. Raw data and peak lists.
• 2) Identification results inferred from the original primary
data
• 3) Quantification information
• 4) Experimental and technical metadata
• 5) Any other type of information (e.g. scripts)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Data types/ PSI standard formats
• mzTabFinal Results
• TraMLSRM
• mzQuantMLQuantitation
• mzIdentMLIdentification
• mzMLMS data
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Main public MS-based proteomics repositories:
- PROteomics IDEntifications database (PRIDE Archive, EBI)
- Global Proteome Machine (GPMDB)
- PeptideAtlas (ISB, Seattle)
• Many others, more specialized:
Among others: Human Proteinpedia, Genome Annotation Proteomics Pipeline
(GAPP),…
• Recently developed ones: ProteomicsDB, CHORUS, MassIVE, iProx.
• Very diverse: different aims, functionalities,… but also complementary.
• Main focus is MS/MS data.
Proteomics repositories
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Proteomics repositories (2)
• Many different workflows need to be supported. They provide
complementary ‘views’.
• No data reprocessing. Data is stored as ‘published’ or
originally analysed:
• PRIDE Archive (MS/MS data)
• MassIVE (MS/MS data)
• PASSEL (SRM data)
• Data reprocessing (MS/MS data):
• PeptideAtlas
• GPMDB
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Resources that don’t reprocess data
1) Resources that try to represent the authors’ analysis
view on the data.
• Various workflows are allowed and they can provide
complementary results.
• Data are not ‘updated’ in time. However, meta-analysis
on top is possible.
• Accumulation of FDRs when datasets are combined.
• Main representatives: PRIDE Archive and MassIVE
(MS/MS data) and PeptideAtlas/PASSEL (SRM data).
• Data standards are essential.
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
PRIDE (PRoteomics IDEntifications) Archive
http://www.ebi.ac.uk/pride
• PRIDE stores mass spectrometry based
proteomics data:
• Peptide and protein expression data
(identification and quantification)
• Post-translational modifications
• Mass spectra (raw data and peak
lists)
• Technical and biological metadata
• Any other related information
• Full support for tandem MS approaches
Martens et al., Proteomics, 2005
Vizcaíno et al., NAR, 2013
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
MassIVE (UCSD)
• Mass spectrometry Interactive Virtual Environment
• Project led by Nuno Bandeira (Center for Computational Mass
Spectrometry, UCSD)
• Dataset storage and data submission
• MassIVE 1.0 – Tranche-like functionality
• Imported all data from Tranche
• Under development (they want to explore interaction among
users). Not published yet.
http://proteomics.ucsd.edu/ProteoSAFe/datasets.jsp
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
MassIVE (UCSD)
http://proteomics.ucsd.edu/service/massive/
• Data repository for MS proteomics data
• Tools available for users to analyse their own data
• Joined ProteomeXchange on June 2014.
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Suitable for SRM assays
• Use the PSI standard TraML
plus the output of the most
popular vendor pipelines
• Just started in 2012
• Part of the PX consortium
http://www.peptideatlas.org/passel/
Farrah et al., Proteomics, 2012
PASSEL: repository for SRM data
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Proteomics repositories (2)
17/12/2015 21
• Many different workflows need to be supported. They provide
complementary ‘views’.
• No data reprocessing. Data is stored as ‘published’ or
originally analysed:
• PRIDE (MS/MS data)
• MassIVE (MS/MS data)
• PASSEL (SRM data)
• Data reprocessing (MS/MS data):
• PeptideAtlas
• GPMDB
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Reprocessing repositories
• These resources collect MS raw data and reprocess it using
one given analysis pipeline, and an up to date protein
sequence database.
• Advantage: They provide a ‘standardized’ and updated view
on the experimental data available.
• Only one common analysis method is used and there can be
information loss.
• Different from the author’s view on the data.
• Main resources: GPMDB and PeptideAtlas (ISB, Seattle).
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
http://www.peptideatlas.org
- Developed at the Institute for Systems
Biology (ISB, Seattle, USA)
- Peptide identifications from MS/MS
approaches
- Data are reprocessed using the popular
Trans Proteomic Pipeline (TPP)
- Uses PeptideProphet to derive a
probability for the correct identification for
all contained peptides
PeptideAtlas
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• All peptides IDs are mapped to
Ensembl using ProteinProphet
(to handle protein inference)
• Provides proteotypic peptide
predictions
• Limited metadata available
• Part of the HPP project
Deutsch et al., Proteomics, 2005
Desiere et al., NAR, 2006.
Deutsch et al., EMBO Rep, 2008
PeptideAtlas
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Builds are updated in a regular basis (usually once a
year)
Examples of builds:
- Human (HPP context)
- Human plasma
- Human urine
- Drosophila
- Mouse
- Mouse plasma
- Cow
- Yeast
…
PeptideAtlas builds
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Originally developed by R.
Beavis & R. Craig
• End point of the GPM
proteomics pipeline, to aid in
the process of validating
peptide MS/MS spectra and
protein coverage patterns.
http://gpmdb.thegpm.org/ Craig et al., J Proteome Res, 2004
GPMDB (Global Proteome Machine DB)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Data are reprocessed using
the popular X!Tandem or
X!Hunter spectral searching
algorithm
• Also provides proteotypic
peptides
GPMDB
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Nice visualization features
• Provides very limited
annotation with GO, BTO
• Some support to targeted
approaches is available
• Part of the HPP consortium
GPMDB
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
http://thehpp.org/
The Human Proteome Project (HPP)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Reprocesses data Reprocesses data No reprocessing
Editorial control Editorial control No editorial control
Limited annotation Limited annotation Detailed annotation
Main MS proteomics resources
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Reprocesses data Reprocesses data No reprocessing
Editorial control Editorial control No editorial control
Limited annotation Limited annotation Detailed annotation
Main MS proteomics resources
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Draft Human proteome papers published in 2014
Wilhelm et al., Nature, 2014 Kim et al., Nature, 2014
•Two independent groups claimed to have produced the
first complete draft of the human proteome by MS.
• Some of their findings are controversial and need further
validation… but generated a lot of discussion and put
proteomics in the spotlight.
•Two proteomics resources have been developed:
proteomicsDB and the Human Proteome Map (HPM).Nature cover 29 May 2014
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
ProteomicsDB https://www.proteomicsdb.org/
• Data analysis using Mascot and MaxQuant
• The way the Protein FDR is calculated is controversial
•Quantification information using label free techniques
•New datasets are added in a regular basis
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
ProteomicsDB (2)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Human Proteome Map (HPM)
• Developed by the Pandey group.
•Data reanalysis using Mascot.
• Protein FDR is not mentioned at all
in the corresponding Nature paper.
http://www.humanproteomemap.org/
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Chorus
https://chorusproject.org/pages/ind
ex.html
• Developed by M. MacCoss’ group
• Built on top of Amazon cloud
technologies
• Provides data analysis capabilities for
the users
• Free for public datasets. A fee needs
to be paid for storing private
information.
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Why sharing MS proteomics data?
• Types of information stored in MS proteomics
repositories.
• Main existing repositories and their main
characteristics
• No data reprocessing
• Data reprocessing
• Recently developed resources
• Other resources
Overview
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
MaxQB
Human Proteinpedia
Other repositories
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
COPaKB
Cardiac Organellar Protein Atlas Knowledgebase
International collaboration (EMBL-EBI involved)
Windows Client and iPad App
Submit data for analysis in dta and mzML formats
Data submitted to a ProLuCID pipeline
No MS data download
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
CPTAC data portal
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Pep2pro (Arabidopsis)
http://fgcz-pep2pro.uzh.ch/
Centered on Arabidopsis data
Download spectra by spectra
Quantitative information
Linked to gelmap.de (2DE)
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Example of a repository of supporting data
annotation: Steve Gygi’s lab Supporting data from publications
Spectra annotation results
and PTM evaluation data
Quantitative data
No data downloads
https://gygi.med.harvard.edu/phosphomouse
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
FINAL THOUGHTS
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Why are repositories not more popular?
1. Don’t want to share data
• Researchers don’t like to be shown that they did not analyze the
data as well as they could have.
• Their FDR may be higher than they reported/think.
• Researchers are worried that they missed something in the data
that they could discover if they go back to it at a later date
• Don’t want other authors to get a publication from their data.
• However, this philosophy is changing gradually…
Slide from R. Chalkley
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Why are repositories not more popular? (2)
2. Submission burden
• Getting data into correct format may require some work
• Author is not necessarily computer-savvy
• Having to also supply metadata is seen as a burden, if the
information is already present in an associated manuscript
• Associated raw data may be many GB in size; file transfer to
repository could take a while
Authors are impatient: want to spend time doing science, not
administration!
Slide from R. Chalkley
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Importance of sharing MS proteomics data
• The main existing proteomics repositories are
complementary in focus and functionality
• Main characteristics of:
• PeptideAtlas and GPMDB (Reprocess data)
• PASSEL, MassIVE and PRIDE Archive (at
present they do not reprocess data).
• New resources: proteomicsDB, HPM, Chorus
Conclusions
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
• Vizcaíno et al., J. Proteomics, 2010. PMID: 20615486
• Perez-Riverol et al., Proteomics, 2015. PMID: 25158685
Recommended reading
Juan A. Vizcaíno
juan@ebi.ac.uk
WT Proteomics Bioinformatics Course 2015
Hinxton, 10 December 2015
Questions?

More Related Content

What's hot

Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsJuan Antonio Vizcaino
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Juan Antonio Vizcaino
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015Juan Antonio Vizcaino
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsJuan Antonio Vizcaino
 
An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...Juan Antonio Vizcaino
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?Juan Antonio Vizcaino
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateJuan Antonio Vizcaino
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarJuan Antonio Vizcaino
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Juan Antonio Vizcaino
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsManuel Corpas
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsDuncan Hull
 

What's hot (20)

Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomics
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasets
 
An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...
 
Pride cluster presentation
Pride cluster presentation Pride cluster presentation
Pride cluster presentation
 
Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 

Viewers also liked

Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014Mohammad Hessam Rafiee
 
Genomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryGenomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryPhilip Bourne
 
Genomics and proteomics II
Genomics and proteomics IIGenomics and proteomics II
Genomics and proteomics IINikolay Vyahhi
 
Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentNeil Swainston
 
Advanced bioinformatics of proteomics datasets
Advanced bioinformaticsof proteomics datasetsAdvanced bioinformaticsof proteomics datasets
Advanced bioinformatics of proteomics datasetsLars Juhl Jensen
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleYasset Perez-Riverol
 
Techniques used for separation in proteomics
Techniques used for separation in proteomicsTechniques used for separation in proteomics
Techniques used for separation in proteomicsNilesh Chandra
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1utpaltatu
 
Genomics and proteomics I
Genomics and proteomics IGenomics and proteomics I
Genomics and proteomics INikolay Vyahhi
 
Proteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for ProteomicsProteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for ProteomicsLionel Wolberger
 
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...Amit Yadav
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreemanshreeman cs
 
Proteomics analysis: Basics and Applications
Proteomics analysis: Basics and ApplicationsProteomics analysis: Basics and Applications
Proteomics analysis: Basics and ApplicationsCOST action BM1006
 

Viewers also liked (20)

Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014
 
Genomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryGenomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug Discovery
 
Genomics and proteomics II
Genomics and proteomics IIGenomics and proteomics II
Genomics and proteomics II
 
Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software Development
 
Proteomics
ProteomicsProteomics
Proteomics
 
Advanced bioinformatics of proteomics datasets
Advanced bioinformaticsof proteomics datasetsAdvanced bioinformaticsof proteomics datasets
Advanced bioinformatics of proteomics datasets
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scale
 
Techniques used for separation in proteomics
Techniques used for separation in proteomicsTechniques used for separation in proteomics
Techniques used for separation in proteomics
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1
 
Genomics and proteomics I
Genomics and proteomics IGenomics and proteomics I
Genomics and proteomics I
 
Proteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for ProteomicsProteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for Proteomics
 
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
 
Genomics & Proteomics Based Drug Discovery
Genomics & Proteomics Based Drug DiscoveryGenomics & Proteomics Based Drug Discovery
Genomics & Proteomics Based Drug Discovery
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics
Proteomics Proteomics
Proteomics
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics ppt
Proteomics pptProteomics ppt
Proteomics ppt
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics analysis: Basics and Applications
Proteomics analysis: Basics and ApplicationsProteomics analysis: Basics and Applications
Proteomics analysis: Basics and Applications
 

Similar to Proteomics repositories

PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...Juan Antonio Vizcaino
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeJuan Antonio Vizcaino
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataJuan Antonio Vizcaino
 
Introduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRIntroduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRJuan Antonio Vizcaino
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015QBiC_Tue
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLHJisc
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesMatthieu Schapranow
 
The mzTab data standard format for reporting MS-based peptide, protein and sm...
The mzTab data standard format for reporting MS-based peptide, protein and sm...The mzTab data standard format for reporting MS-based peptide, protein and sm...
The mzTab data standard format for reporting MS-based peptide, protein and sm...Juan Antonio Vizcaino
 
Big Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesBig Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesMatthieu Schapranow
 

Similar to Proteomics repositories (20)

Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
 
Pride and ProteomeXchange
Pride and ProteomeXchangePride and ProteomeXchange
Pride and ProteomeXchange
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
 
Introduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRIntroduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIR
 
Reuse of public data in proteomics
Reuse of public data in proteomicsReuse of public data in proteomics
Reuse of public data in proteomics
 
Human microbiome project
Human microbiome projectHuman microbiome project
Human microbiome project
 
ProteomeXchange update HUPO 2016
ProteomeXchange update HUPO 2016ProteomeXchange update HUPO 2016
ProteomeXchange update HUPO 2016
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLH
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomexchange
ProteomexchangeProteomexchange
Proteomexchange
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life Sciences
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
The mzTab data standard format for reporting MS-based peptide, protein and sm...
The mzTab data standard format for reporting MS-based peptide, protein and sm...The mzTab data standard format for reporting MS-based peptide, protein and sm...
The mzTab data standard format for reporting MS-based peptide, protein and sm...
 
Big Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesBig Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and Challenges
 

More from Juan Antonio Vizcaino

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Juan Antonio Vizcaino
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formatsJuan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Juan Antonio Vizcaino
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...Juan Antonio Vizcaino
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...Juan Antonio Vizcaino
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Juan Antonio Vizcaino
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...Juan Antonio Vizcaino
 
Enabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataEnabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataJuan Antonio Vizcaino
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Juan Antonio Vizcaino
 

More from Juan Antonio Vizcaino (16)

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
 
ProteomeXchange update 2017
ProteomeXchange update 2017ProteomeXchange update 2017
ProteomeXchange update 2017
 
Enabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataEnabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics data
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016
 

Recently uploaded

Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Jshifa
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 

Recently uploaded (20)

Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 

Proteomics repositories

  • 1. Proteomics repositories Dr. Juan Antonio Vizcaíno PRIDE Group Coordinator Proteomics Services Team EMBL-EBI Hinxton, Cambridge, UK
  • 2. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 3. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Genomics Transcriptomics Proteomics From the genome to the proteome
  • 4. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Corresponding public repositories Genomics Transcript- omics Proteomics DNA sequence databases (GenBank, EMBL, DDJB) ArrayExpress (EBI), GEO (NCBI) MS proteomics resources (ProteomeXchange)
  • 5. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Data sharing in Proteomics • Proteomics data can be very complex and its interpretation is often troublesome and/or controversial. • In other ‘omics’ fields, data sharing ‘culture’ is well established. Generally, it is considered to be a good scientific practise. • In proteomics, the ‘culture’ is definitely evolving in that direction. A big shift is happening in the last few years. • Scientific journals and funding agencies are two of the main drivers.
  • 6. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • 1) Data producers are not always the best data analysts Sharing of data allows analysts access to real data, and in turn allows better analysis tools to be developed • 2) Meta-analysis of data can recycle previous findings for new tasks Putting findings in the context of other findings increases their scope • 3) Sharing data allows independent review of the findings When actual replication of an experiment is often impossible, a re- analysis or spot checks on the obtained data become vitally important • 4) Direct benefit for the field Development of fragmentation models, spectral libraries, SRM assays, ... Data sharing. Why?
  • 7. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 What is a proteomics publication in 2015? • Proteomics studies generate potentially large amounts of data and results. • Ideally, a proteomics publication needs to: • Summarize the results of the study • Provide supporting information for reliability of any results reported • Information in a publication: • Manuscript • Supplementary material • Associated data submitted to a public repository
  • 8. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 9. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Main types of information stored • 1) Original experimental data recorded by the mass spectrometer (primary data) -. Raw data and peak lists. • 2) Identification results inferred from the original primary data • 3) Quantification information • 4) Experimental and technical metadata • 5) Any other type of information (e.g. scripts)
  • 10. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Data types/ PSI standard formats • mzTabFinal Results • TraMLSRM • mzQuantMLQuantitation • mzIdentMLIdentification • mzMLMS data
  • 11. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 12. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Main public MS-based proteomics repositories: - PROteomics IDEntifications database (PRIDE Archive, EBI) - Global Proteome Machine (GPMDB) - PeptideAtlas (ISB, Seattle) • Many others, more specialized: Among others: Human Proteinpedia, Genome Annotation Proteomics Pipeline (GAPP),… • Recently developed ones: ProteomicsDB, CHORUS, MassIVE, iProx. • Very diverse: different aims, functionalities,… but also complementary. • Main focus is MS/MS data. Proteomics repositories
  • 13. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Proteomics repositories (2) • Many different workflows need to be supported. They provide complementary ‘views’. • No data reprocessing. Data is stored as ‘published’ or originally analysed: • PRIDE Archive (MS/MS data) • MassIVE (MS/MS data) • PASSEL (SRM data) • Data reprocessing (MS/MS data): • PeptideAtlas • GPMDB
  • 14. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 15. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Resources that don’t reprocess data 1) Resources that try to represent the authors’ analysis view on the data. • Various workflows are allowed and they can provide complementary results. • Data are not ‘updated’ in time. However, meta-analysis on top is possible. • Accumulation of FDRs when datasets are combined. • Main representatives: PRIDE Archive and MassIVE (MS/MS data) and PeptideAtlas/PASSEL (SRM data). • Data standards are essential.
  • 16. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 PRIDE (PRoteomics IDEntifications) Archive http://www.ebi.ac.uk/pride • PRIDE stores mass spectrometry based proteomics data: • Peptide and protein expression data (identification and quantification) • Post-translational modifications • Mass spectra (raw data and peak lists) • Technical and biological metadata • Any other related information • Full support for tandem MS approaches Martens et al., Proteomics, 2005 Vizcaíno et al., NAR, 2013
  • 17. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 MassIVE (UCSD) • Mass spectrometry Interactive Virtual Environment • Project led by Nuno Bandeira (Center for Computational Mass Spectrometry, UCSD) • Dataset storage and data submission • MassIVE 1.0 – Tranche-like functionality • Imported all data from Tranche • Under development (they want to explore interaction among users). Not published yet. http://proteomics.ucsd.edu/ProteoSAFe/datasets.jsp
  • 18. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 MassIVE (UCSD) http://proteomics.ucsd.edu/service/massive/ • Data repository for MS proteomics data • Tools available for users to analyse their own data • Joined ProteomeXchange on June 2014.
  • 19. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Suitable for SRM assays • Use the PSI standard TraML plus the output of the most popular vendor pipelines • Just started in 2012 • Part of the PX consortium http://www.peptideatlas.org/passel/ Farrah et al., Proteomics, 2012 PASSEL: repository for SRM data
  • 20. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 21. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Proteomics repositories (2) 17/12/2015 21 • Many different workflows need to be supported. They provide complementary ‘views’. • No data reprocessing. Data is stored as ‘published’ or originally analysed: • PRIDE (MS/MS data) • MassIVE (MS/MS data) • PASSEL (SRM data) • Data reprocessing (MS/MS data): • PeptideAtlas • GPMDB
  • 22. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Reprocessing repositories • These resources collect MS raw data and reprocess it using one given analysis pipeline, and an up to date protein sequence database. • Advantage: They provide a ‘standardized’ and updated view on the experimental data available. • Only one common analysis method is used and there can be information loss. • Different from the author’s view on the data. • Main resources: GPMDB and PeptideAtlas (ISB, Seattle).
  • 23. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 http://www.peptideatlas.org - Developed at the Institute for Systems Biology (ISB, Seattle, USA) - Peptide identifications from MS/MS approaches - Data are reprocessed using the popular Trans Proteomic Pipeline (TPP) - Uses PeptideProphet to derive a probability for the correct identification for all contained peptides PeptideAtlas
  • 24. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • All peptides IDs are mapped to Ensembl using ProteinProphet (to handle protein inference) • Provides proteotypic peptide predictions • Limited metadata available • Part of the HPP project Deutsch et al., Proteomics, 2005 Desiere et al., NAR, 2006. Deutsch et al., EMBO Rep, 2008 PeptideAtlas
  • 25. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Builds are updated in a regular basis (usually once a year) Examples of builds: - Human (HPP context) - Human plasma - Human urine - Drosophila - Mouse - Mouse plasma - Cow - Yeast … PeptideAtlas builds
  • 26. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Originally developed by R. Beavis & R. Craig • End point of the GPM proteomics pipeline, to aid in the process of validating peptide MS/MS spectra and protein coverage patterns. http://gpmdb.thegpm.org/ Craig et al., J Proteome Res, 2004 GPMDB (Global Proteome Machine DB)
  • 27. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Data are reprocessed using the popular X!Tandem or X!Hunter spectral searching algorithm • Also provides proteotypic peptides GPMDB
  • 28. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Nice visualization features • Provides very limited annotation with GO, BTO • Some support to targeted approaches is available • Part of the HPP consortium GPMDB
  • 29. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 http://thehpp.org/ The Human Proteome Project (HPP)
  • 30. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Reprocesses data Reprocesses data No reprocessing Editorial control Editorial control No editorial control Limited annotation Limited annotation Detailed annotation Main MS proteomics resources
  • 31. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Reprocesses data Reprocesses data No reprocessing Editorial control Editorial control No editorial control Limited annotation Limited annotation Detailed annotation Main MS proteomics resources
  • 32. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 33. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Draft Human proteome papers published in 2014 Wilhelm et al., Nature, 2014 Kim et al., Nature, 2014 •Two independent groups claimed to have produced the first complete draft of the human proteome by MS. • Some of their findings are controversial and need further validation… but generated a lot of discussion and put proteomics in the spotlight. •Two proteomics resources have been developed: proteomicsDB and the Human Proteome Map (HPM).Nature cover 29 May 2014
  • 34. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 ProteomicsDB https://www.proteomicsdb.org/ • Data analysis using Mascot and MaxQuant • The way the Protein FDR is calculated is controversial •Quantification information using label free techniques •New datasets are added in a regular basis
  • 35. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 ProteomicsDB (2)
  • 36. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Human Proteome Map (HPM) • Developed by the Pandey group. •Data reanalysis using Mascot. • Protein FDR is not mentioned at all in the corresponding Nature paper. http://www.humanproteomemap.org/
  • 37. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Chorus https://chorusproject.org/pages/ind ex.html • Developed by M. MacCoss’ group • Built on top of Amazon cloud technologies • Provides data analysis capabilities for the users • Free for public datasets. A fee needs to be paid for storing private information.
  • 38. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Why sharing MS proteomics data? • Types of information stored in MS proteomics repositories. • Main existing repositories and their main characteristics • No data reprocessing • Data reprocessing • Recently developed resources • Other resources Overview
  • 39. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 MaxQB Human Proteinpedia Other repositories
  • 40. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 COPaKB Cardiac Organellar Protein Atlas Knowledgebase International collaboration (EMBL-EBI involved) Windows Client and iPad App Submit data for analysis in dta and mzML formats Data submitted to a ProLuCID pipeline No MS data download
  • 41. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 CPTAC data portal
  • 42. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Pep2pro (Arabidopsis) http://fgcz-pep2pro.uzh.ch/ Centered on Arabidopsis data Download spectra by spectra Quantitative information Linked to gelmap.de (2DE)
  • 43. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Example of a repository of supporting data annotation: Steve Gygi’s lab Supporting data from publications Spectra annotation results and PTM evaluation data Quantitative data No data downloads https://gygi.med.harvard.edu/phosphomouse
  • 44. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 FINAL THOUGHTS
  • 45. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Why are repositories not more popular? 1. Don’t want to share data • Researchers don’t like to be shown that they did not analyze the data as well as they could have. • Their FDR may be higher than they reported/think. • Researchers are worried that they missed something in the data that they could discover if they go back to it at a later date • Don’t want other authors to get a publication from their data. • However, this philosophy is changing gradually… Slide from R. Chalkley
  • 46. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Why are repositories not more popular? (2) 2. Submission burden • Getting data into correct format may require some work • Author is not necessarily computer-savvy • Having to also supply metadata is seen as a burden, if the information is already present in an associated manuscript • Associated raw data may be many GB in size; file transfer to repository could take a while Authors are impatient: want to spend time doing science, not administration! Slide from R. Chalkley
  • 47. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Importance of sharing MS proteomics data • The main existing proteomics repositories are complementary in focus and functionality • Main characteristics of: • PeptideAtlas and GPMDB (Reprocess data) • PASSEL, MassIVE and PRIDE Archive (at present they do not reprocess data). • New resources: proteomicsDB, HPM, Chorus Conclusions
  • 48. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 • Vizcaíno et al., J. Proteomics, 2010. PMID: 20615486 • Perez-Riverol et al., Proteomics, 2015. PMID: 25158685 Recommended reading
  • 49. Juan A. Vizcaíno juan@ebi.ac.uk WT Proteomics Bioinformatics Course 2015 Hinxton, 10 December 2015 Questions?