SlideShare a Scribd company logo
Submitting your data to 
ProteomeXchange – A Mini-Tutorial 
Dr. Juan Antonio Vizcaíno 
PRIDE Group Coordinator 
Proteomics Services Team 
EMBL-EBI 
Hinxton, Cambridge, UK
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Overview 
• The ProteomeXchange (PX) consortium 
• How to submit and access data in PX via PRIDE 
• How to access PX data 
• Submitting data triggers data reuse
ProteomeXchange Consortium 
• Goal: Development of a framework to allow 
standard data submission and dissemination 
pipelines between the main existing proteomics 
repositories. 
• Includes PeptideAtlas (ISB, Seattle), PRIDE 
(Cambridge, UK) and (very recently) MassIVE 
(UCSD, San Diego). 
• Common identifier space (PXD identifiers) 
• Two supported data workflows: MS/MS and SRM. 
• Main objective: Make life easier for researchers 
http://www.proteomexchange.org 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
ProteomeXchange data workflow 
Results 
Raw Data* 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
ProteomeCentral 
PRIDE 
(MS/MS data) 
13th HUPO World Congress 
Madrid, 7 October 2014 
Metadata / 
Manuscript 
Journals 
UniProt/ 
neXtProt 
Peptide Atlas 
Other DBs 
Receiving repositories 
PASSEL 
(SRM data) 
Other DBs 
GPMDB 
Researcher’s results 
Reprocessed results 
Raw data* 
Metadata 
MassIVE 
(MS/MS data) 
Vizcaíno et al., Nat Biotechnol, 2014
ProteomeXchange data workflow 
Results 
Raw Data* 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
ProteomeCentral 
PRIDE 
(MS/MS data) 
13th HUPO World Congress 
Madrid, 7 October 2014 
Metadata / 
Manuscript 
Journals 
UniProt/ 
neXtProt 
Peptide Atlas 
Other DBs 
Receiving repositories 
PASSEL 
(SRM data) 
Other DBs 
GPMDB 
Researcher’s results 
Reprocessed results 
Raw data* 
Metadata 
MassIVE 
(MS/MS data) 
Vizcaíno et al., Nat Biotechnol, 2014
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Overview 
• The ProteomeXchange (PX) consortium 
• How to submit and access data in PX via PRIDE 
• How to access PX data 
• Submitting data triggers data reuse
PRIDE (PRoteomics IDEntifications) database 
http://www.ebi.ac.uk/pride 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
• Focused on MS/MS 
approaches 
• Other data types can 
also be stored as 
“Partial submissions”. 
Martens et al., Proteomics, 2005 
Vizcaíno et al., NAR, 2013
Manuscript just out detailing the process 
http://www.proteomexchange.org/submission Ternent et al., Proteomics, 2014 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Example dataset: 
PXD000764 
- Title: “Discovery of new CSF biomarkers for meningitis in children” 
- 12 runs: 4 controls and 8 infected samples 
- Identification and quantification data
PX Data workflow for MS/MS data 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
1. Mass spectrometer output files: raw data (binary files) or 
peak list spectra in a standardized format (mzML, mzXML). 
2. Result files: 
a. Complete submissions: Result files can be converted to 
PRIDE XML or the mzIdentML data standard. 
b. Partial submissions: For workflows not yet supported by 
PRIDE, search engine output files will be stored and 
provided in their original form. 
3. Metadata: Sufficiently detailed description of sample origin, 
workflow, instrumentation, submitter. 
4. Other files: Optional files: 
a. QUANT: Quantification related results e. FASTA 
b. PEAK: Peak list files f. SP_LIBRARY 
c. GEL: Gel images 
d. OTHER: Any other file type 
13th HUPO World Congress 
Madrid, 7 October 2014 
Published 
Raw 
Files 
Other 
files
Complete vs Partial submissions: experimental metadata 
Complete Partial 
General experimental metadata about the projects is similar. 
However, at the assay level information in partial submissions is not so detailed 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
Complete vs Partial submissions: processed results 
For complete submissions, it is possible to connect the spectra with the identification 
processed results and they can be visualized. 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Complete 
Partial
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to PRIDE XML or mzIdentML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to PRIDE XML or mzIdentML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
How to perform a complete PX submission to PRIDE 
• Complete submission: 
• MS/MS data. 
• Processed results can be converted to the PSI standard 
mzIdentML or PRIDE XML. 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
• Partial submission: 
• Any type of data (not SRM, which goes to PASSEL) 
• E.g. top down, data independent acquisition, MS Imaging (to 
come), etc. 
• Processed results cannot be converted to a data standard.
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to mzIdentML or PRIDE XML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
Juan A. Vizcaíno 
juan@ebi.ac.uk 
Complete submissions 
An increasing number of tools support export to mzIdentML 
1.1 
13th HUPO World Congress 
Madrid, 7 October 2014 
Search 
Engine 
Results + 
MS files 
Search 
engines 
mzIdentML 
- Mascot 
- MSGF+ 
- Myrimatch and related tools from D. Tabb’s lab 
- OpenMS 
- PEAKS 
- ProCon (ProteomeDiscoverer, Sequest) 
- Scaffold 
- TPP via the idConvert tool (ProteoWizard) 
- ProteinPilot (planned by the end of 2014) 
- Others: library for X!Tandem conversion, lab 
internal pipelines, … 
- Referenced spectral files need to be submitted as well 
(all open formats are supported). 
Updated list: http://www.psidev.info/tools-implementing-mzIdentML#.
Original data files ‘RESULT’ file generation Final ‘RESULT’ file 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Search 
output 
files 
Spectra 
files 
PRIDE 
XML 
‘RESULT’ 
Before: file conversion 
File conversion 
PRIDE 
Converter
Tools ‘RESULT’ file generation Final ‘RESULT’ file 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
mzIdentML 
‘RESULT’ 
Now: native file export 
Spectra 
files 
Mascot 
ProteinPilo 
t 
Scaffold 
PEAKS 
MSGF+ 
Others 
Native File export
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to PRIDE XML or mzIdentML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
Available for complete submissions 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Wang et al., Nat. Biotechnology, 2012 
PRIDE Inspector 2.0 
PRIDE Inspector 2.0 supports: 
- PRIDE XML 
- mzIdentML + all types of spectra files 
- mzML 
- mzTab Ident (work in progress) 
http://code.google.com/p/pride-toolsuite/ 
wiki/PRIDEInspector
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to PRIDE XML or mzIdentML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
PX submission tool 
• Capture the mappings between the different types of files. 
• Make the file upload process straightforward to the submitter (It transfers all the 
files using Aspera or FTP). 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Published 
Raw 
Other 
files 
http://www.proteomexchange.org/submission 
PX 
submission 
tool 
• Command line alternative: some scripting is needed
PX submission tool: step by step 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Step 1 
Step 2
PX submission tool: step by step 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Step 3 
Step 4
PX submission tool: step by step 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
Step 5 Step 6 
13th HUPO World Congress 
Madrid, 7 October 2014
PX submission tool: step by step 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Step 7 
Step 8
PX submission tool: step by step 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
Step 9 
13th HUPO World Congress 
Madrid, 7 October 2014
Fast file transfer with Aspera 
- Aspera is the default file transfer protocol to PRIDE: 
- Up to 50X faster than FTP 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
- PX Submission tool 
- Command line 
File transfer speed should 
not be a problem!! 
13th HUPO World Congress 
Madrid, 7 October 2014
PX submission tool: HPP tags 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
Batch submissions on the command line 
• Generate on your own the PX summary file (generated 
by default by the PX submission tool). 
MTD first_name John Arthur 
MTD last_name Smith 
MTD email john.smith@cam.edu 
MTD affiliation University of Cambridge 
MTD title Human proteome 
MTD description An experiment about human proteome 
MTD keyword human, proteome 
MTD pubmed 12345 
MTD px 10.1000/182 
MTD pride_login pride-user 
FMH file_id file_type file_path file_mapping 
FME 1 result /path/to/pride/xml/files/pride-1.xml7,8,9 
FME 2 result /path/to/pride/xml/files/pride-2.xml4 
FME 3 result /path/to/mzidentml/files/mzidentml-1.xml 5,10 
FME 4 raw /path/to/raw/files/raw-1.bin 
FME 5 raw /path/to/raw/files/raw-2.bin 
FME 6 raw /path/to/raw/files/raw-3.bin 
FME 7 raw ftp://some.url/at/some/place/raw-4.bin 
FME 8 search/path/to/search/engine/output/search-1.out 
FME 9 other /path/to/other/file/other-1.e 
FME 10 peak /path/to/peak/list/mzml-1.xml 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
Batch submissions on the command line 
• Generate on your own the PX summary file (generated 
by default by the PX submission tool). 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
Batch submissions on the command line (2) 
• Generate on your own the PX summary file (generated 
by default by the PX submission tool). 
• Put together all the files plus the PX summary file. 
• Ask PRIDE team for a specific upload directory (pride-support@ 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
ebi.ac.uk) 
13th HUPO World Congress 
Madrid, 7 October 2014
How to perform a complete PX submission to PRIDE 
• Decide between a complete/partial submission. 
• File conversion/export to PRIDE XML or mzIdentML 
• File check before submission (PRIDE Inspector) 
• Experimental annotation and actual file submission (PX 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
submission tool) 
• Post-submission steps
Post-processing steps 
• PRIDE curators will check the files 
• Files must be valid to the schema 
• All the required annotations must be there 
• Basic QC check (e.g. detect errors in PTM annotation) 
• If everything is correct, submission to PRIDE is done 
• The author receives a PXD identifier, a reviewer username 
and a password, and a DOI (for complete submissions). 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Overview 
• The ProteomeXchange (PX) consortium 
• How to submit and access data in PX via PRIDE 
• How to access PX data 
• Submitting data triggers data reuse
ProteomeXchange data workflow 
Results 
Raw Data* 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
ProteomeCentral 
PRIDE 
(MS/MS data) 
13th HUPO World Congress 
Madrid, 7 October 2014 
Metadata / 
Manuscript 
Journals 
UniProt/ 
neXtProt 
Peptide Atlas 
Other DBs 
Receiving repositories 
PASSEL 
(SRM data) 
Other DBs 
GPMDB 
Researcher’s results 
Reprocessed results 
Raw data* 
Metadata 
MassIVE 
(MS/MS data) 
Vizcaíno et al., Nat Biotechnol, 2014
ProteomeCentral: Portal for all PX datasets 
http://proteomecentral.proteomexchange.org/cgi/GetDataset 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014
ProteomeXchange: 1,148 datasets up until August 2014 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Origin: 
235 USA 
142 Germany 
97 United Kingdom 
67 Switzerland 
64 Netherlands 
62 China 
60 France 
48 Canada 
43 Spain 
36 Belgium 
32 Sweden 
29 Australia 
26 Denmark 
23 Japan 
18 Taiwan 
17 India 
16 Ireland 
14 Norway 
14 Italy 
12 Finland 
11 Republic of Korea 
10 Brazil 
8 Austria 
7 Israel 
7 Singapore … 
Type: 
386 PRIDE complete 
687 PRIDE partial 
51 PeptideAtlas/PASSEL complete 
1 MassIVE 
23 reprocessed 
Publicly Accessible: 
544 datasets, 50% of all 
90% PRIDE 
10% PASSEL 
Top Species studied by at least 10 
datasets: 
510 Homo sapiens 
142 Mus musculus 
46 Saccharomyces cerevisiae 
45 Arabidopsis thaliana 
23 Rattus norvegicus 
16 Escherichia coli 
15 Bos taurus 
15 Mycobacterium tuberculosis 
13 Oryza sativa 
12 Drosophila melanogaster 
12 Glycine max 
~ 265 species in total 
Data volume: 
Total: ~51 TB 
Number of all files: ~130,000 
PXD000320-324: ~ 5 TB 
PXD000065: ~ 1.4TB 
Datasets/year: 
2012: 102 
2013: 527 
2014: 519
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Overview 
• The ProteomeXchange (PX) consortium 
• How to submit and access data in PX via PRIDE 
• How to access PX data 
• Submitting data triggers data reuse
Which are the most accessed datasets? 
PXD Identifier Hits Dataset title Publication 
PXD000561 153512 A draft map of the human proteome 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Kim et al., 
Nature,2014. 
PMID: 24870542 
PXD000851 111587 
Membrane proteomic analysis of 
colorectal cancer tissue 
Kume et al., MCP, 
2014. 
PMID:24687888 
PXD000865 51639 
Mass spectrometry based draft of 
the human proteome 
Wilhelm et al., 2014, 
Nature, 
PMID:24870543
Which are the most accessed datasets? 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Total Numbers
CompOmics Open Source Analysis Pipeline 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
Vaudel M, Burkhart J, Zahedi RP, Berven FS, Sickmann A, Martens L, 
Barsnes H: 
Nature Biotechnology (in press) 
13th HUPO World Congress 
Madrid, 7 October 2014 
Vaudel M, Barsnes H, Berven FS, Sickmann A, 
Martens L: 
Proteomics 2011;11(5):996-9. 
http://searchgui.googlecode.com http://peptide-shaker.googlecode.com
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Find the desired PRIDE project … 
… inspect the project details …. 
… and start re-analyzing the data! 
Reshake PRIDE data!
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Conclusions 
• Submission to ProteomeXchange via PRIDE is easy. 
• Decide between complete and partial submissions. 
• Different open source tools available to facilitate the process. 
• File transfer speed should not be a problem (Aspera support)
Acknowledgements 
Juan A. Vizcaíno 
juan@ebi.ac.uk 
PeptideAtlas Team (ISB, Seattle) 
Eric Deutsch 
Terry Farrah 
Zhi Sun 
Andrew R. Jones 
Lennart Martens 
Juan Pablo Albar 
Martin Eisenacher 
Gil Omenn 
And many other PX partners and 
stakeholders 
13th HUPO World Congress 
Madrid, 7 October 2014 
PRIDE Team 
Attila Csordas 
Rui Wang 
Florian Reisinger 
Jose A. Dianes 
Tobias Ternent 
Yasset Perez-Riverol 
Noemi del Toro 
Henning Hermjakob 
EU FP7 grant number 260558
Juan A. Vizcaíno 
juan@ebi.ac.uk 
13th HUPO World Congress 
Madrid, 7 October 2014 
Questions?

More Related Content

What's hot

Cgrp calcitonin gene related peptide
Cgrp calcitonin gene related peptideCgrp calcitonin gene related peptide
Cgrp calcitonin gene related peptide
Vijay Anand
 
HLA typing SSO method using xMAP Technology.
HLA typing SSO method using xMAP Technology.HLA typing SSO method using xMAP Technology.
HLA typing SSO method using xMAP Technology.
VijayDubey60
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technology
Abhishek Gaikar
 
2 d gel electrophoresis
2 d gel electrophoresis2 d gel electrophoresis
2 d gel electrophoresis
DR.HARI SINGH GOUR
 
Presentation1
Presentation1Presentation1
Presentation1nehasutar
 
Microarray
MicroarrayMicroarray
Microarray
Ankitha Hirematha
 
Protein and protein Dystrophin
Protein and protein DystrophinProtein and protein Dystrophin
Protein and protein Dystrophin
Hari Sharan Makaju
 
Tyrosine kinase linked receptors
Tyrosine kinase linked receptorsTyrosine kinase linked receptors
Tyrosine kinase linked receptors
Faidu Kalathingal
 
Peptides
PeptidesPeptides
Peptides
Abhay140195
 
Cloning a to z
Cloning a to zCloning a to z
Cloning a to z
Kalaiselvi Govindan
 
RIA PPT.pptx
RIA PPT.pptxRIA PPT.pptx
RIA PPT.pptx
SwapnilUgle
 
MALDI-TOF: Pricinple and Its Application in Biochemistry and Biotechnology
MALDI-TOF: Pricinple  and Its Application in Biochemistry and BiotechnologyMALDI-TOF: Pricinple  and Its Application in Biochemistry and Biotechnology
MALDI-TOF: Pricinple and Its Application in Biochemistry and Biotechnology
Devakumar Jain
 
Western blotting- Technique used in RDT
Western blotting- Technique used in RDTWestern blotting- Technique used in RDT
Western blotting- Technique used in RDT
BishwarupSarkar
 
disgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R packagedisgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R package
Núria Queralt Rosinach
 
Nucleotides nucleosides
Nucleotides nucleosidesNucleotides nucleosides
Nucleotides nucleosides
MariaIslam4
 
Automating the ACMG Guidelines with VSClinical
Automating the ACMG Guidelines with VSClinicalAutomating the ACMG Guidelines with VSClinical
Automating the ACMG Guidelines with VSClinical
Golden Helix
 
Hexokinase.pptx
Hexokinase.pptxHexokinase.pptx
Hexokinase.pptx
Hasan Noah
 
Recombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug DiscoveryRecombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug Discovery
Ashok Jangra
 
SiRNA & MiRNA.pptx
SiRNA & MiRNA.pptxSiRNA & MiRNA.pptx
SiRNA & MiRNA.pptx
SIRAJUDDIN MOLLA
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA Technology
ShubhamMahajan117
 

What's hot (20)

Cgrp calcitonin gene related peptide
Cgrp calcitonin gene related peptideCgrp calcitonin gene related peptide
Cgrp calcitonin gene related peptide
 
HLA typing SSO method using xMAP Technology.
HLA typing SSO method using xMAP Technology.HLA typing SSO method using xMAP Technology.
HLA typing SSO method using xMAP Technology.
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technology
 
2 d gel electrophoresis
2 d gel electrophoresis2 d gel electrophoresis
2 d gel electrophoresis
 
Presentation1
Presentation1Presentation1
Presentation1
 
Microarray
MicroarrayMicroarray
Microarray
 
Protein and protein Dystrophin
Protein and protein DystrophinProtein and protein Dystrophin
Protein and protein Dystrophin
 
Tyrosine kinase linked receptors
Tyrosine kinase linked receptorsTyrosine kinase linked receptors
Tyrosine kinase linked receptors
 
Peptides
PeptidesPeptides
Peptides
 
Cloning a to z
Cloning a to zCloning a to z
Cloning a to z
 
RIA PPT.pptx
RIA PPT.pptxRIA PPT.pptx
RIA PPT.pptx
 
MALDI-TOF: Pricinple and Its Application in Biochemistry and Biotechnology
MALDI-TOF: Pricinple  and Its Application in Biochemistry and BiotechnologyMALDI-TOF: Pricinple  and Its Application in Biochemistry and Biotechnology
MALDI-TOF: Pricinple and Its Application in Biochemistry and Biotechnology
 
Western blotting- Technique used in RDT
Western blotting- Technique used in RDTWestern blotting- Technique used in RDT
Western blotting- Technique used in RDT
 
disgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R packagedisgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R package
 
Nucleotides nucleosides
Nucleotides nucleosidesNucleotides nucleosides
Nucleotides nucleosides
 
Automating the ACMG Guidelines with VSClinical
Automating the ACMG Guidelines with VSClinicalAutomating the ACMG Guidelines with VSClinical
Automating the ACMG Guidelines with VSClinical
 
Hexokinase.pptx
Hexokinase.pptxHexokinase.pptx
Hexokinase.pptx
 
Recombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug DiscoveryRecombinant DNA Technology and Drug Discovery
Recombinant DNA Technology and Drug Discovery
 
SiRNA & MiRNA.pptx
SiRNA & MiRNA.pptxSiRNA & MiRNA.pptx
SiRNA & MiRNA.pptx
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA Technology
 

Viewers also liked

Integrative Analysis of Epigenomics and miRNA data in Immune System Models
Integrative Analysis of Epigenomics and miRNA data in Immune System ModelsIntegrative Analysis of Epigenomics and miRNA data in Immune System Models
Integrative Analysis of Epigenomics and miRNA data in Immune System ModelsCOST action BM1006
 
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
Chromatography & Mass Spectrometry Solutions
 
from B-cell Biology to Data Integration
from B-cell Biology to Data Integrationfrom B-cell Biology to Data Integration
from B-cell Biology to Data IntegrationCOST action BM1006
 
New DIA Workflows for Ultimate Flexibility in LCMS Proteomics
New DIA Workflows for Ultimate Flexibility in LCMS ProteomicsNew DIA Workflows for Ultimate Flexibility in LCMS Proteomics
New DIA Workflows for Ultimate Flexibility in LCMS Proteomics
thermo_omics
 
Optimizing solid core_30955
Optimizing solid core_30955Optimizing solid core_30955
Data Independent Analysis on Thermo Scientific Orbitrap MS Systems
Data Independent Analysis on Thermo Scientific Orbitrap MS SystemsData Independent Analysis on Thermo Scientific Orbitrap MS Systems
Data Independent Analysis on Thermo Scientific Orbitrap MS Systems
Thermo Fisher Scientific
 
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
Chromatography & Mass Spectrometry Solutions
 
Reverse-engineering techniques in Data Integration
Reverse-engineering techniques in Data IntegrationReverse-engineering techniques in Data Integration
Reverse-engineering techniques in Data IntegrationCOST action BM1006
 
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
Chromatography & Mass Spectrometry Solutions
 
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
Chromatography & Mass Spectrometry Solutions
 
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
Chromatography & Mass Spectrometry Solutions
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
Juan Antonio Vizcaino
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...SCIEX
 
Chromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and CharacterizationChromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and Characterization
Chromatography & Mass Spectrometry Solutions
 
Use of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
Use of Solid Core Chromatography for the Analysis of Pharmaceutical CompoundsUse of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
Use of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
Chromatography & Mass Spectrometry Solutions
 
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
Chromatography & Mass Spectrometry Solutions
 
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
Chromatography & Mass Spectrometry Solutions
 
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
Chromatography & Mass Spectrometry Solutions
 
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
Chromatography & Mass Spectrometry Solutions
 
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
Chromatography & Mass Spectrometry Solutions
 

Viewers also liked (20)

Integrative Analysis of Epigenomics and miRNA data in Immune System Models
Integrative Analysis of Epigenomics and miRNA data in Immune System ModelsIntegrative Analysis of Epigenomics and miRNA data in Immune System Models
Integrative Analysis of Epigenomics and miRNA data in Immune System Models
 
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
Analysis of Cations in Hydraulic Fracturing Flowback Water from the Marcellus...
 
from B-cell Biology to Data Integration
from B-cell Biology to Data Integrationfrom B-cell Biology to Data Integration
from B-cell Biology to Data Integration
 
New DIA Workflows for Ultimate Flexibility in LCMS Proteomics
New DIA Workflows for Ultimate Flexibility in LCMS ProteomicsNew DIA Workflows for Ultimate Flexibility in LCMS Proteomics
New DIA Workflows for Ultimate Flexibility in LCMS Proteomics
 
Optimizing solid core_30955
Optimizing solid core_30955Optimizing solid core_30955
Optimizing solid core_30955
 
Data Independent Analysis on Thermo Scientific Orbitrap MS Systems
Data Independent Analysis on Thermo Scientific Orbitrap MS SystemsData Independent Analysis on Thermo Scientific Orbitrap MS Systems
Data Independent Analysis on Thermo Scientific Orbitrap MS Systems
 
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
Chromatography: Analysis of Phosphate and Manganese degradation Products in A...
 
Reverse-engineering techniques in Data Integration
Reverse-engineering techniques in Data IntegrationReverse-engineering techniques in Data Integration
Reverse-engineering techniques in Data Integration
 
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
Oligosaccharide Analysis Using High-Performance Anion-Exchange Chromatography...
 
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
Next Generation Ultra High Pressure Liquid Chromatography (UHPLC) Technologie...
 
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
What does USP Chapter 2232 on Elemental Contaminants in Dietary Supplements M...
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...
 
Chromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and CharacterizationChromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and Characterization
 
Use of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
Use of Solid Core Chromatography for the Analysis of Pharmaceutical CompoundsUse of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
Use of Solid Core Chromatography for the Analysis of Pharmaceutical Compounds
 
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
New Innovations in Ultra High Performance Liquid Chromatography and Liquid Ch...
 
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
Determination of Carbohydrates in Various Matrices by Capillary High-Performa...
 
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
Determination of Common Counterions and Impurity Anions in Pharmaceuticals Us...
 
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
Stationary and mobile_phase_selection_m_ab_ph_gradient_analysis_33974
 
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
Pesticide Residue Analysis Webinar Series: Tips and Tricks for the Whole Work...
 

Similar to Submitting your data to ProteomeXchange – a mini tutorial

AHUPO_Vizcaino_remote_presentation_082014
AHUPO_Vizcaino_remote_presentation_082014AHUPO_Vizcaino_remote_presentation_082014
AHUPO_Vizcaino_remote_presentation_082014
Juan Antonio Vizcaino
 
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
Juan Antonio Vizcaino
 
Pride and ProteomeXchange
Pride and ProteomeXchangePride and ProteomeXchange
Pride and ProteomeXchange
Juan Antonio Vizcaino
 
Human microbiome project
Human microbiome projectHuman microbiome project
Human microbiome project
Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
Juan Antonio Vizcaino
 
MS Imaging data in ProteomeXchange (HUPO 2014)
MS Imaging data in ProteomeXchange (HUPO 2014)MS Imaging data in ProteomeXchange (HUPO 2014)
MS Imaging data in ProteomeXchange (HUPO 2014)
Juan Antonio Vizcaino
 
ProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easyProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easy
Juan Antonio Vizcaino
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
Juan Antonio Vizcaino
 
Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomics
Juan Antonio Vizcaino
 
Presentation PRIDE Cluster HUPO 2014
Presentation PRIDE Cluster HUPO 2014Presentation PRIDE Cluster HUPO 2014
Presentation PRIDE Cluster HUPO 2014
Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusablePRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
Yasset Perez-Riverol
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
Juan Antonio Vizcaino
 
Mass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progressMass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progress
Juan Antonio Vizcaino
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Juan Antonio Vizcaino
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Juan Antonio Vizcaino
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasets
Juan Antonio Vizcaino
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
Juan Antonio Vizcaino
 
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLOpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
Platforma Otwartej Nauki
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
Juan Antonio Vizcaino
 

Similar to Submitting your data to ProteomeXchange – a mini tutorial (20)

AHUPO_Vizcaino_remote_presentation_082014
AHUPO_Vizcaino_remote_presentation_082014AHUPO_Vizcaino_remote_presentation_082014
AHUPO_Vizcaino_remote_presentation_082014
 
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
 
Pride and ProteomeXchange
Pride and ProteomeXchangePride and ProteomeXchange
Pride and ProteomeXchange
 
Human microbiome project
Human microbiome projectHuman microbiome project
Human microbiome project
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
 
MS Imaging data in ProteomeXchange (HUPO 2014)
MS Imaging data in ProteomeXchange (HUPO 2014)MS Imaging data in ProteomeXchange (HUPO 2014)
MS Imaging data in ProteomeXchange (HUPO 2014)
 
ProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easyProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easy
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
 
Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomics
 
Presentation PRIDE Cluster HUPO 2014
Presentation PRIDE Cluster HUPO 2014Presentation PRIDE Cluster HUPO 2014
Presentation PRIDE Cluster HUPO 2014
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
 
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusablePRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
 
Mass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progressMass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progress
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasets
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLOpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
 

More from Juan Antonio Vizcaino

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
Juan Antonio Vizcaino
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
Juan Antonio Vizcaino
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
Juan Antonio Vizcaino
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
Juan Antonio Vizcaino
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
Juan Antonio Vizcaino
 
PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
Juan Antonio Vizcaino
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
Juan Antonio Vizcaino
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Juan Antonio Vizcaino
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
Juan Antonio Vizcaino
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
Juan Antonio Vizcaino
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
Juan Antonio Vizcaino
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Juan Antonio Vizcaino
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
Juan Antonio Vizcaino
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
Juan Antonio Vizcaino
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
Juan Antonio Vizcaino
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
Juan Antonio Vizcaino
 

More from Juan Antonio Vizcaino (20)

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
 

Recently uploaded

mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
muralinath2
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 

Recently uploaded (20)

mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 

Submitting your data to ProteomeXchange – a mini tutorial

  • 1. Submitting your data to ProteomeXchange – A Mini-Tutorial Dr. Juan Antonio Vizcaíno PRIDE Group Coordinator Proteomics Services Team EMBL-EBI Hinxton, Cambridge, UK
  • 2. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Overview • The ProteomeXchange (PX) consortium • How to submit and access data in PX via PRIDE • How to access PX data • Submitting data triggers data reuse
  • 3. ProteomeXchange Consortium • Goal: Development of a framework to allow standard data submission and dissemination pipelines between the main existing proteomics repositories. • Includes PeptideAtlas (ISB, Seattle), PRIDE (Cambridge, UK) and (very recently) MassIVE (UCSD, San Diego). • Common identifier space (PXD identifiers) • Two supported data workflows: MS/MS and SRM. • Main objective: Make life easier for researchers http://www.proteomexchange.org Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 4. ProteomeXchange data workflow Results Raw Data* Juan A. Vizcaíno juan@ebi.ac.uk ProteomeCentral PRIDE (MS/MS data) 13th HUPO World Congress Madrid, 7 October 2014 Metadata / Manuscript Journals UniProt/ neXtProt Peptide Atlas Other DBs Receiving repositories PASSEL (SRM data) Other DBs GPMDB Researcher’s results Reprocessed results Raw data* Metadata MassIVE (MS/MS data) Vizcaíno et al., Nat Biotechnol, 2014
  • 5. ProteomeXchange data workflow Results Raw Data* Juan A. Vizcaíno juan@ebi.ac.uk ProteomeCentral PRIDE (MS/MS data) 13th HUPO World Congress Madrid, 7 October 2014 Metadata / Manuscript Journals UniProt/ neXtProt Peptide Atlas Other DBs Receiving repositories PASSEL (SRM data) Other DBs GPMDB Researcher’s results Reprocessed results Raw data* Metadata MassIVE (MS/MS data) Vizcaíno et al., Nat Biotechnol, 2014
  • 6. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Overview • The ProteomeXchange (PX) consortium • How to submit and access data in PX via PRIDE • How to access PX data • Submitting data triggers data reuse
  • 7. PRIDE (PRoteomics IDEntifications) database http://www.ebi.ac.uk/pride Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 • Focused on MS/MS approaches • Other data types can also be stored as “Partial submissions”. Martens et al., Proteomics, 2005 Vizcaíno et al., NAR, 2013
  • 8. Manuscript just out detailing the process http://www.proteomexchange.org/submission Ternent et al., Proteomics, 2014 Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Example dataset: PXD000764 - Title: “Discovery of new CSF biomarkers for meningitis in children” - 12 runs: 4 controls and 8 infected samples - Identification and quantification data
  • 9. PX Data workflow for MS/MS data Juan A. Vizcaíno juan@ebi.ac.uk 1. Mass spectrometer output files: raw data (binary files) or peak list spectra in a standardized format (mzML, mzXML). 2. Result files: a. Complete submissions: Result files can be converted to PRIDE XML or the mzIdentML data standard. b. Partial submissions: For workflows not yet supported by PRIDE, search engine output files will be stored and provided in their original form. 3. Metadata: Sufficiently detailed description of sample origin, workflow, instrumentation, submitter. 4. Other files: Optional files: a. QUANT: Quantification related results e. FASTA b. PEAK: Peak list files f. SP_LIBRARY c. GEL: Gel images d. OTHER: Any other file type 13th HUPO World Congress Madrid, 7 October 2014 Published Raw Files Other files
  • 10. Complete vs Partial submissions: experimental metadata Complete Partial General experimental metadata about the projects is similar. However, at the assay level information in partial submissions is not so detailed Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 11. Complete vs Partial submissions: processed results For complete submissions, it is possible to connect the spectra with the identification processed results and they can be visualized. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Complete Partial
  • 12. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to PRIDE XML or mzIdentML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 13. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to PRIDE XML or mzIdentML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 14. How to perform a complete PX submission to PRIDE • Complete submission: • MS/MS data. • Processed results can be converted to the PSI standard mzIdentML or PRIDE XML. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 • Partial submission: • Any type of data (not SRM, which goes to PASSEL) • E.g. top down, data independent acquisition, MS Imaging (to come), etc. • Processed results cannot be converted to a data standard.
  • 15. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to mzIdentML or PRIDE XML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 16. Juan A. Vizcaíno juan@ebi.ac.uk Complete submissions An increasing number of tools support export to mzIdentML 1.1 13th HUPO World Congress Madrid, 7 October 2014 Search Engine Results + MS files Search engines mzIdentML - Mascot - MSGF+ - Myrimatch and related tools from D. Tabb’s lab - OpenMS - PEAKS - ProCon (ProteomeDiscoverer, Sequest) - Scaffold - TPP via the idConvert tool (ProteoWizard) - ProteinPilot (planned by the end of 2014) - Others: library for X!Tandem conversion, lab internal pipelines, … - Referenced spectral files need to be submitted as well (all open formats are supported). Updated list: http://www.psidev.info/tools-implementing-mzIdentML#.
  • 17. Original data files ‘RESULT’ file generation Final ‘RESULT’ file Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Search output files Spectra files PRIDE XML ‘RESULT’ Before: file conversion File conversion PRIDE Converter
  • 18. Tools ‘RESULT’ file generation Final ‘RESULT’ file Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 mzIdentML ‘RESULT’ Now: native file export Spectra files Mascot ProteinPilo t Scaffold PEAKS MSGF+ Others Native File export
  • 19. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to PRIDE XML or mzIdentML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 20. Available for complete submissions Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Wang et al., Nat. Biotechnology, 2012 PRIDE Inspector 2.0 PRIDE Inspector 2.0 supports: - PRIDE XML - mzIdentML + all types of spectra files - mzML - mzTab Ident (work in progress) http://code.google.com/p/pride-toolsuite/ wiki/PRIDEInspector
  • 21. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to PRIDE XML or mzIdentML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 22. PX submission tool • Capture the mappings between the different types of files. • Make the file upload process straightforward to the submitter (It transfers all the files using Aspera or FTP). Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Published Raw Other files http://www.proteomexchange.org/submission PX submission tool • Command line alternative: some scripting is needed
  • 23. PX submission tool: step by step Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Step 1 Step 2
  • 24. PX submission tool: step by step Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Step 3 Step 4
  • 25. PX submission tool: step by step Juan A. Vizcaíno juan@ebi.ac.uk Step 5 Step 6 13th HUPO World Congress Madrid, 7 October 2014
  • 26. PX submission tool: step by step Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Step 7 Step 8
  • 27. PX submission tool: step by step Juan A. Vizcaíno juan@ebi.ac.uk Step 9 13th HUPO World Congress Madrid, 7 October 2014
  • 28. Fast file transfer with Aspera - Aspera is the default file transfer protocol to PRIDE: - Up to 50X faster than FTP Juan A. Vizcaíno juan@ebi.ac.uk - PX Submission tool - Command line File transfer speed should not be a problem!! 13th HUPO World Congress Madrid, 7 October 2014
  • 29. PX submission tool: HPP tags Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 30. Batch submissions on the command line • Generate on your own the PX summary file (generated by default by the PX submission tool). MTD first_name John Arthur MTD last_name Smith MTD email john.smith@cam.edu MTD affiliation University of Cambridge MTD title Human proteome MTD description An experiment about human proteome MTD keyword human, proteome MTD pubmed 12345 MTD px 10.1000/182 MTD pride_login pride-user FMH file_id file_type file_path file_mapping FME 1 result /path/to/pride/xml/files/pride-1.xml7,8,9 FME 2 result /path/to/pride/xml/files/pride-2.xml4 FME 3 result /path/to/mzidentml/files/mzidentml-1.xml 5,10 FME 4 raw /path/to/raw/files/raw-1.bin FME 5 raw /path/to/raw/files/raw-2.bin FME 6 raw /path/to/raw/files/raw-3.bin FME 7 raw ftp://some.url/at/some/place/raw-4.bin FME 8 search/path/to/search/engine/output/search-1.out FME 9 other /path/to/other/file/other-1.e FME 10 peak /path/to/peak/list/mzml-1.xml Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 31. Batch submissions on the command line • Generate on your own the PX summary file (generated by default by the PX submission tool). Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 32. Batch submissions on the command line (2) • Generate on your own the PX summary file (generated by default by the PX submission tool). • Put together all the files plus the PX summary file. • Ask PRIDE team for a specific upload directory (pride-support@ Juan A. Vizcaíno juan@ebi.ac.uk ebi.ac.uk) 13th HUPO World Congress Madrid, 7 October 2014
  • 33. How to perform a complete PX submission to PRIDE • Decide between a complete/partial submission. • File conversion/export to PRIDE XML or mzIdentML • File check before submission (PRIDE Inspector) • Experimental annotation and actual file submission (PX Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 submission tool) • Post-submission steps
  • 34. Post-processing steps • PRIDE curators will check the files • Files must be valid to the schema • All the required annotations must be there • Basic QC check (e.g. detect errors in PTM annotation) • If everything is correct, submission to PRIDE is done • The author receives a PXD identifier, a reviewer username and a password, and a DOI (for complete submissions). Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 35. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Overview • The ProteomeXchange (PX) consortium • How to submit and access data in PX via PRIDE • How to access PX data • Submitting data triggers data reuse
  • 36. ProteomeXchange data workflow Results Raw Data* Juan A. Vizcaíno juan@ebi.ac.uk ProteomeCentral PRIDE (MS/MS data) 13th HUPO World Congress Madrid, 7 October 2014 Metadata / Manuscript Journals UniProt/ neXtProt Peptide Atlas Other DBs Receiving repositories PASSEL (SRM data) Other DBs GPMDB Researcher’s results Reprocessed results Raw data* Metadata MassIVE (MS/MS data) Vizcaíno et al., Nat Biotechnol, 2014
  • 37. ProteomeCentral: Portal for all PX datasets http://proteomecentral.proteomexchange.org/cgi/GetDataset Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014
  • 38. ProteomeXchange: 1,148 datasets up until August 2014 Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Origin: 235 USA 142 Germany 97 United Kingdom 67 Switzerland 64 Netherlands 62 China 60 France 48 Canada 43 Spain 36 Belgium 32 Sweden 29 Australia 26 Denmark 23 Japan 18 Taiwan 17 India 16 Ireland 14 Norway 14 Italy 12 Finland 11 Republic of Korea 10 Brazil 8 Austria 7 Israel 7 Singapore … Type: 386 PRIDE complete 687 PRIDE partial 51 PeptideAtlas/PASSEL complete 1 MassIVE 23 reprocessed Publicly Accessible: 544 datasets, 50% of all 90% PRIDE 10% PASSEL Top Species studied by at least 10 datasets: 510 Homo sapiens 142 Mus musculus 46 Saccharomyces cerevisiae 45 Arabidopsis thaliana 23 Rattus norvegicus 16 Escherichia coli 15 Bos taurus 15 Mycobacterium tuberculosis 13 Oryza sativa 12 Drosophila melanogaster 12 Glycine max ~ 265 species in total Data volume: Total: ~51 TB Number of all files: ~130,000 PXD000320-324: ~ 5 TB PXD000065: ~ 1.4TB Datasets/year: 2012: 102 2013: 527 2014: 519
  • 39. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Overview • The ProteomeXchange (PX) consortium • How to submit and access data in PX via PRIDE • How to access PX data • Submitting data triggers data reuse
  • 40. Which are the most accessed datasets? PXD Identifier Hits Dataset title Publication PXD000561 153512 A draft map of the human proteome Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Kim et al., Nature,2014. PMID: 24870542 PXD000851 111587 Membrane proteomic analysis of colorectal cancer tissue Kume et al., MCP, 2014. PMID:24687888 PXD000865 51639 Mass spectrometry based draft of the human proteome Wilhelm et al., 2014, Nature, PMID:24870543
  • 41. Which are the most accessed datasets? Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Total Numbers
  • 42. CompOmics Open Source Analysis Pipeline Juan A. Vizcaíno juan@ebi.ac.uk Vaudel M, Burkhart J, Zahedi RP, Berven FS, Sickmann A, Martens L, Barsnes H: Nature Biotechnology (in press) 13th HUPO World Congress Madrid, 7 October 2014 Vaudel M, Barsnes H, Berven FS, Sickmann A, Martens L: Proteomics 2011;11(5):996-9. http://searchgui.googlecode.com http://peptide-shaker.googlecode.com
  • 43. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Find the desired PRIDE project … … inspect the project details …. … and start re-analyzing the data! Reshake PRIDE data!
  • 44. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Conclusions • Submission to ProteomeXchange via PRIDE is easy. • Decide between complete and partial submissions. • Different open source tools available to facilitate the process. • File transfer speed should not be a problem (Aspera support)
  • 45. Acknowledgements Juan A. Vizcaíno juan@ebi.ac.uk PeptideAtlas Team (ISB, Seattle) Eric Deutsch Terry Farrah Zhi Sun Andrew R. Jones Lennart Martens Juan Pablo Albar Martin Eisenacher Gil Omenn And many other PX partners and stakeholders 13th HUPO World Congress Madrid, 7 October 2014 PRIDE Team Attila Csordas Rui Wang Florian Reisinger Jose A. Dianes Tobias Ternent Yasset Perez-Riverol Noemi del Toro Henning Hermjakob EU FP7 grant number 260558
  • 46. Juan A. Vizcaíno juan@ebi.ac.uk 13th HUPO World Congress Madrid, 7 October 2014 Questions?