SlideShare a Scribd company logo
1 of 22
mzTab - Reporting MS-based Proteomics 
and Metabolomics Results 
Dr. Juan A. Vizcaíno on behalf of 
Dr. Johannes Griss 
Proteomics Services Team 
EMBL-EBI 
Hinxton, Cambridge, UK 
Division of Immunology, Allergy and 
Infectious Diseases 
Department of Dermatology 
Medical University of Vienna, Austria
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
Overview 
• Need for mzTab 
• Details about the data format (mzTab 1.0) 
• Existing software implementations 
• Extension of mzTab 1.0 for metabolomics
HUPO Proteomics Standards Initiative 
•Develops data format standards for proteomics. 
•Both data representation and annotation standards. 
•Involves data producers, database providers, software 
producers, publishers, … 
•Active Workgroups: MI, MS, PI, Mod, (Protein Separation). 
•Inter-group activities: MIAPE and Controlled Vocabularies. 
•Started in 2002, so some experience already… 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
www.psidev.info
PSI-MS/PI Standard File Formats before mzTab 
Quantitation •mzQuantML 
Identification •mzIdentML 
MS data •mzML 
Johannes Griss 
jgriss@ebi.ac.uk 
SRM • TraML 
HUPO 2014
Reasons for an additional file format (mzTab) 
• mzIdentML and mzQuantML (necessary) focus on 
complete representation of proteomics results 
• Complex XML-based file formats 
• Specialised software required for visualisation 
• In-depth bioinformatics understanding required to create and 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
use files 
• No simple method to communicate final results to non-proteomics 
experts 
• No simple method to utilise files through scripting 
languages and standard statistical software
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab – Aims 
• Store final results of MS-based experiment in a single file 
• Quantitation data 
• Identification data 
• Small Molecule data 
• Reduce complexity to make data accessible to non-proteomics 
/ bioinformatics experts 
• Be easily accessible using “standard” software
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab – Aims 
• What the format does NOT aim at: 
• Replace mzIdentML or mzQuantML for proteomics 
approaches 
• Contain the complete data of a MS based experiment 
• Provide fully detailed evidence for the data 
• Allow a researcher to recreate the process which led to the 
results
Why a tab-delimited file? 
• Using XML based formats requires sophisticated 
bioinformatics expertise 
• Many researchers are still used to use MS Excel to “look” 
at or exchange their data. 
• Standard tab-delimited file formats for transcriptomics 
(MAGE-TAB) and molecular interactions (MI-TAB) data 
were already successful 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab format 
http://mztab.googlecode.com
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab - Sections 
• Basic information about experiment and sample 
• Key-Value pairs Metadata 
• Basic information about protein identifications 
• Table-based Protein 
• Information about quantified peptides 
• Table-based Peptide 
• Information about identified spectra 
• Table-based PSM 
• Basic information about identified small molecules 
• Table-based Small Molecule
Metadata section - Example 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
mzTab –Modes and Types 
• Modes (depending on the level of detail): 
• ‘Summary’: only the ‘final results’. 
• ‘Complete’: detailed information for each individual assay or 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
replicate is provided. 
• Types: 
• ‘Identification’: Only identification results. 
• ‘Quantification’: They can also contain identification results. 
• Overall, 4 different files “flavors” are possible, so very 
flexible design.
Protein Section (label-free) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Protein Section (label-free) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Peptide Section (label-free) 
• Only used in “Quantification” files. 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
PSM section (identification data) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
mzTab – Current implementations 
• jmzTab (Java API): Version 3.0 is now a stable version. Manuscript 
published in the journal Proteomics. 
• mzTab Validator, PRIDE XML to mzTab converter (PRIDE team). 
• mzIdentML and mzQuantML to mzTab converters (Andy Jones 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
group). 
• MaxQuant: exporter in beta is available. 
• OpenMS (version 1.10). 
• R/Bioconductor package Msnbase (L. Gatto, Cambridge University). 
• LipidDataAnalyzer (J. Hartler, University of Graz, see next talk). 
• Metabolights (EBI).
mzTab – ongoing development 
• More detailed modelling of MS metabolomics data 
• Led by S. Neumann (COSMOS EU FP7 project). 
• Extension from one to three sections. 
Example file exists at 
https://github.com/sneumann/mtbls2/faahKO.mzTab 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
http://www.cosmos-fp7.eu/
mzTab format related publications 
J. Griss et al., MCP, 2014 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
http://code.google.com/p/mztab/ 
Q.W. Xu et al., Proteomics, 2014
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab format 
http://mztab.googlecode.com
Current PSI-MS/PI Standard File Formats 
Final Results • mzTab 
Quantitation • mzQuantML 
Identification • mzIdentML 
MS data • mzML 
Johannes Griss 
jgriss@ebi.ac.uk 
SRM • TraML 
HUPO 2014
Acknowledgements 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
Johannes Griss 
Qing-Wei Xu 
Henning Hermjakob 
Timo Sachsenberg 
Mathias Walzer 
Oliver Kohlbacher 
http://mztab.googlecode.com 
Andy Jones 
S. Neumann and other COSMOS 
partners 
PSI editor and reviewers 
… and many others have 
also contributed 
BBSRC PROCESS grant 
BBSRC ProteoSuite grant

More Related Content

Similar to The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results

Similar to The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results (20)

PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
 
Mass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progressMass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progress
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
 
Giab workshop update mar2019
Giab workshop update mar2019Giab workshop update mar2019
Giab workshop update mar2019
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
 
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
 
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
 
EMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesEMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and services
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 

More from Juan Antonio Vizcaino

More from Juan Antonio Vizcaino (20)

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
 
ProteomeXchange update 2017
ProteomeXchange update 2017ProteomeXchange update 2017
ProteomeXchange update 2017
 

Recently uploaded

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 

Recently uploaded (20)

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 

The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results

  • 1. mzTab - Reporting MS-based Proteomics and Metabolomics Results Dr. Juan A. Vizcaíno on behalf of Dr. Johannes Griss Proteomics Services Team EMBL-EBI Hinxton, Cambridge, UK Division of Immunology, Allergy and Infectious Diseases Department of Dermatology Medical University of Vienna, Austria
  • 2. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 Overview • Need for mzTab • Details about the data format (mzTab 1.0) • Existing software implementations • Extension of mzTab 1.0 for metabolomics
  • 3. HUPO Proteomics Standards Initiative •Develops data format standards for proteomics. •Both data representation and annotation standards. •Involves data producers, database providers, software producers, publishers, … •Active Workgroups: MI, MS, PI, Mod, (Protein Separation). •Inter-group activities: MIAPE and Controlled Vocabularies. •Started in 2002, so some experience already… Johannes Griss jgriss@ebi.ac.uk HUPO 2014 www.psidev.info
  • 4. PSI-MS/PI Standard File Formats before mzTab Quantitation •mzQuantML Identification •mzIdentML MS data •mzML Johannes Griss jgriss@ebi.ac.uk SRM • TraML HUPO 2014
  • 5. Reasons for an additional file format (mzTab) • mzIdentML and mzQuantML (necessary) focus on complete representation of proteomics results • Complex XML-based file formats • Specialised software required for visualisation • In-depth bioinformatics understanding required to create and Johannes Griss jgriss@ebi.ac.uk HUPO 2014 use files • No simple method to communicate final results to non-proteomics experts • No simple method to utilise files through scripting languages and standard statistical software
  • 6. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab – Aims • Store final results of MS-based experiment in a single file • Quantitation data • Identification data • Small Molecule data • Reduce complexity to make data accessible to non-proteomics / bioinformatics experts • Be easily accessible using “standard” software
  • 7. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab – Aims • What the format does NOT aim at: • Replace mzIdentML or mzQuantML for proteomics approaches • Contain the complete data of a MS based experiment • Provide fully detailed evidence for the data • Allow a researcher to recreate the process which led to the results
  • 8. Why a tab-delimited file? • Using XML based formats requires sophisticated bioinformatics expertise • Many researchers are still used to use MS Excel to “look” at or exchange their data. • Standard tab-delimited file formats for transcriptomics (MAGE-TAB) and molecular interactions (MI-TAB) data were already successful Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 9. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab format http://mztab.googlecode.com
  • 10. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab - Sections • Basic information about experiment and sample • Key-Value pairs Metadata • Basic information about protein identifications • Table-based Protein • Information about quantified peptides • Table-based Peptide • Information about identified spectra • Table-based PSM • Basic information about identified small molecules • Table-based Small Molecule
  • 11. Metadata section - Example Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 12. mzTab –Modes and Types • Modes (depending on the level of detail): • ‘Summary’: only the ‘final results’. • ‘Complete’: detailed information for each individual assay or Johannes Griss jgriss@ebi.ac.uk HUPO 2014 replicate is provided. • Types: • ‘Identification’: Only identification results. • ‘Quantification’: They can also contain identification results. • Overall, 4 different files “flavors” are possible, so very flexible design.
  • 13. Protein Section (label-free) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 14. Protein Section (label-free) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 15. Peptide Section (label-free) • Only used in “Quantification” files. Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 16. PSM section (identification data) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 17. mzTab – Current implementations • jmzTab (Java API): Version 3.0 is now a stable version. Manuscript published in the journal Proteomics. • mzTab Validator, PRIDE XML to mzTab converter (PRIDE team). • mzIdentML and mzQuantML to mzTab converters (Andy Jones Johannes Griss jgriss@ebi.ac.uk HUPO 2014 group). • MaxQuant: exporter in beta is available. • OpenMS (version 1.10). • R/Bioconductor package Msnbase (L. Gatto, Cambridge University). • LipidDataAnalyzer (J. Hartler, University of Graz, see next talk). • Metabolights (EBI).
  • 18. mzTab – ongoing development • More detailed modelling of MS metabolomics data • Led by S. Neumann (COSMOS EU FP7 project). • Extension from one to three sections. Example file exists at https://github.com/sneumann/mtbls2/faahKO.mzTab Johannes Griss jgriss@ebi.ac.uk HUPO 2014 http://www.cosmos-fp7.eu/
  • 19. mzTab format related publications J. Griss et al., MCP, 2014 Johannes Griss jgriss@ebi.ac.uk HUPO 2014 http://code.google.com/p/mztab/ Q.W. Xu et al., Proteomics, 2014
  • 20. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab format http://mztab.googlecode.com
  • 21. Current PSI-MS/PI Standard File Formats Final Results • mzTab Quantitation • mzQuantML Identification • mzIdentML MS data • mzML Johannes Griss jgriss@ebi.ac.uk SRM • TraML HUPO 2014
  • 22. Acknowledgements Johannes Griss jgriss@ebi.ac.uk HUPO 2014 Johannes Griss Qing-Wei Xu Henning Hermjakob Timo Sachsenberg Mathias Walzer Oliver Kohlbacher http://mztab.googlecode.com Andy Jones S. Neumann and other COSMOS partners PSI editor and reviewers … and many others have also contributed BBSRC PROCESS grant BBSRC ProteoSuite grant