SlideShare a Scribd company logo
mzTab - Reporting MS-based Proteomics 
and Metabolomics Results 
Dr. Juan A. Vizcaíno on behalf of 
Dr. Johannes Griss 
Proteomics Services Team 
EMBL-EBI 
Hinxton, Cambridge, UK 
Division of Immunology, Allergy and 
Infectious Diseases 
Department of Dermatology 
Medical University of Vienna, Austria
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
Overview 
• Need for mzTab 
• Details about the data format (mzTab 1.0) 
• Existing software implementations 
• Extension of mzTab 1.0 for metabolomics
HUPO Proteomics Standards Initiative 
•Develops data format standards for proteomics. 
•Both data representation and annotation standards. 
•Involves data producers, database providers, software 
producers, publishers, … 
•Active Workgroups: MI, MS, PI, Mod, (Protein Separation). 
•Inter-group activities: MIAPE and Controlled Vocabularies. 
•Started in 2002, so some experience already… 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
www.psidev.info
PSI-MS/PI Standard File Formats before mzTab 
Quantitation •mzQuantML 
Identification •mzIdentML 
MS data •mzML 
Johannes Griss 
jgriss@ebi.ac.uk 
SRM • TraML 
HUPO 2014
Reasons for an additional file format (mzTab) 
• mzIdentML and mzQuantML (necessary) focus on 
complete representation of proteomics results 
• Complex XML-based file formats 
• Specialised software required for visualisation 
• In-depth bioinformatics understanding required to create and 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
use files 
• No simple method to communicate final results to non-proteomics 
experts 
• No simple method to utilise files through scripting 
languages and standard statistical software
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab – Aims 
• Store final results of MS-based experiment in a single file 
• Quantitation data 
• Identification data 
• Small Molecule data 
• Reduce complexity to make data accessible to non-proteomics 
/ bioinformatics experts 
• Be easily accessible using “standard” software
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab – Aims 
• What the format does NOT aim at: 
• Replace mzIdentML or mzQuantML for proteomics 
approaches 
• Contain the complete data of a MS based experiment 
• Provide fully detailed evidence for the data 
• Allow a researcher to recreate the process which led to the 
results
Why a tab-delimited file? 
• Using XML based formats requires sophisticated 
bioinformatics expertise 
• Many researchers are still used to use MS Excel to “look” 
at or exchange their data. 
• Standard tab-delimited file formats for transcriptomics 
(MAGE-TAB) and molecular interactions (MI-TAB) data 
were already successful 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab format 
http://mztab.googlecode.com
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab - Sections 
• Basic information about experiment and sample 
• Key-Value pairs Metadata 
• Basic information about protein identifications 
• Table-based Protein 
• Information about quantified peptides 
• Table-based Peptide 
• Information about identified spectra 
• Table-based PSM 
• Basic information about identified small molecules 
• Table-based Small Molecule
Metadata section - Example 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
mzTab –Modes and Types 
• Modes (depending on the level of detail): 
• ‘Summary’: only the ‘final results’. 
• ‘Complete’: detailed information for each individual assay or 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
replicate is provided. 
• Types: 
• ‘Identification’: Only identification results. 
• ‘Quantification’: They can also contain identification results. 
• Overall, 4 different files “flavors” are possible, so very 
flexible design.
Protein Section (label-free) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Protein Section (label-free) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
Peptide Section (label-free) 
• Only used in “Quantification” files. 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
PSM section (identification data) 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014
mzTab – Current implementations 
• jmzTab (Java API): Version 3.0 is now a stable version. Manuscript 
published in the journal Proteomics. 
• mzTab Validator, PRIDE XML to mzTab converter (PRIDE team). 
• mzIdentML and mzQuantML to mzTab converters (Andy Jones 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
group). 
• MaxQuant: exporter in beta is available. 
• OpenMS (version 1.10). 
• R/Bioconductor package Msnbase (L. Gatto, Cambridge University). 
• LipidDataAnalyzer (J. Hartler, University of Graz, see next talk). 
• Metabolights (EBI).
mzTab – ongoing development 
• More detailed modelling of MS metabolomics data 
• Led by S. Neumann (COSMOS EU FP7 project). 
• Extension from one to three sections. 
Example file exists at 
https://github.com/sneumann/mtbls2/faahKO.mzTab 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
http://www.cosmos-fp7.eu/
mzTab format related publications 
J. Griss et al., MCP, 2014 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
http://code.google.com/p/mztab/ 
Q.W. Xu et al., Proteomics, 2014
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
mzTab format 
http://mztab.googlecode.com
Current PSI-MS/PI Standard File Formats 
Final Results • mzTab 
Quantitation • mzQuantML 
Identification • mzIdentML 
MS data • mzML 
Johannes Griss 
jgriss@ebi.ac.uk 
SRM • TraML 
HUPO 2014
Acknowledgements 
Johannes Griss 
jgriss@ebi.ac.uk 
HUPO 2014 
Johannes Griss 
Qing-Wei Xu 
Henning Hermjakob 
Timo Sachsenberg 
Mathias Walzer 
Oliver Kohlbacher 
http://mztab.googlecode.com 
Andy Jones 
S. Neumann and other COSMOS 
partners 
PSI editor and reviewers 
… and many others have 
also contributed 
BBSRC PROCESS grant 
BBSRC ProteoSuite grant

More Related Content

Similar to The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results

PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
Juan Antonio Vizcaino
 
Mass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progressMass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progress
Juan Antonio Vizcaino
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
Juan Antonio Vizcaino
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
Juan Antonio Vizcaino
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
Juan Antonio Vizcaino
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
Juan Antonio Vizcaino
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
Juan Antonio Vizcaino
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
Juan Antonio Vizcaino
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Juan Antonio Vizcaino
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
Juan Antonio Vizcaino
 
Giab workshop update mar2019
Giab workshop update mar2019Giab workshop update mar2019
Giab workshop update mar2019
GenomeInABottle
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
Juan Antonio Vizcaino
 
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
guest5e6f31
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
Juan Antonio Vizcaino
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
Juan Antonio Vizcaino
 
EMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesEMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and services
Rafael C. Jimenez
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 

Similar to The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results (20)

PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
 
Mass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progressMass Spectrometry Informatics formats in progress
Mass Spectrometry Informatics formats in progress
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchangeData volumes in proteomics data resources: PRIDE and ProteomeXchange
Data volumes in proteomics data resources: PRIDE and ProteomeXchange
 
ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015ProteomeXchange_and_PRIDE_Semmeting_2015
ProteomeXchange_and_PRIDE_Semmeting_2015
 
Giab workshop update mar2019
Giab workshop update mar2019Giab workshop update mar2019
Giab workshop update mar2019
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
 
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
 
Mik Black bioinformatics symposium
Mik Black bioinformatics symposiumMik Black bioinformatics symposium
Mik Black bioinformatics symposium
 
PRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinarPRIDE and ProteomeXchange: Training webinar
PRIDE and ProteomeXchange: Training webinar
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
 
EMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesEMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and services
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 

More from Juan Antonio Vizcaino

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
Juan Antonio Vizcaino
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
Juan Antonio Vizcaino
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
Juan Antonio Vizcaino
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
Juan Antonio Vizcaino
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
Juan Antonio Vizcaino
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
Juan Antonio Vizcaino
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Juan Antonio Vizcaino
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
Juan Antonio Vizcaino
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
Juan Antonio Vizcaino
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
Juan Antonio Vizcaino
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
Juan Antonio Vizcaino
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Juan Antonio Vizcaino
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
Juan Antonio Vizcaino
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
Juan Antonio Vizcaino
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
Juan Antonio Vizcaino
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
Juan Antonio Vizcaino
 
ProteomeXchange update 2017
ProteomeXchange update 2017ProteomeXchange update 2017
ProteomeXchange update 2017
Juan Antonio Vizcaino
 

More from Juan Antonio Vizcaino (20)

Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
 
ProteomeXchange update 2017
ProteomeXchange update 2017ProteomeXchange update 2017
ProteomeXchange update 2017
 

Recently uploaded

Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
PirithiRaju
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
Modelo de slide quimica para powerpoint
Modelo  de slide quimica para powerpointModelo  de slide quimica para powerpoint
Modelo de slide quimica para powerpoint
Karen593256
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 

Recently uploaded (20)

Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
Modelo de slide quimica para powerpoint
Modelo  de slide quimica para powerpointModelo  de slide quimica para powerpoint
Modelo de slide quimica para powerpoint
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 

The mzTab data standard format for reporting MS-based peptide, protein and small molecule identification and quantification results

  • 1. mzTab - Reporting MS-based Proteomics and Metabolomics Results Dr. Juan A. Vizcaíno on behalf of Dr. Johannes Griss Proteomics Services Team EMBL-EBI Hinxton, Cambridge, UK Division of Immunology, Allergy and Infectious Diseases Department of Dermatology Medical University of Vienna, Austria
  • 2. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 Overview • Need for mzTab • Details about the data format (mzTab 1.0) • Existing software implementations • Extension of mzTab 1.0 for metabolomics
  • 3. HUPO Proteomics Standards Initiative •Develops data format standards for proteomics. •Both data representation and annotation standards. •Involves data producers, database providers, software producers, publishers, … •Active Workgroups: MI, MS, PI, Mod, (Protein Separation). •Inter-group activities: MIAPE and Controlled Vocabularies. •Started in 2002, so some experience already… Johannes Griss jgriss@ebi.ac.uk HUPO 2014 www.psidev.info
  • 4. PSI-MS/PI Standard File Formats before mzTab Quantitation •mzQuantML Identification •mzIdentML MS data •mzML Johannes Griss jgriss@ebi.ac.uk SRM • TraML HUPO 2014
  • 5. Reasons for an additional file format (mzTab) • mzIdentML and mzQuantML (necessary) focus on complete representation of proteomics results • Complex XML-based file formats • Specialised software required for visualisation • In-depth bioinformatics understanding required to create and Johannes Griss jgriss@ebi.ac.uk HUPO 2014 use files • No simple method to communicate final results to non-proteomics experts • No simple method to utilise files through scripting languages and standard statistical software
  • 6. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab – Aims • Store final results of MS-based experiment in a single file • Quantitation data • Identification data • Small Molecule data • Reduce complexity to make data accessible to non-proteomics / bioinformatics experts • Be easily accessible using “standard” software
  • 7. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab – Aims • What the format does NOT aim at: • Replace mzIdentML or mzQuantML for proteomics approaches • Contain the complete data of a MS based experiment • Provide fully detailed evidence for the data • Allow a researcher to recreate the process which led to the results
  • 8. Why a tab-delimited file? • Using XML based formats requires sophisticated bioinformatics expertise • Many researchers are still used to use MS Excel to “look” at or exchange their data. • Standard tab-delimited file formats for transcriptomics (MAGE-TAB) and molecular interactions (MI-TAB) data were already successful Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 9. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab format http://mztab.googlecode.com
  • 10. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab - Sections • Basic information about experiment and sample • Key-Value pairs Metadata • Basic information about protein identifications • Table-based Protein • Information about quantified peptides • Table-based Peptide • Information about identified spectra • Table-based PSM • Basic information about identified small molecules • Table-based Small Molecule
  • 11. Metadata section - Example Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 12. mzTab –Modes and Types • Modes (depending on the level of detail): • ‘Summary’: only the ‘final results’. • ‘Complete’: detailed information for each individual assay or Johannes Griss jgriss@ebi.ac.uk HUPO 2014 replicate is provided. • Types: • ‘Identification’: Only identification results. • ‘Quantification’: They can also contain identification results. • Overall, 4 different files “flavors” are possible, so very flexible design.
  • 13. Protein Section (label-free) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 14. Protein Section (label-free) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 15. Peptide Section (label-free) • Only used in “Quantification” files. Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 16. PSM section (identification data) Johannes Griss jgriss@ebi.ac.uk HUPO 2014
  • 17. mzTab – Current implementations • jmzTab (Java API): Version 3.0 is now a stable version. Manuscript published in the journal Proteomics. • mzTab Validator, PRIDE XML to mzTab converter (PRIDE team). • mzIdentML and mzQuantML to mzTab converters (Andy Jones Johannes Griss jgriss@ebi.ac.uk HUPO 2014 group). • MaxQuant: exporter in beta is available. • OpenMS (version 1.10). • R/Bioconductor package Msnbase (L. Gatto, Cambridge University). • LipidDataAnalyzer (J. Hartler, University of Graz, see next talk). • Metabolights (EBI).
  • 18. mzTab – ongoing development • More detailed modelling of MS metabolomics data • Led by S. Neumann (COSMOS EU FP7 project). • Extension from one to three sections. Example file exists at https://github.com/sneumann/mtbls2/faahKO.mzTab Johannes Griss jgriss@ebi.ac.uk HUPO 2014 http://www.cosmos-fp7.eu/
  • 19. mzTab format related publications J. Griss et al., MCP, 2014 Johannes Griss jgriss@ebi.ac.uk HUPO 2014 http://code.google.com/p/mztab/ Q.W. Xu et al., Proteomics, 2014
  • 20. Johannes Griss jgriss@ebi.ac.uk HUPO 2014 mzTab format http://mztab.googlecode.com
  • 21. Current PSI-MS/PI Standard File Formats Final Results • mzTab Quantitation • mzQuantML Identification • mzIdentML MS data • mzML Johannes Griss jgriss@ebi.ac.uk SRM • TraML HUPO 2014
  • 22. Acknowledgements Johannes Griss jgriss@ebi.ac.uk HUPO 2014 Johannes Griss Qing-Wei Xu Henning Hermjakob Timo Sachsenberg Mathias Walzer Oliver Kohlbacher http://mztab.googlecode.com Andy Jones S. Neumann and other COSMOS partners PSI editor and reviewers … and many others have also contributed BBSRC PROCESS grant BBSRC ProteoSuite grant