SlideShare a Scribd company logo
1 of 27
Download to read offline
ProteomeXchange update
Dr. Juan Antonio Vizcaíno
(on behalf of all ProteomeXchange partners)
EMBL-European Bioinformatics Institute
Hinxton, Cambridge, UK
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Overview
• Introduction
• Some usage statistics
• Guidelines: Handling of reprocessed datasets
• New prospective member: Panorama Public
• Miscellaneous
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
ProteomeXchange: A Global, distributed proteomics database
PASSEL
(SRM data)
PRIDE
(MS/MS data)
MassIVE
(MS/MS data)
Raw
ID/Q
Meta
jPOST
(MS/MS data)
Mandatory data deposition
http://www.proteomexchange.org
Vizcaíno et al., Nat Biotechnol, 2014
Deutsch et al., NAR, 2017
• Framework to allow standard data submission and dissemination
pipelines between the main existing proteomics repositories.
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
ProteomeXchange: A Global, distributed proteomics database
PASSEL
(SRM data)
PRIDE
(MS/MS data)
MassIVE
(MS/MS data)
Raw
ID/Q
Meta
jPOST
(MS/MS data)
Mandatory data deposition
http://www.proteomexchange.org
Vizcaíno et al., Nat Biotechnol, 2014
Deutsch et al., NAR, 2017
iProX
(MS/MS data)
• Framework to allow standard data submission and dissemination
pipelines between the main existing proteomics repositories.
New in 2017
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
VIP
Load balance server 1
nginx keepalived
CentOS
Load balance server 2
nginx keepalived
CentOS
Application server 1
SpringMVC MyBatis
tomcat
java
CentOS
Application server 2
SpringMVC MyBatis
tomcat
java
CentOS
Database server (Master)
CentOS
MySql
Database server (slave)
CentOS
MySql
Data storage server 2
nginx
CentOS
Data storage server 1
nginx keepalived
CentOS
aspera
Data storage server 3
nginx keepalived
CentOS
aspera
iProX- the integrated proteome resources in China
Cloud platform architecture
with High Availability
http://www.iprox.org
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Deployment of iProX
Beijing
Hunan
Shanghai
• BPRC & NCPSB (Beijing): Main
location of deployment and the
only submission site
• Three Offsite data backups
• CNIC (Beijing, north China)
• SCBIT(Shanghai, east China)
• NSCC(Hunan, south China)
• All four sites will provide
downloading service at the same
time coordinated by the load
balancer.
• By the end of March 2018, 374
datasets are submitted, with a
total amount of 60 TB
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
ProteomeXchange: A Global, distributed proteomics database
PASSEL
(SRM data)
PRIDE
(MS/MS data)
MassIVE
(MS/MS data)
Raw
ID/Q
Meta
jPOST
(MS/MS data)
Mandatory data deposition
http://www.proteomexchange.org
Vizcaíno et al., Nat Biotechnol, 2014
Deutsch et al., NAR, 2017
iProX
(MS/MS data)
• Framework to allow standard data submission and dissemination
pipelines between the main existing proteomics repositories.
New in 2017
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
https://jpostdb.org/
Repository
is going well.
Database part is just open.
Re-analysis part is
under development.
Funding is just renewed for next 5 years!
JPOST status
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
ProteomeCentral: Portal for all PX datasets
http://proteomecentral.proteomexchange.org/cgi/GetDataset
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Public datasets from different omics: OmicsDI
http://www.omicsdi.org/
• Aims to integrate of ‘omics’ datasets (proteomics,
transcriptomics, metabolomics and genomics at present).
PRIDE
MassIVE
jPOST
PASSEL
GPMDB
ArrayExpress
Expression Atlas
MetaboLights
Metabolomics Workbench
GNPS
EGA
…and others
Perez-Riverol et al., Nat Biotechnol, 2017
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Overview
• Introduction
• Some usage statistics
• Guidelines: Handling of reprocessed datasets
• New prospective member: Panorama Public
• Miscellaneous
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Data content per resource (PXD identifiers)
84.9%
11.5%
1.8% 1.5% 0.3%
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
PRIDE data submissions and data growth
> 2,400 datasets submitted in 2017
In March 2018 we have reached for the
first time 300 submitted datasets
Datasets submitted per month Datasets submitted per year
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Data re-use in proteomics is increasing
Data download volume for PRIDE Archive in
2017: 295 TB
0
50
100
150
200
250
300
350
2013 2014 2015 2016 2017
Downloads in TBs
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Overview
• Introduction
• Some usage statistics
• Guidelines: Handling of reprocessed datasets
• New prospective member: Panorama Public
• Miscellaneous
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Guidelines developed
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Guidelines developed
• Initial implementation in MassIVE
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Other guidelines developed during the last year
• Retraction of datasets (“Re-calling”)
• Support for alternative location of datasets (alternative URLs)
• Try to get external datasets into PX (e.g. CPTAC)
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Overview
• Introduction
• Some usage statistics
• Guidelines: Handling of reprocessed datasets
• New prospective member: Panorama Public
• Miscellaneous
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Panorama Public
• Panorama Public is designed for sharing data generated
through Skyline-based targeted proteomics workflows such as
SRM and PRM or targeted DDA and DIA.
• Led by Brendan MacLean & Mike MacCoss group
• Processed results are stored in the Skyline XML format
• Interested to join ProteomeXchange as a repository for targeted
proteomics workflows.
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Panorama Public
https://panoramaweb.org/
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Overview
• Introduction
• Some usage statistics
• Guidelines: Handling of reprocessed datasets
• New prospective member: Panorama Public
• Miscellaneous
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
PRIDE has become and ELIXIR core data resource
• ELIXIR coordinates, integrates and sustains bioinformatics
resources across Europe and enables users in academia and
industry to access services that are vital for their research
• First list of core resources announced on July 2017.
• PRIDE included in the initial list.
https://www.elixir-europe.org/platforms/data/core-data-resources
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
• The goal of the ELIXIR proteomics community is to
develop and maintain sustainable proteomics
tools and data resources
• An essential part of the development will also be the
‘FAIRification’ of the resources (i.e. making the
resources FAIR)
• Integrate proteomics bioinformatics activities in
ELIXIR
PRIDE as a “pillar” of the ELIXIR Proteomics Community
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Main plans for meeting
• General update
• Panorama Public application to join PX
• Do we need more formalised guidelines for several topics?
• Short Report about GDPR guidelines
• Two related projects (NIH “data standards” grant):
• Universal Spectrum Identifier (USI)
• PROXI
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018
Aknowledgements: People
Yasset Perez-Riverol
Attila Csordas
Tobias Ternent
Gerhard Mayer (de.NBI)
Andrew Jarnuczak
Mathias Walzer
Suresh Hewapathirana
Jingwen Bai
Former team members, especially:
Henning Hermjakob
Acknowledgements: All ProteomeXchange partners
All data submitters !!!
Eric Deutsch
Zhi Sun
David Campbell
Nuno Bandeira
Mingxun Wang
Jeremy Carver
Yasushi Ishihama
Shin Kawano
Follow new datasets @proteomexchange
Yunping Zhu
Masheng Li
Juan A. Vizcaíno
juan@ebi.ac.uk
PSI Meeting 2018
Heidelberg, 18 April 2018

More Related Content

What's hot

ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...Dr. Haxel Consult
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Kees van Bochove
 
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...OpenAIRE
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...Dr. Haxel Consult
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...Dr. Haxel Consult
 
New Product Introductions - Minesoft
New Product Introductions - MinesoftNew Product Introductions - Minesoft
New Product Introductions - MinesoftDr. Haxel Consult
 
ICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChemICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChemDr. Haxel Consult
 
New PID developments
New PID developmentsNew PID developments
New PID developmentsOpenAIRE
 
FAIR data principles and data management plans - 31 Oct 2017
FAIR data principles and data management plans - 31 Oct 2017FAIR data principles and data management plans - 31 Oct 2017
FAIR data principles and data management plans - 31 Oct 2017ARDC
 
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Kees van Bochove
 
Neue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
Neue Lösungen für Life Sciences und die Pharmaindustrie mit GraphdatenbankenNeue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
Neue Lösungen für Life Sciences und die Pharmaindustrie mit GraphdatenbankenNeo4j
 
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Martin Hamilton
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBDr. Haxel Consult
 
Role of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksRole of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksOpenAIRE
 
Linked open data, its realization
Linked open data, its realizationLinked open data, its realization
Linked open data, its realizationSeonho Kim
 
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS PlatformALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS PlatformSeonho Kim
 
Optimising Content Spending with Analytics
Optimising Content Spending with AnalyticsOptimising Content Spending with Analytics
Optimising Content Spending with AnalyticsDr. Haxel Consult
 
How 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&DHow 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&DKees van Bochove
 
Metadata catalogues survey results, EOSCpilot H2020 EU project
Metadata catalogues survey results, EOSCpilot H2020 EU projectMetadata catalogues survey results, EOSCpilot H2020 EU project
Metadata catalogues survey results, EOSCpilot H2020 EU projectMassimiliano Assante
 
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...Kees van Bochove
 

What's hot (20)

ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
 
New Product Introductions - Minesoft
New Product Introductions - MinesoftNew Product Introductions - Minesoft
New Product Introductions - Minesoft
 
ICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChemICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChem
 
New PID developments
New PID developmentsNew PID developments
New PID developments
 
FAIR data principles and data management plans - 31 Oct 2017
FAIR data principles and data management plans - 31 Oct 2017FAIR data principles and data management plans - 31 Oct 2017
FAIR data principles and data management plans - 31 Oct 2017
 
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
 
Neue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
Neue Lösungen für Life Sciences und die Pharmaindustrie mit GraphdatenbankenNeue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
Neue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
 
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
 
Role of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksRole of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly works
 
Linked open data, its realization
Linked open data, its realizationLinked open data, its realization
Linked open data, its realization
 
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS PlatformALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
 
Optimising Content Spending with Analytics
Optimising Content Spending with AnalyticsOptimising Content Spending with Analytics
Optimising Content Spending with Analytics
 
How 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&DHow 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&D
 
Metadata catalogues survey results, EOSCpilot H2020 EU project
Metadata catalogues survey results, EOSCpilot H2020 EU projectMetadata catalogues survey results, EOSCpilot H2020 EU project
Metadata catalogues survey results, EOSCpilot H2020 EU project
 
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
 

Similar to ProteomeXchange update

Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Juan Antonio Vizcaino
 
Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsJuan Antonio Vizcaino
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsJuan Antonio Vizcaino
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataJuan Antonio Vizcaino
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Juan Antonio Vizcaino
 
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...Juan Antonio Vizcaino
 
Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Juan Antonio Vizcaino
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateJuan Antonio Vizcaino
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldJuan Antonio Vizcaino
 

Similar to ProteomeXchange update (20)

Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
ProteomeXchange update
ProteomeXchange updateProteomeXchange update
ProteomeXchange update
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomics
 
PRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchangePRIDE resources and ProteomeXchange
PRIDE resources and ProteomeXchange
 
ProteomeXchange update 2017
ProteomeXchange update 2017ProteomeXchange update 2017
ProteomeXchange update 2017
 
Mining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasetsMining the hidden proteome using hundreds of public proteomics datasets
Mining the hidden proteome using hundreds of public proteomics datasets
 
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics dataPRIDE and ProteomeXchange: A golden age for working with public proteomics data
PRIDE and ProteomeXchange: A golden age for working with public proteomics data
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
The ELIXIR Proteomics community
The ELIXIR Proteomics community The ELIXIR Proteomics community
The ELIXIR Proteomics community
 
PRIDE and ProteomeXchange
PRIDE and ProteomeXchangePRIDE and ProteomeXchange
PRIDE and ProteomeXchange
 
Pride and ProteomeXchange
Pride and ProteomeXchangePride and ProteomeXchange
Pride and ProteomeXchange
 
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
 
Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...Reusing and integrating public proteomics data to improve our knowledge of th...
Reusing and integrating public proteomics data to improve our knowledge of th...
 
ProteomeXchange update HUPO 2016
ProteomeXchange update HUPO 2016ProteomeXchange update HUPO 2016
ProteomeXchange update HUPO 2016
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
 

More from Juan Antonio Vizcaino

Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formatsJuan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Juan Antonio Vizcaino
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...Juan Antonio Vizcaino
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...Juan Antonio Vizcaino
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Juan Antonio Vizcaino
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Juan Antonio Vizcaino
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Juan Antonio Vizcaino
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...Juan Antonio Vizcaino
 
Enabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataEnabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataJuan Antonio Vizcaino
 
Introduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRIntroduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRJuan Antonio Vizcaino
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)Juan Antonio Vizcaino
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Juan Antonio Vizcaino
 

More from Juan Antonio Vizcaino (19)

Introduction to the PSI standard data formats
Introduction to the PSI standard data formatsIntroduction to the PSI standard data formats
Introduction to the PSI standard data formats
 
Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018Introduction to the Proteomics Bioinformatics Course 2018
Introduction to the Proteomics Bioinformatics Course 2018
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
PSI-Proteome Informatics update
PSI-Proteome Informatics updatePSI-Proteome Informatics update
PSI-Proteome Informatics update
 
The ELIXIR Proteomics Community
The ELIXIR Proteomics CommunityThe ELIXIR Proteomics Community
The ELIXIR Proteomics Community
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...Public proteomics data: a (mostly unexploited) gold mine for computational re...
Public proteomics data: a (mostly unexploited) gold mine for computational re...
 
How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?How to run and maintain a popular biological data repository?
How to run and maintain a popular biological data repository?
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017Introduction to the Proteomics Bioinformatics Course 2017
Introduction to the Proteomics Bioinformatics Course 2017
 
Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?Is it feasible to identify novel biomarkers by mining public proteomics data?
Is it feasible to identify novel biomarkers by mining public proteomics data?
 
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
The spectra-cluster toolsuite: Enhancing proteomics analysis through spectrum...
 
Enabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics dataEnabling automated processing and analysis of large-scale proteomics data
Enabling automated processing and analysis of large-scale proteomics data
 
Introduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIRIntroduction to EBI for Proteomics in ELIXIR
Introduction to EBI for Proteomics in ELIXIR
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016
 
Reuse of public data in proteomics
Reuse of public data in proteomicsReuse of public data in proteomics
Reuse of public data in proteomics
 

Recently uploaded

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 

Recently uploaded (20)

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett Square
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 

ProteomeXchange update

  • 1. ProteomeXchange update Dr. Juan Antonio Vizcaíno (on behalf of all ProteomeXchange partners) EMBL-European Bioinformatics Institute Hinxton, Cambridge, UK
  • 2. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Overview • Introduction • Some usage statistics • Guidelines: Handling of reprocessed datasets • New prospective member: Panorama Public • Miscellaneous
  • 3. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 ProteomeXchange: A Global, distributed proteomics database PASSEL (SRM data) PRIDE (MS/MS data) MassIVE (MS/MS data) Raw ID/Q Meta jPOST (MS/MS data) Mandatory data deposition http://www.proteomexchange.org Vizcaíno et al., Nat Biotechnol, 2014 Deutsch et al., NAR, 2017 • Framework to allow standard data submission and dissemination pipelines between the main existing proteomics repositories.
  • 4. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 ProteomeXchange: A Global, distributed proteomics database PASSEL (SRM data) PRIDE (MS/MS data) MassIVE (MS/MS data) Raw ID/Q Meta jPOST (MS/MS data) Mandatory data deposition http://www.proteomexchange.org Vizcaíno et al., Nat Biotechnol, 2014 Deutsch et al., NAR, 2017 iProX (MS/MS data) • Framework to allow standard data submission and dissemination pipelines between the main existing proteomics repositories. New in 2017
  • 5. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 VIP Load balance server 1 nginx keepalived CentOS Load balance server 2 nginx keepalived CentOS Application server 1 SpringMVC MyBatis tomcat java CentOS Application server 2 SpringMVC MyBatis tomcat java CentOS Database server (Master) CentOS MySql Database server (slave) CentOS MySql Data storage server 2 nginx CentOS Data storage server 1 nginx keepalived CentOS aspera Data storage server 3 nginx keepalived CentOS aspera iProX- the integrated proteome resources in China Cloud platform architecture with High Availability http://www.iprox.org
  • 6. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Deployment of iProX Beijing Hunan Shanghai • BPRC & NCPSB (Beijing): Main location of deployment and the only submission site • Three Offsite data backups • CNIC (Beijing, north China) • SCBIT(Shanghai, east China) • NSCC(Hunan, south China) • All four sites will provide downloading service at the same time coordinated by the load balancer. • By the end of March 2018, 374 datasets are submitted, with a total amount of 60 TB
  • 7. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 ProteomeXchange: A Global, distributed proteomics database PASSEL (SRM data) PRIDE (MS/MS data) MassIVE (MS/MS data) Raw ID/Q Meta jPOST (MS/MS data) Mandatory data deposition http://www.proteomexchange.org Vizcaíno et al., Nat Biotechnol, 2014 Deutsch et al., NAR, 2017 iProX (MS/MS data) • Framework to allow standard data submission and dissemination pipelines between the main existing proteomics repositories. New in 2017
  • 8. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 https://jpostdb.org/ Repository is going well. Database part is just open. Re-analysis part is under development. Funding is just renewed for next 5 years! JPOST status
  • 9. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 ProteomeCentral: Portal for all PX datasets http://proteomecentral.proteomexchange.org/cgi/GetDataset
  • 10. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Public datasets from different omics: OmicsDI http://www.omicsdi.org/ • Aims to integrate of ‘omics’ datasets (proteomics, transcriptomics, metabolomics and genomics at present). PRIDE MassIVE jPOST PASSEL GPMDB ArrayExpress Expression Atlas MetaboLights Metabolomics Workbench GNPS EGA …and others Perez-Riverol et al., Nat Biotechnol, 2017
  • 11. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Overview • Introduction • Some usage statistics • Guidelines: Handling of reprocessed datasets • New prospective member: Panorama Public • Miscellaneous
  • 12. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Data content per resource (PXD identifiers) 84.9% 11.5% 1.8% 1.5% 0.3%
  • 13. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 PRIDE data submissions and data growth > 2,400 datasets submitted in 2017 In March 2018 we have reached for the first time 300 submitted datasets Datasets submitted per month Datasets submitted per year
  • 14. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Data re-use in proteomics is increasing Data download volume for PRIDE Archive in 2017: 295 TB 0 50 100 150 200 250 300 350 2013 2014 2015 2016 2017 Downloads in TBs
  • 15. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Overview • Introduction • Some usage statistics • Guidelines: Handling of reprocessed datasets • New prospective member: Panorama Public • Miscellaneous
  • 16. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Guidelines developed
  • 17. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Guidelines developed • Initial implementation in MassIVE
  • 18. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Other guidelines developed during the last year • Retraction of datasets (“Re-calling”) • Support for alternative location of datasets (alternative URLs) • Try to get external datasets into PX (e.g. CPTAC)
  • 19. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Overview • Introduction • Some usage statistics • Guidelines: Handling of reprocessed datasets • New prospective member: Panorama Public • Miscellaneous
  • 20. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Panorama Public • Panorama Public is designed for sharing data generated through Skyline-based targeted proteomics workflows such as SRM and PRM or targeted DDA and DIA. • Led by Brendan MacLean & Mike MacCoss group • Processed results are stored in the Skyline XML format • Interested to join ProteomeXchange as a repository for targeted proteomics workflows.
  • 21. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Panorama Public https://panoramaweb.org/
  • 22. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Overview • Introduction • Some usage statistics • Guidelines: Handling of reprocessed datasets • New prospective member: Panorama Public • Miscellaneous
  • 23. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 PRIDE has become and ELIXIR core data resource • ELIXIR coordinates, integrates and sustains bioinformatics resources across Europe and enables users in academia and industry to access services that are vital for their research • First list of core resources announced on July 2017. • PRIDE included in the initial list. https://www.elixir-europe.org/platforms/data/core-data-resources
  • 24. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 • The goal of the ELIXIR proteomics community is to develop and maintain sustainable proteomics tools and data resources • An essential part of the development will also be the ‘FAIRification’ of the resources (i.e. making the resources FAIR) • Integrate proteomics bioinformatics activities in ELIXIR PRIDE as a “pillar” of the ELIXIR Proteomics Community
  • 25. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Main plans for meeting • General update • Panorama Public application to join PX • Do we need more formalised guidelines for several topics? • Short Report about GDPR guidelines • Two related projects (NIH “data standards” grant): • Universal Spectrum Identifier (USI) • PROXI
  • 26. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018 Aknowledgements: People Yasset Perez-Riverol Attila Csordas Tobias Ternent Gerhard Mayer (de.NBI) Andrew Jarnuczak Mathias Walzer Suresh Hewapathirana Jingwen Bai Former team members, especially: Henning Hermjakob Acknowledgements: All ProteomeXchange partners All data submitters !!! Eric Deutsch Zhi Sun David Campbell Nuno Bandeira Mingxun Wang Jeremy Carver Yasushi Ishihama Shin Kawano Follow new datasets @proteomexchange Yunping Zhu Masheng Li
  • 27. Juan A. Vizcaíno juan@ebi.ac.uk PSI Meeting 2018 Heidelberg, 18 April 2018