SlideShare a Scribd company logo
1 of 65
Community effort to provide molecular
interaction data
PSI-MI & PSICQUIC
Programmatic Access To Biological Databases (Perl)
1 – 4 October 2012
Marine Dumousseau, Noemi Del Toro Ayllon and Rafael C. Jimenez
marine@ebi.ac.uk, ntoro@ebi.ac.uk and rafael@ebi.ac.uk
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
1
2
3
4
5
6
• Proteomics Standards Initiative
• Work group of the Human Proteome Organization
• Defines community standards for data in proteomics
• … facilitating data comparison, exchange and verification
PSI
3
PSI-MI
• Work group of the Proteomics Standards Initiative
• Community coordination to ensure deposition of
Molecular Interaction data in public repositories
• Concentrating on …
• Annotation and representation of published MI data
• Accessibility of MI data to the user community
http://www.psidev.info/MI
13/12/2018
5
http://psidev.info/
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
PSI-MI
Data format/schema
Data distribution
Control vocabulary
MIAPE
Reporting guideline
PSI-MI XML
PSI-MITAB
PSICQUIC
MIMIx
IMEx
PSI-MI CV
http://www.psidev.info/MI
Scoring
PSISCORE
PSI-MI format
• Community standard for Molecular Interactions
• Jointly developed by major data providers: BIND,
CellZome, DIP, GSK, HPRD, Hybrigenics, IntAct, MINT, MIPS, Serono,
U. Bielefeld, U. Bordeaux, U. Cambridge, and others
• Collecting and combining data from different sources
has become easier
• Standardized annotation through PSI-MI ontologies
• Tools from different organizations can be chained, e.g.
IntAct data in Cytoscape.
8
psi-mi/xml25 psi-mi/tab25
9
PSI-MI 2.5 Standards
• Version 1.0 published in February 2004
The HUPO PSI Molecular Interaction Format - A community standard for the representation of
protein interaction data.Henning Hermjakob et al, Nature Biotechnology 2004, 22, 176-183.
• Version 2.5 published in October 2007
PSI-MI XML: A simplified view of the schema
Participant Participant Participant
Interactor Interactor
Feature
RangeRange
Feature
Experiment
Entry
EntrySet
Interaction Interaction
XML specification: http://code.google.com/p/psimi/wiki/PsimiXMLSpecifications
Schema view: http://psidev.sourceforge.net/molecular_interactions/rel25/doc/
Schema: http://psidev.sourceforge.net/mi/rel25/src/MIF25.xsd
General docuemntation: http://www.psidev.info/node/60#overlay-context=node/60
“Compact” format
Interaction
Interaction
experimentDescription id=“1”
experimentDescription id=“2”
experimentRef=“1”
interactor id=“3”
interactor id=“4”
Participant id=“5”
interactorRef=“3”
Participant id=“6”
interactorRef=“4”
Interaction id=“7”
Interaction id=“8”
“Expanded” formatPSI-MI XML
PSI-MI XML
“Compact” format
PSI-MI XML
“Expanded” format
PSI-MITAB
• Released at the same time that PSI-MI XML 2.5
• Aimed at users that are more comfortable with Excel
• Only provides binary interactions
• Latest version: PSI-MI TAB 2.7
14
psi-mi/tab27
PSI-MI TAB 2.5: http://code.google.com/p/psimi/wiki/PsimiTabFormat
PSI-MI TAB 2.7: http://code.google.com/p/psimi/wiki/PsimiTab27Format
General information: http://www.psidev.info/groups/molecular-interactions
15
Standard columns (15):
• ID(s) interactor A & B
• Alt. ID(s) interactor A & B
• Alias(es) interactor A & B
• Interaction detection method(s)
• Publication 1st author(s)
• Publication Identifier(s)
• Taxid interactor A & B
• Interaction type(s)
• Source database(s)
• Interaction identifier(s)
• Confidence value(s)
PSI-MITAB 2.5 columns
15
v2.5
16
Standard columns (15):
• ID(s) interactor A & B
• Alt. ID(s) interactor A & B
• Alias(es) interactor A & B
• Interaction detection method(s)
• Publication 1st author(s)
• Publication Identifier(s)
• Taxid interactor A & B
• Interaction type(s)
• Source database(s)
• Interaction identifier(s)
• Confidence value(s)
Standard columns (21):
• Complex expansion
• Biological role A & B
• Experimental role A & B
• Interactor type A & B
• Xrefs A, B & Int.
• Annotations A, B & Int.
• Host organism
• Parameters Int.
• Created
• Updated
• CheckSum A, B & Int.
• Negative
Standard columns (4):
• Binding feature A & B
• Stoichiometry A & B
v2.5 v2.6 v2.7
15 36 40
PSI-MITAB 2.7 columns
+ +
PSI-MI format: Tools
• XML Java API (PSI-MI XML 2.5 Java Parser)
• Parse “PSI-MI XML”
• Create “PSI-MI XML”
• MITAB Java API (PSI-MITAB 2.5 Java Parser)
• Parse “PSI-MITAB”
• Create “PSI-MITAB”
• XMLMakerFlattener
• “PSI MI XML” to “Tab-delimited format”
• “Tab-delimited format” to “PSI MI XML”
• XML Validator
• Semantic and syntactic consistency
• XML transformation:
• MIF25_view.xsl “XML” to “HTML”
• MIF25_compact.xsl PSI-MI XML “expanded” to “compact”
• MIF25_expand.xsl PSI-MI XML “compact” to “expanded”
17
More information: http://www.psidev.info/mif#tools
Data submission tools
PSI-MI XML files
PSI Excel Sheet
PSI Web Form
More information: http://www.psidev.info/mif#tools
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
20
• OLS Makes available OBO controlled vocabularies
• Web site allows for searching and browsing their
hierarchy
• Programmatic access though SOAP web services
PSI-MI CV in the Ontology Lookup Service
Control vocabulary: PSI-MI CV
• Ontology browser: http://www.ebi.ac.uk/ontology-lookup
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
MIMIx
• Minimum information guideline for reporting molecular interactions
• 1. Manuscript information
• 2. Experiment
• 3. Interaction
• 4. Confidence
http://www.nature.com/nbt/journal/v25/n8/full/nbt1324.html
IMEx
13/12/2018
24
http://onlinelibrary.wiley.com/doi/10.1002/pmic.200700286/abstract
Data distribution best practices
13/12/2018
25
http://code.google.com/p/psicquic/wiki/DataDistributionBestPractices
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
Data distribution: PSICQUIC
• Proteomics Standards Initiative Common QUery InterfaCe.
• Community effort to standardise the way to access and retrieve data
from Molecular Interaction databases.
• Widely implemented by independent interaction data resources.
• Based on the PSI standard formats (PSI-MI XML and MITAB)
• Not limited to protein-protein interactions, also e.g.
• Drug-target interactions
• Simplified pathway data
• A registry listing resources implementing PSICQUIC
• Documentation: http://psicquic.googlecode.com
Published in 2011
http://www.ncbi.nlm.nih.gov/pubmed/21716279
APID
416,124
612,785
362,355
268,920
1,465,119
845
132,490
24,268
74,861
15,844
107,619
26,045,661
Around
150 million
binary interactions
available using
PSICQUIC*
208,55839,367
192,961
(*) As of February 2012
117
I2D
546
IntAct And PSICQUICBruno Aranda (baranda@ebi.ac.uk)
“The one Client to rule them all”
• Services
• IntAct
• Biogrid
• String
• ...
• Registry
• Clients
• PSICQUIC view
• Cytoscape
• …
13.12.201831
Service Oriented Architecture
query
PublishFind
PSICQUIC
Registry
DAS ClientsDAS ClientsPSICQUIC
Clients
Format
PSICQUIC
sources
PSICQUIC
sources
PSICQUIC
Services
MIQL
PSI-MI
PSIQUIC Proteomics Standard Initiative Common QUery InterfaCe
13/12/2018
32
PSICQUIC
Query Interactions
PSICQUIC
Registry
PSI-MIMIQL
Input Output
PSICQUIC
Service A
PSICQUIC
Service B
PSICQUIC
Service C
User View
Web Service System
application
PSICQUIC Registry
• It contains a list of the PSICQUIC services from different
providers.
• It is a web service itself, and it can be accessed remotely
using REST.
• Information can be found about the services, such as the
URLs to use, number of interactions provided, versioning,
tags, etc.
PSICQUIC Registry
• ~ 150 million interactions from 25 resources
• ~ 2.5 million interactions from 16 curated resources
http://www.ebi.ac.uk/Tools/webservices/psicquic/registry/registry?action=STATUS
(*) As of February 2012
Registry Tagging system
• The registry classifies the different services with tags.
PSICQUIC services
36
• PSICQUIC services are Web Services (REST & SOAP)
• Independent databases
• Adopting the same specification
• Implementing the same interface
• Same methods to query services
• Same format to provide results
PSICQUIC
serviceMIQL PSI-MI
input output
Query language Format
PSICQUIC REST specification
13/12/2018
37
http://code.google.com/p/psicquic/wiki/PsicquicSpec_1_2_Rest
PSICQUIC REST Query explained
http://www.ebi.ac.uk/Tools/webservices/psicquic/intact/webservices/current/search/query/species:human?firstResult=0&maxResults=100
Example:
PSICQUIC REST queries
Bruno Aranda (baranda@ebi.ac.uk)
http://mint.bio.uniroma2.it/mint/psicquic/webservices/current/search/query/p53
http://www.ebi.ac.uk/Tools/webservices/psicquic/intact/webservices/current/search/query/p53
http://www.ebi.ac.uk/Tools/webservices/psicquic/chembl/webservices/current/search/query/p53
1
2
3
MIQL: Molecular Interactions Query Language
• The Molecular Interactions Query Language (MIQL)
allows more powerful and flexible queries.
• It is the default query syntax for PSIQCUIC.
• Designed for fast and effective searches on PSI-MI
TAB files.
• All fields (columns) can be searched with specific
queries.
• MIQL is a consensus between the different
databases, so you should be able to use the same
query across different repositories.
MIQL
Bruno Aranda (baranda@ebi.ac.uk)
MIQL
…/query/specie:rat
…/query/brca AND rpa1
• Terms
• Fields
• Operands
PSICQUIC SOAP service
species:trypanosoma AND detmethod:’two hybrid’
SOAP Methods
• getByInteraction
Retrieves interactions by using an interaction AC.
• getByInteractionList
Retrieves interactions by using a list of interaction AC.
• getByInteractor
Retrieves interactions by using a participant identifier.
• getByInteractorList
Retrieves interactions by using a list of participant identifiers.
• getByQuery
Retrieves interactions by using a Molecular Interaction Query Language (MIQL)
query (full text searches)
SOAP Methods
Other metadata methods:
• getVersion
Returns the version of the web service implementation.
• getSupportedDbAcs
Returns the supported database identifiers
• getSupportedReturnTypes
Returns the list of available format types for the results.
• A limited number of interactions can be fetched. It is possible to retrieve
large datasets using pagination. Most methods have two additional
parameters:
•First result: Index for the first result to retrieve.
•Max results: Number of interactions returned per query.
Default formats
• The default formats are:
• PSI-MI XML 2.5.4 (psi-mi/xml25)
• PSI MITAB 2.5 (psi-mi/tab25)
• PSI MITAB 2.7 (psi-mi/tab27)
• Compressed MITAB (tab25-bin) – Only REST
• Count (count) – Only REST
• New formats will be included in the future (work in
progress):
• BioPAX (biopax)
• And other RDF formats (rdf-xml / rdf-n3 / rdf-n3-triple / rdf-turtle)
• (so it will be possible to use PSICQUIC in the semantic web!)
IntAct And PSICQUICBruno Aranda (baranda@ebi.ac.uk)
psi-mi/xml25
psi-mi/tab25
psi-mi/tab27
biopax
RDF/XML
RDF N3
PSICQUIC clients
13/12/2018
51
PSICQUIC view Cytoscape
www.ebi.ac.uk/Tools/webservices/psicquic/view
Web client
http://code.google.com/p/psicquic/wiki/CytoscapeClient
Desktop application
PSICQUIC view
• Simple and complex queries
• Link back to the original source for more details
• Clustering of queries across providers
• Visualization of graphical network
http://www.ebi.ac.uk/Tools/webservices/psicquic/view/
53
PSICQUIC View: table view
54
PSICQUIC View: clustering
PSICQUIC View: graphical network
Cytoscape PSICQUIC support
• Cytoscape 2.5
• Open “PSI-MI XML 1.0” files
• Cytoscape 2.7
• Open “PSI-MI XML 1.0” files
• Import interactions from PSICQUIC (plug-in)
• Cytoscape 3.0
• Open “PSI-MI TAB 2.5” files
• Import interactions from PSICQUIC (core)
• Merge/cluster PSICQUIC interactions
13/12/2018
56
Cytoscape 3.0 PSICQUIC import
13/12/2018
57
Thanks to ...
Keiichiro Ono
and the cytoscape team
0.90
0.380.25
—0.63
0.45
0.71
0.40
0.11
PSICQUIC PSISCORE
Query Interactions
PSI-MI
0.90
0.380.25
—0.63
0.45
0.71
0.40
0.11
PSICQUIC
Registry
Confidence Scores
PSI-MIMIQL
PSISCORE
Registry
client
Input Input
Output Output
PSICQUIC
Service A
PSICQUIC
Service B
PSISCORE
Service D
PSISCORE
Service E
PSICQUIC
Service C
User View
Web Service System
PSISCORE registry
• 4 sources
• 14 scoring methods
http://code.google.com/p/psiscore/
PSISCORE client
PSI-MI
Data format
Data distribution
Control vocabulary
Reporting guidelines
Standard format
Tools
PSICQUIC
PSI-MI CV
MIMIx
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
Semantic Validator
RPsiXML (Bioconductor)
PSI Excel Sheet submission
PSI Web Form submission
Servers
Registry
Clients
PSISCORE Servers
Registry
Clients
IMEx
TOC
Data distribution best practices
IMEx: The International Molecular Exchange Consortium
• Group of major public interaction data providers sharing
curation effort: DIP, IntAct, MINT, MPact, MatrixDB, MPIDB and BioGRID
• Independent molecular interaction resources
• Common curation standards for detailed curation
• Common data formats (PSI-MI XML, PSI-MITAB, PSICQUIC)
• Common accession number space
• Coordinated & non-redundant curation
• In production mode since February 2010
• Since 3/2009 supported by the European Commission
under PSIMEx, contract number FP7-HEALTH-2007-223411, with additional partners Vital-IT, Nature,
Wiley, BiaCore (GE), U. Maryland, CSIC, TU Munich, MIPS, SCBIT (Shanghai)
http://www.imexconsortium.org
IMEx website http://www.imexconsortium.org
PSICQUIC client: PSICQUIC view in IMEx
http://www.imexconsortium.org
Thank you!
13/12/2018
65
ProteomicsServicesTeam

More Related Content

Similar to PSI-MI & PSICQUIC. Community effort to provide molecular interaction data.

PSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICPSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICRafael C. Jimenez
 
EMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesEMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesRafael C. Jimenez
 
Clustering and scoring molecular interactions relying on community standards
Clustering and scoring molecular interactions relying on community standardsClustering and scoring molecular interactions relying on community standards
Clustering and scoring molecular interactions relying on community standardsRafael C. Jimenez
 
Affinomics Bioinformatics Meeting
Affinomics Bioinformatics MeetingAffinomics Bioinformatics Meeting
Affinomics Bioinformatics MeetingRafael C. Jimenez
 
NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw Alexander Pico
 
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...brosiusad
 
Cedar Overview
Cedar OverviewCedar Overview
Cedar Overviewjbgraybeal
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Rafael C. Jimenez
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Presentation forpd bj_1
Presentation forpd bj_1Presentation forpd bj_1
Presentation forpd bj_1Maori Ito
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupAnita de Waard
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical CoordinatorRafael C. Jimenez
 
A systematic review of network analyst - Pubrica
A systematic review of network analyst - PubricaA systematic review of network analyst - Pubrica
A systematic review of network analyst - PubricaPubrica
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)Elia Brodsky
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
The Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchThe Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchJeremy Leipzig
 
2014 genome informatics Linked Data
2014 genome informatics Linked Data2014 genome informatics Linked Data
2014 genome informatics Linked DataENCODE-DCC
 
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc RDM
 

Similar to PSI-MI & PSICQUIC. Community effort to provide molecular interaction data. (20)

PSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUICPSI-MI standards and PSICQUIC
PSI-MI standards and PSICQUIC
 
EMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and servicesEMBL-EBI Proteomics data resources and services
EMBL-EBI Proteomics data resources and services
 
Clustering and scoring molecular interactions relying on community standards
Clustering and scoring molecular interactions relying on community standardsClustering and scoring molecular interactions relying on community standards
Clustering and scoring molecular interactions relying on community standards
 
Affinomics Bioinformatics Meeting
Affinomics Bioinformatics MeetingAffinomics Bioinformatics Meeting
Affinomics Bioinformatics Meeting
 
NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw
 
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
Cedar Overview
Cedar OverviewCedar Overview
Cedar Overview
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Presentation forpd bj_1
Presentation forpd bj_1Presentation forpd bj_1
Presentation forpd bj_1
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical Coordinator
 
A systematic review of network analyst - Pubrica
A systematic review of network analyst - PubricaA systematic review of network analyst - Pubrica
A systematic review of network analyst - Pubrica
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)
 
Standards
StandardsStandards
Standards
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
The Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchThe Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational Research
 
2014 genome informatics Linked Data
2014 genome informatics Linked Data2014 genome informatics Linked Data
2014 genome informatics Linked Data
 
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 Paper
 

More from Rafael C. Jimenez

BMB Resource Integration Workshop
BMB Resource Integration WorkshopBMB Resource Integration Workshop
BMB Resource Integration Workshop Rafael C. Jimenez
 
Proteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resourcesProteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resourcesRafael C. Jimenez
 
Summary of Technical Coordinators discussions
Summary of Technical Coordinators discussionsSummary of Technical Coordinators discussions
Summary of Technical Coordinators discussionsRafael C. Jimenez
 
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...Rafael C. Jimenez
 
Standardisation in BMS European infrastructures
Standardisation in BMS European infrastructuresStandardisation in BMS European infrastructures
Standardisation in BMS European infrastructuresRafael C. Jimenez
 
An introduction to programmatic access
An introduction to programmatic accessAn introduction to programmatic access
An introduction to programmatic accessRafael C. Jimenez
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...Rafael C. Jimenez
 
Technical activities in ELIXIR Europe
Technical activities in ELIXIR EuropeTechnical activities in ELIXIR Europe
Technical activities in ELIXIR EuropeRafael C. Jimenez
 
Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.Rafael C. Jimenez
 
Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.Rafael C. Jimenez
 
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...Rafael C. Jimenez
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesRafael C. Jimenez
 
SASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course informationSASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course information Rafael C. Jimenez
 

More from Rafael C. Jimenez (20)

BMB Resource Integration Workshop
BMB Resource Integration WorkshopBMB Resource Integration Workshop
BMB Resource Integration Workshop
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Proteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resourcesProteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resources
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Summary of Technical Coordinators discussions
Summary of Technical Coordinators discussionsSummary of Technical Coordinators discussions
Summary of Technical Coordinators discussions
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...
 
Standardisation in BMS European infrastructures
Standardisation in BMS European infrastructuresStandardisation in BMS European infrastructures
Standardisation in BMS European infrastructures
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
ELIXIR TCG update
ELIXIR TCG updateELIXIR TCG update
ELIXIR TCG update
 
An introduction to programmatic access
An introduction to programmatic accessAn introduction to programmatic access
An introduction to programmatic access
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...
 
Technical activities in ELIXIR Europe
Technical activities in ELIXIR EuropeTechnical activities in ELIXIR Europe
Technical activities in ELIXIR Europe
 
Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.
 
Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.
 
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciences
 
SASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course informationSASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course information
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 

Recently uploaded

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...HyderabadDolls
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?RemarkSemacio
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...HyderabadDolls
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Availablegargpaaro
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...vershagrag
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 

Recently uploaded (20)

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Rohtak [ 7014168258 ] Call Me For Genuine Models We...
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 

PSI-MI & PSICQUIC. Community effort to provide molecular interaction data.

  • 1. Community effort to provide molecular interaction data PSI-MI & PSICQUIC Programmatic Access To Biological Databases (Perl) 1 – 4 October 2012 Marine Dumousseau, Noemi Del Toro Ayllon and Rafael C. Jimenez marine@ebi.ac.uk, ntoro@ebi.ac.uk and rafael@ebi.ac.uk
  • 2. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices 1 2 3 4 5 6
  • 3. • Proteomics Standards Initiative • Work group of the Human Proteome Organization • Defines community standards for data in proteomics • … facilitating data comparison, exchange and verification PSI 3
  • 4. PSI-MI • Work group of the Proteomics Standards Initiative • Community coordination to ensure deposition of Molecular Interaction data in public repositories • Concentrating on … • Annotation and representation of published MI data • Accessibility of MI data to the user community http://www.psidev.info/MI
  • 6. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices
  • 7. PSI-MI Data format/schema Data distribution Control vocabulary MIAPE Reporting guideline PSI-MI XML PSI-MITAB PSICQUIC MIMIx IMEx PSI-MI CV http://www.psidev.info/MI Scoring PSISCORE
  • 8. PSI-MI format • Community standard for Molecular Interactions • Jointly developed by major data providers: BIND, CellZome, DIP, GSK, HPRD, Hybrigenics, IntAct, MINT, MIPS, Serono, U. Bielefeld, U. Bordeaux, U. Cambridge, and others • Collecting and combining data from different sources has become easier • Standardized annotation through PSI-MI ontologies • Tools from different organizations can be chained, e.g. IntAct data in Cytoscape. 8 psi-mi/xml25 psi-mi/tab25
  • 9. 9 PSI-MI 2.5 Standards • Version 1.0 published in February 2004 The HUPO PSI Molecular Interaction Format - A community standard for the representation of protein interaction data.Henning Hermjakob et al, Nature Biotechnology 2004, 22, 176-183. • Version 2.5 published in October 2007
  • 10. PSI-MI XML: A simplified view of the schema Participant Participant Participant Interactor Interactor Feature RangeRange Feature Experiment Entry EntrySet Interaction Interaction XML specification: http://code.google.com/p/psimi/wiki/PsimiXMLSpecifications Schema view: http://psidev.sourceforge.net/molecular_interactions/rel25/doc/ Schema: http://psidev.sourceforge.net/mi/rel25/src/MIF25.xsd General docuemntation: http://www.psidev.info/node/60#overlay-context=node/60
  • 11. “Compact” format Interaction Interaction experimentDescription id=“1” experimentDescription id=“2” experimentRef=“1” interactor id=“3” interactor id=“4” Participant id=“5” interactorRef=“3” Participant id=“6” interactorRef=“4” Interaction id=“7” Interaction id=“8” “Expanded” formatPSI-MI XML
  • 14. PSI-MITAB • Released at the same time that PSI-MI XML 2.5 • Aimed at users that are more comfortable with Excel • Only provides binary interactions • Latest version: PSI-MI TAB 2.7 14 psi-mi/tab27 PSI-MI TAB 2.5: http://code.google.com/p/psimi/wiki/PsimiTabFormat PSI-MI TAB 2.7: http://code.google.com/p/psimi/wiki/PsimiTab27Format General information: http://www.psidev.info/groups/molecular-interactions
  • 15. 15 Standard columns (15): • ID(s) interactor A & B • Alt. ID(s) interactor A & B • Alias(es) interactor A & B • Interaction detection method(s) • Publication 1st author(s) • Publication Identifier(s) • Taxid interactor A & B • Interaction type(s) • Source database(s) • Interaction identifier(s) • Confidence value(s) PSI-MITAB 2.5 columns 15 v2.5
  • 16. 16 Standard columns (15): • ID(s) interactor A & B • Alt. ID(s) interactor A & B • Alias(es) interactor A & B • Interaction detection method(s) • Publication 1st author(s) • Publication Identifier(s) • Taxid interactor A & B • Interaction type(s) • Source database(s) • Interaction identifier(s) • Confidence value(s) Standard columns (21): • Complex expansion • Biological role A & B • Experimental role A & B • Interactor type A & B • Xrefs A, B & Int. • Annotations A, B & Int. • Host organism • Parameters Int. • Created • Updated • CheckSum A, B & Int. • Negative Standard columns (4): • Binding feature A & B • Stoichiometry A & B v2.5 v2.6 v2.7 15 36 40 PSI-MITAB 2.7 columns + +
  • 17. PSI-MI format: Tools • XML Java API (PSI-MI XML 2.5 Java Parser) • Parse “PSI-MI XML” • Create “PSI-MI XML” • MITAB Java API (PSI-MITAB 2.5 Java Parser) • Parse “PSI-MITAB” • Create “PSI-MITAB” • XMLMakerFlattener • “PSI MI XML” to “Tab-delimited format” • “Tab-delimited format” to “PSI MI XML” • XML Validator • Semantic and syntactic consistency • XML transformation: • MIF25_view.xsl “XML” to “HTML” • MIF25_compact.xsl PSI-MI XML “expanded” to “compact” • MIF25_expand.xsl PSI-MI XML “compact” to “expanded” 17 More information: http://www.psidev.info/mif#tools
  • 18. Data submission tools PSI-MI XML files PSI Excel Sheet PSI Web Form More information: http://www.psidev.info/mif#tools
  • 19. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices
  • 20. 20 • OLS Makes available OBO controlled vocabularies • Web site allows for searching and browsing their hierarchy • Programmatic access though SOAP web services PSI-MI CV in the Ontology Lookup Service
  • 21. Control vocabulary: PSI-MI CV • Ontology browser: http://www.ebi.ac.uk/ontology-lookup
  • 22. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices
  • 23. MIMIx • Minimum information guideline for reporting molecular interactions • 1. Manuscript information • 2. Experiment • 3. Interaction • 4. Confidence http://www.nature.com/nbt/journal/v25/n8/full/nbt1324.html
  • 25. Data distribution best practices 13/12/2018 25 http://code.google.com/p/psicquic/wiki/DataDistributionBestPractices
  • 26. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices
  • 27. Data distribution: PSICQUIC • Proteomics Standards Initiative Common QUery InterfaCe. • Community effort to standardise the way to access and retrieve data from Molecular Interaction databases. • Widely implemented by independent interaction data resources. • Based on the PSI standard formats (PSI-MI XML and MITAB) • Not limited to protein-protein interactions, also e.g. • Drug-target interactions • Simplified pathway data • A registry listing resources implementing PSICQUIC • Documentation: http://psicquic.googlecode.com
  • 30. IntAct And PSICQUICBruno Aranda (baranda@ebi.ac.uk) “The one Client to rule them all”
  • 31. • Services • IntAct • Biogrid • String • ... • Registry • Clients • PSICQUIC view • Cytoscape • … 13.12.201831 Service Oriented Architecture query PublishFind PSICQUIC Registry DAS ClientsDAS ClientsPSICQUIC Clients Format PSICQUIC sources PSICQUIC sources PSICQUIC Services MIQL PSI-MI
  • 32. PSIQUIC Proteomics Standard Initiative Common QUery InterfaCe 13/12/2018 32 PSICQUIC Query Interactions PSICQUIC Registry PSI-MIMIQL Input Output PSICQUIC Service A PSICQUIC Service B PSICQUIC Service C User View Web Service System application
  • 33. PSICQUIC Registry • It contains a list of the PSICQUIC services from different providers. • It is a web service itself, and it can be accessed remotely using REST. • Information can be found about the services, such as the URLs to use, number of interactions provided, versioning, tags, etc.
  • 34. PSICQUIC Registry • ~ 150 million interactions from 25 resources • ~ 2.5 million interactions from 16 curated resources http://www.ebi.ac.uk/Tools/webservices/psicquic/registry/registry?action=STATUS (*) As of February 2012
  • 35. Registry Tagging system • The registry classifies the different services with tags.
  • 36. PSICQUIC services 36 • PSICQUIC services are Web Services (REST & SOAP) • Independent databases • Adopting the same specification • Implementing the same interface • Same methods to query services • Same format to provide results PSICQUIC serviceMIQL PSI-MI input output Query language Format
  • 38. PSICQUIC REST Query explained http://www.ebi.ac.uk/Tools/webservices/psicquic/intact/webservices/current/search/query/species:human?firstResult=0&maxResults=100 Example:
  • 39. PSICQUIC REST queries Bruno Aranda (baranda@ebi.ac.uk) http://mint.bio.uniroma2.it/mint/psicquic/webservices/current/search/query/p53 http://www.ebi.ac.uk/Tools/webservices/psicquic/intact/webservices/current/search/query/p53 http://www.ebi.ac.uk/Tools/webservices/psicquic/chembl/webservices/current/search/query/p53 1 2 3
  • 40. MIQL: Molecular Interactions Query Language • The Molecular Interactions Query Language (MIQL) allows more powerful and flexible queries. • It is the default query syntax for PSIQCUIC. • Designed for fast and effective searches on PSI-MI TAB files. • All fields (columns) can be searched with specific queries. • MIQL is a consensus between the different databases, so you should be able to use the same query across different repositories.
  • 43. PSICQUIC SOAP service species:trypanosoma AND detmethod:’two hybrid’
  • 44. SOAP Methods • getByInteraction Retrieves interactions by using an interaction AC. • getByInteractionList Retrieves interactions by using a list of interaction AC. • getByInteractor Retrieves interactions by using a participant identifier. • getByInteractorList Retrieves interactions by using a list of participant identifiers. • getByQuery Retrieves interactions by using a Molecular Interaction Query Language (MIQL) query (full text searches)
  • 45. SOAP Methods Other metadata methods: • getVersion Returns the version of the web service implementation. • getSupportedDbAcs Returns the supported database identifiers • getSupportedReturnTypes Returns the list of available format types for the results. • A limited number of interactions can be fetched. It is possible to retrieve large datasets using pagination. Most methods have two additional parameters: •First result: Index for the first result to retrieve. •Max results: Number of interactions returned per query.
  • 46. Default formats • The default formats are: • PSI-MI XML 2.5.4 (psi-mi/xml25) • PSI MITAB 2.5 (psi-mi/tab25) • PSI MITAB 2.7 (psi-mi/tab27) • Compressed MITAB (tab25-bin) – Only REST • Count (count) – Only REST • New formats will be included in the future (work in progress): • BioPAX (biopax) • And other RDF formats (rdf-xml / rdf-n3 / rdf-n3-triple / rdf-turtle) • (so it will be possible to use PSICQUIC in the semantic web!)
  • 47. IntAct And PSICQUICBruno Aranda (baranda@ebi.ac.uk) psi-mi/xml25 psi-mi/tab25 psi-mi/tab27
  • 51. PSICQUIC clients 13/12/2018 51 PSICQUIC view Cytoscape www.ebi.ac.uk/Tools/webservices/psicquic/view Web client http://code.google.com/p/psicquic/wiki/CytoscapeClient Desktop application
  • 52. PSICQUIC view • Simple and complex queries • Link back to the original source for more details • Clustering of queries across providers • Visualization of graphical network http://www.ebi.ac.uk/Tools/webservices/psicquic/view/
  • 56. Cytoscape PSICQUIC support • Cytoscape 2.5 • Open “PSI-MI XML 1.0” files • Cytoscape 2.7 • Open “PSI-MI XML 1.0” files • Import interactions from PSICQUIC (plug-in) • Cytoscape 3.0 • Open “PSI-MI TAB 2.5” files • Import interactions from PSICQUIC (core) • Merge/cluster PSICQUIC interactions 13/12/2018 56
  • 57. Cytoscape 3.0 PSICQUIC import 13/12/2018 57 Thanks to ... Keiichiro Ono and the cytoscape team
  • 58. 0.90 0.380.25 —0.63 0.45 0.71 0.40 0.11 PSICQUIC PSISCORE Query Interactions PSI-MI 0.90 0.380.25 —0.63 0.45 0.71 0.40 0.11 PSICQUIC Registry Confidence Scores PSI-MIMIQL PSISCORE Registry client Input Input Output Output PSICQUIC Service A PSICQUIC Service B PSISCORE Service D PSISCORE Service E PSICQUIC Service C User View Web Service System
  • 59. PSISCORE registry • 4 sources • 14 scoring methods http://code.google.com/p/psiscore/
  • 61. PSI-MI Data format Data distribution Control vocabulary Reporting guidelines Standard format Tools PSICQUIC PSI-MI CV MIMIx PSI-MI XML PSI-MITAB XML Java API MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) PSI Excel Sheet submission PSI Web Form submission Servers Registry Clients PSISCORE Servers Registry Clients IMEx TOC Data distribution best practices
  • 62. IMEx: The International Molecular Exchange Consortium • Group of major public interaction data providers sharing curation effort: DIP, IntAct, MINT, MPact, MatrixDB, MPIDB and BioGRID • Independent molecular interaction resources • Common curation standards for detailed curation • Common data formats (PSI-MI XML, PSI-MITAB, PSICQUIC) • Common accession number space • Coordinated & non-redundant curation • In production mode since February 2010 • Since 3/2009 supported by the European Commission under PSIMEx, contract number FP7-HEALTH-2007-223411, with additional partners Vital-IT, Nature, Wiley, BiaCore (GE), U. Maryland, CSIC, TU Munich, MIPS, SCBIT (Shanghai) http://www.imexconsortium.org
  • 64. PSICQUIC client: PSICQUIC view in IMEx http://www.imexconsortium.org