Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
ELIXIR
1. European Life Sciences Infrastructure for Biological Information
www.elixir-europe.org
Rafael C Jimenez
ELIXIR Hub technical coordinator
ELIXIR
2. ELIXIR
• European life sciences research infrastructure for
biological information to facilitate research
• Safeguard data and build sustainable data
services
• Creating a robust infrastructure for biological
information is a bigger task than any individual
organisation or nation can take on alone
2
3. ELIXIR Members
3
ECA members
•Czech Rep
•Norway
•Denmark
•Portugal
•Estonia
•Sweden
•Israel
•Switzerland
•Netherlands
•UK
•EMBL-EBI
MoU countries
•Belgium
•Finland
•France
•Greece
•Italy
•Slovenia
•Spain
Participated by major bioinformatics service providers and
supported by 17 EU member states
4. A distributed infrastructure to scale with the
challenges
• ELIXIR deliver services
through national
ELIXIR Nodes
• ELIXIR Nodes build
local bioinformatics
capacity throughout
Europe
• ELIXIR Nodes build on
national strengths and
priorities
4
http://www.elixir-europe.org/about/elixir-nodes
6. Quality control
Service delivery plan
6
ELIXIR Technical
Strategy Metrics
Monitoring
Service delivery plan
Core resources Named services
Life cycle
Collaboration
agreements
Node proposals
7. Acceleration towards sustained operations
• Key objectives:
• In 2014: Build on success of Interim Phase, formal establishment of
Nodes
• In 2015: Start delivering ELIXIR services
• From 2016 onwards: sustained operations across ELIXIR
7
12. 7 | 62
Figure 2 Together, the biomedical science research infrastructuresaddresssocietal challenges
By establishing interoperability between data and services in the biological,
medical, translational and clinical domains, BioMedBridges links basic
BioMedBridges
Biomedical sciences research infrastructures
stronger through common links
• FP7-funded cluster project
• 21 project partners in 9
countries
• Computational ‘data and
service’ bridges between the
BMS RIs
• Interoperability between
data and services in the
biological, medical,
translational and clinical
domains
12
13. Infrastructure for Life Sciences
13
Services & connectors
to drive access and
exploitation
Integration and interoperability
of data and services
Sustain core data
resources
Access, Exchange & Compute
on sensitive data
Compute
Dat
a
Standards
Tools
Training
Professional skills for
managing and exploiting data
Access, Search, Analysis …
Integration, Optimization, Privacy, …
Storage, Network & Computing
Formats, Ontologies, Guidelines, …
Scientific & technical
14. Programme of work
14
Domain specific services
Data interoperability, vocabulary and ontology services
Tools interoperability and Service Registry
Data resources & services
Technical Services
Management and operations
Training
Compute
Dat
a
Standards
Tools
Training
15. • Working groups to coordinate the ELIXIR technical strategy
• Driven by national technical leads
• Plan, agree and implement technical strategies
• Represent ELIXIR on one specific technical topic
Task forces
15
Scientific
and funding
strategies
ELIXIR Programme of work
Task forces
Planning Agreement Implementation
Technical strategies
Pilot projects
HoNs
HoNs
HoNs
TCG
16. ELIXIRTask Forces
16
Task forces Programme of work
Cloud Technical services
Storage Technical services
Authentication and authorization Technical services
Service registry Tools Interoperability and Service Registry
Metrics, monitoring & quality control Data resources and services
Communication Management and Operations
Website Management and Operations
Training portal Training
e-Learning Training
17. European Life Sciences Infrastructure for Biological Information
www.elixir-europe.org
Rafael C Jimenez
ELIXIR Hub technical coordinator
Standards
18. Data integration issues
Many data sources
• Maintain and update
• New appearing
• Many vanishing*
Different query interfaces
data integration?
Variable results
• Syntax
• Semantics
• Minimum information
* Merali Z. et all. Databases in peril. Nature 2005.
Where to find them?
Redundant data?
19. Why so many databases?
• Diverse data types
• Many communities
• Different ways to structure data
• Control, reputation, easy to publish
• New funding normally just encourage to do something new
19
1 2 3 4
funding
time (years)
development
Are they sustainable?
* Merali Z. et all. Databases in peril. Nature 2005.
• Just 20% has a sustained future*
21. Improving Links Between distributed European
resources
ELIXIR pilot: Interoperability of protein expressions resources
The Human Protein Atlas portal is a publicly available database
with millions of high-resolution images showing the spatial
distribution of proteins in 46 different normal human tissues and
20 different cancer types, as well as 47 different human cell lines.
22. Communities organized per domain
• Produce technical standards intended to address the needs of
a community of users.
develop, coordinate, promulgate, revise, amend, reissue, interpret
22
23. ELIXIR role
• Support communities developing standards
• Establish links among communities and standards
• Promote the adoption of standards
• Help to find the gaps among standards
• Recommend standards best practices in data sharing
23
24. • Proteomics Standards Initiative
• Work group of the Human Proteome Organization
• Defines community standards for data in proteomics
• … facilitating data comparison, exchange and verification
PSI
24
http://www.psidev.info/
25. 25
PSI-MI
Data format
Data distribution
Control vocabulary
Data submission
Standard format
Tools
PSICQUIC
PSI-MI CV
Reporting guideline MIMIx
Tools
PSI-MI XML
PSI-MITAB
XML Java API
MITAB Java API
XMLMakerFlattener
XML Validator
MIF25_view.xsl
MIF25_compact.xsl
MIF25_expand.xsl
PSI-MI XML files
PSI Excel Sheet
PSI Web Form
Servers
Registry
Clients
35. Standards in data sharing
http://www.mrc.ac.uk/Utilities/Documentrecord/index.htm?d=MRC002552
36. Different formats for the same data
36
MI
Data
PSI-XML
PSI-MITAB
BioPax
RDF
Cytoscape
DAS • Comprehensive
• Simple
• Generic
• Domain specific
• Structured
37. BioMedBridges Knowledge ExchangeWorkshop
Tuesday 24 - Wednesday 25 June 2014.VUMC, Amsterdam,The Netherlands
Workshop organised byWP3 - Standards Description and
Harmonisation, to bring together BMB partners, biomedical
standards experts and representatives of external projects.
Best practice for identifiers
•Gap analysis of current identifiers
Development of the BMB standards registry
•Gap analysis for usage of the registry
•Integration of the registry with other tools
37
38. Identifiers Best Practice - purpose
• Recommendations for identifiers best practice
• Designing (format, re-use)
• Managing (creation, versioning, provenance, deprecation etc.)
• Using (resolving, mapping etc.)
• Publish a paper
• Introduction to identifier concepts
• Case Studies illustrating identifier usage in real-world scenarios
• Recommendations on best practice
• Show not tell
• Descriptive not normative
• fornon-experts/newcomers
• Gap analysis
• list the biological entities and identifiers type used by BMB partners
38
39. Identifiers Best Practice – topics 1/2
• Identifier formats
• syntax of database IDs, URI patterns
• Identifier management
• creation, versioning, provenance, deprecation
• Identifier resolution
• how to use an ID to get useful information about the entity
• services for this, e.g. Identifiers.org
• what info. should be given ?
39
40. Identifiers Best Practice – topics 2/2
• Identifier mapping / aggregation
• how to map IDs on entries in one resource to those in another, to
assign equivalence / make useful links
• e.g. IDs for equivalent protein sequences in different organisms
• e.g. probe->gene->pathway->function (GO)
• Identifier cataloguing
• compilations of types of identifiers (e.g. EDAM ontology) or specific
identifiers (e.g. Cell Line Ontology)
• what info. should be given?
• Case Studies
• use of IDs in a particular domain
40
41. Standards format registry - purpose
• Discovery, Agreement, Benchmarking, …
• Facilitate syntactic operability across research infrastructure
so samples and data can be integrated and analysed across
ESFRI BMS domains.
41
42. Standards format registry - topics
• Catalogue of standard
• Interoperability among registry
• identifiers.org, biosahring, EDAM, ELIXIR service registry
• Adaptat to user needs
• Community of users vs. community of producers
• Microstandards
• Standards mapping
• Access to exert knowledge
• Assess fit for purpose
• Rating/metrics
42
43. European Life Sciences Infrastructure for Biological Information
www.elixir-europe.org
Thank you
Belgium has not submitted a Node application, and can’t attend today, but it will be coordinated out of VIB, so will include plant sciences resources
Explain what ECA is
Creating a robust infrastructure for biological information is a bigger task than any individual organisation or nation can take on alone
These are issues of such complexity that no single institution or country can tackle alone
Norwegian bioinformatics platform – bringing together 5 norwegian universities / institutes
‘research infrastructures’ refers to facilities, resources and related services used by the scientific community
Physical facilities to conduct experiment or store samples
Repositories of data and analysis tools
‘research infrastructures’ refers to facilities, resources and related services used by the scientific community
Physical facilities to conduct experiment or store samples
Repositories of data and analysis tools
Previous example leads into BioMedBridges project that build bridges between the infrastructures and starting to develop data and service bridges to support research projects that of course will access and benefit from services involving several of these.
Programme of work
5 years plan
Work streams
Strategy focused on generic topic
Task forces
Define technical implementation of specific tasks
The HUPO Proteomics Standards Initiative (PSI) defines community standards for data representation in proteomics to facilitate data comparison, exchange and verification.
The PSI was founded at the HUPO meeting in Washington, April 28-29, 2002
MIAPE: The Minimum Information About a Proteomics Experiment .
Guidance document specifying the data and metadata that should be collected from proteomics experiments
Where samples came from and how analyses were performed
Data accompanied by context: 'metadata' ('data about the data')
HUPO
The Human Proteome Organisation (HUPO) is an international scientific organization representing and promoting proteomics through international cooperation and collaborations by fostering the development of new technologies, techniques and training.
PSI
The HUPO Proteomics Standards Initiative (PSI) defines community standards for data representation in proteomics to facilitate data comparison, exchange and verification.
PSI-Molecular Interactions
Work group of the Proteomics Standards Initiative focused on Molecular Interactions
IMEx
The IMEx consortium is a group of major public interaction data providers sharing curation effort.
IntAct
Open source database system and analysis tools for molecular interaction data including information derived from literature curation or direct user submissions.