Advertisement

BioSharing - EUDAT semantic workshop

Academic Lead for Research Practice; Professor of Data Readiness, Department of Engineering Science; Associate Director, Oxford e-Research Centre
Apr. 2, 2017
Advertisement

More Related Content

Slideshows for you(18)

Similar to BioSharing - EUDAT semantic workshop (20)

Advertisement

More from Susanna-Assunta Sansone(20)

Recently uploaded(20)

Advertisement

BioSharing - EUDAT semantic workshop

  1. n connecting standards, databases and data policies Susanna-Assunta Sansone Associate Director Oxford e-Research Centre, University of Oxford
  2. • Domain-level descriptors that are essential for interpretation, verification, reproducibility and reusability of datasets • The depth and breadth of descriptors vary according to the domain broadly covering the what, who, when, how and why Content standards
  3. Formats Terminologies Guidelines Content standards: three categories
  4. Minimum information reporting requirements, checklists o Report the same core, essential information o e.g. MIAME guidelines Controlled vocabularies, taxonomies, thesauri, ontologies etc. o Unambiguous identification and definition of concepts o e.g. Gene Ontology Conceptual model, schema, exchange formats etc o Define the structure and interrelation of information, and the transmission format o e.g. FASTA Formats Terminologies Guidelines Content standards: three categories
  5. Formats Terminologies Guidelines Community-driven initiatives de jure de facto grass-roots groups standard organizations Nanotechnology Working Group
  6. 883 -> ~1000 220+ 115+ 548 source source source Content standards in numbers Formats Terminologies Guidelines MIAME MIRIAM MIQASMIX MIGEN ARRIVE MIAPE MIASE MIQE MISFISHIE…. REMARK CONSORT SRAxml SOFT FASTA DICOM MzML SBRML SEDML… GELML ISA CML MITAB AAO CHEBIOBI PATO ENVO MOD BTO IDO… TEDDY PRO XAO DO VO MIAPPE Sample-Tab
  7. Content standards Data policies by funders, journals and other organizations Databases, tools and services Formats Terminologies Guidelines Mapping this evolving landscape
  8. Content standards Data policies by funders, journals and other organizations Databases, tools and services Formats Terminologies Guidelines a resource of the ELIXIR Interoperability Platform • A web-based, curated and searchable portal that monitors their development and evolution to inform and educate
  9. Not just quantity but quality: rich, curated and community vetted descriptions
  10. Indicators to describe the status of standards and databases Ready for use, implementation, or recommendation In development Status uncertain Deprecated as subsumed or superseded Manually curated and verified by the community behind each resource
  11. Tracking evolution, e.g.:
  12. Visualizing relations, e.g.: Data Policy List of their recommended databases and standards
  13. …to inform and educate on existing and new resources Data Policy
  14. Working with/for the community and our ‘adopters’, e.g.: Standard developing groups:Journal, publishers: Cross-links, data exchange: Societies and organisations: Institutional RDM services: Projects, programmes: 533 responders
  15. Progressively cross-linking with other ELIXIR resources Cross-links, data exchange: Societies and organisations: Standard developing groups:Journal, publishers: Institutional RDM services: Projects, programmes:
  16. • Increase discoverability (e.g. by search engines), aggregation (e.g. by indices) and analysis of content in different websites and services • use of schema.org structured semantic markup (for web pages’ content) by Google, Bing, Yahoo, Yandex • coordinate its extension, where needed, in the life science area Gaining traction and support by:
  17. Acknowledgements
Advertisement