Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

04 findable imming

20 views

Published on

F is for FAIR data: Findable
Sharing is Caring X Amsterdam 2019

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

04 findable imming

  1. 1. F is for FAIR data: @melimming DOI 10.5281/zenodo.3548811 Sharing is Caring, November 2019 Findable Melanie Imming, Imming Impact / SURF https://resolver.kb.nl/resolve?urn=urn:gvn:PRB01:158100123
  2. 2. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource To be Accessible A1. (meta)data are retrievable by their identifier using a standardized communications protocol A1.1 the protocol is open, free, and universally implementable A1.2 the protocol allows for an authentication and authorization procedure, where necessary A2. metadata are accessible, even when the data are no longer available To be Interoperable I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation I2. (meta)data use vocabularies that follow FAIR principles I3. (meta)data include qualified references to other (meta)data To be Reusable R1. meta(data) are richly described with a plurality of accurate and relevant attributes R1.1. (meta)data are released with a clear and accessible data usage license R1.2. (meta)data are associated with detailed provenance R1.3. (meta)data meet domain-relevant community standards THE FAIR DATA GUIDING PRINCIPLES
  3. 3. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource Findable: The first step THE FAIR DATA GUIDING PRINCIPLES
  4. 4. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource Findable: The first step EASY? THE FAIR DATA GUIDING PRINCIPLES
  5. 5. FAIR DATA ADVANCED USE CASES: FROM PRINCIPLES TO PRACTICE IN THE NETHERLANDS Implementing FAIR is seen as a series of improvements ‘Going for FAIR’ is done one step at a time. Steps that focus on Findability and Accessibility are often taken up first. There are always steps ahead that can improve FAIR-ness even further. Machine readability is sometimes one of those next steps.http://doi.org/10.5281/zenodo.1250535
  6. 6. FAIR DATA ADVANCED USE CASES: FROM PRINCIPLES TO PRACTICE IN THE NETHERLANDS Implementing FAIR is seen as a series of improvements The importance of machine readable data is acknowledged in all use cases, but for less data driven communities there is a tendency to focus on human interoperability first. Full compliancy with the FAIR principles is not seen as easy to achieve, if possible at all. http://doi.org/10.5281/zenodo.1250535
  7. 7. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource THE FAIR DATA GUIDING PRINCIPLES “It will be hard to achieve other aspects of FAIR without globally unique and persistent identifiers” https://www.go-fair.org/fair-principles/
  8. 8. • Identifiers can help other people understand exactly what you mean; • Identifiers allow computers to interpret your data in a meaningful way (i.e., computers that are searching for your data or trying to automatically integrate them); • In addition, identifiers will help others to properly cite your work when reusing your data. https://www.go-fair.org/fair-principles/
  9. 9. PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  10. 10. 7 PID PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  11. 11. 9 PID Persistent PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  12. 12. 9 PID Persistent Globally unique PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  13. 13. 10 PID Persistent Globally unique Actionable PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  14. 14. 14 Reference • Disambiguate • Share • Collaborate • Re-use Attribute • Cite • Recognize PID Identify Resolve • Dereference • Query, get more info PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  15. 15. There are multiple PID systems: • Archival Resource Keys (ARKs) • Digital Object Identifiers (DOIs) • Handle • OpenURL • Persistent Uniform Resource Locators (PURLs) • Uniform Resource Names (URNs) Each system has its own particular properties, strengths and weaknesses. Which system is most suited to your situation depends.
  16. 16. The PID Guide from the Digital Heritage Network’s Persistent Identifier project helps you learn and think about important PID subjects, and guides your first steps towards selecting a PID system.
  17. 17. https://www.slideshare.net/sjDCC/what-it-means-to-be-fair
  18. 18. PID hosting service @ SURF 23 140 120 100 80 60 40 20 0 #HANDLES(MILLION) https://www.surf.nl/en/data-persistent-identifier-data-always- findable-by-permanent-references PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  19. 19. PID hosting service @ SURF 25 Nationaal Archief Stadsarchief Delft CINECA DEVENTit Haags gemeente Archief Regionaal Archief Rivierenland Gemeente Nijmegen KNMI (Orfeus) Westfries Archief SURFsara Gemeente Den Bosch Regionaal Archief Tilburg Gemeentearchief Zaanstad Beeld en Geluid PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  20. 20. https://hdl.handle.net/10934/RM0001.COLLECT.325823 https://hdl.handle.net/10648/868c5ca8-e061-45ec-af1c- 4ea5c68b7205 26 PID hosting service @ SURF PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  21. 21. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource THE FAIR DATA GUIDING PRINCIPLES “The rationale behind this principle is that someone should be able to find data based on the information provided by their metadata, even without the data’s identifier.” https://www.go-fair.org/fair-principles/
  22. 22. https://www.accelerateopenscience.nl/what-is-open-science/
  23. 23. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource THE FAIR DATA GUIDING PRINCIPLES
  24. 24. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource THE FAIR DATA GUIDING PRINCIPLES Object Object representation and metadata
  25. 25. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource THE FAIR DATA GUIDING PRINCIPLES “If the availability of a digital resource such as a dataset, service or repository is not known, then nobody (and no machine) can discover it. There are many ways in which digital resources can be made discoverable, including indexing.” https://www.go-fair.org/fair-principles/
  26. 26. “It is not enough to make collections available through web based end user interfaces and provide download options for individual objects and metadata records. On the contrary: collection data and objects must be findable (..) for people and software in their entirety and in specific parts” FAIR Principles for Library, Archive and Museum Collections: A proposal for standards for reusable collections Lukas Koster, Saskia Woutersen-Windhouwer http://journal.code4lib.org/articles/13427, ISSN 1940-5758
  27. 27. Meet Robo-Researcher Machine-actionable | AI- ready |“The machine knows what I mean” • Has a PID • Wants to: • Get associated metadata • Parse metadata to determine if data is relevant • If yes: Download and process data PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  28. 28. 4TUDATA for research PID’s on a dataset level, plus metadata; They added certain scripts to made it possible for large search machines ( such as google) to index the content of their repository.
  29. 29. “It is still widespread common practice to use internal dedicated system identifiers which refer to both the metadata record and the object described. This type of internal identifier can be published on the web using a URL based on the particular system or domain, but it is not a global and persistent identifier, because it depends on the specific system or domain.” FAIR Principles for Library, Archive and Museum Collections: A proposal for standards for reusable collections Lukas Koster, Saskia Woutersen-Windhouwer http://journal.code4lib.org/articles/13427, ISSN 1940-5758
  30. 30. Wikidata is a free, collaborative, multilingual, secondary database, collecting structured data to provide support for Wikimedia and anyone in the world. • One of the most edited knowledge bases which contains structured data (RDF4 format); • Serves as the data source for many projects in the Wikimedia sphere and beyond; • Increasingly growing in term of both its community and its content; Wikidata can act as a point of convergence for data from different vocabularies and ontologies; • Lot’s of Wiki data research projects;
  31. 31. Further development of standards & technology and many others… PID’S & POETRY: AN ANTHOLOGY: Hylke Koers, SURFsara
  32. 32. When visiting scholarly portals, readers can easily figure out landing pages, links to bibliographic records, authorship, etc. But, because portals use different conventions to convey such patterns, machines have a hard time finding their way around. Image courtesy of Patrick Hochstenbach signposting.org There is very little interoperability among scholarly portals on the web. Most portals focus on access via the user interface.
  33. 33. As a portal administrator or operator of scholarly infrastructure, you can change that by implementing some of the Signposting patterns listed on this site. Doing so will allow machines to navigate scholarly portals in a uniform manner. Which will lead to applications that make things easier for readers too. signposting.org
  34. 34. To be Findable F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource Findable: The first step EASY? THE FAIR DATA GUIDING PRINCIPLES
  35. 35. FAIR DATA ADVANCED USE CASES: FROM PRINCIPLES TO PRACTICE IN THE NETHERLANDS FAIR takes effort, but it is worth it It takes effort to get to a certain level of FAIR-ness, but some use cases show that once you have reached that level of FAIR, a whole world of possibilities opens. http://doi.org/10.5281/zenodo.1250535
  36. 36. F is for FAIR data: @melimming Findable Sharing is Caring, November 2019 Melanie Imming, Imming Impact / SURF Thank you!
  37. 37. F is for FAIR data: @melimming Findable FAIR DATA ADVANCED USE CASES: FROM PRINCIPLES TO PRACTICE IN THE NETHERLANDS http://doi.org/10.5281/zenodo.1250535 https://www.slideshare.net/sjDCC/what-it-means-to-be-fair https://www.go-fair.org/fair-principles/ https://www.go-fair.org/fair-principles/ https://www.pidwijzer.nl/en FAIR Principles for Library, Archive and Museum Collections: A proposal for standards for reusable collections Lukas Koster, Saskia Woutersen-Windhouwer http://journal.code4lib.org/articles/13427, ISSN 1940- 5758 https://www.geheugenvannederland.nl/en https://www.wikidata.org/wiki/Wikidata:WikiProject_sum_of_all_paintings signposting.org Sharing is Caring, November 2019 Melanie Imming, Imming Impact / SURF

×