Integrating event-core and other new and complex data types (data models) in GBIF. Data types inside Darwin Core not yet supported in GBIF. Introduction for starting the discussions at the 10th European GBIF Nodes Meeting in Tallinn Estonia 15th May 2018. See also: http://bit.ly/gbifEu2018_datatypes | http://bit.ly/gbifEu2018_new_datatypes | DOI: 10.13140/RG.2.2.16667.36641
Publishing Germplasm Vocabularies as Linked DataValeria Pesce
What has already been published?
What may still be needed?
How to do it?
This presentation is a part of the 3rd Session of the 1st International e-Conference on Germplasm Data Interoperability https://sites.google.com/site/germplasminteroperability/
This document summarizes Rodrigo Dias Arruda Senra's 2012 doctoral thesis defense at the University of Campinas. The thesis studied how to organize digital information for sharing across heterogeneous systems and proposed three main contributions: 1) SciFrame, a conceptual framework for scientific digital data processing; 2) database descriptors to enable loose coupling between applications and database management systems; and 3) organographs, a method for explicitly organizing information based on tasks.
This document summarizes a presentation about EZID, a service that provides persistent identifiers and supports data citation. It introduces DataCite as an international consortium that develops specifications for data citation. The presentation outlines new features being developed for EZID, including service replicas, URN support, suffix pass-through identifiers, and identifier status indicators. It also discusses the ARK community and governance.
The document discusses semantic web, ontology languages, and case-based reasoning. It provides an overview of semantic web and its motivations. It describes ontology languages like RDF, RDF Schema, OWL, and others. It then discusses how case-based reasoning can utilize semantic web technologies by applying an AI technique to retrieve metadata related to codes for earthquake science.
The document provides an overview of the work done at DERI Galway, including developing technologies like SIOC, ActiveRDF, and BrowseRDF to interconnect online communities and enable semantic applications. It also describes JeromeDL, a digital library system that uses semantic metadata and services to allow users to collaboratively browse and share knowledge.
The document provides an overview of semantic web technologies including:
- The history and core principles of the semantic web including RDF, ontologies, and vocabularies.
- Methods for publishing and consuming semantic data such as Linked Data, RDFa, microdata, and SPARQL querying.
- Some challenges with semantic web technologies including differences between object-oriented programming and description logics, and open world assumptions versus closed world assumptions from relational databases.
- Ways that semantic web technologies can help with interoperability, information retrieval, recommendations, and search engine optimization.
Persistent identifiers for digitized specimens (2013)Dag Endresen
Persistent identifiers (PID) for digitized museum collections. Presented at the European GBIF meeting at Digitarium in Jounsuu, Finland, 6 March 2013. A proposed model for assigning UUID PIDs using QR-codes during the imaging and digitization process.
Publishing Germplasm Vocabularies as Linked DataValeria Pesce
What has already been published?
What may still be needed?
How to do it?
This presentation is a part of the 3rd Session of the 1st International e-Conference on Germplasm Data Interoperability https://sites.google.com/site/germplasminteroperability/
This document summarizes Rodrigo Dias Arruda Senra's 2012 doctoral thesis defense at the University of Campinas. The thesis studied how to organize digital information for sharing across heterogeneous systems and proposed three main contributions: 1) SciFrame, a conceptual framework for scientific digital data processing; 2) database descriptors to enable loose coupling between applications and database management systems; and 3) organographs, a method for explicitly organizing information based on tasks.
This document summarizes a presentation about EZID, a service that provides persistent identifiers and supports data citation. It introduces DataCite as an international consortium that develops specifications for data citation. The presentation outlines new features being developed for EZID, including service replicas, URN support, suffix pass-through identifiers, and identifier status indicators. It also discusses the ARK community and governance.
The document discusses semantic web, ontology languages, and case-based reasoning. It provides an overview of semantic web and its motivations. It describes ontology languages like RDF, RDF Schema, OWL, and others. It then discusses how case-based reasoning can utilize semantic web technologies by applying an AI technique to retrieve metadata related to codes for earthquake science.
The document provides an overview of the work done at DERI Galway, including developing technologies like SIOC, ActiveRDF, and BrowseRDF to interconnect online communities and enable semantic applications. It also describes JeromeDL, a digital library system that uses semantic metadata and services to allow users to collaboratively browse and share knowledge.
The document provides an overview of semantic web technologies including:
- The history and core principles of the semantic web including RDF, ontologies, and vocabularies.
- Methods for publishing and consuming semantic data such as Linked Data, RDFa, microdata, and SPARQL querying.
- Some challenges with semantic web technologies including differences between object-oriented programming and description logics, and open world assumptions versus closed world assumptions from relational databases.
- Ways that semantic web technologies can help with interoperability, information retrieval, recommendations, and search engine optimization.
Persistent identifiers for digitized specimens (2013)Dag Endresen
Persistent identifiers (PID) for digitized museum collections. Presented at the European GBIF meeting at Digitarium in Jounsuu, Finland, 6 March 2013. A proposed model for assigning UUID PIDs using QR-codes during the imaging and digitization process.
GBIF BIFA mentoring, Day 2 Publish data, July 2016Dag Endresen
GBIF BIFA mentoring in Los Banos, Philippines for the South-East Asian ASEAN Biodiversity Heritage Parks. With Dr. Yu-Huang Wang, Dr. Po-Jen Chiang, and Guan-Shuo Mai from TaiBIF the GBIF node of Taiwan (Chinese Tapei); and the Biodiversity Informatics team at ASEAN Centre For Biodiversity. http://www.gbif.no/events/2016/gbif-bifa-mentoring.html
Data Equivalence
Mark Parsons, Lead Project Manager, Senior Associate Scientist, National Snow and Ice Data Center
Data citation, especially using persistent identifiers like Digital Object Identifiers (DOIs), is an increasingly accepted scientific practice. Recently, several, respected organizations have developed guidelines for data citation. The different guidelines are largely congruent in that they agree on the basic practice and elements of data citation, especially for relatively static, whole data collections. There is less agreement on the more subtle nuances of data citation that are sometimes necessary to ensure precise reference and scientific reproducibility--the core purpose of data citation. We need to be sure that if you follow a data reference you get to the precise data that were used or at least their scientific equivalent. Identifiers such as DOIs are necessary but not sufficient for the precise, detailed, references necessary. This talk discusses issues around data set versioning, micro-citation, and scientific equivalence. I propose some interim solutions and suggest research strategies for the future.
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Dag Endresen
Implementation of persistent and globally unique identifiers for specimens held in natural history collections worldwide will open up new opportunities for referring to these physical resources in an interlinked digital context such as the Internet. Here, we will describe the approach for persistent identification of collection specimens developed and implemented at the Natural History Museum in Oslo (NHM-UiO) by the the Norwegian participant node to the Global Biodiversity Information Facility (GBIF-Norway). The Norwegian university museums are invited to use our resolver service at "http://purl.org/gbifnorway/id/<uuid>" when publishing biodiversity data to GBIF. All occurrence records published through GBIF-Norway, with appropriate PURL-UUID identifiers mapped to the Darwin Core occurrenceID, will automatically be added to our resolver service and kept updated.
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)Dag Endresen
Potential relationship and collaboration between EURISCO and GBIF - a distributed network of databases for the ECP/GR D&I network meeting at ZADI Bonn Germany 11th April 2005. Dag Endresen (Nordic Gene Bank). GBIF is a Global Biodiversity Information Facility for free and open access to biodiversity data.
The Role of Metadata in Reproducible Computational ResearchJeremy Leipzig
Reproducible computational research (RCR) provides the keystone to the scientific method, packaging the transformation of raw data to published results in a manner than can be communicated to others. Developing RCR standards has been a growing concern of statisticians, data scientists, and informatics professionals. Metadata provides context and provenance to raw data, and is essential to both discovery and validation RCR. This presentation will give an overview for emerging metadata standards in data, analysis, pipelines tools, and publications.
Access to knowledge is key to combating hunger and poverty, but complex information needs for agricultural development cannot be met by availability alone. Linked data principles and vocabularies can help integrate disconnected repositories and allow users to discover related information across databases. The CIARD RING aims to apply these approaches to provide value-added services that help users find information on technologies, countries, crops, and projects.
Using the Biological Collections Ontology to Advance Biodiversity Sciencerlwalls2008
advancing biodiversity science and addressing the many pressing questions in the field that require access to large-scale data and integration of data from many sources. The Biological Collections Ontology (BCO) provides a semantic framework for integrating and reasoning over biodiversity data from diverse sources, including Darwin Core Archives (DwCA), Minimum Information for any (x) Sequence (MIxS) compliant genomic/metagenomic datasets, and field-based surveys or sampling studies. While the ontology itself is a key component of the biodiversity informatics tool kit, we believe that it must be accompanied by user-friendly data conversion, management, and query tools in order to gain wide adoption and function effectively. Over the past year, several hackathons and workshops have focused on converting DwCA and tabular-formatted data into resource description format (RDF), so that it can be queried using the BCO. The result is a functioning proof-of-concept workflow and proposals to take parts of the workflow into production.
In this presentation, we will report on some of the latest developments in the BCO and related applications, including mappings from Darwin Core (DwC) and MIxS to BCO and tools for converting between tabular/DwCA data and RDF. We will conclude with a brief discussion of how the semantic approach adopted by the BCO project can contribute to research for understanding and sustaining biodiversity.
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)Dr.-Ing. Thomas Hartmann
In this thesis, a validation framework is introduced that enables to consistently execute RDF-based constraint languages on RDF data and to formulate constraints of any type. The framework reduces the representation of constraints to the absolute minimum, is based on formal logics, consists of a small lightweight vocabulary, and ensures consistency regarding validation results and enables constraint transformations for each constraint type across RDF-based constraint languages.
interoperability: the value of recombinant potentiallisld
Interoperability allows for the combining and reuse of resources and data across systems through standards and protocols. It provides economic value by maximizing the use of investments in metadata and content when they can be shared, reused, and recombined. For users, interoperability reduces the technical barriers to accessing and using resources, allowing them to focus on their work.
Data is an increasingly common term used on the assumption that its meaning is commonly understood. This presentation seeks to drill down into the very specifics of what data is all about.
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
The tremendous growth in digital data has led to an increase in metadata initiatives for different types of scientific data, as evident in Ball’s survey (2009). Although individual communities have specific needs, there are shared goals that need to be recognized if systems are to effectively support data sharing within and across all domains. This paper considers this need, and explores systems requirements that are essential for metadata supporting the discovery and management of scientific data. The paper begins with an introduction and a review of selected research specific to metadata modeling in the sciences. Next, the paper’s goals are stated, followed by the presentation of valuable systems requirements. The results include a base-model with three chief principles: principle of least effort, infrastructure service, and portability. The principles are intended to support “data user” tasks. Results also include a set of defined user tasks and functions, and applications scenarios.
The document discusses technologies and infrastructure for publishing biodiversity data from environmental impact assessments (EIA). It covers the types and formats of EIA biodiversity data, tools for data capture and digitization, platforms for data discovery and publishing, ensuring data quality, and hosting data centers to facilitate long-term archiving and publishing of EIA biodiversity data.
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
This document describes a system for creating digital journal clubs for emergency response. The system harvests and semantically maps bibliographic metadata from various sources to expose focused collections. It augments the metadata with information on author relationships, georeferences and concepts. Tools enable exploration of collections through visualizations and maps. Social features allow users to tag, comment and collaborate, stored as semantic triples to enable interoperability. The system aims to provide responders with timely access to vetted information and collaboration tools to help address emergency situations.
Global Biodiversity Information Facility (GBIF) - 2012Dag Endresen
Presentation of the Global Biodiversity Information Facility (GBIF) and GBIF Norway for the Department of Technical and Scientific Conservation (CONSERV) at the Natural History Museum, University of Oslo. Tøyen, Oslo, 7 November 2012.
The document discusses technical issues and opportunities for improving the Global Biodiversity Information Facility's (GBIF) registry and portals for discovering biodiversity resources. It analyzes GBIF's past use of UDDI registry and data portal, and outlines challenges in developing a new graph-based registry model to better represent the network of institutions, collections, and relationships. The new registry aims to improve discoverability through associating automated and human-generated metadata, uniquely identifying resources, and defining services and vocabularies.
The document discusses lessons learned in transforming metadata from XML formats to RDF. It describes how libraries and cultural heritage institutions are working to express existing metadata standards like MODS and PBCore in RDF to take advantage of capabilities like linked data. Challenges include mapping XML schemas to RDF ontologies and ensuring RDF can meet identified use cases. Examples are provided of institutions that have transformed metadata to RDF to share across systems or publish as linked open data.
Harmonization of vocabularies for water dataSimon Cox
The document discusses challenges and approaches to harmonizing water quality vocabularies using semantic web technologies. It outlines issues with formalizing vocabularies using SKOS, RDFS, and OWL; handling collections and versioning of terms; and distributing and discovering vocabularies. While progress has been made, challenges remain regarding conceptualizing terms as classes or individuals, developing URI patterns, and tracking vocabulary versions. Suggested paths forward include continued work on formalization methods and publishing best practices.
The RDF Report Card: Beyond the Triple CountLeigh Dodds
My talk from the Semtech Biz conference in London.
I argued that it is time to move beyond discussing size of datasets and encourage a more nuanced view to understand quality and utility.
The RDF Report Card is offered as one simple, high-level visualization.
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16Dag Endresen
GBIF Norway contributed to a symposium organized jointly by Biodiversa+ and GBIF, to discuss the requirements for national biodiversity monitoring hubs in the context of proposals for a European Biodiversity Observation Coordination Centre.
GBIF BIFA mentoring, Day 2 Publish data, July 2016Dag Endresen
GBIF BIFA mentoring in Los Banos, Philippines for the South-East Asian ASEAN Biodiversity Heritage Parks. With Dr. Yu-Huang Wang, Dr. Po-Jen Chiang, and Guan-Shuo Mai from TaiBIF the GBIF node of Taiwan (Chinese Tapei); and the Biodiversity Informatics team at ASEAN Centre For Biodiversity. http://www.gbif.no/events/2016/gbif-bifa-mentoring.html
Data Equivalence
Mark Parsons, Lead Project Manager, Senior Associate Scientist, National Snow and Ice Data Center
Data citation, especially using persistent identifiers like Digital Object Identifiers (DOIs), is an increasingly accepted scientific practice. Recently, several, respected organizations have developed guidelines for data citation. The different guidelines are largely congruent in that they agree on the basic practice and elements of data citation, especially for relatively static, whole data collections. There is less agreement on the more subtle nuances of data citation that are sometimes necessary to ensure precise reference and scientific reproducibility--the core purpose of data citation. We need to be sure that if you follow a data reference you get to the precise data that were used or at least their scientific equivalent. Identifiers such as DOIs are necessary but not sufficient for the precise, detailed, references necessary. This talk discusses issues around data set versioning, micro-citation, and scientific equivalence. I propose some interim solutions and suggest research strategies for the future.
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Dag Endresen
Implementation of persistent and globally unique identifiers for specimens held in natural history collections worldwide will open up new opportunities for referring to these physical resources in an interlinked digital context such as the Internet. Here, we will describe the approach for persistent identification of collection specimens developed and implemented at the Natural History Museum in Oslo (NHM-UiO) by the the Norwegian participant node to the Global Biodiversity Information Facility (GBIF-Norway). The Norwegian university museums are invited to use our resolver service at "http://purl.org/gbifnorway/id/<uuid>" when publishing biodiversity data to GBIF. All occurrence records published through GBIF-Norway, with appropriate PURL-UUID identifiers mapped to the Darwin Core occurrenceID, will automatically be added to our resolver service and kept updated.
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)Dag Endresen
Potential relationship and collaboration between EURISCO and GBIF - a distributed network of databases for the ECP/GR D&I network meeting at ZADI Bonn Germany 11th April 2005. Dag Endresen (Nordic Gene Bank). GBIF is a Global Biodiversity Information Facility for free and open access to biodiversity data.
The Role of Metadata in Reproducible Computational ResearchJeremy Leipzig
Reproducible computational research (RCR) provides the keystone to the scientific method, packaging the transformation of raw data to published results in a manner than can be communicated to others. Developing RCR standards has been a growing concern of statisticians, data scientists, and informatics professionals. Metadata provides context and provenance to raw data, and is essential to both discovery and validation RCR. This presentation will give an overview for emerging metadata standards in data, analysis, pipelines tools, and publications.
Access to knowledge is key to combating hunger and poverty, but complex information needs for agricultural development cannot be met by availability alone. Linked data principles and vocabularies can help integrate disconnected repositories and allow users to discover related information across databases. The CIARD RING aims to apply these approaches to provide value-added services that help users find information on technologies, countries, crops, and projects.
Using the Biological Collections Ontology to Advance Biodiversity Sciencerlwalls2008
advancing biodiversity science and addressing the many pressing questions in the field that require access to large-scale data and integration of data from many sources. The Biological Collections Ontology (BCO) provides a semantic framework for integrating and reasoning over biodiversity data from diverse sources, including Darwin Core Archives (DwCA), Minimum Information for any (x) Sequence (MIxS) compliant genomic/metagenomic datasets, and field-based surveys or sampling studies. While the ontology itself is a key component of the biodiversity informatics tool kit, we believe that it must be accompanied by user-friendly data conversion, management, and query tools in order to gain wide adoption and function effectively. Over the past year, several hackathons and workshops have focused on converting DwCA and tabular-formatted data into resource description format (RDF), so that it can be queried using the BCO. The result is a functioning proof-of-concept workflow and proposals to take parts of the workflow into production.
In this presentation, we will report on some of the latest developments in the BCO and related applications, including mappings from Darwin Core (DwC) and MIxS to BCO and tools for converting between tabular/DwCA data and RDF. We will conclude with a brief discussion of how the semantic approach adopted by the BCO project can contribute to research for understanding and sustaining biodiversity.
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)Dr.-Ing. Thomas Hartmann
In this thesis, a validation framework is introduced that enables to consistently execute RDF-based constraint languages on RDF data and to formulate constraints of any type. The framework reduces the representation of constraints to the absolute minimum, is based on formal logics, consists of a small lightweight vocabulary, and ensures consistency regarding validation results and enables constraint transformations for each constraint type across RDF-based constraint languages.
interoperability: the value of recombinant potentiallisld
Interoperability allows for the combining and reuse of resources and data across systems through standards and protocols. It provides economic value by maximizing the use of investments in metadata and content when they can be shared, reused, and recombined. For users, interoperability reduces the technical barriers to accessing and using resources, allowing them to focus on their work.
Data is an increasingly common term used on the assumption that its meaning is commonly understood. This presentation seeks to drill down into the very specifics of what data is all about.
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
The tremendous growth in digital data has led to an increase in metadata initiatives for different types of scientific data, as evident in Ball’s survey (2009). Although individual communities have specific needs, there are shared goals that need to be recognized if systems are to effectively support data sharing within and across all domains. This paper considers this need, and explores systems requirements that are essential for metadata supporting the discovery and management of scientific data. The paper begins with an introduction and a review of selected research specific to metadata modeling in the sciences. Next, the paper’s goals are stated, followed by the presentation of valuable systems requirements. The results include a base-model with three chief principles: principle of least effort, infrastructure service, and portability. The principles are intended to support “data user” tasks. Results also include a set of defined user tasks and functions, and applications scenarios.
The document discusses technologies and infrastructure for publishing biodiversity data from environmental impact assessments (EIA). It covers the types and formats of EIA biodiversity data, tools for data capture and digitization, platforms for data discovery and publishing, ensuring data quality, and hosting data centers to facilitate long-term archiving and publishing of EIA biodiversity data.
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
This document describes a system for creating digital journal clubs for emergency response. The system harvests and semantically maps bibliographic metadata from various sources to expose focused collections. It augments the metadata with information on author relationships, georeferences and concepts. Tools enable exploration of collections through visualizations and maps. Social features allow users to tag, comment and collaborate, stored as semantic triples to enable interoperability. The system aims to provide responders with timely access to vetted information and collaboration tools to help address emergency situations.
Global Biodiversity Information Facility (GBIF) - 2012Dag Endresen
Presentation of the Global Biodiversity Information Facility (GBIF) and GBIF Norway for the Department of Technical and Scientific Conservation (CONSERV) at the Natural History Museum, University of Oslo. Tøyen, Oslo, 7 November 2012.
The document discusses technical issues and opportunities for improving the Global Biodiversity Information Facility's (GBIF) registry and portals for discovering biodiversity resources. It analyzes GBIF's past use of UDDI registry and data portal, and outlines challenges in developing a new graph-based registry model to better represent the network of institutions, collections, and relationships. The new registry aims to improve discoverability through associating automated and human-generated metadata, uniquely identifying resources, and defining services and vocabularies.
The document discusses lessons learned in transforming metadata from XML formats to RDF. It describes how libraries and cultural heritage institutions are working to express existing metadata standards like MODS and PBCore in RDF to take advantage of capabilities like linked data. Challenges include mapping XML schemas to RDF ontologies and ensuring RDF can meet identified use cases. Examples are provided of institutions that have transformed metadata to RDF to share across systems or publish as linked open data.
Harmonization of vocabularies for water dataSimon Cox
The document discusses challenges and approaches to harmonizing water quality vocabularies using semantic web technologies. It outlines issues with formalizing vocabularies using SKOS, RDFS, and OWL; handling collections and versioning of terms; and distributing and discovering vocabularies. While progress has been made, challenges remain regarding conceptualizing terms as classes or individuals, developing URI patterns, and tracking vocabulary versions. Suggested paths forward include continued work on formalization methods and publishing best practices.
The RDF Report Card: Beyond the Triple CountLeigh Dodds
My talk from the Semtech Biz conference in London.
I argued that it is time to move beyond discussing size of datasets and encourage a more nuanced view to understand quality and utility.
The RDF Report Card is offered as one simple, high-level visualization.
Similar to Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Tallinn Estonia May 2018 (20)
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16Dag Endresen
GBIF Norway contributed to a symposium organized jointly by Biodiversa+ and GBIF, to discuss the requirements for national biodiversity monitoring hubs in the context of proposals for a European Biodiversity Observation Coordination Centre.
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Dag Endresen
TDWG 2023 Hobart, 2023-10-10.
Sabine von Mering, Paul Jean-Charles Braun, Robert W. N. Cubey, Quentin Groom, Elspeth M Haston, Annika Hendriksen, Rukaya Johaadien, Siobhan Leachman, Luke Marsden, Heimo Rainer, Joaquim Santos, Dag Endresen. https://doi.org/10.3897/biss.7.111427
See also https://www.wikidata.org/wiki/Wikidata:WikiProject_Research_expeditions
Ontologies for biodiversity informatics, UiO DSC June 2023Dag Endresen
GBIF Norway was invited to the UiO Digital Scholar Centre Data (DSC) Managers Network meeting on 2023-06-08 to present how we use biodiversity ontologies. https://www.gbif.no/news/2023/biodiversity-ontologies.html
The UiO Natural History Museum (GBIF Norway) presented the evacuation of the Kherson herbarium in Ukraine at the 2023 annual conference for the Norwegian Association of Archives. Plenary 2023-06-01.
More information at: https://www.gbif.no/news/2023/privatarkivkonferansen.html
Video at: https://www.gbif.no/news/2023/video/2023-06-kherson-herbarium.mp4
Scientific Advisory Committee (SAC) meeting May 2023 for the Global Information System (GLIS) of the Plant Treaty (ITPGRFA) of the United Nations Food and Agriculture Organization (FAO)
BioDT for the UiO Science section meeting 2023-03-24Dag Endresen
Presentation of the Biodiversity Digital Twin (BioDT) project for the University of Oslo (UiO) Natural History Museum (NHMO) Science department on 2023-03-24.
BioDATA final conference in Oslo, November 2022Dag Endresen
BioDATA – Biodiversity data management skills for students (2018-2022). BioDATA is an international project on developing skills in biodiversity data management and data publishing for undergraduate and postgraduate students from Armenia, Belarus, Norway, Tajikistan, and Ukraine. The project is coordinated by the University of Oslo (Norway) and supported by the Global Biodiversity Information Facility (GBIF). The project is funded by the Directorate for Higher Education and Skills (HKDir). The final closing symposium for all partner universities was organized at the University of Oslo Natural History Museum in Oslo from 11th to 12th November 2022.
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20Dag Endresen
Nansen Legacy (Arven etter Nansen, AeN) - Marine data publishing workshop. 3-day workshop to publish marine biodiversity data from the AeN project as Darwin Core Archives on September 20-22, 2022. With support from the Norwegian Global Biodiversity Information Facility (GBIF) node, and the Ocean Biodiversity Information System (OBIS, EurOBIS). https://www.gbif.no/events/2022/nansen-legacy-tromso.html
GBIF at Living Norway Open Science Lab 2022-03-03Dag Endresen
Presentation of GBIF at the Living Norway Open Science Lab on 2022-03-03. See program at
https://livingnorway.no/join-the-living-norway-ecological-data-network-through-our-open-science-lab/
https://livingnorway.no/2022/02/10/join-our-open-science-lab/
https://www.gbif.no/events/2022/open-science-lab-1.html
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...Dag Endresen
This document discusses digitalization efforts and open biodiversity data infrastructure. It provides an overview of GBIF (Global Biodiversity Information Facility), including its goals of providing open access to biodiversity data worldwide. It notes that over 1.9 billion species occurrence records have been published through GBIF from over 1,700 data publishers. The document encourages museums to engage in open science and digitalization to remain relevant and take advantage of new opportunities and funding. It discusses using identifiers like DOIs to cite biodiversity data and link it to publications and people.
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Dag Endresen
[Råd 1] Norske forskningsdata bør publiseres i henhold til internasjonale data-standarder. Internasjonale data-standarder sikrer interoperabilitet og reelle muligheter for gjenbruk av data. Etablerte data-standarder innenfor et fagområde gir ofte best effekt for realisert gjenbruk, men kan hindre gjenbruk av data i nye og uforutsette tverrfaglige studier og sammenhenger. Norge bør derfor også bidra til tverrfaglig videreutvikling av interoperabilitet på tvers av data-standarder som er i anvendelse innenfor de enkelte fagområder.
[Råd 2] Måloppnåelse for økt deling av forskningsdata blir enklere med effektive insentiver. Vi tror at etablering av forskningsdata som siterbart vitenskapelig produkt slik som DORA (sfdora.org, 2012) og Force11 (force11.org, 2011) beskriver gir viktige retningslinjer som datainfrastrukturutvalget bør forsøke å integrere i nye Norske retningslinjer.
[Råd 3] Metrikk for å måle gjennomslag og innflytelse (impact) av forskning ("tellekanter") bør utvides til å inkludere metrikk for anerkjennelse av datakilde (data-publikasjon, data-sitering) for både forsker og institusjon. Publisering av forskningsdata bør fortrinnsvis utføres gjennom en profesjonell infrastruktur (slik som GBIF) der opphavsmann og de ulike bidragsytere til produksjon, innsamling, tilretteleggelse, håndtering, og bevaring av data kan registreres. Dataset bør tilordnes stabil digital identitet, gjennom løsninger slik som DOI (digital object identifier). Personer bør knyttes til stabil digital identitet gjennom løsninger slik som ORCID (Open Researcher and Contributor ID, orcid.org). Institusjoner bør knyttes til stabil digital identitet gjennom system løsninger slik som ROR (Research Organization Registry, ror.org).
[Råd 4] Etablering av infrastruktur for forskningsdata tar tid og behøver derfor kontinuitet og forutsigbare rammer, mandat, og langsiktig strategisk investering. Effektiv langsiktig investering i felles internasjonale løsninger krever ofte bedre kontinuitet enn det som er mulig innenfor handlingsrommet for basisfinansiering for enkelte forskningsinstitusjoner og universiteter. Samtidig som felles multi-nasjonal investering i fellesløsninger ofte har en betydelig lavere kostnad enn en alternativ mere fragmentert infrastruktur.
GBIF Norge (GBIF.no) er den norske deltagernoden i Global Biodiversity Information Facility (GBIF.org). GBIF er en internasjonal organisasjon som arbeider for fri og åpen tilgang til globalt dekkende informasjon om biologisk mangfold. GBIF ble etablert i 2001 etter en beslutning i OECDs Science Forum i 1999. Norge ble medlem av GBIF i 2004 og den norske deltagernoden, GBIF Norge, ble etablert med sekretariat ved Universitet i Oslo Naturhistorisk Museum i nært samarbeid med Artsdatabanken og med finansiering fra Forskningsrådet. GBIF Norges mandat omfatter nasjonal deltagelse i GBIF med internasjonal publisering av norske artsdata i henhold til internasjonale data-standarder som er forvaltet av GBIF.
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
The document discusses the role of biodiversity informatics and the Global Biodiversity Information Facility (GBIF) in making biodiversity data available through open access. GBIF provides free and open access to over 1.6 billion species occurrence records from over 1600 data publishers. The document highlights how digitizing natural history collections and integrating diverse biodiversity data sources can support research and policy goals. It emphasizes best practices like using common data standards, publishing datasets on GBIF to make them widely discoverable and reusable, and citing data with DOIs to incentivize open data sharing.
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
This document discusses open data and open science practices in natural history museums. It summarizes that very few museum specimens have been digitized, with GBIF publishing around 1.6 billion records including 200 million specimens. This represents only about 10% of the estimated 1.2-3 billion total specimens. The document promotes open data practices and FAIR data principles. It outlines the role of GBIF in providing infrastructure for open data publishing and how this can enable new research opportunities while supporting policy goals. Museums are encouraged to adopt open science approaches to remain relevant in an era of open data and big biodiversity data.
Lecture for a course at NTNU, 27th January 2021
CC-BY 4.0 Dag Endresen https://orcid.org/0000-0002-2352-5497
See also http://bit.ly/biodiversityinformatics
https://www.gbif.no/events/2021/lecture-ntnu-gbif.html
GBIF is a global biodiversity data infrastructure that provides open access to over 1.6 billion species occurrence records. It connects over 1,600 data publishers through a voluntary network of participants and aims to facilitate research and policy related to biodiversity and sustainable development. Data shared through GBIF is cited with digital object identifiers to give credit to data publishers and encourage further data sharing. The presentation reviewed GBIF's role in open science and data citation principles, provided statistics on global and Norwegian contributions to the network, and explained how to publish and cite biodiversity data through GBIF.
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...Dag Endresen
BioDATA Biodiversity Data for Internationalization in Higher Education is funded by the Norwegian Agency for International Cooperation and Quality Enhancement in Higher Education (DIKU) -- and is based on reusing training materials from the GBIF Biodiversity Information for Development (BID) program funded by the European Commission.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...PECB
Denis is a dynamic and results-driven Chief Information Officer (CIO) with a distinguished career spanning information systems analysis and technical project management. With a proven track record of spearheading the design and delivery of cutting-edge Information Management solutions, he has consistently elevated business operations, streamlined reporting functions, and maximized process efficiency.
Certified as an ISO/IEC 27001: Information Security Management Systems (ISMS) Lead Implementer, Data Protection Officer, and Cyber Risks Analyst, Denis brings a heightened focus on data security, privacy, and cyber resilience to every endeavor.
His expertise extends across a diverse spectrum of reporting, database, and web development applications, underpinned by an exceptional grasp of data storage and virtualization technologies. His proficiency in application testing, database administration, and data cleansing ensures seamless execution of complex projects.
What sets Denis apart is his comprehensive understanding of Business and Systems Analysis technologies, honed through involvement in all phases of the Software Development Lifecycle (SDLC). From meticulous requirements gathering to precise analysis, innovative design, rigorous development, thorough testing, and successful implementation, he has consistently delivered exceptional results.
Throughout his career, he has taken on multifaceted roles, from leading technical project management teams to owning solutions that drive operational excellence. His conscientious and proactive approach is unwavering, whether he is working independently or collaboratively within a team. His ability to connect with colleagues on a personal level underscores his commitment to fostering a harmonious and productive workplace environment.
Date: May 29, 2024
Tags: Information Security, ISO/IEC 27001, ISO/IEC 42001, Artificial Intelligence, GDPR
-------------------------------------------------------------------------------
Find out more about ISO training and certification services
Training: ISO/IEC 27001 Information Security Management System - EN | PECB
ISO/IEC 42001 Artificial Intelligence Management System - EN | PECB
General Data Protection Regulation (GDPR) - Training Courses - EN | PECB
Webinars: https://pecb.com/webinars
Article: https://pecb.com/article
-------------------------------------------------------------------------------
For more information about PECB:
Website: https://pecb.com/
LinkedIn: https://www.linkedin.com/company/pecb/
Facebook: https://www.facebook.com/PECBInternational/
Slideshare: http://www.slideshare.net/PECBCERTIFICATION
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Add Chatter in the odoo 17 ERP ModuleCeline George
In Odoo, the chatter is like a chat tool that helps you work together on records. You can leave notes and track things, making it easier to talk with your team and partners. Inside chatter, all communication history, activity, and changes will be displayed.
Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Tallinn Estonia May 2018
1. Event core and new data types in GBIF
European GBIF Nodes Meeting in Tallinn 14-16 May 2018
Slides, CC-BY Dag Endresen, GBIF.no
Slides available at: https://bit.ly/gbifEu2018_new_datatypes
2. New data types
• Multimedia (Audubon Core)
• Collections (NCD)
• Taxon concepts, taxon traits (Plinian Core)
• Molecular genotype data (GGBN)
• Experiment or fact, organism traits
• …
• New data types from Darwin Core not yet
supported in GBIF?
5. Simple Darwin Core
«The Simple Darwin Core is simple in that it
assumes (and allows) no structure beyond the
concept of rows and columns, which might be
thought of as attributes and their values, or fields
and records.»
In other words: de-normalized.
8. dwciri: namespace
The dwciri: namespace is intended for
use exclusively with non-literal objects.
http://rs.tdwg.org/dwc/iri/
Baskauf S, Wieczorek J, Deck J, Webb C, Morris PJ, and Schildhauer M (2015)
Darwin Core RDF Guide, Biodiversity Information Standards
http://rs.tdwg.org/dwc/terms/guides/rdf/
Baskauf SJ, Wieczorek J, Deck J, and Webb CO (2016) Lessons learned from
adapting the Darwin Core vocabulary standard for use in RDF. Semantic Web
7(6): 617-627, DOI: 10.3233/SW-150199
9. dwciri: and dwc: terms
“If a term in the• dwciri:
namespace has an analogue in
the dwc: namespace having the
same local name, the dwciri:
term will have the same
meaning as its dwc:
counterpart, but is intended to
have an IRI-reference object.”
11. Darwin Core RDF Guide
«A number of Darwin Core properties specify that
their values should be an identifier. There is
significant ambiguity in the use of these properties
because depending on the situation the value may
be an identifier for the subject resource itself or it
may be the identifier for a resource that is related
in some way to the subject resource.»
«Darwin Core contains a number of "ID" terms
intended to designate identifiers,
e.g., dwc:occurrenceID, dwc:identificationID,
dwc:locationID, etc. The "ID" terms provide two
functions, specifying the class of the resource and
indicating that value of the term is an identifier.»
http://rs.tdwg.org/dwc/terms/guides/rdf/index.htm
12. Unintended consequences of ID terms
• In its normative definition, each Darwin
Core ID term is declared to be
rdfs:subPropertyOf dcterms:identifier.
• In RDF, the purpose of a subproperty
declaration is to allow a client with
reasoning capability to infer a triple
containing a broader (and presumably
more well-known) property.
• (Applicable to RDF-serialization, less so to
text-or XML-serialization).
13. DwC RDF Guide ID term example 23
Example 23 : Turtle (incorrect usage)
<http://guid.mvz.org/identifications/23459>
a dwc:Identification; dwc:identifiedBy "Richard Sage";
dwc:dateIdentified "2000"^^xsd:gYear;
dwc:taxonID
"http://lsid.tdwg.org/urn:lsid:catalogueoflife.org:taxon:d79c11aa-
29c1-102b-9a4a-00304854f820:col20120721".
possible incorrect inference:
dcterms:identifier
"http://lsid.tdwg.org/urn:lsid:catalogueoflife.org:taxon:d79c11aa-
29c1-102b-9a4a-00304854f820:col20120721".
15. Event core and measurements
• Event core (since 2015) opens up new
functionality for data published in GBIF!
• Limitations of the star-schema is a frequently
mentioned obstacle – of event core…
– E.g. when connecting measurement or facts (MoF)
only core entities can be described.
16. DwC archive – application schema
DwC-A Event (since 2015)
– http://rs.gbif.org/core/dwc_event_2016_06_21.xml
DwC-A Occurrence
– http://rs.gbif.org/core/dwc_occurrence_2015-07-02.xml
DwC-A Measurement or fact (want link to event and occ!)
– http://rs.gbif.org/extension/dwc/measurements_or_facts.xml
OBIS Extended measurement or fact (proposed 2017)
– http://rs.gbif.org/extension/obis/extended_measurement_or_fact.xml
DwC-A Resource relationship (possible solution?)
– http://rs.gbif.org/extension/dwc/resource_relation.xml
18. Use case : Measurements for both events
and occurrences in the same dataset
OBIS expeditions are sampling more data than
only organism occurrences (e.g. environment).
--> development of a new extended measurement
and fact extension (eMoF for DwC-archives)
19. Integrated Publishing Toolkit
Star Schema (DwC-archive)
Occurrence core
MoF Extension
1
Event core
Occurrence
Extension
MoF Extension
1 1
MoF = Measurement or Fact
Here: MoF describes
only the Occurrences
Sample event is the DwC core
Here: MoF describes
only the Event
Slide by:
OBIS-ENV-DATA
International Oceanographic
Data and Information Exchange
21. eMoF propose minting three new terms
measurementType: The nature of the measurement or fact (MoF)
measurementTypeID (new): identifier for the measurementType (global unique identifier, URI)
http://rs.iobis.org/obis/terms/measurementTypeID
measurementValue: the value of the MoF
measurementValueID (new): identifier for facts reported as measurementValue (URI)
http://rs.iobis.org/obis/terms/measurementValueID
measurementUnit: measurement unit (SI unit; meter, gram, etc)
measurementUnitID (new): identifier for the measurementUnit (URI)
http://rs.iobis.org/obis/terms/measurementUnitID
Let's explore the Darwin Core RDF Guide first…
23. Darwin Core RDF guide and the dwciri: namespace term
already have alternatives for some of the new terms
minted by the OBIS eMoF.
emof = http://rs.iobis.org/obis/terms/
dwciri = http://rs.tdwg.org/dwc/iri/
emof:measurementTypeID = dwciri:measurementType
emof:measurementUnitID = dwciri:measurementUnit
emof:measurementValueID (no analogue in dwciri - useful addition)
see also:
dwciri:measurementDeterminedBy
dwciri:measurementMethod
dwciri:samplingProtocol
DwC RDF Guide: http://rs.tdwg.org/dwc/terms/guides/rdf/index.htm#2.9_MeasurementOrFact_instances
24. Linking MoF instances to resources
• "Darwin Core does not provide the object properties
that would be required to describe precisely how the
dwc:MeasurementOrFact instance was related to the
dwc:Occurrence instance, the dwc:Organism whose
occurrence was recorded, or the
dwc:PreservedSpecimen that was collected".
• Examples of the DwC RDF Guide use
dcterms:relation to link MoF instances to the
respective resources.
Baskauf et al. 2015, Darwin Core RDF Guide, chapter 2.9
25. DwC RDF Guide MoF example 31
Example 31 - Turtle
<http://mczbase.mcz.harvard.edu/guid/MCZ:Mamm:67231>
a dwc:Occurrence;
dcterms:relation [a dwc:MeasurementOrFact ;
dwc:measurementType "tail length" ;
dwciri:measurementType <http://purl.obolibrary.org/obo/VT_0002758> ;
dwc:measurementValue "25"^^xsd:int ; dwc:measurementUnit "mm" ;
dwciri:measurementUnit
<http://mimi.case.edu/ontologies/2009/1/UnitsOntology#millimeter> ;
dwc:measurementAccuracy "0.5"^^xsd:decimal ;
dwc:measurementDeterminedDate "2009-08-22"^^xsd:date ;
dwc:measurementDeterminedBy "Ryan B Stephens" ;
dwciri:measurementDeterminedBy
<http://scholar.google.com/citations?user=RAsUdjoAAAAJ> ;
dwc:measurementMethod "unspecified" ;
dwciri:measurementMethod <http://purl.obolibrary.org/obo/MMO_0000160> ;
dwc:measurementRemarks "Accuracy from significant digits."@en].
27. DwC Resource Relationship
Term Name: ResourceRelationship
Identifier: http://rs.tdwg.org/dwc/terms/ResourceRelationship
Definition: A relationship of one rdfs:Resource
(http://www.w3.org/2000/01/rdf-schema#Resource) to another.
Comment: Resources can be thought of as identifiable records or
instances of classes and may include, but need not be limited to
Occurrences, Organisms, MaterialSamples, Events, Locations,
GeologicalContexts, Identifications, or Taxa. Example: An instance of an
Organism is the mother of another instance of an Organism. For
discussion see http://terms.tdwg.org/wiki/dwc:ResourceRelationship
ResourceRelationship | resourceRelationshipID | resourceID |
relatedResourceID | relationshipOfResource | relationshipAccordingTo |
relationshipEstablishedDate | relationshipRemarks
30. Use case for Resource Rela-onship
CORE entity is dwc:Event, measurement is on a
dwc:Occurrence organized in a DwCA extension
• resourceRelationshipID = "relationship1"
• resourceID = "organism1"^^dwc:Occurrence
• relatedResourceID = "measurement1"
• relationshipOfResource =
"measurement_of_occurrence"
https://github.com/gbif/rs.gbif.org/pull/13github.com/gbif/rs.gbif.org/pull/13
31. Use case for Resource Relationship
CORE entity is dwc:Occurrence, but the measurement
is on one of the entities flattened into the simple
Darwin Core data record (Organism, Event, Location,
Multimedia, MaterialSample, ...)
• resourceRelationshipID = "relationship1"
• resourceID = "organism1"^^dwc:Organism |
"event1"^^dwc:Event | …
• relatedResourceID = "measurement1"
• relationshipOfResource =
"measurement_of_organism/event ..."
32. Use case for Resource Relationship
CORE en5ty is dwc:Event (measurement event, not the
collec5ng event), measurement is on a dwc:Occurrence
published in another dataset
resourceRela5onshipID• = "rela5onship1" |
hGp:urn:uuid:UUID^^resourceRela5onship
resourceID• = "hGp:urn:uuid:UUID"^^dwc:Occurrence
relatedResourceID• =
hGp:urn:uuid:UUID^^dwc:MeasurementOrFact
rela5onshipOfResource• =
"measurement_of_occurrence_in_another_dataset"
33. Proposing a new use case:
Measurement events as dwc:Event
Experiments with living specimens
at a location and date that is
different from the collecting event.
34. Collec&ng expedi&ons Genebank, collections Field/lab experiments
Seeds from
collecting trip
expeditions are
deposited in
genebank
collections. Living seeds from
genebank
collections are
used for field or
laboratory
experiments.
35. Germplasm material flow
Material samples from many different collec5ng•
missions are deposited in different genebanks.
Material from public genebanks are included in•
other genebank collec5ons (and breeders
collec5ons).
Material from different genebanks (and thus•
many different collec5ng events) are included in
each field trial experiment.
Collec5ng Event• ≠ Measurement Event (MoF)