FOSS4G Belgium 2018 - DCAT-AP Schema-plugin for GeoNetwork

Goedertier Stijn
Goedertier StijnManager at GIM
www.vlaanderen.be/informatievlaanderen
Geraldine Nolf, Dirk De Baere, Mathias De Schrijver, Bart Cosyn
Gustaaf Vandeboel, Mathieu Van Wambeke, Mathieu Chaussier,
An Heirman, Stijn Goedertier
GeoNetwork
DCAT-AP schema-plug-in
FOSS4G Belgium
25 Oktober2018
Information
?
2GeoNetwork-DCAT-APschema-plug-in-25 October2018
AS IS: METADATA STANDARDS, SYSTEMS, PORTALS
GEO
OPEN
LEVEL: Flanders Belgian European
Open Geo
Standards:
Standards:
Systems:
Systems:
Portals:
Portals:
3
Overview study
GeoNetwork-DCAT-APschema-plug-in-25 October2018 4
Geographic
data & services
ISO/INSPIRE
GeoNetwork
Geopunt
Data/Services
Open data
DCAT
CKAN/TDT
CKAN
Data
Documents
ISAD(G)-AP
EAD
PREMIS
(to be)
(to be)
Documents
…
…
…
…
…
Metadata Standard
Scope Information Catalogue
Metadata Management
System
Publication via
Portal / Catalogue
Data / Information
Study
> Scenario 0: Optimize & automate AS IS
> Scenario 1: Open via procedures Geo domain
> Scenario 2: Geo via procedures Open data domain
> Scenario 3: Mixed Scenario
Study: Scenarios
GeoNetwork-DCAT-APschema-plug-in-25 October2018 5
Resultson thestudy between Geo & Open data domains
GeoNetwork-DCAT-APschema-plug-in-25 October2018 6
“Informatiecatalogus”
GeoNetwork -DCAT-APschema-plug-in-2018 October3rd 7
7
Geografic
data &
services
ISO/INSPIRE
GeoNetwork
Geopunt
Data/Services
Open
data
DCAT
CKAN/TDT
GeoNetwork
CKAN
Data
Documents
EAD
(to be – GeoNetwork
?)
(to be)
Documents
Personal &
Company
data
(to be – DCAT ?)
(to be – GeoNetwork
?)
MAGDA-online
Data/Services
Statistic
data &
services
SDMX / StatDCAT ?
(to be –
GenoNetwork ?)
(to be)
Statistics/Services
API’s
(to be - DCAT ?)
(to be - Apigee ?)
…
Services
API’s
Standards
…
…
…
Standards
Codelists
…
…
…
Codelists
Images
…
…
…
Images
Infographics
Scope “Informatiecatalogus”
Metadata Standard
Scope1
Scope Information Catalogue
Metadata Management
System
Publication via
Portal / Catalogue
Data / Information
GeoNetwork -DCAT-APschema-plug-in-25 October2018 8
 W3C Recommendation since 16 January 2014.
 https://www.w3.org/TR/vocab-dcat/
 Makes useof terms defined in foundational vocabularies such as Dublin Core, FOAF,
etc. butalso defines its ownterms in a namespace: http://www.w3.org/ns/dcat#
DCAT: a W3C recommendation
GeoNetwork-DCAT-APschema-plug-in-25 October2018 9
DCAT: a W3C recommendation
10GeoNetwork-DCAT-APschema-plug-in-25 October2018
DCAT-AP v1.1: an application profile of DCAT for data
portals in Europe
 Conforms with DCAT
 https://joinup.ec.europa.eu/release/dcat-ap-v11
 Extends DCAT with additionalterms and properties
 Imposes mandatory, recommended, and optional classes and properties
 Imposes controlled vocabularies
GeoNetwork-DCAT-APschema-plug-in-25 October2018 11
> Via schema plugins GeoNetwork can be extended to
support other metadata standards
> Examples: ISO19139, ISO19115-3, ISO19110, SensorML,
… https://github.com/metadata101
> Plugins are an additional directory with:
 XML Schema
 XML configuration (forms)
 XSLT: forms, post-processing, index-fields, conversion,
etc
 schematron validation rules (optional)
 Java code: e.g. harvester (optional)
 Metadata template / sample-data (optional)
 …
GeoNetwork: schemaplugin architecture
GeoNetwork-DCAT-APschema-plug-in-25 October2018 13
> Based on DCAT-AP v1.1
> Funded and managed by Informatie Vlaanderen
> Beta environment:
http://beta.metadata.vlaanderen.be/geonetwork
> Will be released as open-source software (GPLv2 license)
https://github.com/metadata101/dcat-ap1.1
DCAT-AP Schemaplugin
GeoNetwork-DCAT-APschema-plug-in-25 October2018 14
DCAT-AP XML Schema: a “normalised”XML Schema for DCAT-
AP
 XML is a nested data structure that canbe queried with Xpath,Xquery,XSLT, …
 RDF is a graph-based data structure that can bequeried with SPARQL. The RDF/XML
serialisation of RDF can bearbitrarily structured.
 GeoNetwork can only process XML-based metadata.
 So, an XMLSchemamust bedefined.
DCAT-AP
XML
DCAT-AP
RDF
SPARQL
CONSTRUCTXSLT
DCAT-RDF
records
normalised
DCAT-XML
15GeoNetwork-DCAT-APschema-plug-in-25 October2018
More restrictive than DCAT-RDF
Any document that is valid according to DCAT-XML is valid according to
DCAT-RDF
> Sequence and nesting:
 dcat:Catalog
> dcat:Dataset,
 dcat:Distribution
> dct:LicenseDocument as sub-templates
> dct:language (dct:LinguisticDocument), dcat:theme (skos:Concept),
dct:accrualPeriodicity (dct: … as thesauri (skos:Concept)
> multilingualism
DCAT-AP XML Schema: design choices
GeoNetwork-DCAT-APschema-plug-in-25 October2018 16
> Simple and advanced form
> Multilingual
> Controlled vocabularies as
thesauri
> Reuse (with modification)
of existing controls (e.g.
spatial extent)
DCAT-AP editor
GeoNetwork-DCAT-APschema-plug-in-25 October2018 17
> The plugin will include the DCAT-AP v1.1 controlled vocabularies as
GeoNetwork thesauri (SKOS)
<dct:language>
<skos:Concept rdf:about="http://publications.europa.eu/resource/authority/language/NLD">
<rdf:type rdf:resource="dct:LinguisticSystem"/>
<skos:prefLabel xml:lang="nl">Nederlands</skos:prefLabel>
<skos:prefLabel xml:lang="en">Dutch</skos:prefLabel>
<skos:prefLabel xml:lang="fr">néerlandais</skos:prefLabel>
<skos:prefLabel xml:lang="de">Niederländisch</skos:prefLabel>
<skos:inScheme rdf:resource="http://publications.europa.eu/resource/authority/language"/>
</skos:Concept>
</dct:language>
Controlledvocabularies
GeoNetwork-DCAT-APschema-plug-in-25 October2018 18
> DCAT-AP integrity constraints have been transformed into
a three schematron rulesets:
 DCAT-AP v1.1 strict rules
 DCAT-AP v1.1 recommendations
 DCAT-AP-VL strict rules
Validation
GeoNetwork-DCAT-APschema-plug-in-25 October2018 19
> Standard GeoNetwork API
 Reuse of Lucene search parameters
> Follows recommendations by W3C LDP Paging
> Example:
http://beta.metadata.vlaanderen.be/geonetwork/srv/api/0.1/
records?from=1&hitsPerPage=10&any=beer
RDFendpoint
GeoNetwork-DCAT-APschema-plug-in-25 October2018 20
DCAT harvester
 XML is a nested data structure that canbe queried with Xpath, Xquery, XSLT, …
 RDF is a graph-based data structure that can bequeried with SPARQL. The RDF/XML
serialisation of RDF can bearbitrarily structured.
 GeoNetwork can only process XML-based metadata.
 Onimport, DCAT-RDF metadata mustbe “normalised”
DCAT-
XML
DCAT-
RDF
XSLT (trivial)
SPARQL SELECT +
XSLT
SPARQL
CONSTRUCTXSLT
DCAT-AP
records
normalised
DCAT-XML
21GeoNetwork-DCAT-APschema-plug-in-25 October2018
> Turtle, RDF/XML, N3, JSON-LD syntax
> No paging support so far (not needed)
DCAT harvester
GeoNetwork-DCAT-APschema-plug-in-25 October2018 22
> Usability of the forms
> Usability of validator (link with fields)
> Harvester with paging support
> Schema.org integration (JSON.LD on HTML pages)
> Look & feel
Room for improvement
GeoNetwork-DCAT-APschema-plug-in-25 October2018 23
www.vlaanderen.be/informatievlaanderen
Contact?
geraldine.nolf@kb.vlaanderen.be
stijn.goedertier@gim.be
1 of 23

Recommended

BigDataEurope @BDVA Summit2016 2: Societal Pilots by
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigData_Europe
496 views43 slides
Big Data Europe Transport Pilot case, Luigi Selmi by
Big Data Europe Transport Pilot case, Luigi SelmiBig Data Europe Transport Pilot case, Luigi Selmi
Big Data Europe Transport Pilot case, Luigi SelmiBigData_Europe
1.8K views22 slides
BDE-BDVA Webinar: BDE Technical Overview by
BDE-BDVA Webinar: BDE Technical OverviewBDE-BDVA Webinar: BDE Technical Overview
BDE-BDVA Webinar: BDE Technical OverviewBigData_Europe
660 views26 slides
Release webinar: Sansa and Ontario by
Release webinar: Sansa and OntarioRelease webinar: Sansa and Ontario
Release webinar: Sansa and OntarioBigData_Europe
550 views25 slides
Apache Big_Data Europe event: "Integrators at work! Real-life applications of... by
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...BigData_Europe
676 views9 slides
BDE SC3.3 Workshop - BDE Platform: Technical overview by
 BDE SC3.3 Workshop -  BDE Platform: Technical overview BDE SC3.3 Workshop -  BDE Platform: Technical overview
BDE SC3.3 Workshop - BDE Platform: Technical overviewBigData_Europe
276 views45 slides

More Related Content

Similar to FOSS4G Belgium 2018 - DCAT-AP Schema-plugin for GeoNetwork

Best practices and lessons learnt from Running Apache NiFi at Renault by
Best practices and lessons learnt from Running Apache NiFi at RenaultBest practices and lessons learnt from Running Apache NiFi at Renault
Best practices and lessons learnt from Running Apache NiFi at RenaultDataWorks Summit
31.9K views50 slides
Data access and data extraction services within the Land Imagery Portal by
Data access and data extraction services within the Land Imagery PortalData access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery PortalGasperi Jerome
805 views34 slides
apidays LIVE Helsinki & North 2022_Apps without APIs by
apidays LIVE Helsinki & North 2022_Apps without APIsapidays LIVE Helsinki & North 2022_Apps without APIs
apidays LIVE Helsinki & North 2022_Apps without APIsapidays
310 views16 slides
[PLCUG] Splunk - complete Citrix environment monitoring by
[PLCUG] Splunk - complete Citrix environment monitoring[PLCUG] Splunk - complete Citrix environment monitoring
[PLCUG] Splunk - complete Citrix environment monitoringJaroslaw Sobel
5.4K views40 slides
Is 12 Factor App Right About Logging by
Is 12 Factor App Right About LoggingIs 12 Factor App Right About Logging
Is 12 Factor App Right About LoggingPhil Wilkins
355 views30 slides
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond... by
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...InfluxData
230 views33 slides

Similar to FOSS4G Belgium 2018 - DCAT-AP Schema-plugin for GeoNetwork(20)

Best practices and lessons learnt from Running Apache NiFi at Renault by DataWorks Summit
Best practices and lessons learnt from Running Apache NiFi at RenaultBest practices and lessons learnt from Running Apache NiFi at Renault
Best practices and lessons learnt from Running Apache NiFi at Renault
DataWorks Summit31.9K views
Data access and data extraction services within the Land Imagery Portal by Gasperi Jerome
Data access and data extraction services within the Land Imagery PortalData access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery Portal
Gasperi Jerome805 views
apidays LIVE Helsinki & North 2022_Apps without APIs by apidays
apidays LIVE Helsinki & North 2022_Apps without APIsapidays LIVE Helsinki & North 2022_Apps without APIs
apidays LIVE Helsinki & North 2022_Apps without APIs
apidays310 views
[PLCUG] Splunk - complete Citrix environment monitoring by Jaroslaw Sobel
[PLCUG] Splunk - complete Citrix environment monitoring[PLCUG] Splunk - complete Citrix environment monitoring
[PLCUG] Splunk - complete Citrix environment monitoring
Jaroslaw Sobel5.4K views
Is 12 Factor App Right About Logging by Phil Wilkins
Is 12 Factor App Right About LoggingIs 12 Factor App Right About Logging
Is 12 Factor App Right About Logging
Phil Wilkins355 views
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond... by InfluxData
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...
InfluxData230 views
DEEP general presentation by EUDAT
DEEP general presentationDEEP general presentation
DEEP general presentation
EUDAT39 views
Enterprise guide to building a Data Mesh by Sion Smith
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
Sion Smith70 views
DSD-INT 2015 - RSS Sentinel Toolbox - J. Manuel Delgado Blasco by Deltares
DSD-INT 2015 - RSS Sentinel Toolbox - J. Manuel Delgado BlascoDSD-INT 2015 - RSS Sentinel Toolbox - J. Manuel Delgado Blasco
DSD-INT 2015 - RSS Sentinel Toolbox - J. Manuel Delgado Blasco
Deltares1.5K views
SC4 Workshop 2: Soren Auer BDE project Overview by BigData_Europe
SC4 Workshop 2: Soren Auer BDE project OverviewSC4 Workshop 2: Soren Auer BDE project Overview
SC4 Workshop 2: Soren Auer BDE project Overview
BigData_Europe273 views
The new CIARD RING , a machine-readable directory of datasets for agriculture by Valeria Pesce
The new CIARD RING, a machine-readable directory of datasets for agricultureThe new CIARD RING, a machine-readable directory of datasets for agriculture
The new CIARD RING , a machine-readable directory of datasets for agriculture
Valeria Pesce606 views
Data Processing and Analysis by EUDAT
Data Processing and AnalysisData Processing and Analysis
Data Processing and Analysis
EUDAT776 views
Working together with SURF Raymond Oonk Annette Langedijk SURF by CommunicatieSURF
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURF
CommunicatieSURF188 views
Getting Access to ALCF Resources and Services by davidemartin
Getting Access to ALCF Resources and ServicesGetting Access to ALCF Resources and Services
Getting Access to ALCF Resources and Services
davidemartin297 views
DECK36 - Log everything! and Realtime Datastream Analytics with Storm by Mike Lohmann
DECK36 - Log everything! and Realtime Datastream Analytics with StormDECK36 - Log everything! and Realtime Datastream Analytics with Storm
DECK36 - Log everything! and Realtime Datastream Analytics with Storm
Mike Lohmann2.3K views
Sector - Presentation at Cloud Computing & Its Applications 2009 by Robert Grossman
Sector - Presentation at Cloud Computing & Its Applications 2009Sector - Presentation at Cloud Computing & Its Applications 2009
Sector - Presentation at Cloud Computing & Its Applications 2009
Robert Grossman527 views

Recently uploaded

Network Source of Truth and Infrastructure as Code revisited by
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
52 views45 slides
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericShapeBlue
88 views9 slides
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...ShapeBlue
138 views18 slides
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineShapeBlue
181 views19 slides
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueShapeBlue
222 views23 slides
20231123_Camunda Meetup Vienna.pdf by
20231123_Camunda Meetup Vienna.pdf20231123_Camunda Meetup Vienna.pdf
20231123_Camunda Meetup Vienna.pdfPhactum Softwareentwicklung GmbH
50 views73 slides

Recently uploaded(20)

Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by ShapeBlue
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
ShapeBlue88 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue138 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue181 views
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
ShapeBlue222 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software385 views
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool by ShapeBlue
Extending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPoolExtending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPool
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool
ShapeBlue84 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash153 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker50 views
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue123 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue85 views
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue132 views
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by ShapeBlue
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
ShapeBlue146 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc160 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue158 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty62 views

FOSS4G Belgium 2018 - DCAT-AP Schema-plugin for GeoNetwork

  • 1. www.vlaanderen.be/informatievlaanderen Geraldine Nolf, Dirk De Baere, Mathias De Schrijver, Bart Cosyn Gustaaf Vandeboel, Mathieu Van Wambeke, Mathieu Chaussier, An Heirman, Stijn Goedertier GeoNetwork DCAT-AP schema-plug-in FOSS4G Belgium 25 Oktober2018
  • 3. AS IS: METADATA STANDARDS, SYSTEMS, PORTALS GEO OPEN LEVEL: Flanders Belgian European Open Geo Standards: Standards: Systems: Systems: Portals: Portals: 3
  • 4. Overview study GeoNetwork-DCAT-APschema-plug-in-25 October2018 4 Geographic data & services ISO/INSPIRE GeoNetwork Geopunt Data/Services Open data DCAT CKAN/TDT CKAN Data Documents ISAD(G)-AP EAD PREMIS (to be) (to be) Documents … … … … … Metadata Standard Scope Information Catalogue Metadata Management System Publication via Portal / Catalogue Data / Information Study
  • 5. > Scenario 0: Optimize & automate AS IS > Scenario 1: Open via procedures Geo domain > Scenario 2: Geo via procedures Open data domain > Scenario 3: Mixed Scenario Study: Scenarios GeoNetwork-DCAT-APschema-plug-in-25 October2018 5
  • 6. Resultson thestudy between Geo & Open data domains GeoNetwork-DCAT-APschema-plug-in-25 October2018 6
  • 8. Geografic data & services ISO/INSPIRE GeoNetwork Geopunt Data/Services Open data DCAT CKAN/TDT GeoNetwork CKAN Data Documents EAD (to be – GeoNetwork ?) (to be) Documents Personal & Company data (to be – DCAT ?) (to be – GeoNetwork ?) MAGDA-online Data/Services Statistic data & services SDMX / StatDCAT ? (to be – GenoNetwork ?) (to be) Statistics/Services API’s (to be - DCAT ?) (to be - Apigee ?) … Services API’s Standards … … … Standards Codelists … … … Codelists Images … … … Images Infographics Scope “Informatiecatalogus” Metadata Standard Scope1 Scope Information Catalogue Metadata Management System Publication via Portal / Catalogue Data / Information GeoNetwork -DCAT-APschema-plug-in-25 October2018 8
  • 9.  W3C Recommendation since 16 January 2014.  https://www.w3.org/TR/vocab-dcat/  Makes useof terms defined in foundational vocabularies such as Dublin Core, FOAF, etc. butalso defines its ownterms in a namespace: http://www.w3.org/ns/dcat# DCAT: a W3C recommendation GeoNetwork-DCAT-APschema-plug-in-25 October2018 9
  • 10. DCAT: a W3C recommendation 10GeoNetwork-DCAT-APschema-plug-in-25 October2018
  • 11. DCAT-AP v1.1: an application profile of DCAT for data portals in Europe  Conforms with DCAT  https://joinup.ec.europa.eu/release/dcat-ap-v11  Extends DCAT with additionalterms and properties  Imposes mandatory, recommended, and optional classes and properties  Imposes controlled vocabularies GeoNetwork-DCAT-APschema-plug-in-25 October2018 11
  • 12. > Via schema plugins GeoNetwork can be extended to support other metadata standards > Examples: ISO19139, ISO19115-3, ISO19110, SensorML, … https://github.com/metadata101 > Plugins are an additional directory with:  XML Schema  XML configuration (forms)  XSLT: forms, post-processing, index-fields, conversion, etc  schematron validation rules (optional)  Java code: e.g. harvester (optional)  Metadata template / sample-data (optional)  … GeoNetwork: schemaplugin architecture GeoNetwork-DCAT-APschema-plug-in-25 October2018 13
  • 13. > Based on DCAT-AP v1.1 > Funded and managed by Informatie Vlaanderen > Beta environment: http://beta.metadata.vlaanderen.be/geonetwork > Will be released as open-source software (GPLv2 license) https://github.com/metadata101/dcat-ap1.1 DCAT-AP Schemaplugin GeoNetwork-DCAT-APschema-plug-in-25 October2018 14
  • 14. DCAT-AP XML Schema: a “normalised”XML Schema for DCAT- AP  XML is a nested data structure that canbe queried with Xpath,Xquery,XSLT, …  RDF is a graph-based data structure that can bequeried with SPARQL. The RDF/XML serialisation of RDF can bearbitrarily structured.  GeoNetwork can only process XML-based metadata.  So, an XMLSchemamust bedefined. DCAT-AP XML DCAT-AP RDF SPARQL CONSTRUCTXSLT DCAT-RDF records normalised DCAT-XML 15GeoNetwork-DCAT-APschema-plug-in-25 October2018
  • 15. More restrictive than DCAT-RDF Any document that is valid according to DCAT-XML is valid according to DCAT-RDF > Sequence and nesting:  dcat:Catalog > dcat:Dataset,  dcat:Distribution > dct:LicenseDocument as sub-templates > dct:language (dct:LinguisticDocument), dcat:theme (skos:Concept), dct:accrualPeriodicity (dct: … as thesauri (skos:Concept) > multilingualism DCAT-AP XML Schema: design choices GeoNetwork-DCAT-APschema-plug-in-25 October2018 16
  • 16. > Simple and advanced form > Multilingual > Controlled vocabularies as thesauri > Reuse (with modification) of existing controls (e.g. spatial extent) DCAT-AP editor GeoNetwork-DCAT-APschema-plug-in-25 October2018 17
  • 17. > The plugin will include the DCAT-AP v1.1 controlled vocabularies as GeoNetwork thesauri (SKOS) <dct:language> <skos:Concept rdf:about="http://publications.europa.eu/resource/authority/language/NLD"> <rdf:type rdf:resource="dct:LinguisticSystem"/> <skos:prefLabel xml:lang="nl">Nederlands</skos:prefLabel> <skos:prefLabel xml:lang="en">Dutch</skos:prefLabel> <skos:prefLabel xml:lang="fr">néerlandais</skos:prefLabel> <skos:prefLabel xml:lang="de">Niederländisch</skos:prefLabel> <skos:inScheme rdf:resource="http://publications.europa.eu/resource/authority/language"/> </skos:Concept> </dct:language> Controlledvocabularies GeoNetwork-DCAT-APschema-plug-in-25 October2018 18
  • 18. > DCAT-AP integrity constraints have been transformed into a three schematron rulesets:  DCAT-AP v1.1 strict rules  DCAT-AP v1.1 recommendations  DCAT-AP-VL strict rules Validation GeoNetwork-DCAT-APschema-plug-in-25 October2018 19
  • 19. > Standard GeoNetwork API  Reuse of Lucene search parameters > Follows recommendations by W3C LDP Paging > Example: http://beta.metadata.vlaanderen.be/geonetwork/srv/api/0.1/ records?from=1&hitsPerPage=10&any=beer RDFendpoint GeoNetwork-DCAT-APschema-plug-in-25 October2018 20
  • 20. DCAT harvester  XML is a nested data structure that canbe queried with Xpath, Xquery, XSLT, …  RDF is a graph-based data structure that can bequeried with SPARQL. The RDF/XML serialisation of RDF can bearbitrarily structured.  GeoNetwork can only process XML-based metadata.  Onimport, DCAT-RDF metadata mustbe “normalised” DCAT- XML DCAT- RDF XSLT (trivial) SPARQL SELECT + XSLT SPARQL CONSTRUCTXSLT DCAT-AP records normalised DCAT-XML 21GeoNetwork-DCAT-APschema-plug-in-25 October2018
  • 21. > Turtle, RDF/XML, N3, JSON-LD syntax > No paging support so far (not needed) DCAT harvester GeoNetwork-DCAT-APschema-plug-in-25 October2018 22
  • 22. > Usability of the forms > Usability of validator (link with fields) > Harvester with paging support > Schema.org integration (JSON.LD on HTML pages) > Look & feel Room for improvement GeoNetwork-DCAT-APschema-plug-in-25 October2018 23