The document describes Linghub, a repository for linguistic resources and metadata. It lists several tools available in Linghub for discovering, transforming, and managing data, including tools for search and discovery, transforming data to RDF standards, composing license information, linking datasets, and using workflows with containerization. It also describes the functionalities of the Linghub portal such as browsing, searching with filters, using a SPARQL endpoint, performing quality tests on metadata, converting between metadata standards, and checking for duplicates and broken links.
2. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
2
PAL Tools
Discovery
Transform
Data
Manager
Link
Workflows
Search and discovery of datasets
Transformation to RDF standards
Licenses compositions
Dataset linking
Workflows using containerization
3. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Old Linghub Portal
8
4. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Linghub Portal
9
5. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
DSpace-based
Open source software for building Open Digital Repositories
https://duraspace.org/dspace/
10
from
6. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Functionalities
11
7. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Linked Data-based
13
8. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Search / Filters
14
9. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Browsing
15
10. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
SPARQL endpoint
16
11. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Quality tests
17
12. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Metadata standards
18
If schemas belong to {dc, dcterms, skos, owl, dcat, odrl, rdf, rdfs, foaf,ms }
13. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Schema conversion
Language
19
14. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Schema conversion
License
20
15. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Schema conversion
License
21
16. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
ODRL form
22
17. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Teanga Compatibility
23
18. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
URL check
24
19. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Duplicate detection
25
20. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Sources
26
21. This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182
Repositories
27
Teanga services
iLOD Annohub
LRE-Map
CLARIN old.datahub
Total:
851,034 resources
2,997 languages covered
OLAC
META-SHARE
Editor's Notes
Renovation, rejuvenation, robust
Open source
Good documentation & support channels
Supports Linked Data through RDF export, SPARQL and OAI
Batch import
REST API (need authentication)
Unique identifier generation
Complete User Interface (UI)
Source code for Linghub on Github
Cron task (regular check) on fields:
dcat.accessURL
dcat.endpointURL
dcat.downloadURL
dcat.landingPage
Based on dc.identifier and dc.identifier.uri -> only displays one item in search ( Priority on duplicates: 1/ META-SHARE 2/ CLARIN 3/ Annohub 4/ old.datahub 5/ OLAC 6/ LRE Map
Metashare: dump provided by ELDA in June 2021ELRA included in OLACOld.datahub: collected from website in June 2021Annohub: dump provided
iLOD: dump provided in May 2021
LRE-Map old dump from 2014 (not possible to crawl current website)
P2P decentralized storage infrastructure using the InterPlanetary File System (IPFS)