CRIS and OAR entities as LinkedData in scholarly communication –a vision scenario Jochen Schirrwagen Bielefeld University Library, Germany
About OpenAIRE - Motivation Implementation of a support infrastructure for the European Open Access pilot (2009-2012) Research Assessment – Identification, Capture, Measurement of EC funded FP7 project results (Special Clause 39) Addressing of Interoperability aspects regarding – European Commission (EC) Tools – Current Research Information Systems (CRIS) JISC Research Excellence Framework – Open Access Repositories (OAR)2 London, 14th of July 2011 – Linked Data and Libraries
OpenAIRE facts Itself an EC funded FP7 project 38 OpenAIRE partners across Europe 27 National Open Access Liaison Offices 6887 projects in FP7 – Using CORDA as authoritative source Some 10.000 publications estimated Data-Sources (striving for OpenAIRE Guidelines compliancy) – Institutional repositories Using OpenDOAR as authoritative source of ~850 repositories – Coverage of subject-based repositories planned – Coverage of OA journals planned3 London, 14th of July 2011 – Linked Data and Libraries
Interoperability Scenario CRIS CORDA Project data at Project data institution at EC KE-CRIS-OAR; PMH; ORE OpenAIRE Bibliographic Deposit/claiming data; DC; of publications OAI-PMH related to project OA-Repositories Author4 London, 14th of July 2011 – Linked Data and Libraries
Candidates for Entities & Vocabularies5 London, 14th of July 2011 – Linked Data and Libraries
Challenges – Data and Interoperability Capturing research output from different domains involves: Different responsibilities and tasks Different metadata formats used Different metadata exchange interfaces and protocols Different metadata granularity – In CRIS -> fine – In OAR -> coarse6 London, 14th of July 2011 – Linked Data and Libraries
Challenges – Data and Interoperability In the CRIS domain In the OAR domain – Covers the research process – Covers research publications – Run by the administrative – Run by the library department department – Focus on bibliographic quality – Broader view on research – Diverse metadata formats information DC, DIDL/MODS, EPrints – Diverse data models and formats CERIF (-like) models DDF-MXD, METIS, PURE7 London, 14th of July 2011 – Linked Data and Libraries
Issues addressed by KE CRIS-OAR Working group within the quadrolateral Knowledge Exchange-Initiative (KE: SURF-NL, JISC-UK, DFG-DE, DEFF-DK) Aiming to increase interoperability between CRIS and OAR domains – Increasing metadata quality and re-use – Increasing level of interface standards – By taking existing formats into account: Defining a metadata exchange format With a corresponding set of common vocabularies8 London, 14th of July 2011 – Linked Data and Libraries
Publication entity as the center of interest Organisation Person Publication Event Project9 London, 14th of July 2011 – Linked Data and Libraries
How could “Linked Data” help ? Common way of linkages of content from distinct domains – Use of native web-technologies Controlled vocabularies may help to tame semantic variability – URIfying named entities Data publishers keep control of their data Avoids context loss compared to interchange formats May avoid double input and thus redundant data in each domain Vocabulary helps to tame semantic variability May reduce the identifier problem by assigning persistent URIs to the entities – Person (Author) identifier (DAI, ORCID) – Publication identifier (DOI, URN, …) – Project identifier (?) – Event identifier (?) – Organisation identifier (?)10 London, 14th of July 2011 – Linked Data and Libraries
Aggregation of Interlinked Data Task to be addressed: – “bulk import/export” of publication and project data -> new wording “exposure” – Representing different views on the information packages, e.g.: Publication as an Aggregation of Person, Organisation, Project and Event entities Nested aggregation as a collection of all publications, where each publication is itself an aggregation of relative CRIS-OAR entities11 London, 14th of July 2011 – Linked Data and Libraries
Extract of a ORE RDF Serialization13 London, 14th of July 2011 – Linked Data and Libraries
Next Steps Adopting KE CRIS-OAR model and vocabulary Addressing “Linked Data” in OpenAIREplus starting in 12/2011 – Linkage of publications and research data – Linkage of CRIS and OAR domains Further scenarios may include linked data as citations or statistical data14 London, 14th of July 2011 – Linked Data and Libraries
Further Links • Portal and project home: www.openaire.eu • EC pilot: ec.europa.eu/research/science-society/open_access • CERIF: www.eurocris.org • KE CRIS-OAR: http://knowledge-exchange.info/Default.aspx?ID=340 • CRIS-OAR schema and vocabulary: https://infoshare.dtv.dk/twiki/bin/view/KeCrisOar/KeCrisOarFormat Jochen Schirrwagen firstname.lastname@example.org Wolfram Horstmann email@example.com London, 14th of July 2011 – Linked Data and Libraries
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.