because good research needs good data
Increasing research impact
The national data registry
Alex Ball
DCC/UKOLN Informatic...
UK Research Data (Metadata) Registry Pilot Project
Project Team
Kevin Ashley, DCC (Edinburgh)
Alex Ball, DCC (Bath)
Patric...
Outline
Motivation
Project overview
Architecture
Collaborators
Metadata
Evaluation
Future
Jisc Digifest, ICC, Birmingham 2...
UK data landscape
Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
Motivation
Not just specialist data centres any more. . .
Institutional data repositories
Generalist repositories related ...
Brainstorming the national data registry
Research
data registry
Gateway to
Research
Equipment
.data
Data
Citation
Index
DM...
Research Data Australia About Collections Parties Activities Services Themes
What’s in Research Data Australia
Collections...
Research Data UK?
Attractions of the Research Data Australia software:
Familiar to project team
Proven technology
Plays ni...
Project overview
1. Implement a working instance of the ANDS software.
2. Assemble a group of contributors and establish h...
Architecture
CentOS Linux
MS Azure
Access management
Front end
Metadata registry
OAI-PMH harvester
Indexer (Apache Solr)
C...
Collaborators
Data centres:
UK Data Archive
NERC Data Catalogue
Service
BADC
BODC
EIDC
NEODC
NGDC
PDC
UKSSDC
ADS
Universit...
Metadata crosswalks
DDI Codebook 2.5
UK Data Archive
DataCite 3
Edinburgh (TBC)
Oxford (TBC)
Hull (TBC)
OAI-PMH Dublin Cor...
DDI Codebook 2.5 Crosswalk
Mapping from DDI to —IF-CS
The following table provides a mapping to populate a RIF-CS Collecti...
From UKDA to UKRDR
Documentation Related Studies Download/Order Get full DDI XML
Catalogue
UK Data Service data catalogue ...
Evaluation questions
Does the software work as intended?
Do the harvested records look useful and accurate?
Is the system ...
Future work
Formal evaluation
ROAMEF = Rationale, Objectives, Appraisal, Monitoring,
Evaluation, Feedback
Questions to con...
because good research needs good data
Thank you for your attention
DCC Website: http://www.dcc.ac.uk/
Alex Ball: http://al...
Upcoming SlideShare
Loading in …5
×

Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

2,569 views

Published on

Evidence shows that all forms of research output have a role in increasing the impact and value of research. Data is particularly valuable, which is why research funders are placing so much emphasis on its retention, management and discoverability. However, few universities have data collections large enough to make their data globally visible, and few have the resources to connect data held locally with data in international data centres.

Jisc’s data registry service plans to cost-effectively solve this problem for universities, whilst also providing feedback for them and their researchers on how to increase the impact of their research data. This session will explain the goals and approach of the pilot, relate it to lessons from other countries and in government open data, and explain how Jisc and the community can work together to drive future developments in data discovery.

Published in: Education
  • Be the first to comment

  • Be the first to like this

Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

  1. 1. because good research needs good data Increasing research impact The national data registry Alex Ball DCC/UKOLN Informatics, University of Bath 11 March 2014 Except where otherwise stated, this work is licensed under the Creative Commons Attribution 4.0 International licence: http://creativecommons.org/licenses/by/4.0/ Supported by Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  2. 2. UK Research Data (Metadata) Registry Pilot Project Project Team Kevin Ashley, DCC (Edinburgh) Alex Ball, DCC (Bath) Patrick McCann, DCC (Glasgow) Laura Molloy, DCC (Glasgow) Veerle Van den Eynden, UKDA Funded by Jisc Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  3. 3. Outline Motivation Project overview Architecture Collaborators Metadata Evaluation Future Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  4. 4. UK data landscape Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  5. 5. Motivation Not just specialist data centres any more. . . Institutional data repositories Generalist repositories related to journals Interdisciplinary and multidisciplinary research requires data drawn from diverse sources. Data as a first class research output Funder impact Research Excellence Framework Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  6. 6. Brainstorming the national data registry Research data registry Gateway to Research Equipment .data Data Citation Index DMPs Metadata scheme Interop- erability Useful for discovery Harvest from. . . Institutional data repositories CRISes Data centres Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  7. 7. Research Data Australia About Collections Parties Activities Services Themes What’s in Research Data Australia Collections (92633) Research datasets or collections of research materials. Parties (25467) Researchers or research organisations that create or maintain research datasets or collections. Activities (40674) Projects or programs that create research datasets or collections. Services (184) Services that support the creation or use of research datasets or collections. Spotlight on research data N.C.W. Beadle Herbarium The N.C.W. Beadle Herbarium (NE) at University of New England contains around 90,000 pressed, dried, incorporated and databased plant specimens. The collection includes more than 150 TYPE specimens that anchor scientific names as cited in the original publication of those names. This rich resource contains many collections that are of great interest to local and international researchers. The specimen sheet collection of the N.C.W. Beadle Herbarium is databased and available to registered users for online data entry and data query. Explore the N.C.W. Beadle Herbarium Collection through Research Data Australia >>> Browse by Subject Area Browse by Map Coverage Advanced Search
  8. 8. Research Data UK? Attractions of the Research Data Australia software: Familiar to project team Proven technology Plays nicely with search engines Displays sample citations and access/rights information up front Challenges of using the software in the UK: Not used before outside Australia Uses uncommon metadata standard (RIF-CS) internally Original implementation only harvests in RIF-CS No UK data centre can output RIF-CS metadata Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  9. 9. Project overview 1. Implement a working instance of the ANDS software. 2. Assemble a group of contributors and establish how their metadata will be harvested. 3. Write crosswalks for transforming contributed metadata into RIF-CS. 4. Harvest metadata from contributors. 5. Reports on using the Research Data Australia software; how harvesting from data centres went; how harvesting from university repositories went; the value of continuing to develop the registry. Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  10. 10. Architecture CentOS Linux MS Azure Access management Front end Metadata registry OAI-PMH harvester Indexer (Apache Solr) CMS editor ID manager UKRDR Collections without OAI-PMH support HTTP Collections with OAI-PMH support OAI-PMH Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  11. 11. Collaborators Data centres: UK Data Archive NERC Data Catalogue Service BADC BODC EIDC NEODC NGDC PDC UKSSDC ADS Universities: Edinburgh Glasgow Hull Lincoln Leeds Oxford Oxford Brookes St Andrews Southampton Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  12. 12. Metadata crosswalks DDI Codebook 2.5 UK Data Archive DataCite 3 Edinburgh (TBC) Oxford (TBC) Hull (TBC) OAI-PMH Dublin Core Oxford Brookes (TBC) UK Gemini 2.2 NERC Data Catalogue Service EPrints 3 Glasgow Leeds Lincoln (TBC) Southampton Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  13. 13. DDI Codebook 2.5 Crosswalk Mapping from DDI to —IF-CS The following table provides a mapping to populate a RIF-CS Collection record from a UKDA DDI record. The value of the UKDA ID is recorded in the DDI record at codeBook > std⁴Dscr > citation > titlStmt > IDNo[AGENCY=UKDA]. RIF-CS . element Source using DDI . record collection[dateAccessioned] codeBook > std⁴Dscr > citation > distStmt > depDate[date] identifier[t⁴pe=doi] codeBook > std⁴Dscr > citation > titlStmt > IDNo[agenc⁴=datacite] identifier[t⁴pe=local] codeBook > std⁴Dscr > citation > titlStmt > IDNo [agenc⁴] content name[t⁴pe=primar⁴] > namePart codeBook > std⁴Dscr > citation > titlStmt > titl name[t⁴pe=alternative] > namePart codeBook > std⁴Dscr > citation > titlStmt > altTitl dates[t⁴pe=dc.available, dc.issued] > date[t⁴pe=dateFrom] codeBook > std⁴Dscr > citation > distStmt > distDate dates[t⁴pe=dc.dateSubmitted] > date[t⁴pe=dateFrom] codeBook > std⁴Dscr > citation > distStmt > depDate[date] location > address > electronic[t⁴pe=url] > value codeBook > std⁴Dscr > citation > holdings[U—I] subject[t⁴pe=hasset] codeBook > std⁴Dscr > std⁴Info > subject > ke⁴²ord[vocab=S] subject[termIdentifier] codeBook > std⁴Dscr > std⁴Info > subject > ke⁴²ord[vocab=S vocabU—I] subject[t⁴pe=ukdasc] codeBook > std⁴Dscr > std⁴Info > subject > topClas description[t⁴pe=full] codeBook > std⁴Dscr > std⁴Info > abstract coverage > temporal > date[t⁴pe=dateFrom] codeBook > std⁴Dscr > std⁴Info > sumDscr > collDate[event=start, single date], timePrd[event=start, single date] coverage > temporal > date[t⁴pe=dateTo] codeBook > std⁴Dscr > std⁴Info > sumDscr > collDate[event=end date], timePrd[event=end date] coverage > spatial[t⁴pe=te³t] codeBook > std⁴Dscr > std⁴Info > sumDscr > geogCover, geogUnit, nation codeBook > std⁴Dscr > std⁴Info > subject > ke⁴²ord[vocab=G] relatedInfo[t⁴pe=metadata] > identifier[t⁴pe=uri] ‘http://esds.ac.uk/DDI /’ + UKDA ID + ‘.xml’ 06/03/2014 Ddi2p5ToRifcs.php <?php class Ddi2p5ToRifcs extends Crosswalk { private $oaipmh = null; private $rifcs = null; private $ddiProviders = array( "http://oai.ukdataservice.ac.uk/oai/provider" => "UK Data Archive", ); function __construct(){ require_once(REGISTRY_APP_PATH . "core/crosswalks/_crosswalk_helper.php"); $this->rifcs = simplexml_load_string(CrosswalkHelper::RIFCS_WRAPPER); } public function identify(){ return "DDI v2.5 to RIF-CS (Experimental)"; } public function metadataFormat(){ return "ddi_2.5"; } public function payloadToRIFCS($payload){ $this->load_payload($payload); foreach ($this->oaipmh->ListRecords->children() as $record){ if ($record->getName() != "record") { continue; } $reg_obj = $this->rifcs->addChild("registryObject"); if (array_key_exists((string) $this->oaipmh->request, $this->ddiProviders)) { $reg_obj->addAttribute("group", $this->ddiProviders[(string) $this->oaipmh->request]); } $key = $reg_obj->addChild("key", $record->header->identifier); $originatingSource = $reg_obj->addChild("originatingSource", $this->oaipmh->request); $coll = $reg_obj->addChild("collection"); $coll->addAttribute("type", "dataset"); $coll->addAttribute("dateModified", date(DATE_W3C)); $citation = $coll->addChild("citationInfo"); $citation_metadata = $citation->addChild("citationMetadata"); $coverage = $coll->addChild("coverage"); $rights = $coll->addChild("rights"); foreach ($record->metadata->codeBook->stdyDscr->children() as $node){ foreach ($node->children() as $subnode) { $func = "process_".$subnode->getName(); if (is_callable(array($this, $func))){ call_user_func( array($this, $func), $subnode, array( "registry_object" => $reg_obj, "key" => $key, "collection" => $coll, "citation_metadata" => $citation_metadata, "coverage" => $coverage, "rights" => $rights ) ); } } } } return $this->rifcs->asXML(); } public function validate($payload){ $this->load_payload($payload); if (!$this->oaipmh){ return false; } if ($this->oaipmh->getName() != "OAI-PMH") { return false; Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  14. 14. From UKDA to UKRDR Documentation Related Studies Download/Order Get full DDI XML Catalogue UK Data Service data catalogue record for: Attitudes of Students at the London School of Economics, February 1980 TITLE DETAILS SN: 1354 Title: Attitudes of Students at the London School of Economics, February 1980 Persistent identifier: 10.5255/UKDA-SN-1354-1 Series: Attitudes of Students at the London School of Economics, 1980- Depositor: Husbands, C., London School of Economics and Political Science. Department of Sociology Principal investigator(s): Husbands, C., London School of Economics and Political Science. Department of Sociology SUBJECT CATEGORIES Higher and further ABSTRACT To conduct a course exercise that collects questionnaire-based information each year from a sample of students at the London School of Economics. The studies focus on background characteristics relevant to a student population, on attitudes to selected political and social issues, and on participation in various activities at LSE. Questions vary somewhat from year to year. COVERAGE, UNIVERSE, METHODOLOGY Dates of fieldwork: 6 February 1980 - 22 February 1980 Country: England Geography: London Observation units: Individuals Groups Universe: Subnational Students A sample of registered part-time and full-time students at London School of Economics and Political Science each year between 1980-1992 Time dimensions: Repeated cross-sectional study surveys conducted annually Sampling procedures: Quota sample based on sex, undergraduate/graduate status, domestic/overseas status, and department Number of units: 288 (target) 280 (obtained) Method of data collection: Face-to-face interview Weighting: No information recorded KEYWORDS ABORTION (INDUCED) ALCOHOL CONSUMPTION ATTITUDES EDUCATIONAL FEES EDUCATIONAL FINANCE EDUCATIONAL GRANTS FAMILY INFLUENCE FOREIGN STUDENTS GENDER GREATER LONDON NARCOTIC DRUGS OCCUPATIONS UK DATA SERVICE makes use of browser cookies. By continuing to use this website you are agreeing to our use of cookies. Tell me more Attitudes of Students at the London School of Economics, February 1980 Identifiers Local: sn1354 DOI: 10.5255/UKDA-SN-1354-1 Additional Metadata URI: http://esds.ac.uk/DDI25/1354.xml Spatial Coverage: text: GREATER LONDON text: England text: London Temporal Coverage: From 1980-02-06 to 1980-02-22 Access Access rights The depositor has specified that registration is required and standard conditions of use apply. The depositor may be informed about usage. See for further information. Connections People Suggested Links Internal Records with matching subjects External Records from DataCite Home / UK Data Archive / Collection To conduct a course exercise that collects questionnaire-based information each year from a sample of students at the London School of Economics. The studies focus on background characteristics relevant to a student population, on attitudes to selected political and social issues, and on participation in various activities at LSE. Questions vary somewhat from year to year. How to Cite this Collection Citation (Metadata): Husbands, C. ( 1 Ja,1 Ja,1 Ja,1 Ja ): Attitudes of Students at the London School of Economics, February 1980. UK Data Service. DOI: 10.5255/UKDA-SN-1354-1. http://dx.doi.org/10.5255/UKDA-SN-1354-1 http://dx.doi.org/10.5255/UKDA-SN-1354-1 Subjects Keywords ABORTION (INDUCED) ALCOHOL CONSUMPTION ATTITUDES EDUCATIONAL FEES EDUCATIONAL FINANCE EDUCATIONAL GRANTS FAMILY INFLUENCE FOREIGN STUDENTS GENDER NARCOTIC DRUGS OCCUPATIONS PARENTS PART-TIME COURSES POLITICAL PARTICIPATION PORNOGRAPHY SEXUAL BEHAVIOUR SMOKING SOCIAL ACTIVITIES (LEISURE) SOCIAL CLASS SOCIAL PROTEST STUDENT HOUSING STUDENT LEISURE STUDENT PARTICIPATION STUDENTS UNIVERSITY COURSES Higher and further http://dx.doi.org/10.5255/UKD... terms and conditions of access C. Husbands (PI) 258 records 1 records Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  15. 15. Evaluation questions Does the software work as intended? Do the harvested records look useful and accurate? Is the system straightforward to use? What might be improved? What additional functions would be desirable? Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  16. 16. Future work Formal evaluation ROAMEF = Rationale, Objectives, Appraisal, Monitoring, Evaluation, Feedback Questions to consider Would another platform suit us better? Would another internal metadata scheme suit us better than RIF-CS? What use cases should the registry target? How can we add value to the registry’s records? Could the registry add value to other systems? Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr
  17. 17. because good research needs good data Thank you for your attention DCC Website: http://www.dcc.ac.uk/ Alex Ball: http://alexball.me.uk/ UKRDR Pilot Project: http://www.dcc.ac.uk/ projects/research-data-registry-pilot Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

×