RJ Broker: Automating Delivery of Research Output to Repositories


Published on

Presentation by Muriel Mewissen at the RSP event: Increasing the full-text deposits in your institutional repository, in London on 12 June 2013.

Published in: Technology, Business
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

RJ Broker: Automating Delivery of Research Output to Repositories

  1. 1. RJ Broker:Automating Delivery of ResearchOutput to RepositoriesMuriel Mewissen – EDINARSP Event - London - 12 June 20131
  2. 2. Overview• Need for a broker• Development of the RJ Broker• Publisher & Subject Repository Trials• Future• ConclusionRSP Event - London - 12 June 20132
  3. 3. The need for a BrokerRSP Event - London - 12 June 20133
  4. 4. Focus• To increase the number of deposits to UK repositories• To minimise effort by depositors and IR managersSupport Open Access & Funder mandatesInstitutional RepoSubject repoPublishersAuthor 2 /RepresentativeFundersPublisher SystemIRIRIRAuthor/PIRSP Event - London - 12 June 20134Broker
  5. 5. Development of the RJ BrokerRSP Event - London - 12 June 20135
  6. 6. RJ Broker Project• Repository based projects at EDINA since 2006http://edina.ac.uk/projects/• RJ Broker– April 2012 to March 2013, extended to July 2013– Component of the RepNet infrastructure• RJ Broker is a delivery service for researchoutput– Deposit: parcel or letter– Metadata: address– Notification: postcard– Vision:http://oarepojunction.wordpress.com/2013/01/10/rj-broker-a-research-output-delivery-service/RSP Event - London - 12 June 20136
  7. 7. RJ Broker Middleware ToolRSP Event - London - 12 June 201371. Accept deposit of research articlesNLM DTD, bespoke format, SWORD, Eprints2. Process the deposits into a common formatRJ Broker code3. Identify target repositories from metadataOrganisation and Repository Identification (ORI)http://ori.edina.ac.uk/4. Handle deposition to registered repositoriesSWORD, plugins (Eprints, DSpace, Fedora)5. Provide tracking ID to content supplierURIs6. Notify other repositories with relevant contentMonthly email7. Allow browsing, search and downloadGUI & APIs (Eprints)“View” is useful for non SWORD systems (CRIS) & individuals1234567
  8. 8. Publisher & Subject RepositoryTrials with the Pilot RJ BrokerRSP Event - London - 12 June 20138
  9. 9. Pilot RJ Broker• Demonstrate the functionality• Real data• Test the scalability• Publisher: Nature Publishing Group (NPG)• Subject Repository: Europe PubMed CentralRSP Event - London - 12 June 20139
  10. 10. Publisher: NPG• Record includes– Metadata: rich, embargo, funder, multiple authors,ORCID in the future…– Content: Multi-part publication (some content may beembargoed) full text author final copy (post-print)• Development work:– Agree format for the record (NLM DTD based)– EDINA developed an importer for the data– Transfer using SWORD 1.3– NPG added new stream in their publication workflowthat send data to the RJ BrokerRSP Event - London - 12 June 201310
  11. 11. NPG• Legal agreements to respect embargo periods– Between NPG & EDINA– Between EDINA & IRs• MIT signed the IR agreement– Working on data importer for DSpace• Worth considering to receive:– Quality: Full text publication & rich metadata– Timely: Straight from the publisher during thepublication process even if embargoed• Template agreement on requestRSP Event - London - 12 June 201311
  12. 12. NPG• Set up took several months– Time difference– Relies on voluntary participation– Requires small amount of development work– Legal framework• Successful data transfer trial between NPG& RJ Broker in February 2013• Transfer to test IRs• NPG ready to start continuous data feed– A couple of journals first to increase with take upRSP Event - London - 12 June 201312
  13. 13. Subject Repository: Europe PMC• Use case supported by Jisc, RepNet & WellcomeTrust– UK focus– Support funders mandate• Record includes:– New publication or Update to existing publication– Metadata only: funders, grant numbers, first authoronly, DOI to full text…– No restrictions to redistributionRSP Event - London - 12 June 201313
  14. 14. Europe PMC• Development work:– Agree format for the record (bespoke)– EDINA developed an importer for the data– MIMAS/EBI get regular data feed from PMC– Push data from their regular feed to the RJ Broker– Transfer using SWORD 1.3• Set up took a few weeks• Successful data transfer trial betweenEurope PMC & RJ Broker in February 2013• Transfer to test IRs• Ready to start continuous data feed– Average 160,000 records per monthRSP Event - London - 12 June 201314
  15. 15. Europe PMC Trial in Numbers~67,000~60,000~58,500~22,500~14,5001,665RSP Event - London - 12 June 20131567,000 records in the trial dataset(~12 days based on an average 160,000 per month)7,000 no affiliation 60,000 sent to RJBroker1,500 errors (bad format) 58,500 successfullyreceived by RJ Broker36,000 with no identifiableorganisationRJ Broker identifiesorganisation for 22,5008,000 no repositories 14,500 haverepositories13,000 worldwide (not UK) 1,665 in the UK
  16. 16. Europe PMC Trial in NumbersRSP Event - London - 12 June 201316
  17. 17. Europe PMC Trial in NumbersRSP Event - London - 12 June 201317Number ofassociatedrepositories forrecords with oneorganisationidentified
  18. 18. Europe PMC Trial in NumbersRSP Event - London - 12 June 201318CountryCodeCountry Number ofrecordsus USA 5934gb UnitedKingdom1665ca Canada 1099jp Japan 722au Australia 655se Sweden 313es Spain 304nl Netherlands 299de Germany 239tw Taiwan 181fr France 180br Brazil 179it Italy 176be Belgium 174th Thailand 168za South Africa 160sd Sudan 15555 other countries withless than 1% of recordseach1836
  19. 19. Top UK Institutions Destination Number ofrecordsUniversity of Oxford 170University of Cambridge 139University College London 119Imperial College 103University of Edinburgh 88University of Manchester 63University of Bristol 61University of Nottingham, University of Newcastle Upon Tyne 56Liverpool 55University of Glasgow 52RSP Event - London - 12 June 201319Europe PMC Trial in Numbers78 UK Institutions in total
  20. 20. RJ Broker Trial InstallationRSP Event - London - 12 June 201320GUI preview accessOA records fromtrials are availablefor browsing &downloading– Check what wehave for yourinstitution!– http://devel.edina.ac.uk:1203/– !!! It is only trial &developmentinstallation– !!! Not a service yet
  21. 21. RJ Broker TrialDemonstrate features:• Importing records from different suppliers• Storing & Processing (~2s per record)• Repository Identification• Delivery• Browsing & DownloadMore end-to-end use cases with external IRsRSP Event - London - 12 June 201321
  22. 22. The FutureRSP Event - London - 12 June 201322
  23. 23. Immediate Future• Project extension (31 July 2013)• Prepare transition to service– Service installation– Add functionality• Email notification to all (non-registered) IRs• Improve support for different repository platforms• Bulk transfer of data backlog• Support RIOXX metadata export– Early adopters• IRs• Data suppliers to establish data feeds– Start building data store• Content kept for 1 year to start withRSP Event - London - 12 June 201323
  24. 24. Future (after July 2013)• Transition to Service– SLD (RepNet/Jisc)– Roadmap for adding further functionality• Open for recruitment– Info „pack‟, template, sandbox, help & support– IR Registration process:• provides SWORD endpoint credentials to RJ Broker• IR is configured to accept RJ Broker data• Option to opt-in to receive embargo content requires to sign alegal agreement– Data supplier Registration process:• RJ Broker to provide SWORD access• Agree format & develop importer• Enable regular data feed into RJ BrokerRSP Event - London - 12 June 201324
  25. 25. ConclusionRSP Event - London - 12 June 201325
  26. 26. Conclusion• Effective solution to content dissemination• Benefits all– Increase deposit to IRs– Support OA (Gold & Green)– Help with reporting– Support promotion of research output– Saves time & effort (money)• Appeal of service will grow• Small amount of development work neededlocally but it is worth it!RSP Event - London - 12 June 201326
  27. 27. Thanks• You• RSP• EDINA Team– Ian Stuart - Cesare Bellini– Muriel Mewissen - Christine Rees– Peter Burnhill - Theo Andrews• NPG, MIT, Europe PMC, MIMAS, EBI, WellcomeTrust• UK RepositoryNet+• JiscRSP Event - London - 12 June 201327