Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

New Initiatives - Geoffrey Bilder - London LIVE 2017

2,023 views

Published on

Presentation by Geoffrey Bilder at Crossref London LIVE, 26th September 2017. New initiatives at Crossref including organisational and grant identifiers.

  • Be the first to comment

  • Be the first to like this

New Initiatives - Geoffrey Bilder - London LIVE 2017

  1. 1. @gbilder http://orcid.org/0000-0003-1315-5960
  2. 2. New Initiatives Crossref Live 2017, London Geoffrey Bilder Director of Strategic Initiatives @gbilder
  3. 3. Organisational Identifiers
  4. 4. Background
  5. 5. Content identifiers: DataCite Crossref Contributor identifiers: ORCID Organisation identifiers: Gaps
  6. 6. What has been done?
  7. 7. Harvard University
  8. 8. Data
  9. 9. 75k
  10. 10. Test Data Preparation ● Normalise each dataset to a standard JSON format – Identifier, name, country (& code) ● Collect sample affiliation data – Use affiliation data from CrossRef & ORCID – Manually match affiliations against candidate datasets (benchmark dataset) ● Benchmark dataset contains 100 affiliations – Removed erroneous data, some repeats
  11. 11. Matching ● Take benchmark dataset of 100 affiliations ● Match each affiliation against each dataset – Search against a simple Elastic Search index – Use 2 approaches: “basic” and “institution” ● Generate reports for each dataset – Details of best match for each affiliation ● Generate overall summary across datasets ● Finally, produce weighted score – % coverage x % successful matches – Normalises for current dataset coverage
  12. 12. Testing
  13. 13. Anticipating Growth & Churn
  14. 14. Summary
  15. 15. Findings • Good candidate seed datasets • Small data set • Relatively stable
  16. 16. Grant Identifiers
  17. 17. The first rule of grant IDs is that they should not be called “grant IDs.”
  18. 18. • grants • endowments • secondments • loans use of premises/equipment • crowd-funding
  19. 19. https://en.wikipedia.org/wiki/Payment_card_number#Credit_card_numbering
  20. 20. Conference & Project Identifiers
  21. 21. API improvements
  22. 22. gbilder@crossref.org Thank You @gbilder
  23. 23. • “Payment Icons” - Stuart Colville, https://github.com/muffinresearch/payment- icons • DataCite, THOR, and ORCID logos via their respective organisations. • All other images via a paid subscription to The Noun Project Credits
  24. 24. gbilder@crossref.org Thank You @gbilder

×