Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

System Update (2011 CrossRef Annual Meeting)

2,969 views

Published on

  • Video recording of this presentation is now available on River Valley TV: http://river-valley.tv/system-update-2011/
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

System Update (2011 CrossRef Annual Meeting)

  1. 1. System Update 2011  Rewrite  System Status  2012 Chuck Koscher Director of Technology ckoscher@crossref.org
  2. 2. System Re-write  Query System (QS) fully deployed, stable and improving  Deposit System (DS), early in 2011 we switched to a 2 phase project plan  Phase 1: port existing code to new framework  Phase 2: redesign deposit database schema and new code base  Moderate pace of development  balance costs Vs. timetable Vs. rich design 2011 2012 QS Deploy QS Shakedown QS Stable QS Query DB DS P1 Develop DS Test DS P1 DS P2 Development DS Data Migration DS P2
  3. 3. System Status www.alertra.com 12 Months Deposits OpenURL Queries
  4. 4. System Status DOI Clicks 60,000,000 50,000,000 40,000,000 30,000,000 20,000,000 10,000,000 0
  5. 5. System Status
  6. 6. System Status  Query volume has doubled, but matching rate has fallen
  7. 7. System Status Queries # of Queries Matches80,000,00075,000,00070,000,00065,000,00060,000,00055,000,00050,000,00045,000,00040,000,00035,000,00030,000,00025,000,00020,000,00015,000,00010,000,000 5,000,000 0 Month
  8. 8. System Status70  Deposit times, a lot of large re-deposit activity60 Feb50 Jan40 Mar Apr30 May June20 July Aug10 Sep 0 >24hr 18-24hr 12-18hr 6-12hr 1-6hr 5min-1hr <5min
  9. 9. System Status DOI Deposits Backfile Articles Current Articles1,000,000 900,000 800,000 700,000 600,000 500,000 400,000 300,000 200,000 100,000 0 Month
  10. 10. System Status Cited-By linking (depositing references)  Most large publishers and many smaller to mid-size members are in # of CrossRef Cited-by Publishers 300 248 200 100 0 2007 2008 2009 2010 2011 present  Over 225 million cited-by relationships  We expect another 50+ million when Elsevier data is finished  Unstructured citations not yet being processed, but will be soon  Changes being made to improve data retrieval  Forward link alert emails being aggregated (two emails a day) Forward links by prefix on a given day (often still too much data!)  Use OAI-PMH to retrieve by date, by prefix or by publication title
  11. 11. 2012  Deposit rewrite & Query system improvements Improve deposit logging (messaging Vs. email)  Analysis of our now robust logs to improve performance  New member/user/prefix/title/ownership model  Production implementation of Crossmark  Rework conflict process Cited-By Focus on OAI-PMH for data harvesting Implement messaging service for alerts (replace email)  Unstructured reference processing  Finish simple text query API  Processing of PDF files (deposit metadata & extract references)  Simplify usage of overlapping technologies
  12. 12. Go to support.crossref.org Subscribe to the forums!

×