Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Using OpenURL Activity Data - Activity Data Online Exchange Event

1,031 views

Published on

Presented by Sheila Fraser at Activity Data Online Exchange Event, an online event, 2 June 2011.

Published in: Education, Technology
  • Be the first to comment

  • Be the first to like this

Using OpenURL Activity Data - Activity Data Online Exchange Event

  1. 1. Using OpenURL Activity Data Activity Data Online Exchange Event Sheila Fraser 2 nd June 2011
  2. 2. What are the OpenURL Router Data? <ul><li>Learners, teachers and researchers in UK HE institutions seek journals and papers for academic use </li></ul><ul><li>A paper may be available from many different service providers - so which is the “appropriate copy” for a user? </li></ul><ul><li>The OpenURL Router directs the request to the appropriate institutional resolver </li></ul><ul><li>Each redirect request is logged, providing a record of the article the user was attempting to find </li></ul><ul><li>The Router also logs non-bibliographic metadata: “lookup” requests (registry searches) and preferred button image requests </li></ul>Existing process … Institutional Resolvers OpenURL Router Institutional Resolvers Institutional Resolvers Institutional Resolvers Request Redirect request Log request Level 0 Data (Log)
  3. 3. What are we doing in this project? Aim 3: Explore including other institutions’ data in aggregation Existing process … Institutional Resolvers OpenURL Router Institutional Resolvers Institutional Resolvers Institutional Resolvers Request Redirect request Log request Level 0 Data (Log) Survey institutions to enable opt-out Level 1 Data <ul><li>Process to Level 1: </li></ul><ul><li>Exclude data from opted-out institutions </li></ul><ul><li>Anonymise IP addresses </li></ul><ul><li>Anonymise institution & remove button & lookup data that would identify institution </li></ul><ul><li>Parse OpenURL request into constituent parts </li></ul><ul><li>Process to Level 2: </li></ul><ul><li>Include only redirect to resolver requests </li></ul>Level 2 Data Use for prototypes & services Aim 1: Make this data available under open licence Aim 2: Develop prototype service using this activity data
  4. 4. What’s in the data set? <ul><li>Log-specific data (based on OpenURL Router log entries): </li></ul><ul><ul><li>logDate (Date the record was logged) </li></ul></ul><ul><ul><li>logTime (Time the record was logged in format HH:MM:SS) </li></ul></ul><ul><ul><li>encryptedUserIP (Anonymised IP address/session identifier) </li></ul></ul><ul><li>Request-specific data (based on the OpenURL standard): </li></ul><ul><ul><li>institutionResolverID (Anonymised institutional identifier) </li></ul></ul><ul><ul><li>routerRedirectIdentifier (Redirect identifier passed as part of the URL) </li></ul></ul><ul><ul><li>aulast (Last author) </li></ul></ul><ul><ul><li>aufirst (First author) </li></ul></ul><ul><ul><li>auinit (First author's first and middle initials) </li></ul></ul><ul><ul><li>auinit1 (First author's first initial) </li></ul></ul><ul><ul><li>auinitm (First author's middle initial) </li></ul></ul><ul><ul><li>au (Full name of a single author) </li></ul></ul><ul><ul><li>aucorp (Organization or corporation that is the author or creator of the document) </li></ul></ul><ul><ul><li>atitle (Article title) </li></ul></ul><ul><ul><li>title (Journal title, for compatibility with version 0.1) </li></ul></ul><ul><ul><li>jtitle (Journal title) </li></ul></ul><ul><ul><li>stitle (Short journal title) </li></ul></ul><ul><ul><li>date (Date of publication) </li></ul></ul><ul><ul><li>ssn (Season (chronology). Legitimate values are spring, summer, fall, winter) </li></ul></ul><ul><ul><li>quarter (Quarter (chronology). Legitimate values are 1, 2, 3, 4.) </li></ul></ul><ul><ul><li>volume (Volume designation, usually expressed as a number but could be non-numeric) </li></ul></ul><ul><ul><li>part (Part can be a special subdivision of a volume or it can be the highest level division of the journal. Parts are often designated with letters or names) </li></ul></ul><ul><ul><li>issue (Designation of published issue of a journal) </li></ul></ul><ul><ul><li>spage (First page number. Pages are not always numeric) </li></ul></ul><ul><ul><li>epage (Second (ending) page number) </li></ul></ul><ul><ul><li>pages (Start and end pages, e.g. 53-58) </li></ul></ul><ul><ul><li>artnum (Article number assigned by the publisher) </li></ul></ul><ul><ul><li>issn (International Standard Serials Number) </li></ul></ul><ul><ul><li>eissn (ISSN for electronic version of the journal) </li></ul></ul><ul><ul><li>isbn (International Standard Book Number) </li></ul></ul><ul><ul><li>coden (Alphanumeric bibliographic code) </li></ul></ul><ul><ul><li>sici (Serial Item Contribution Identifier) </li></ul></ul><ul><ul><li>genre </li></ul></ul><ul><ul><li>btitle (The title of the book - can also be expressed as title) </li></ul></ul><ul><ul><li>place (International Standard Book Number) </li></ul></ul><ul><ul><li>pub (Publisher name) </li></ul></ul><ul><ul><li>edition (Statement of the edition of the book) </li></ul></ul><ul><ul><li>tpages (Total pages) </li></ul></ul><ul><ul><li>series (The title of a series in which the book or document was issued) </li></ul></ul><ul><ul><li>doi (Digital Object Identifier) </li></ul></ul><ul><ul><li>sid (Service ID, the item(journal, article etc) provider) </li></ul></ul>Further details: http://openurl.ac.uk/doc/data/data.html
  5. 5. How might the data be used? <ul><li>Article/journal recommendations </li></ul><ul><li>Student analysis </li></ul><ul><li>Research thesis </li></ul><ul><li>Publishers comparing listings with texts sought </li></ul><ul><li>Identifying priorities for eJournal preservation </li></ul><ul><li>Innovative services to meet your users’ needs </li></ul><ul><li>Other, unanticipated uses </li></ul>
  6. 6. Explore including other institutions’ data <ul><li>Can it be aggregated? </li></ul><ul><ul><li>Data compatibility (OpenURL standard) </li></ul></ul><ul><li>What are the issues? </li></ul><ul><ul><li>Legal </li></ul></ul><ul><ul><ul><li>DPA: cannot share personal data without permission </li></ul></ul></ul><ul><ul><li>Technical </li></ul></ul><ul><ul><ul><li>Can we / how do we extract resolver data at the same level of detail? </li></ul></ul></ul><ul><ul><ul><li>How to identify duplicates? </li></ul></ul></ul><ul><ul><ul><li>Regular sharing & maintainability? </li></ul></ul></ul><ul><ul><li>Financial </li></ul></ul><ul><ul><ul><li>What potential effort could be involved? </li></ul></ul></ul><ul><li>What other issues are there? </li></ul>

×