Successfully reported this slideshow.
Ringgold Webinar Series: Session 2
29 January 2014
Today’s Agenda
1. Unique identifiers at the core of good data health

2. Identifiers in scholarly publishing
3. Embedding ...
At the core of good data health
Regardless of the state of your data’s
health, it can be improved by the
addition of unique identifiers
What are standard identifiers?
 Numeric or alpha-numeric persistent designations associated with a single entity
 Entiti...
…and what do they do, exactly?
1. Disambiguate, aka enforce

uniqueness
2. Enable linking, aka data
integration
In other w...
Enforcing Uniqueness Means:
Disambiguating things that have the same name,
but are actually different:
 UCL:
 University...
….. And consolidating the things that have different names
but are actually the same
 University of Oxford
 Univ. Oxford...
Why is disambiguation important?
 Uniquely identify institutions within records
 Eradicate duplication of data
 Ensure ...
Data integration, or linking
Identifiers are a single data element that provides an
unambiguous “hook” into a record
What can you do with linked data?
 Using Institutional

Identifiers to link internal
systems:

CRM

Electronic
document
s...
Linking author and institution IDs
 When authors and their affiliations are linked correctly,

publishers gain:
 Market ...
The supply chain using identifiers

Consortium
People, places, publications…..
What do we need to identify?
 People

 Institutions

 Authors

 Subscribers / customers

 Members

 Funders

 Edito...
Personal Identifiers
 International Standard Name

Identifier (ISNI) www.isni.org
 Open Researcher and Contributor ID
(O...
ISNI
ISNI Number

ISNI Number

Party ID 1

Party ID 2

Proprietary
Information and/or
Metadata

Proprietary
Information an...
ISNI – Personal Record
ISNI – Institutional Record
Institutional Identifiers
 JISC and CASRAI (Consortia Advancing

Standards in Research Administration
Information) report...
CASRAI report findings
 Disambiguating organizational information from multiple

sources typically described as “a nightm...
●
●
●
●

●

●

●

●
●

●
●

●

●

Publishers

●
●

Funders

Companies

●

HEIs

●

●

●
●

Curated

Regulated

●
●
●

●

H...
Ringgold ID:
Covers institutions in the scholarly supply chain
FundRef
Content-related Identifiers
 ISSN, eISSN
 ISBN
 DOI
 LCCN
Where & When to Include IDs
 Adding them to existing records
 Embedding IDs as new records are created – make them a

re...
In-House Options
 Use internal resources & personnel to join existing records to

IDs or an authority file
 Build custom...
Outsourcing Considerations
 Mapping data elements in your

records to standard identifiers vs.
data normalization service...
How to build a linked supply chain
 Urge your vendors and partners to

adopt identifiers
 Request dedicated data fields ...
Identify
Auditing
Validate
Use Cases
 Identify can act as an authority file of institutions in any

number of systems: editorial, MSS submissions, C...
The world of institutions from a publisher’s point of view
Identify Database:
Catalogs & classifies institutions in the scholarly publishing supply chain…..
…organizes them into hierarchies (aka “family trees”)…
…and spans all industries, market segments, and
regions.
Academia
Medical
Not-for-profit
Public libraries
Corporate
Govern...
Delivery & Access
 Access is enterprise wide: All divisions may utilize complete

array of Identify features and data
 W...
Licensing terms
 Annual subscription: provides ongoing access to the Identify

database. Upon cancellation Ringgold Numbe...
Mapping your accounts to Identify
Audit Service
Turn your customer records
from this…..

…..into this.
Auditing is……
 Manual process, ideal for high-value records such as

institutional subscribers
 Conducted by our team of...
Audit Process
Receive files from
client

Normalise data
(de-duplication
and automatching)

Researcher
checks and
matches t...
Deliverables & Fees
Audit Files for Systems:
 Intended for sequential upload into multiple data systems
Audit Files for H...
Audit Data
Consortia
Member
Parent RIN

Consortia Member Parent
Ringgold ID
Inst Name

Ringgold Inst Name

Customer
ID

Cu...
Beta Affiliation Matching Service
 Matches institutional affiliations in personal records to

Identify
 Combines machine...
Instant creation of new Ringgold Identifiers
Validate
 Validate enables Ringgold’s Identify customers to obtain

Ringgold IDs for institutions which are not currently...
How Validate works
User searches for an
institution in Identify

Cannot find institution

Adds institution in
Validate wit...
Enabling effective supply chain linking
ProtoView
 A service that creates and disseminates book and e-book

metadata on behalf of scholarly publishers
 Develope...
ProtoView in the supply chain
Upcoming Webinars
Session 3: Lean and Mean: Publication
Metadata to Enhance
Discovery, Purchase and Use of Your
Content
We...
Jay Henry
Chief Marketing Officer

www.ringgold.com

Christine Orr
Sales Director
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Foundation of Healthy Data and the Basis for Linki...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Foundation of Healthy Data and the Basis for Linki...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Foundation of Healthy Data and the Basis for Linki...
Upcoming SlideShare
Loading in …5
×

Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Foundation of Healthy Data and the Basis for Linking Your Supply Chain

1,300 views

Published on

The second session took place on Wednesday January 29 and discussed Ringgold IDs - what they are and what other identifiers can do for your business. We addressed:
- The current landscape of standard identifiers applicable to scholarly publishing including Ringgold IDs, ISNI, and ORCID. What are they, and why are they important?
- How & why to incorporate them into your internal data silos and into your supply chain activities
- Ringgold Identifiers and the Identify database: Service overview & typical use cases

Published in: Marketing, Technology, Business
  • Be the first to comment

  • Be the first to like this

Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Foundation of Healthy Data and the Basis for Linking Your Supply Chain

  1. 1. Ringgold Webinar Series: Session 2 29 January 2014
  2. 2. Today’s Agenda 1. Unique identifiers at the core of good data health 2. Identifiers in scholarly publishing 3. Embedding identifiers into your records 4. Related Ringgold services
  3. 3. At the core of good data health
  4. 4. Regardless of the state of your data’s health, it can be improved by the addition of unique identifiers
  5. 5. What are standard identifiers?  Numeric or alpha-numeric persistent designations associated with a single entity  Entities can be an institution, person, or piece of content
  6. 6. …and what do they do, exactly? 1. Disambiguate, aka enforce uniqueness 2. Enable linking, aka data integration In other words, they provide a simple basis for data governance
  7. 7. Enforcing Uniqueness Means: Disambiguating things that have the same name, but are actually different:  UCL:  University College London (UK)  Université Catholique de Louvain (Belgium)  Universidad Cristiana Latinoamericana (Ecuador)  University College Lillebælt (Denmark)  Centro Universitario Celso Lisboa (Brazil)  Union County Library (USA)  NPL:  National Physical Laboratory (UK)  National Physical Laboratory (India)  York University  University of York (UK)  York University (Canada)  Northeastern University:  Northeastern University (Boston, USA)  Northeastern University (Shenyang, China)
  8. 8. ….. And consolidating the things that have different names but are actually the same  University of Oxford  Univ. Oxford  Oxford University  Library, Oxford Univ.  Radcliffe Science Library  Bodleian Library  Bodleian, Oxford  Oxford, University of  University of Northampton  Northampton Business School  School of Education  School of Health  School of Science and Technology     Division of Computing Division of Engineering Environmental & Geographical Sciences Institute for Creative Leather Technologies  School of Social Sciences  School of The Arts
  9. 9. Why is disambiguation important?  Uniquely identify institutions within records  Eradicate duplication of data  Ensure correct delivery, entitlements and access rights  Better understand your customer base and relationships with institutions  Improve “trust” in data  Map institutions into their hierarchy
  10. 10. Data integration, or linking Identifiers are a single data element that provides an unambiguous “hook” into a record
  11. 11. What can you do with linked data?  Using Institutional Identifiers to link internal systems: CRM Electronic document storage Financial System  Break down silos  Keep data up-to-date and systems synchronised  Enable staff to use data more effectively  Simplify data transmission  Improve overall data quality Authentication Institutional Identifiers Membership system Usage statistics Author Database Fulfilment system
  12. 12. Linking author and institution IDs  When authors and their affiliations are linked correctly, publishers gain:  Market intelligence about authors and institutions  Author and subscriber information mapped together  Knowledge of where research funding is concentrated  Reduction in time taken calculating open access charges (APCs)  Institutions gain information about their overall research output  Funders gain information about where authors reside and publish
  13. 13. The supply chain using identifiers Consortium
  14. 14. People, places, publications…..
  15. 15. What do we need to identify?  People  Institutions  Authors  Subscribers / customers  Members  Funders  Editors & other contributors  Publishers / licensors  Customers / subscribers  Aggregators  Content  Books & ebooks  Journals  Articles  Sales & subscription agents
  16. 16. Personal Identifiers  International Standard Name Identifier (ISNI) www.isni.org  Open Researcher and Contributor ID (ORCID) www.orcid.org  Scopus Author ID www.elsevier.com/onlinetools/scopus  ResearcherID http://wokinfo.com/researcherid/ And many other proprietary system IDs: Mendeley, Microsoft Academic, Google Scholar, etc….
  17. 17. ISNI ISNI Number ISNI Number Party ID 1 Party ID 2 Proprietary Information and/or Metadata Proprietary Information and/or Metadata  ISO Standard 27729  ISNI is designed to be a “bridge identifier”  Covers any type of entity
  18. 18. ISNI – Personal Record
  19. 19. ISNI – Institutional Record
  20. 20. Institutional Identifiers  JISC and CASRAI (Consortia Advancing Standards in Research Administration Information) report on Organisation IDs: http://repository.jisc.ac.uk/5381/1/CC549D 001-1.0_org_ID_landscape_study.pdf  Examined the landscape of organizational identifiers in the UK and identified 23 different IDs  Lots of detail on use cases for publishing, funders, and institutions
  21. 21. CASRAI report findings  Disambiguating organizational information from multiple sources typically described as “a nightmare”  Benefits from effective unique identifiers are truly realized when data is shared  Key aspects of identifiers that support the widest range of uses:  Governance  Trust  Transparency  Temporal  Appropriate metadata
  22. 22. ● ● ● ● ● ● ● ● ● ● ● ● ● Publishers ● ● Funders Companies ● HEIs ● ● ● ● Curated Regulated ● ● ● ● Historic Global Global Global Global Global Global Global Global Global Mainly used for linking Identifier Name Dun & Bradstreet FundRef ISNI ORCID Ringgold's Identify MACE & UK Federation VIAF Research Analytics Coverage Global Identifiers ● ● ● ● ● ● ● ● ● ● ●
  23. 23. Ringgold ID: Covers institutions in the scholarly supply chain
  24. 24. FundRef
  25. 25. Content-related Identifiers  ISSN, eISSN  ISBN  DOI  LCCN
  26. 26. Where & When to Include IDs  Adding them to existing records  Embedding IDs as new records are created – make them a required data field  Priority record sets?  Existing workflows?  Which IDs do you need?  Create dedicated fields for selected IDs
  27. 27. In-House Options  Use internal resources & personnel to join existing records to IDs or an authority file  Build customized solutions mapping systems together ; i.e. data loaders and transformation tools  Improve data capture to require an ID upon record creation  Manual vs. programmatic  ORCID tools: http://support.orcid.org/
  28. 28. Outsourcing Considerations  Mapping data elements in your records to standard identifiers vs. data normalization services  Normalizing against a standard taxonomy  Computer mapping vs manual process
  29. 29. How to build a linked supply chain  Urge your vendors and partners to adopt identifiers  Request dedicated data fields in any systems implementations  Embed IDs in data exchange processes with your vendors and partners (e.g. subscription agents)  Encourage authors and contributors to register with ORCID
  30. 30. Identify Auditing Validate
  31. 31. Use Cases  Identify can act as an authority file of institutions in any number of systems: editorial, MSS submissions, CRMs, financial, fulfillment, etc.  Understand & analyze your customer base  Analyze the wider market for opportunities  Disambiguate institutions & find duplicate accounts  Reveal institutional relationships with hierarchies  Enhance customer records with Identify metadata  Support pricing decisions & policies
  32. 32. The world of institutions from a publisher’s point of view
  33. 33. Identify Database: Catalogs & classifies institutions in the scholarly publishing supply chain…..
  34. 34. …organizes them into hierarchies (aka “family trees”)…
  35. 35. …and spans all industries, market segments, and regions. Academia Medical Not-for-profit Public libraries Corporate Government Publishers Funding bodies Intermediaries More than 370,000 institutions and growing
  36. 36. Delivery & Access  Access is enterprise wide: All divisions may utilize complete array of Identify features and data  Weekly data feed: Direct feed of complete Identify database for incorporation into your own data warehouse or systems  Identify Online: Ringgold’s own web interface; may be accessed via UN/PW and IP addresses  API: Webservice permits calls to Identify and returns selected data elements
  37. 37. Licensing terms  Annual subscription: provides ongoing access to the Identify database. Upon cancellation Ringgold Numbers and Ringgold Names may be retained; Ringgold will require deletion of all other Ringgold data from the customer’s systems.  Perpetual-use licence: provides ownership of all of the data provided by Ringgold in the Identify database at time of purchase and archival rights to the data supplied. The annual maintenance fee covers the supply of a continuing data feed and ownership of the data held within. Upon cancellation, Ringgold will cease to provide the data feed.
  38. 38. Mapping your accounts to Identify
  39. 39. Audit Service Turn your customer records from this….. …..into this.
  40. 40. Auditing is……  Manual process, ideal for high-value records such as institutional subscribers  Conducted by our team of 40 researchers, speaking more than 30 languages and expert in their assigned regions  Delivers the following for each unique institution:  Unique Ringgold Identifier  Institutional hierarchy  Additional metadata
  41. 41. Audit Process Receive files from client Normalise data (de-duplication and automatching) Researcher checks and matches to Ringgold IDs, hierarchy etc. Researcher creates new IDs for unidentified organizations Data split into countries Data assigned to appropriate country expert Data uploaded to Identify system Client sent encrypted file via FTP with IDs and metadata
  42. 42. Deliverables & Fees Audit Files for Systems:  Intended for sequential upload into multiple data systems Audit Files for Humans:  Excel files for direct analysis by any member of staff Identify Online incorporation:  With Identify subscription, you can see your accounts in a custom, secure view of Identify Online. View your accounts vs the wider market for prospecting, penetration analysis, etc.  Per-record fees apply
  43. 43. Audit Data Consortia Member Parent RIN Consortia Member Parent Ringgold ID Inst Name Ringgold Inst Name Customer ID Customer Name Product Price Format 27003 Universite de Caen BasseNormandie 27003 Universite de Caen BasseNormandie 1008564 Bibliotheque Univ. de Caen Advances in Warp $ Speed Engine Efficiency 27003 Universite de Caen BasseNormandie 56820 Universite de Caen Faculte de Medecine 1151389 UFR De Medecine De Caen Enterprise-Wide Alls Package $ 10,100 Print + Online 27015 Universite Joseph Fourier 27015 Universite Joseph Fourier 58596 U Joe Fourier Bibliotheque Enterprise-Wide Alls Package $ 10,100 Print + Online 27015 Universite Joseph Fourier 72758 Universite Joseph Fourier Faculte de Medecine de Grenoble Journal of Interspecies Bioengineering $ 4,500 Print + Online 27092 Universite Francois-Rabelais de Tours 27092 Universite Francois-Rabelais de Tours Enterprise-Wide Alls Package $ 10,100 Print + Online 27092 Universite Francois-Rabelais de Tours 56555 Universite de Tours Faculte de Medecine de Tours Annals of Mind Meld Research $ 1,600 Print + Online 27092 Universite Francois-Rabelais de Tours 56555 Universite de Tours Faculte de Medecine de Tours 484855 Medical School, Univ of Tours Journal of Interspecies Bioengineering $ 3,995 Online 128791 Aix-Marseille Universite 128791 Aix-Marseille Universite 128791 Aix-Marseille Universite 128791 Aix-Marseille Universite 1216879 BU Gren1 Med Lot 2 332568 Tour Univ Library 1303611 Service De Documentation (I894) 1037952 Bibl. Univ Med Odontolgie 889965 Med Biblio - Aix LeaderSHIP Quarterly $ Journal of Interspecies Bioengineering $ 1,230 Print + Online 970 Print + Online 3,995 Online
  44. 44. Beta Affiliation Matching Service  Matches institutional affiliations in personal records to Identify  Combines machine matching with manual processes; ideal for datasets such as members, authors, reviewers, etc.  Fees are levied on a per-record basis
  45. 45. Instant creation of new Ringgold Identifiers
  46. 46. Validate  Validate enables Ringgold’s Identify customers to obtain Ringgold IDs for institutions which are not currently held in the Identify database with immediate effect.  Users search for an institution, if the institution does not appear to be in Identify, the institution can be added and the Ringgold number obtained immediately.  Ringgold’s staff and researchers manually check all entries made in the Validate system.
  47. 47. How Validate works User searches for an institution in Identify Cannot find institution Adds institution in Validate with required location information Obtains new and unique Ringgold ID instantly Researcher checks entry for duplication or mistakes Researcher adds metadata for new records Report sent back to publisher next day Duplicate Ringgold IDs deleted
  48. 48. Enabling effective supply chain linking
  49. 49. ProtoView  A service that creates and disseminates book and e-book metadata on behalf of scholarly publishers  Developed from a successful model as the next generation of services to meet the needs of an evolving market  Guided by industry best practices and standards Built on the Book News, Inc. foundation and its 35 years of experience in providing promotional services for publishers
  50. 50. ProtoView in the supply chain
  51. 51. Upcoming Webinars Session 3: Lean and Mean: Publication Metadata to Enhance Discovery, Purchase and Use of Your Content Wednesday, February 12. 60 minutes. Session 4: 30-Minute Workout: Quick Tips for Better Customer Data Health Wednesday February 26. 30 minutes. Visit www.ringgold.com to see full descriptions & to register.
  52. 52. Jay Henry Chief Marketing Officer www.ringgold.com Christine Orr Sales Director

×