Christine Orr, North American Sales Director for Ringgold, spoke at the CESSE 2015 annual meeting session 'Adding Value to Your Process: Supporting Researchers and Data Requirements'.
2. Today’s Agenda
1. Ringgold Introduction
2. Entity Management: People, Places, and
Things
3. Identifiers in the Scholarly Space
4. Open Access Case Study
5. Recommendations & Best Practices
3. Provide structured, authoritative
data about two entity types
Institutions
• Identify Database
• Identify Audit Service: data
normalization
• Consortia Directory Online
Scholarly Works
Books & ebooks: Abstracts and
bibliographic data to drive
discovery, purchase, and use
4. Ringgold’s Vision
A scholarly supply chain where
information and data about
subscribers, authors, readers,
and content is able to flow easily
and without confusion. Trust in
data increases, and more
confident decisions can be made.
Founded 2005
5. Identify & Ringgold ID:
Broad Adoption by Publishers &
Intermediaries…
• Copyright Clearance Center – RightsLink for Open Access
• Aries Editorial Manager
• ScholarOne Manuscripts (coming soon)
• ORCID – Identify used for institutional affiliation module
• Advantage Computing Systems – multiple client integrations
• SalesForce – multiple client integrations
• 50+ scholarly publishers such as Elsevier, OUP, CUP, ProQuest,
PLOS, Sage, T&F, Wolters Kluwer, Kudos…
6. …and by Associations
• AAAS
• American Academy of Pediatrics
• American Chemical Society
• American College of Physicians
• American Institute of Aeronautics and
Astronautics
• American Institute of Physics
• American Medical Association
• American Psychological Association
• American Society for Microbiology
• American Society of Hematology
• American Thoracic Society
• ASTM International
• British Editorial Society of Bone &
Joint Surgery
• Institute of Engineering and
Technology (IET)
• IOP Publishing
• Massachusetts Medical Society (NEJM)
• Radiological Society of North America
• RCN Publishing
• The Royal Society
• Royal Society of Chemistry
• SAO/NASA Astrophysical Data System
• Society for Exploration Geophysicists
• Society for Neuroscience
8. What do
we need
to identify
&
describe?
Authors, Members,
Editors, Readers,
Researchers
Licensees,
Publishers,
Funders,
Intermediaries,
Affiliations
Books,
Journals,
Articles,
Grants,
Citations
9. What are standard identifiers?
• Numeric or alpha-numeric persistent designations associated with a
single entity
• Entities can be an institution, person, or piece of content
10. Disambiguate aka enforce
uniqueness
Institutions that have the same name, but are actually different…
Ringgold ID 1848
Northeastern University (Boston, USA)
Ringgold ID 12434
Northeastern University (Shenyang, China)
11. *This means data that can be linked together through
unambiguous identification and exchanged with others
Governed
Trusted
Transparent
And link to appropriate metadata
In order to be effective, identifiers must be:
12. What can high
quality data
help us to
achieve?
Tactical
support for
day to day
operations
Strategic
planning
Develop
new
information
14. Personal Identifiers
International Standard Name Identifier (ISNI)
www.isni.org
Open Researcher and Contributor ID (ORCID)
www.orcid.org
And many proprietary IDs:
Scopus Author ID - Elsevier
ResearcherID - Thomson Reuters
16. ISNI: International Standard Name
Identifier
• ISO Standard 27729
• ISNI is designed to be
a “bridge identifier”
to join other data
sets.
• Covers any type of
entity that might
appear in a library
catalogue record:
person or institution.
ISNI Number
Party ID 2Ringgold ID
Identify
Metadata &
Hierarchies
Proprietary
Information and/or
Metadata
20. Ringgold Identifier:
Institutions currently active in the scholarly
supply chain
• Numerical ID applied to each record
in the Identify Database
• Format is 4-6 digits
• Created by Ringgold’s team of
researchers (manual process)
• Global coverage
• All sectors: .edu, .gov., .com, .org
• All roles: licensees, publishers,
funders, intermediaries
• 415K institutions and growing
27. Challenge: Correct Application
of APC Discounts / Waivers
• Multiple systems & data sources involved
• Lack of standard references
• Complex criteria + complex institutional
relationships
Solution: Get everyone speaking the same
language
28. RightsLink for Open Access: Typical Workflow
Publisher
•Provides CCC a list of eligible institutions with Ringgold IDs,
per Ringgold Identify Audit or CCC mapping
Author
•Chooses affiliation from Ringgold-enabled list via
•MSS submission (Aries, ScholarOne) OR
•CCC RightsLink for Open Access
CCC
•Captures author affiliation and Ringgold ID
•Compares affiliation to list of eligible institutions using
Ringgold IDs and hierarchies
•Calculates proper APC
30. Discounts can be triggered via Identify hierarchy
or other metadata elements
31. • Correct APC
discounts &
waivers applied
• Publisher’s
business rules
supported
• Records enhanced
with key metadata
and Ringgold IDs
• Reporting fueled
by accurate data
32. Elements of Best Practice at Work
Data integration & linking
Free text entries for
affiliations are minimized in
webforms
Single institutional
authority file
Hierarchical links provide
relational authority
Powers business intelligence
33. Individual + Institution IDs
=
Benefits for Stakeholders
• Funders – Want to track areas of interest, identify worthwhile pursuits, and see where
their money goes
• Institutions – Demonstrate research output more accurately and precisely describe the
institution’s contribution and who is affiliated with that work
• Publishers – Facilitate transactions of all types from content discovery to delivery of
author royalties. Improved market analysis and targeted advertising. Accurately deliver
content, calculate APCs
• Societies – Understand their membership, total constituency, and institutional impact
35. Recommendations
• Create the most complete metadata
possible – for all entity types & roles
• Adhere to standards and identifiers
which are broadly adopted (e.g.
ORCID, ISBN, ISSN, DOIs, Ringgold ID,
ISNI)
• Require authors & members to
establish an ORCID profile
• Minimize manual entry of data
• Improve data capture to require an ID
upon record creation
36. Considerations
• Which records & entities require IDs?
– Old vs new
• How to apply them?
– In-house resources required to join existing records to IDs or authority
file
– Outside partners to help w metadata development, application of IDs,
data strategy
• How to leverage them?
– New reporting & analysis tools to leverage newly linked datasets
37.
38. THANK YOU
Christine Orr
Sales Director, North America
christine.orr@ringgold.com
Tel: 540.359.6620
http://orcid.org/0000-0003-1362-3330
www.ringgold.com @ringgoldinc
Clean Data. Confident Decisions.
Editor's Notes
And very often, the same person might play different roles – and therefore be found in multiple data sets. SLA example.
Also: there are plenty of institutions which share the same acronym, or are known by different official names – think different languages. This happens more often than you think……
So, you cannot disambiguate on name alone – you’d need a second data element, location…and that’s inefficient, and makes for poor data quality control & management.
You’ll note that I’ve used the term “Standard Identifiers” as opposed to just “Standards”… I’ll be focusing on using standard identifiers as the main data hooks that will allow us to aggregate information for the purpose of synthesizing knowledge.
Interoperability implies communication; how we communicate something is very different than how we describe things.
I should take a moment to clarify that the Ringgold ID is not a standard – not an ISO Certified standard, in any case, but in many cases, our data has become a defacto standard through application; some of you might be wondering what then constitutes a big “S” Standards – if any system uses a predefined taxonomy as an authority file to validate data (thereby achieving identify data entries for each and every instance it is needed) then a standard has been achieved.
How data is exchanged is quite different than the data itself, and of course, standards may be applied to both. For my part, I’m going to talk about the data itself, not how it is exchanged– So, in terms of the data itself, what are we trying to standardize? Descriptions – the wrapper around highly unique content. More importantly, as an industry—as a species, really—we are creating data elements that can be interpreted by machines – I should say, easily interpreted by machines – I’ll come back to this topic near the end of my presentation.
Regardless of our business model, open access or traditional subscriptions, we need to make decisions upon a solid foundation, interoperate with all manner of systems, and facilitate sharing of data throughout the virtuous cycle of research, from researcher to reader.
Tactical: customer service, APC calculation & application, helping different systems to interoperate & share data.
Strategic: move away from “anecdata” make business decisions based on facts and trends. Analysis – how’s that OA journal doing?
Develop new information: Create data overlays that may tell us new stories, and help us develop new services. Answer questions we’ve never been able to confidently state before: where have we got authors and no subscriptions? Overlay authors & reviewers to ensure there is no conflict of interest. Reveal new interdisciplinary collaborations.
More than 1.4M and counting,
International Standard Name Identifier: Scope = entities that may be found in library catalogs, ie, published works. Creators, publishers, subjects. Covers people (past, present, real, fictional, groups) and institutions such as publishers.
Of most interest in the OA discussion…..
ID applied to each record in a Ringgold Identify Database record; here is a sample.
What it is:
Funders currently active. This is a registry run by CrossRef, populated via data which is sourced from Elsevier.
Most publishers offer some level of discount or waiver based on affiliation: specific institutions, classes of institutions (per country or org type).
Authors don’t need to know their RIN, multiple orgs using the same standards & vocabulary.
Prevent duplicate account / record creation
Breaks down silos by enabling systems to be compared and linked
Keep data up-to-date and systems synchronised
Improve accuracy in reporting
34
35
Identifiers become more meaningful, more powerful, more effective, when they are used broadly and frequently.