This has been our mission since day 1 – enabling information to be exchanged more smoothly between all players in the supply chain. But what we are really talking about is TRUST. Trust that the data/info you are passing around your organization, and to partners, is correct, complete, and especially that we are all talking about the same entity. Clean data is trustworthy data, and we want to help publishers and intermediaries achieve that.
First, a bit of background: We’ve been growing the Identify database since 2003, and the company was founded in 2005. What this shows is we are eager to partner with others in the supply chain, such as publishers, intermediaries, and others working with clean standardized data, and that we are always growing our core service, the Identify database
If folks know anything about us, it’s the Ringgold Identifier, the number. What we really do is help publishers and intermediaries get their data healthier, so it can be used to support better business intelligence, improve decision making, and more easily connect data/records both internally (we all have lots of data silos) and with their external partners like ScholarOne. And where we really live is in helping to provide standardized, tidy information about 2 legs of the stool: institutions (places), and content (things). What we’re going to focus on today are the institutions.
We’ve seen from Laure’s presentation that one person can be represented many ways in the scholarly supply chain. So it is the same with institutions.
Standard identifiers are better than normalized names of institutions – getting rid of free text is only one part of the solution. The entities we care about – people , places (institutions), things (bits of content). Gerry Grenier spoke yesterday about the power of the DOI – for years it’s been important in linking readers to the right content, every time they click on something – even if that URL has changed, the journal has been transferred. Now with the proliferation of data, and the importance of data-driven decisions, having trust in our data – and that the person and institution are properly identified – unique identifiers have become more prominent.
Data governance – for managing your data, records, and information as the valuable business asset it can be. There are two things that IDs do that other data elements can’t reliably achieve:
So it’s easy to see why the UCL abbreviation on the left is not helpful – you point to too many institutions. But look at the right hand side – there really are 2 Northeastern Universities in the world, so the standard name only gets you so far.
And many institutions have legitimate variant names – former names, the names of the library, or – different parts of the same institution have different names.
It’s important so you can trust that when you see the authors or members or subscriptions from Northeastern University in China, you know you’re not confusing them with the one in Boston.
The other thing that a unique numerical ID can do for you is allow you to join up systems both internally and externally.
Trust that when you pass a record from the CRM to finance, that you’re on the same page. Tell ASP story:
Linked correctly and in all possible records & data silos, powerful things can happen: When I was on the publisher side, I worked in institutional sales & marketing – it was always my dream to know where I had authors and no subscribers, which subscribers had a large society member presence, etc. We’ve become even more data driven since: We need this info to make sure we charge the right author the right APC To craft a fully-informed license proposal to a university system To allow our readers to know that they’ve found all the papers output by a specific institution in their field of study It’s great for us, and for our constituents.
This is my ORCID record – note that we are very pleased to partner with ORCID, and they are using the Identify database to power their institutional affiliaion module. So you don’t see my university represented as UW Madison, Univ Wisc Madison, etc. The name has been normalized in the display, and there is even more authoritative data hiding behind the scenes…….so let’s talk about what Identify is…..
Example of type-ahead search – major disclaimer! This is just an example of how ORCID has used Identify as an authority file, S1 will be exploring how best to implement this for you all.
So, I’d like to leave you with a couple of concepts – first, our vision of what the supply chain really should look like – by “should” I mean, how can we structure scholarly communications in a way that creates less friction and more efficiency between all stakeholders within our industry? I’ve always like the expression, “a confused mind always says ‘no’”, well confusion is what we’re trying to eliminate.
Here’s what we have today--- well, it is actually getting better.
As more points in the supply chain begin using standard identifiers, sharing information, and TURSTING that information becomes easiers.
So, we begin moving to a point where Identifiers are at the heart of communications
This is my ‘happy supply chain flower’ graphic... metadata combined with standard identifiers CHANGE the supply chain...merging at an ever increasing rate and the flow of information across systems will be key to exposing content and realizing sales and use of works.
An authority file that is designed just for the institutions that matter to us in scholarly publishing.
Connecting people, places and things
Council of Science Editors
May 5, 2014
What We Will Cover
1.Who is Ringgold?
2.The case for institutional identifiers
What they are
How they help
Background and History
Ringgold acquires Frontline GMS
Ltd and Consortium Directory
Online. Laura Cox joins as V.P.
Sales & Marketing.
Identify reaches 200,000 records.
created via project
Ringgold Inc and
3,500 records in
Ringgold joins forces with British
Library, Swets, HighWire, OUP and
Rockefeller University Press in pilot
project to apply identifiers to
subscribing institutions in UK.
Identify database grows to 15,000
Ringgold begins licensing
institutional IP data from
Ingenta's global access control
60,000 records in Identify.
Ringgold creates strategic
partnership with DataSalon
to combine Identify database
with MasterVision platform.
100,000 records in Identify.
Ringgold becomes first contracted
ISNI Registration Agency for
Identify database reaches 345,000
Ringgold acquires Book News Inc.
What We Do
discovery, purchase, and use
of scholarly content with
How identifiers help
Proper understanding of the customer, whether author,
reader or institution
Provides a simple basis for wider data governance:
Data governance, as defined at Ringgold, is the processes,
policies, standards, organization, and technologies required
to manage and ensure the availability, accessibility, quality,
consistency, auditability, and security of data.
What are standard identifiers?
Numeric or alpha-numeric persistent designations associated with a single entity
Entities can be an institution, person, or piece of content
…and what do they do, exactly?
Disambiguate, aka enforce uniqueness
Enable linking, aka data integration
In other words, they provide a
simple basis for data governance
Enforcing Uniqueness Means:
Disambiguating things that have the same name,
but are actually different…
University College London (UK)
Université Catholique de Louvain
University College Lillebælt
Centro Universitario Celso Lisboa
Union County Library (USA)
National Physical Laboratory (UK)
National Physical Laboratory (India)
University of York (UK)
York University (Canada)
Northeastern University (Boston,
Northeastern University (Shenyang,
….. And consolidating the things that have different names
but are actually the same
University of Oxford
Library, Oxford Univ.
Radcliffe Science Library
Oxford, University of
University of Northampton
Northampton Business School
School of Education
School of Health
School of Science and Technology
Division of Computing
Division of Engineering
Environmental & Geographical Sciences
Institute for Creative Leather
School of Social Sciences
School of The Arts
Why is disambiguation important?
Uniquely identify institutions within records
Eradicate duplication of data
Ensure correct delivery, entitlements, access rights, fees
Better understand your customer base and relationships with
Map institutions into their hierarchy or “family tree”
So that you can trust your data
Data integration, or linking
Identifiers are a single data element that provides an
unambiguous “hook” into a record
What can you do with linked data?
Identifiers to link internal
Break down silos
Keep data up-to-date and
Enable staff to use data
Simplify data transmission
Improve overall data quality
Linking author and institution IDs
When authors and their affiliations are linked correctly, publishers
Market intelligence about authors and institutions
Author and subscriber information mapped together
Knowledge of where research funding is concentrated
Reduction in time taken calculating open access charges (APCs)
Institutions gain information about their overall research output
Funders gain information about where authors reside and publish
Readers benefit from improved search and discovery
The supply chain
The supply chain
Chief Marketing Officer
ProtoView: Jean Brodahl
Publisher Relations - ProtoView
Identify: Christine Orr
North American Sales Director