SlideShare a Scribd company logo
1 of 47
ALPSP Seminar: Data the universe and everything
22nd January 2014
Laura Cox
Contents
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.

Healthy data
Why now?
Institutional Identifiers – what and why?
Common data problems
Data governance
Data integration (linking your data together)
Institutional Identifiers in the supply chain
Institutional Identifiers – which?
ISNI IDs
What you can do now?
Why is healthy data important?
Good quality, healthy data can be
utilized to gain insight into customers,
business relationships and to support
strategic planning, decision making,
and ongoing business operations.
But when it’s unhealthy….
Poor data has real consequences
 Hard to get a true picture of relationships with institutions
 Lack of quality author (and affiliation) data

 Inability to see overlap between authors, members and

customers
 Inaccurate holdings and revenue reports
 Protracted time and effort taken to analyse data
Everything becomes more difficult, and less accurate
Healthy records are:
 Complete
 Accurate
 Free of duplicates
 Current
 Consistent

 Conform to standards
Unique Identifiers
What are they? How can they help?
 Numeric or alpha-numeric designations which are associated with







a single entity
Entities can be an institution, person, or piece of content
Enable the disambiguation of each entity
Proper understanding of the customer, author, reader or
institution
Proper identification of content object, article, product, or
package
Can be used internally or in conjunction with external partners
Why we should worry about data now?
 Number of researchers increasing by 3% per annum*
 Number of articles increasing by 3% per annum, current

output is 1.8-1.9 million per year*
 Number of journals increasing by 3.5% per annum*
 Growth in China has been in double digits for over 15 years*
 Increased demand for anytime/anywhere access
 Library budgets are frozen or being cut, less money for more
content means we have to work smarter
* Ware, M and Mabe, M, The STM Report, 2012
What are Institutional Identifiers for?
Disambiguating:
 UCL:
 University College London (UK)
 Université Catholique de Louvain
(Belgium)
 Universidad Cristiana
Latinoamericana (Ecuador)
 University College Lillebælt
(Denmark)
 Centro Universitario Celso Lisboa
(Brazil)
 Union County Library (USA)

 NPL:
 National Physical Laboratory (UK)
 National Physical Laboratory
(India)
 York University
 University of York (UK)
 York University (Canada)
 Northeastern University:
 Northeastern University
(Boston, USA)
 Northeastern University
(Shenyang, China)
What are Institutional Identifiers for?
Consolidating:

Hierarchy View:

 University of Oxford

 University of Northampton

 Univ. Oxford

 Northampton Business School

 Oxford University

 School of Education

 Library, Oxford Univ.
 Radcliffe Science Library

 School of Health

 School of Science and Technology


 Bodleian Library



 Bodleian, Oxford



 Oxford, University of



Division of Computing
Division of Engineering
Environmental & Geographical Sciences
Institute for Creative Leather
Technologies

 School of Social Sciences
 School of The Arts
Use cases – the why
Identifiers enforce uniqueness
 Disambiguate institutional records
 Eradicate duplication of data
 Ensure correct delivery, entitlements and access rights
 Better understand your customer base and relationships with

institutions
 Improve “trust” in data
 Map institutions into their hierarchy
Common data problems
 Most publishers have problems with data:
 Multiple accounts for each customer

 Multiple internal IT systems for different purposes
 Data entry without standard names or ID numbers
 Lack of hierarchy information
 No formal manner to track customers across systems
The challenge: Data Sources
 Multiple data sources – ‘system’ data silos
 Multiple locations – ‘geographic’ data silos

 Data entered by different people for different purposes
 Data from third parties in the supply chain
 Data from bought-in sources
The challenge: Data Sources
Typical publisher systems:

Data can be entered by:

 Financial system

 Organisation staff

 CRM/Sales database

 Authors

 Authentication system

 Society members

 Fulfilment

 Agents in the supply chain

 Usage statistics
 Submissions system
 Author database
 Document Storage (contracts and

licences)
 …..

 3rd party organisations
 …..
Implementing a data governance plan
 Important considerations:
 What data is held, where it is held and how it is accessed?

 How can the data be used to further benefit different





departments, processes or activities?
Could the use of current or planned systems be expanded for
further benefit?
Is data highly accurate and consolidated or in need of cleansing?
Are there applications of data that have not been explored?
What requirements are there for additional data?
Improve data capture
 If you can – use web forms
 Implement required fields

 Data validation – at a minimum use naming conventions
 Address validation – postcode lookup
 Institution validation – institution lookup

 Web form consistency across systems
 Avoid free-text fields
 Make institutional identifiers a requirement
Implementing Institutional IDs
Turn your records from this…..

…..into this.
Data integration
CRM

 Using Institutional Identifiers

to link internal systems:

Electronic
document
storage

Financial
System

 Prevent duplicate account








creation
Break down silos
Keep data up-to-date and
systems synchronised
Enable staff to use data more
effectively
Simplify data transmission
Improve overall data quality

Authentication

Institutional
Identifiers

Membership
system

Usage
statistics

Author
Database
Fulfilment
system
Linking author and institution IDs
 When authors and their affiliations are linked

correctly, publishers gain:
 Market intelligence about authors and institutions
 Author and subscriber information mapped together
 Knowledge of where research funding is concentrated
 Reduction in time taken calculating open access charges (APCs)

 Institutions gain information about their overall research

output
 Funders gain information about where authors reside and
publish
The scholarly supply chain
 Purpose:
 Serving the author and reader

 Disseminate content as widely as possible
 Ensure content is easily discoverable
 Provide information in an efficient and trouble-free manner

regardless of:




Content type
User requirements
Desired methods of access
The supply chain (simple version)
Author

Funders
Submission
and Peer
Review
System

End User

Discovery
Service

Consortium
Consortium

Data
Providers and
Systems
(multiple)

Publisher

Online Host
or
Technology
Partner

Library

Fulfilment
House or
System

Subscription
Agent or
Sales Agent

Societies
Supply-chain spaghetti
Author

Funders
Submission
and Peer
Review
System

End User

Discovery
Service

Consortium
Consortium

Data
Providers and
Systems
(multiple)

Publisher

Online Host
or
Technology
Partner

Library

Fulfilment
House or
System

Subscription
Agent or
Sales Agent

Societies
What could possibly go wrong?
 Records are unconnected through the supply chain, links fail:
 Between entities
 Between internal systems
 Between external systems









Renewals are mishandled
Journal transfers are mishandled
Access and authentication is mishandled
Authors and individuals are not linked to their institution
Open access fees have to be checked manually
Authors are not linked to their research
Funders are not linked to the research they fund
Where stronger links are needed
 Finding a path to using standardized data, which:
 Eradicates duplicate records within and between systems

 Enables seamless communication between organizations
 Smoothes the supply chain, removing ambiguity or lack of

information for any party
 Enables higher quality of service
 Increases understanding of customer base and enables better
decision making for everyone involved
Supply-chain spaghetti
Author

Funders
Submission
and Peer
Review
System

End User

Discovery
Service

Consortium
Consortium

Data
Providers and
Systems
(multiple)

Publisher

Online Host
or
Technology
Partner

Library

Fulfilment
House or
System

Subscription
Agent or
Sales Agent

Societies
…becomes organised, with accurate
data and information flow

Consortium
The vision
In an ideal world we would be able to utilise, provide and
obtain data that is accurate, complete and easily joined
together:
 Reducing problems and errors
 Providing better overall service
 Creating seamless processes
 Providing a better understanding of customers and our own
businesses
External linking – in the supply chain
 Using Identifiers will:
 Ensure accuracy of information

 Speed up data transactions
 Reduce queries
 Reduce costs
 Open data up to new uses
 Ensures that authors receive credit for the work they produce
 Ensures that end users receive uninterrupted access to the

content they need
A truly linked supply chain

Identifiers
Institutional Identifiers – which ones?
 JISC and CASRAI (Consortia Advancing Standards in Research

Administration Information) report on Organisation IDs:
http://repository.jisc.ac.uk/5381/1/CC549D0011.0_org_ID_landscape_study.pdf
 Examined the landscape of organisational identifiers in the UK
and identified 23 different IDs
 Based on interviews with key individuals
 Lots of detail on use cases for publishing, funders, and
institutions
CASRAI report
 Disambiguating organisational information from multiple

sources typically described as “a nightmare”
 Benefits from effective unique identifiers are truly realised
when data is shared
 Key aspects of identifiers that support the widest range of
uses:
 Governance
 Trust
 Transparency
 Temporal
 Appropriate metadata
●
●
●

●

●
●
●

●
●
●

●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●

●

●

●

●

●
●

●
●

●
●
●

●
●
●

●
●
●
●
●
●
●
●
●
●

Publishers

●
●

Funders

Companies

●

●

●
●

Curated

Regulated

Mainly
used for
linking

●
●
●

●

HEIs

Global
Global
Global
Global
Global
Global
Global
Global
Global
UK
UK
UK
UK
UK
UK
UK
UK
UK
UK
UK
UK
England
EU

Historic

Please note that ORCID had not
released the institutional
affiliation at the point at which
this report was published.

Identifier
Name
Dun & Bradstreet
FundRef
ISNI
ORCID
Ringgold's Identify
MACE & UK Federation
VIAF
Research Analytics
Companies House
Gateway to Research
Government bodies
HESA
IDBR
Janet
Je-S/CDR OrgID
Research Fish
RCUK
ROS
UCAS
UKPRN
HEFCE
PIC

Coverage

Identifiers identified

●
●
●
●

●
●

●

●
●

●
●
●

●

●

●
●
●

●

●
●

●

●

●
●

●
●
●
●
●

●

●

●

●
●

●
●

●

●

Publishers

●
●

Funders

Companies

●

HEIs

●

●

●
●

Curated

Regulated

●
●
●

●

Historic

Global
Global
Global
Global
Global
Global
Global
Global
Global

Mainly
used for
linking

Identifier
Name
Dun & Bradstreet
FundRef
ISNI
ORCID
Ringgold's Identify
MACE & UK Federation
VIAF
Research Analytics

Coverage

Global Identifiers
●
●
●
●

●
●

●

●
●

●

●
ISNI
ISNI Number

ISNI Number

Party ID 1

Party ID 2

Proprietary
Information and/or
Metadata

Proprietary
Information and/or
Metadata

 ISO Standard 27729
 ISNI is designed to be a

“bridge identifier”
 Covers any type of entity
ISNI IDs
 Ringgold is an ISNI Registration Agency for institutions
 Unique ISNI Institutional ID number can connect any data

and any systems
 ISNI IDs should be used by publishers and across the
scholarly supply chain to:
 Link systems using the ID numbers
 Link data sets which contain proprietary metadata
 Provide clean data transmission
ISNI spans all industries, market segments, and regions
Academia
Medical
Corporate
Government
Not-for-profit
Public libraries
Schools

Publishers
Funding bodies
Intermediaries
Distributors

http://isni.org/
What can YOU do now?
 Engage with the problems you have with data
 Find some resources – think about time not just money

 Consider how data could better serve your organisation
 Appoint a data champion and document everything
 Generate a data governance policy

 Create some basic rules for data entry
 Utilise universal identifiers to clean and link your data
 Work with suppliers and customers to utilise institutional

identifiers to strengthen the supply chain
Laura Cox
President, Chief Financial and Operating Officer
Ringgold Inc.
laura.cox@ringgold.com

More Related Content

What's hot

Mattocks Ont Pragebx Rr 2004 12 08
Mattocks Ont Pragebx Rr 2004 12 08Mattocks Ont Pragebx Rr 2004 12 08
Mattocks Ont Pragebx Rr 2004 12 08
Dr. Cupid Lucid
 

What's hot (20)

Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programs
 
Slawek Korea
Slawek KoreaSlawek Korea
Slawek Korea
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative Advantage
 
Federated Access Management: the Business Case
Federated Access Management: the Business CaseFederated Access Management: the Business Case
Federated Access Management: the Business Case
 
Semantic Applications for Financial Services
Semantic Applications for Financial ServicesSemantic Applications for Financial Services
Semantic Applications for Financial Services
 
Metadata and Analytics
Metadata and AnalyticsMetadata and Analytics
Metadata and Analytics
 
I T Evolution
I T  EvolutionI T  Evolution
I T Evolution
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...
 
Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
The Pistoia Alliance Biology Domain Strategy April 2011
The Pistoia Alliance Biology Domain Strategy April 2011The Pistoia Alliance Biology Domain Strategy April 2011
The Pistoia Alliance Biology Domain Strategy April 2011
 
Mattocks Ont Pragebx Rr 2004 12 08
Mattocks Ont Pragebx Rr 2004 12 08Mattocks Ont Pragebx Rr 2004 12 08
Mattocks Ont Pragebx Rr 2004 12 08
 

Viewers also liked

Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Ringgold Inc
 

Viewers also liked (16)

Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STMMetadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
 
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
 
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
 
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
 
Nicolson
NicolsonNicolson
Nicolson
 
Using Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly WorksUsing Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly Works
 
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
 
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
 
Emerging Standards: Data and Data Exchange in Scholarly Publishing
Emerging Standards: Data and Data Exchange in Scholarly PublishingEmerging Standards: Data and Data Exchange in Scholarly Publishing
Emerging Standards: Data and Data Exchange in Scholarly Publishing
 
Metadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly CommunicationMetadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly Communication
 
Connecting people, places and things
Connecting people, places and things Connecting people, places and things
Connecting people, places and things
 
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
 
Spring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer DataSpring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer Data
 
Persistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To KnowPersistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To Know
 
Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016
 
Ringgold User Group Meeting 2016 (USA)
Ringgold User Group Meeting 2016 (USA)Ringgold User Group Meeting 2016 (USA)
Ringgold User Group Meeting 2016 (USA)
 

Similar to Institutional Identifiers internally and throughout the supply chain

Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
kramsey
 
Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2
ALATechSource
 

Similar to Institutional Identifiers internally and throughout the supply chain (20)

Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
 
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
 
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy DataRinggold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
 
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
BLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, SymplecticBLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, Symplectic
 
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
 
Expertise and Resource Portals for University-Industry Engagement - Jeff Horon
Expertise and Resource Portals for University-Industry Engagement - Jeff HoronExpertise and Resource Portals for University-Industry Engagement - Jeff Horon
Expertise and Resource Portals for University-Industry Engagement - Jeff Horon
 
Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1
 
ORCID Use Cases from the CDL
ORCID Use Cases from the CDLORCID Use Cases from the CDL
ORCID Use Cases from the CDL
 
What ARE we thinking? Collections decisions in an Academic Library
What ARE we thinking? Collections decisions in an Academic LibraryWhat ARE we thinking? Collections decisions in an Academic Library
What ARE we thinking? Collections decisions in an Academic Library
 
Too Difficult
Too DifficultToo Difficult
Too Difficult
 
Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
The FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharing
 
ERM: how it all fits together
ERM: how it all fits togetherERM: how it all fits together
ERM: how it all fits together
 
Dart net intro 2015 tw
Dart net intro 2015 twDart net intro 2015 tw
Dart net intro 2015 tw
 
Building Communities of “Trust”
 Building Communities of “Trust” Building Communities of “Trust”
Building Communities of “Trust”
 
The future of scholarly publishing under digital transformation data, ai an...
The future of scholarly publishing under digital transformation   data, ai an...The future of scholarly publishing under digital transformation   data, ai an...
The future of scholarly publishing under digital transformation data, ai an...
 
Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2
 

More from Ringgold Inc

More from Ringgold Inc (6)

Identify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKIdentify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UK
 
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018 Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
 
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
 
Identify database
Identify databaseIdentify database
Identify database
 
CDO
CDOCDO
CDO
 

Recently uploaded

Recently uploaded (20)

Instant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesInstant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
 
Distribution Ad Platform_ The Role of Distribution Ad Network.pdf
Distribution Ad Platform_ The Role of  Distribution Ad Network.pdfDistribution Ad Platform_ The Role of  Distribution Ad Network.pdf
Distribution Ad Platform_ The Role of Distribution Ad Network.pdf
 
2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com
 
Aiizennxqc Digital Marketing | SEO & SMM
Aiizennxqc Digital Marketing | SEO & SMMAiizennxqc Digital Marketing | SEO & SMM
Aiizennxqc Digital Marketing | SEO & SMM
 
HOW TO HANDLE SALES OBJECTIONS | SELLING AND NEGOTIATION
HOW TO HANDLE SALES OBJECTIONS | SELLING AND NEGOTIATIONHOW TO HANDLE SALES OBJECTIONS | SELLING AND NEGOTIATION
HOW TO HANDLE SALES OBJECTIONS | SELLING AND NEGOTIATION
 
The Impact Of Social Media Advertising.pdf
The Impact Of Social Media Advertising.pdfThe Impact Of Social Media Advertising.pdf
The Impact Of Social Media Advertising.pdf
 
Aligarh Hire 💕 8250092165 Young and Hot Call Girls Service Agency Escorts
Aligarh Hire 💕 8250092165 Young and Hot Call Girls Service Agency EscortsAligarh Hire 💕 8250092165 Young and Hot Call Girls Service Agency Escorts
Aligarh Hire 💕 8250092165 Young and Hot Call Girls Service Agency Escorts
 
Gain potential customers through Lead Generation
Gain potential customers through Lead GenerationGain potential customers through Lead Generation
Gain potential customers through Lead Generation
 
Elevating Your Digital Presence by Evitha.pdf
Elevating Your Digital Presence by Evitha.pdfElevating Your Digital Presence by Evitha.pdf
Elevating Your Digital Presence by Evitha.pdf
 
[Expert Panel] New Google Shopping Ads Strategies Uncovered
[Expert Panel] New Google Shopping Ads Strategies Uncovered[Expert Panel] New Google Shopping Ads Strategies Uncovered
[Expert Panel] New Google Shopping Ads Strategies Uncovered
 
The Art of sales from fictional characters.
The Art of sales from fictional characters.The Art of sales from fictional characters.
The Art of sales from fictional characters.
 
Social Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh BendaySocial Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh Benday
 
The 9th May Incident in Pakistan A Turning Point in History.pptx
The 9th May Incident in Pakistan A Turning Point in History.pptxThe 9th May Incident in Pakistan A Turning Point in History.pptx
The 9th May Incident in Pakistan A Turning Point in History.pptx
 
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
 
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptxUnveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
 
Social Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh BendaySocial Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh Benday
 
Alpha Media March 2024 Buyers Guide.pptx
Alpha Media March 2024 Buyers Guide.pptxAlpha Media March 2024 Buyers Guide.pptx
Alpha Media March 2024 Buyers Guide.pptx
 
TAM_AdEx-Cross_Media_Report-Banking_Finance_Investment_(BFSI)_2023.pdf
TAM_AdEx-Cross_Media_Report-Banking_Finance_Investment_(BFSI)_2023.pdfTAM_AdEx-Cross_Media_Report-Banking_Finance_Investment_(BFSI)_2023.pdf
TAM_AdEx-Cross_Media_Report-Banking_Finance_Investment_(BFSI)_2023.pdf
 
VIP Call Girls Dongri WhatsApp +91-9833363713, Full Night Service
VIP Call Girls Dongri WhatsApp +91-9833363713, Full Night ServiceVIP Call Girls Dongri WhatsApp +91-9833363713, Full Night Service
VIP Call Girls Dongri WhatsApp +91-9833363713, Full Night Service
 
Crypto Quantum Leap - Digital - membership area
Crypto Quantum Leap -  Digital - membership areaCrypto Quantum Leap -  Digital - membership area
Crypto Quantum Leap - Digital - membership area
 

Institutional Identifiers internally and throughout the supply chain

  • 1. ALPSP Seminar: Data the universe and everything 22nd January 2014 Laura Cox
  • 2. Contents 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Healthy data Why now? Institutional Identifiers – what and why? Common data problems Data governance Data integration (linking your data together) Institutional Identifiers in the supply chain Institutional Identifiers – which? ISNI IDs What you can do now?
  • 3.
  • 4. Why is healthy data important? Good quality, healthy data can be utilized to gain insight into customers, business relationships and to support strategic planning, decision making, and ongoing business operations. But when it’s unhealthy….
  • 5. Poor data has real consequences  Hard to get a true picture of relationships with institutions  Lack of quality author (and affiliation) data  Inability to see overlap between authors, members and customers  Inaccurate holdings and revenue reports  Protracted time and effort taken to analyse data Everything becomes more difficult, and less accurate
  • 6. Healthy records are:  Complete  Accurate  Free of duplicates  Current  Consistent  Conform to standards
  • 7. Unique Identifiers What are they? How can they help?  Numeric or alpha-numeric designations which are associated with      a single entity Entities can be an institution, person, or piece of content Enable the disambiguation of each entity Proper understanding of the customer, author, reader or institution Proper identification of content object, article, product, or package Can be used internally or in conjunction with external partners
  • 8.
  • 9. Why we should worry about data now?  Number of researchers increasing by 3% per annum*  Number of articles increasing by 3% per annum, current output is 1.8-1.9 million per year*  Number of journals increasing by 3.5% per annum*  Growth in China has been in double digits for over 15 years*  Increased demand for anytime/anywhere access  Library budgets are frozen or being cut, less money for more content means we have to work smarter * Ware, M and Mabe, M, The STM Report, 2012
  • 10.
  • 11. What are Institutional Identifiers for? Disambiguating:  UCL:  University College London (UK)  Université Catholique de Louvain (Belgium)  Universidad Cristiana Latinoamericana (Ecuador)  University College Lillebælt (Denmark)  Centro Universitario Celso Lisboa (Brazil)  Union County Library (USA)  NPL:  National Physical Laboratory (UK)  National Physical Laboratory (India)  York University  University of York (UK)  York University (Canada)  Northeastern University:  Northeastern University (Boston, USA)  Northeastern University (Shenyang, China)
  • 12. What are Institutional Identifiers for? Consolidating: Hierarchy View:  University of Oxford  University of Northampton  Univ. Oxford  Northampton Business School  Oxford University  School of Education  Library, Oxford Univ.  Radcliffe Science Library  School of Health  School of Science and Technology   Bodleian Library   Bodleian, Oxford   Oxford, University of  Division of Computing Division of Engineering Environmental & Geographical Sciences Institute for Creative Leather Technologies  School of Social Sciences  School of The Arts
  • 13. Use cases – the why Identifiers enforce uniqueness  Disambiguate institutional records  Eradicate duplication of data  Ensure correct delivery, entitlements and access rights  Better understand your customer base and relationships with institutions  Improve “trust” in data  Map institutions into their hierarchy
  • 14.
  • 15. Common data problems  Most publishers have problems with data:  Multiple accounts for each customer  Multiple internal IT systems for different purposes  Data entry without standard names or ID numbers  Lack of hierarchy information  No formal manner to track customers across systems
  • 16. The challenge: Data Sources  Multiple data sources – ‘system’ data silos  Multiple locations – ‘geographic’ data silos  Data entered by different people for different purposes  Data from third parties in the supply chain  Data from bought-in sources
  • 17. The challenge: Data Sources Typical publisher systems: Data can be entered by:  Financial system  Organisation staff  CRM/Sales database  Authors  Authentication system  Society members  Fulfilment  Agents in the supply chain  Usage statistics  Submissions system  Author database  Document Storage (contracts and licences)  …..  3rd party organisations  …..
  • 18.
  • 19. Implementing a data governance plan  Important considerations:  What data is held, where it is held and how it is accessed?  How can the data be used to further benefit different     departments, processes or activities? Could the use of current or planned systems be expanded for further benefit? Is data highly accurate and consolidated or in need of cleansing? Are there applications of data that have not been explored? What requirements are there for additional data?
  • 20. Improve data capture  If you can – use web forms  Implement required fields  Data validation – at a minimum use naming conventions  Address validation – postcode lookup  Institution validation – institution lookup  Web form consistency across systems  Avoid free-text fields  Make institutional identifiers a requirement
  • 21. Implementing Institutional IDs Turn your records from this….. …..into this.
  • 22.
  • 23. Data integration CRM  Using Institutional Identifiers to link internal systems: Electronic document storage Financial System  Prevent duplicate account      creation Break down silos Keep data up-to-date and systems synchronised Enable staff to use data more effectively Simplify data transmission Improve overall data quality Authentication Institutional Identifiers Membership system Usage statistics Author Database Fulfilment system
  • 24. Linking author and institution IDs  When authors and their affiliations are linked correctly, publishers gain:  Market intelligence about authors and institutions  Author and subscriber information mapped together  Knowledge of where research funding is concentrated  Reduction in time taken calculating open access charges (APCs)  Institutions gain information about their overall research output  Funders gain information about where authors reside and publish
  • 25.
  • 26. The scholarly supply chain  Purpose:  Serving the author and reader  Disseminate content as widely as possible  Ensure content is easily discoverable  Provide information in an efficient and trouble-free manner regardless of:    Content type User requirements Desired methods of access
  • 27. The supply chain (simple version) Author Funders Submission and Peer Review System End User Discovery Service Consortium Consortium Data Providers and Systems (multiple) Publisher Online Host or Technology Partner Library Fulfilment House or System Subscription Agent or Sales Agent Societies
  • 28. Supply-chain spaghetti Author Funders Submission and Peer Review System End User Discovery Service Consortium Consortium Data Providers and Systems (multiple) Publisher Online Host or Technology Partner Library Fulfilment House or System Subscription Agent or Sales Agent Societies
  • 29. What could possibly go wrong?  Records are unconnected through the supply chain, links fail:  Between entities  Between internal systems  Between external systems        Renewals are mishandled Journal transfers are mishandled Access and authentication is mishandled Authors and individuals are not linked to their institution Open access fees have to be checked manually Authors are not linked to their research Funders are not linked to the research they fund
  • 30. Where stronger links are needed  Finding a path to using standardized data, which:  Eradicates duplicate records within and between systems  Enables seamless communication between organizations  Smoothes the supply chain, removing ambiguity or lack of information for any party  Enables higher quality of service  Increases understanding of customer base and enables better decision making for everyone involved
  • 31. Supply-chain spaghetti Author Funders Submission and Peer Review System End User Discovery Service Consortium Consortium Data Providers and Systems (multiple) Publisher Online Host or Technology Partner Library Fulfilment House or System Subscription Agent or Sales Agent Societies
  • 32. …becomes organised, with accurate data and information flow Consortium
  • 33. The vision In an ideal world we would be able to utilise, provide and obtain data that is accurate, complete and easily joined together:  Reducing problems and errors  Providing better overall service  Creating seamless processes  Providing a better understanding of customers and our own businesses
  • 34. External linking – in the supply chain  Using Identifiers will:  Ensure accuracy of information  Speed up data transactions  Reduce queries  Reduce costs  Open data up to new uses  Ensures that authors receive credit for the work they produce  Ensures that end users receive uninterrupted access to the content they need
  • 35. A truly linked supply chain Identifiers
  • 36.
  • 37. Institutional Identifiers – which ones?  JISC and CASRAI (Consortia Advancing Standards in Research Administration Information) report on Organisation IDs: http://repository.jisc.ac.uk/5381/1/CC549D0011.0_org_ID_landscape_study.pdf  Examined the landscape of organisational identifiers in the UK and identified 23 different IDs  Based on interviews with key individuals  Lots of detail on use cases for publishing, funders, and institutions
  • 38. CASRAI report  Disambiguating organisational information from multiple sources typically described as “a nightmare”  Benefits from effective unique identifiers are truly realised when data is shared  Key aspects of identifiers that support the widest range of uses:  Governance  Trust  Transparency  Temporal  Appropriate metadata
  • 39. ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Publishers ● ● Funders Companies ● ● ● ● Curated Regulated Mainly used for linking ● ● ● ● HEIs Global Global Global Global Global Global Global Global Global UK UK UK UK UK UK UK UK UK UK UK UK England EU Historic Please note that ORCID had not released the institutional affiliation at the point at which this report was published. Identifier Name Dun & Bradstreet FundRef ISNI ORCID Ringgold's Identify MACE & UK Federation VIAF Research Analytics Companies House Gateway to Research Government bodies HESA IDBR Janet Je-S/CDR OrgID Research Fish RCUK ROS UCAS UKPRN HEFCE PIC Coverage Identifiers identified ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
  • 41.
  • 42. ISNI ISNI Number ISNI Number Party ID 1 Party ID 2 Proprietary Information and/or Metadata Proprietary Information and/or Metadata  ISO Standard 27729  ISNI is designed to be a “bridge identifier”  Covers any type of entity
  • 43. ISNI IDs  Ringgold is an ISNI Registration Agency for institutions  Unique ISNI Institutional ID number can connect any data and any systems  ISNI IDs should be used by publishers and across the scholarly supply chain to:  Link systems using the ID numbers  Link data sets which contain proprietary metadata  Provide clean data transmission
  • 44. ISNI spans all industries, market segments, and regions Academia Medical Corporate Government Not-for-profit Public libraries Schools Publishers Funding bodies Intermediaries Distributors http://isni.org/
  • 45.
  • 46. What can YOU do now?  Engage with the problems you have with data  Find some resources – think about time not just money  Consider how data could better serve your organisation  Appoint a data champion and document everything  Generate a data governance policy  Create some basic rules for data entry  Utilise universal identifiers to clean and link your data  Work with suppliers and customers to utilise institutional identifiers to strengthen the supply chain
  • 47. Laura Cox President, Chief Financial and Operating Officer Ringgold Inc. laura.cox@ringgold.com