Access the world’s research outputs through the CORE API

Access the world’s research outputs
through the CORE API
Petr Knoth, Matteo Cancellieri, Knowledge Media institute, The Open University
https://core.ac.uk
https://core.ac.uk/services/api
https://bit.ly/core-apiv3
@oacore

• What can you do with the CORE API?
• Lessons learned from v2 and new features in v3
• Live tutorial: Did research stop during COVID?
Outline
Questions? https://bit.ly/core-apiv3

In doing so, we:
● enrich scholarly data using state-of-the-art text and data
mining technologies to aid discoverability,
● enable others to develop new tools and use cases on top
of the CORE platform,
● support the network of open access repositories and
journals with innovative technical, solutions and,
● facilitate a scalable, cost-effective route for the delivery of
open scholarship.
CORE’s mission
CORE’s mission is to aggregate all open access research worldwide and deliver
unrestricted access for all.

~97 million
Data providers
10,372
28,468,748 > 90
Countries
250
Metadata records
218,808,331
Full texts hosted
directly by CORE
Languages
Free to read links to full
text papers

CORE services
Content discovery Raw data services Managing content
Discovery
Recommender
API
Dataset
FastSync
Repository Dashboard
Repository Edition
Search

● An extended model of the CORE resources to
link different versions of a paper.
● Support for medium-size datasets collection
● Improved analytical tools
● User management made easier
● Better documentation
● A gallery to kick start your journey with the API
What's new on the CORE API

🖋 Documentation in Swagger
🖋 PHP + Symfony implementation
🚀 Elasticsearch
API clients
•Java https://github.com/oacore/oacore4j
•Python https://github.com/oacore/pyoacore
•R https://github.com/ropensci/rcoreoa
CORE API: where are we?
> 2,500 registered users 252 active users
(in the last two months )

Works
A deduplicated and polished item, it is made with the best metadata we can use from multiple articles
from different sources, it includes enrichments.
Article (old name) /Output (new name)
It is data coming directly from the data providers. It mostly comes from OAI-PMH but there also other
different data providers. The data is uniform so all the different data providers lead to a single metadata
format.
Data provider
It contains repositories (institutional
and disciplinary), preprint servers, journals and
publishers.
Journal
This dataset contains all journal titles included in
the CORE collection.
How CORE sees the world
1...n versions
contains contains

Improved search queries
(
(
"Neural networks"
AND
yearPublished<=2018
)
OR
(
title:"deep learning"
AND yearPublished>2019
)
)
AND
_exists_:doi
+ better sorting
+ better filtering

Large dataset access
 The API now support querying
for medium size datasets
(1,000-100,000 records)
through the scroll parameter.
 For large datasets (>100,000),
consider the CORE dataset

Better analytical tools (coming soon) CORE
Analytics
Meaningful statistics for all
the entities in CORE
Search aggregation to help you orientate
while searching

User management
CORE
communities!

Great
docs
Easy to start with
API clients coming soon!
Examples
Gallery
Free to
use
Free to
register

Roadmap
APIv3 is in
production
Sunset period for
APIv2 (in Q2
2022)

Feedback
Please cite CORE https://core.ac.uk/about/research-outputs
Show us how you are using the API
Let us know what do you think

Thank you!
Questions?
https://bit.ly/core-apiv3

Demo time!
beware of the demo demon

Access the world’s research outputs through the CORE API

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Access the world’s research outputs through the CORE API

Similar to Access the world’s research outputs through the CORE API (20)

Recently uploaded

Recently uploaded (20)

Access the world’s research outputs through the CORE API

Editor's Notes