1. VTT, Espoo (Finland) 11-12 May 2015
YEAR Annual Conference 2015: Open Science in Horizon 2020
This work is licensed under a Creative Commons Attribution 4.0
Dr. Nancy Pontika
Connecting Repositories (CORE)
Knowledge Media Institute
Open University
Twitter: @oacore
Putting Open Access into Practice
2. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Outline
* Introduction to Open Access (OA)
* The need for aggregating open access content
* The CORE system
3. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Outline
* Introduction to Open Access (OA)
* The need for aggregating open access content
* The CORE system
4. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Research Cycle
Research
Idea
Receives
funding
Research
Conduction
Publication
5. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Who does research?
6. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
How much does it cost to access it?
7. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Who has access?
8. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
What is Open Access (OA)?
By “open access” to [peer-reviewed research
literature], we mean its free availability on the
public internet, permitting any users to read,
download, copy, distribute, print, search, or link to
the full texts of these articles, crawl them for
indexing, pass them as data to software, or use
them for any other lawful purpose, without
financial, legal, or technical barriers other than
those inseparable from gaining access to the
internet itself.
[BOAI, 2002]
9. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
OA Routes : open access repositories
* Do NOT perform
peer-review
* Pre-prints, post-
prints, final version
* Standardized: OAI-
PMH compatible
Repositories
Green Route
Institutional Subject
Open Research
Online (ORO)
arXiv.org
10. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
OA Routes : open access journals
Gold route
* Open Access Journals offer peer-reviewed research, pure open
access journals. Sometimes they charge an Article Processing Charge
(APC), sometimes they do not.
* Subscription based journals that offer an open route- hybrid
journals- always charge Article Processing Charges (APCs)
11. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Creative Commons Licenses
12. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
OA
Growth
13. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Growth of items in OA repositories
14. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Records stored across all OARs
164,259,752 records across 2,531
repositories as estimated by OpenDOAR
[December, 2013 -http://www.opendoar.org/]
15. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
HEFCE OA policy
Higher market share for OA content imminent!
16. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Access + reuse = Open Access
http://www.phdcomics.com/comics.php?
f=1533
17. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Outline
* Introduction to Open Access (OA)
* The need for aggregating open access content
* The CORE system
18. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
COAR: About harvesting and aggregations
“Each individual repository is of limited value for
research: the real power of Open Access lies in the
possibility of connecting and tying together
repositories, which is why we need interoperability.
In order to create a seamless layer of content
through connected repositories from around the
world, Open Access relies on interoperability, the
ability for systems to communicate with each other
and pass information back and forth in a usable
format. Interoperability allows us to exploit today's
computational power so that we can aggregate,
data mine, create new tools and services, and
generate new knowledge from repository content.’’
[COAR manifesto]
19. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
What is an aggregation?
Data providers
Aggregator
3. Standardised communication
1. Data standardisation
2. Data collection, harmonisation & enrichment
Users
20. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
What is an aggregation?
Aggregators are intermediaries between providers and users
that collect resources from many sources and add value by
improving access to them.
Physical world:
* libraries
* book stores
* museums
* art galleries
*supermarkets
Digital world:
* digital libraries (e.g. PubMed)
* collections (e.g. The European Library)
* search engines (e.g. Google cache)
* newspaper aggregators (e.g. Google News)
* online retailers (e.g. Amazon)
* travel aggregators (e.g. Kayak)
* insurance aggregators (GoCompare)
* aggregators of research papers (e.g. CORE)
21. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Successful online aggregators and value
* Reduce the time accessing information
* Standardise and harmonise content from many
providers
* Enrich content with new information
* Provide harmonised access to users
* Enable the discovery of new information
22. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Few aggregators provide unrestricted
access to data
Data providers
Aggregator
Services
Human user
Machine user
Typically little or no support
23. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Outline
* Introduction to Open Access (OA)
* The need for aggregating open access content
* The CORE system
24. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
UK need for aggregation
Bringing the UK’s open
access research outputs
together:
• Feasibility study
commissioned by Jisc,
published June 2014
• Referred to as “Open
Mirror”
http://repository.jisc.ac.uk/5570/1/JISC_REPORT_open_mirror_090514_FINAL_WEB.pdf
25. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
UK need for aggregations - conclusion
Jisc should support CORE and seek international support for it.
“CORE should focus on [a]ggregating materials from UK IRs
and from publishers and subject repositories of outputs with
UK-based authors to ensure that UK resources are well
represented in CORE”
l
26. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE's mission
Aggregate all open access content
distributed across different systems
worldwide, enrich this content and provide
access to it through a set of services …
27. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Genesis of the CORE project family
Oct '10 First steps of CORE
Feb '11 (6M) CORE – Jisc
Nov '11 (9M) ServiceCORE
Jan '12 (28M) DiggiCORE, Jisc – ESRC - NWO
Feb '13 (36M) Europeana Cloud
Apr '13 (4M) CORE HEIF - HEFCE
Jul '13 CORE selected UK national aggregator
Feb '14 (4M) UK Aggregation
Jul '14 (3M) UK Aggregation 2
Summer '15 More to come...
28. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
The aggregation process
29. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Processing pipeline
* Metadata download, extraction and cleaning
* Full-text harvesting
* Text-extraction
* Language detection
* Extraction of citation references from text
* Detection of citation reference targets
* Identification of related content
* Detection of duplicate items
* Parsing of author names
* Indexing
30. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE supports a 3 level access architecture
* Programmable (raw) data access
- As downloadable files or through API
* Transaction information access
- Explore content released through the use of a web
portal and its search
* Analytical information access
- Access to statistical information at the collection
level through the use of tables or charts.
http://www.dlib.org/dlib/november12/knoth/11knoth.html
31. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE supports a 3 level access architecture
* Programmable (raw) data access.
- Developers, DLs, DL researchers, companies
* Transaction information access.
- Researchers, students, life-long learners …
* Analytical information access.
- Funders, government, bussiness intelligence …
http://www.dlib.org/dlib/november12/knoth/11knoth.html
32. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE supports a 3 level access architecture
* Programmable (raw) data access.
- Apps: CORE API, CORE Data Dumps
* Transaction information access.
- Apps: CORE Portal, CORE Mobile, CORE
(recommendation) Plugin
* Analytical information access.
- Apps: Repository Analytics, CORE Policy
Compliance Analytics, Repositories Dashboard
(implementation phase)
http://www.dlib.org/dlib/november12/knoth/11knoth.html
33. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Exposing the aggregated content
34. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE applications (1)
CORE Portal – Allows searching and navigating scientific
publications aggregated from Open Access repositories
35. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE applications (2)
CORE Mobile – Allows searching and navigating scientific
publications aggregated from Open Access repositories
36. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE applications (3)
37. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE applications (4a)
CORE Plugin – A plugin to system that
recommendations for related items.
38. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE applications (4b)
CORE Plugin – A plugin to system that
recommendations for related items.
39. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Build on top of CORE API...
CORE Plugin – A cross-repository recommendation system
integrated into OJS.
40. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE Applications (5)
Repository Analytics – is an analytical tool supporting providers of
open access content (in particular repository managers)
41. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE Applications (6)
Repository Dashboard (under development) – Tool to support the
implementation and monitoring of the UK HEFCE OA policy.
42. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
The definition of OA for post-2014 REF
Consultation on OA in the post-2014 Research
Excellence Framework, paragraph 25 says that:
- Accessible through a UK HEI repository (immediately upon
acceptance or publication).
- Made available as the final peer-reviewed text (full-text) after
a (reasonable) embargo period specified by the publisher.
- Harvestable using automated tools.
- In a machine readable form to allow text-mining
- Unambiguously identifiable in the institutional repository,
including items available through a link to another website.
43. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
The developed tool
44. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE statistics – current state (1)
* Content: 24M+ records, 670+ repositories, 1.8M+ full-texts
* The world’s largest full-text open access dataset and still
growing
* The UK national aggregator (part of Repositories Shared
Services project - Jisc)
* Full-text aggregator (not just metadata)
* Placed among Top 10 search engines for research that go
beyond Google [Jisc, 2013]
* Listed among Top 100 Thesis and Dissertation Resources
* Part of Jisc’s Repositories Shared Services Project
* Exploring a partnership of Jisc and OU to deliver CORE
service
45. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
CORE statistics – current state (2)
Used by many researchers and organisations, including:
* the European Library
* UNESCO
* ResearchResearch.com
* Open Access Button
* OARR project (Nottingham University, Cottage Labs)
* HyberLink (Los Alamos National Laboratory, University of
Edinburgh)
* Georgetown University researchers
* Bauhaus University Weimar researchers
* The repository community (OARR, RIOXX, IRUS-UK)
46. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Conclusion
* Open Access outputs available online on the rise
* OA infrastructure, (repositories, aggregators) must enable
efficient use
* CORE provides single access point to this knowledge and
enables its mining
* Opportunities for innovative applications and research
* There are challenges making aggregators hard to operate
and maintain
* OA infrastructure should be available for the benefit of all
and should not be owned by the publishing lobby
47. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Have some fun with Open Access
* Open Access Explained!
http://www.phdcomics.com/comics.php?f=1533
* Open Access quiz
http://nile.lub.lu.se/loDownload/68/quiz_08.htm
48. YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015
Young European
Associated Researchers
Thank you!
Dr. Nancy Pontika
Email: nancy.pontika@open.ac.uk
Twitter: @nancypontika