Do Authors Deposit on Time? Tracking Open Access Policy Compliance

Dasha Herrmannova
Dasha HerrmannovaResearch Scientist at Oak Ridge National Laboratory
Do Authors Deposit on Time?
Tracking Open Access Policy Compliance
Drahomira Herrmannova
Nancy Pontika
Petr Knoth
June 4, 2019 – JCDL 2019, Urbana-Champaign, IL
Big Scientific Data and Text Analytics Group
Knowledge Media Institute, The Open University
Introduction
• Why we want Open Access (OA)
• Taxpayers should be able to read publicly funded research
• Help researchers at poorer institutions without access to
subscriptions
• Institutions suffer from rising journal subscription prices
• Funders introduce policies to encourage OA
• Notable examples:
• U.S. Public Access Plan
• U.S. NIH Public Access Policy
• UK REF 2021 Open Access Policy
• EC H2020 Open Access Policy
1/22
Growing number of OA policies
Source: http://roarmap.eprints.org/
Currently close to 1
thousand funder and
institutional OA policies
2/22
OA policies
• Provide criteria for making papers OA
• Requirements, such as:
• Where should papers be made available (publication or deposit)
• When should papers be deposited
• What version should be deposited (e.g. pre-print vs. post-print)
• Allowed embargo periods
• Etc.
3/22
Research questions
What effect do OA policies have?
4/22
Research questions
• Piwowar et al. (2018): At least 28% of all research papers are OA
• Lariviere and Sugimoto (2018): More than two thirds of papers
from selected funders (with an OA policy) were OA
• Gargouri et al. (2012): OA growth often due to retroactive self-
archiving (often years after publication)
5/22
Research questions
When do author deposit?
6/22
Research questions
When do author deposit?
Do they deposit in accordance with policies?
6/22
Deposit time lag
• What is deposit time lag?
• The difference between date of publication and date of deposit in a
repository expressed in days
• We study deposit time lag across
• Country
• Time
• Repository
• Discipline
7/22
Data
8/22
Data
8/22
Data
8/22
Data
8/22
Deposit time lag calculation
• Deposit time lag = deposit
date – publication date
• The difference was expressed
in days
• Positive values: article
deposited after publication
• Negative values: article
deposited prior to
publication
• Best: as low value as possible
9/22
Dataset
• 2013-2018 publications
• Metadata from Crossref and CORE
Publications 808,984
Repositories 728
Countries 70
Final dataset size Year of publication distribution
10/22
Results: Deposit time lag per country
11/22
Results: Deposit time lag per country/year
• How has deposit time lag changed over time?
• Average deposit time lag per year of publication
?
12/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2013 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2017 publications …?
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2013 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
• Underestimates deposit time
lag for all, but especially for
older publications
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per year/country
Option 1: All data Option 2: Max deposit time lag limit (1 yr)
14/22
Results: Deposit time lag per subject
Bars are not
stacked, but
overlayed
15/22
REF 2021 OA Policy
• In 2014, the UK introduced an OA Policy for its next research
assessment exercise (REF)
• Requirements
• Deposit final manuscript in an OA repository
• Deposit on publication/acceptance or within 3 months from it
• Papers published since April 2016
• Sanction
• The OA requirement is linked to performance review
• Did the introduction of this mandatory policy affect deposit
time lag in the UK compared to other countries?
16/22
Single vs any repository deposit time lag
1. Single repository deposit time lag
• Deposit time lag with respect to the publications’ deposit date in a
given repository
2. Any repository deposit time lag
• Deposit time lag with respect to the publications’ deposit date in any
repository
Repository 1 Repository 2
05/2017 09/2017
Single repository deposit
time lag for Repository 1 =
05/2017 – publication date
Any repository deposit
time lag for Repository 1 =
min(05/2017, 09/2017) –
publication date
17/22
Results: UK REF compliance per year
Any repository deposit time lag
18/22
Results: Deposit time lag per repository
Full lines: Single repository deposit time lag
Dashed lines: Any repository deposit time lag 19/22
Results: Deposit time lag per year/country
Option 1: All data Option 2: Max deposit time lag limit
2014: UK introduces REF 2021 OA policy 20/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
21/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
• Our study excludes publications that were never deposited
• To quantify missing deposits we would have to correctly match all
CORE publications to their Crossref metadata
• Focus on deposit time lag rather than the proportion of missing
deposits
21/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
• Our study excludes publications that were never deposited
• To quantify missing deposits we would have to correctly match all
CORE publications to their Crossref metadata
• Focus on deposit time lag rather than the proportion of missing
deposits
• Matching between Crossref and CORE was done using
metadata (titles, authors, publication years)
• Strict approach, results in high accuracy (~95.27%) but lower recall
21/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
22/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
• After introduction of the UK REF 2021 OA Policy this decrease in
the UK has accelerated
• As of early 2018, UK publications are deposited immediately upon
publication or even slightly before
22/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
• After introduction of the UK REF 2021 OA Policy this decrease in
the UK has accelerated
• As of early 2018, UK publications are deposited immediately upon
publication or even slightly before
• Key messages:
• Our observations support the argument for the inclusion of time
limited deposit requirement in OA policies
• Institutional practices an important role in supporting OA policy
adoption
22/22
Thank you!
Code: https://github.com/oacore/jcdl_2019
Data: https://doi.org/10.5281/zenodo.2605408
1 of 39

Recommended

Assessing Compliance with the UK REF 2021 Open Access Policy by
Assessing Compliance with the UK REF 2021 Open Access PolicyAssessing Compliance with the UK REF 2021 Open Access Policy
Assessing Compliance with the UK REF 2021 Open Access Policypetrknoth
3.8K views40 slides
The Danish Open Access Indicator by
The Danish Open Access IndicatorThe Danish Open Access Indicator
The Danish Open Access IndicatorMikael Elbæk
1.1K views24 slides
Open Access Barometer to Open Access Indicator: lessons learned from the jour... by
Open Access Barometer to Open Access Indicator: lessons learned from the jour...Open Access Barometer to Open Access Indicator: lessons learned from the jour...
Open Access Barometer to Open Access Indicator: lessons learned from the jour...Mikael Elbæk
4.4K views61 slides
UKSG Conference 2017 Breakout - Licensing for additional users and partner or... by
UKSG Conference 2017 Breakout - Licensing for additional users and partner or...UKSG Conference 2017 Breakout - Licensing for additional users and partner or...
UKSG Conference 2017 Breakout - Licensing for additional users and partner or...UKSG: connecting the knowledge community
237 views11 slides
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard by
Tracking compliance of the REF2021 policy with the CORE Repository DashboardTracking compliance of the REF2021 policy with the CORE Repository Dashboard
Tracking compliance of the REF2021 policy with the CORE Repository Dashboardpetrknoth
416 views38 slides
UKSG Conference 2017 Breakout - Licensing for additional users and partner or... by
UKSG Conference 2017 Breakout - Licensing for additional users and partner or...UKSG Conference 2017 Breakout - Licensing for additional users and partner or...
UKSG Conference 2017 Breakout - Licensing for additional users and partner or...UKSG: connecting the knowledge community
348 views13 slides

More Related Content

Similar to Do Authors Deposit on Time? Tracking Open Access Policy Compliance

UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl... by
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...ukcorr
44 views32 slides
Cn mo11 2_alt_status_and_planning_final_hessel by
Cn mo11 2_alt_status_and_planning_final_hesselCn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hesselErik van den Elsen
219 views42 slides
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu... by
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Charleston Conference
571 views23 slides
Assessing the value of OA agreements by
Assessing the value of OA agreementsAssessing the value of OA agreements
Assessing the value of OA agreementsJUSPSTATS
9 views21 slides
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh... by
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...SPARC Europe
2.2K views34 slides
Implementation of the Smooth Transition Model by
Implementation of the Smooth Transition ModelImplementation of the Smooth Transition Model
Implementation of the Smooth Transition ModelUKSG: connecting the knowledge community
17 views16 slides

Similar to Do Authors Deposit on Time? Tracking Open Access Policy Compliance(20)

UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl... by ukcorr
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
ukcorr44 views
Cn mo11 2_alt_status_and_planning_final_hessel by Erik van den Elsen
Cn mo11 2_alt_status_and_planning_final_hesselCn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hessel
Erik van den Elsen219 views
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu... by Charleston Conference
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Assessing the value of OA agreements by JUSPSTATS
Assessing the value of OA agreementsAssessing the value of OA agreements
Assessing the value of OA agreements
JUSPSTATS9 views
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh... by SPARC Europe
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
SPARC Europe2.2K views
ORCID - UK PIDs for Open Access - progress update by Jisc
ORCID - UK PIDs for Open Access - progress updateORCID - UK PIDs for Open Access - progress update
ORCID - UK PIDs for Open Access - progress update
Jisc139 views
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u... by Chris Banks
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
Chris Banks808 views
Social sciences directory liber conference (26.06.2013) by SocSciDir
Social sciences directory   liber conference (26.06.2013)Social sciences directory   liber conference (26.06.2013)
Social sciences directory liber conference (26.06.2013)
SocSciDir1K views
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui by Innovate UK
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiUk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Innovate UK509 views
Open Access in the UK - challenges of compliance with funder mandates by Chris Banks
Open Access in the UK - challenges of compliance with funder mandatesOpen Access in the UK - challenges of compliance with funder mandates
Open Access in the UK - challenges of compliance with funder mandates
Chris Banks1.1K views
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t... by Michael Levine-Clark
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Green or gold: What will Open Access mean for the LSE? by Jane Tinkler
Green or gold: What will Open Access mean for the LSE?Green or gold: What will Open Access mean for the LSE?
Green or gold: What will Open Access mean for the LSE?
Jane Tinkler634 views
Open Access, Plan S and New Models for Academic Publishing by CILIPScotland
Open Access, Plan S and New Models for Academic PublishingOpen Access, Plan S and New Models for Academic Publishing
Open Access, Plan S and New Models for Academic Publishing
CILIPScotland139 views
Evaluating the Big Deal: Usage Statistics for Decision Making by Selena Killick
Evaluating the Big Deal: Usage Statistics for Decision MakingEvaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision Making
Selena Killick1.6K views
Open Access Week 2017: Research data management and data management plans (Fl... by OpenAIRE
Open Access Week 2017: Research data management and data management plans (Fl...Open Access Week 2017: Research data management and data management plans (Fl...
Open Access Week 2017: Research data management and data management plans (Fl...
OpenAIRE866 views

More from Dasha Herrmannova

Machine Learning for Data Extraction by
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data ExtractionDasha Herrmannova
92 views61 slides
Semantometrics: Text Analysis in Research Evaluation by
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Dasha Herrmannova
135 views18 slides
Do Citations and Readership Predict Excellent Publications? by
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Dasha Herrmannova
171 views12 slides
An Analysis of the Microsoft Academic Graph by
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphDasha Herrmannova
512 views32 slides
Visual Search for Supporting Content Exploration in Large Document Collections by
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsDasha Herrmannova
250 views48 slides
Unsupervised Identification of Study Descriptors in Toxicology Research: An E... by
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Dasha Herrmannova
186 views1 slide

More from Dasha Herrmannova(10)

Semantometrics: Text Analysis in Research Evaluation by Dasha Herrmannova
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
Dasha Herrmannova135 views
Do Citations and Readership Predict Excellent Publications? by Dasha Herrmannova
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?
Dasha Herrmannova171 views
An Analysis of the Microsoft Academic Graph by Dasha Herrmannova
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic Graph
Dasha Herrmannova512 views
Visual Search for Supporting Content Exploration in Large Document Collections by Dasha Herrmannova
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document Collections
Dasha Herrmannova250 views
Unsupervised Identification of Study Descriptors in Toxicology Research: An E... by Dasha Herrmannova
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Dasha Herrmannova186 views
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking by Dasha Herrmannova
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Dasha Herrmannova655 views
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin... by Dasha Herrmannova
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Dasha Herrmannova1.1K views
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing... by Dasha Herrmannova
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Dasha Herrmannova567 views
Mining Research Publication Networks for Impact -- KMi Internal Seminar by Dasha Herrmannova
Mining Research Publication Networks for Impact -- KMi Internal SeminarMining Research Publication Networks for Impact -- KMi Internal Seminar
Mining Research Publication Networks for Impact -- KMi Internal Seminar
Dasha Herrmannova2.4K views

Recently uploaded

Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueShapeBlue
149 views7 slides
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlueShapeBlue
75 views23 slides
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...ShapeBlue
128 views20 slides
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...ShapeBlue
59 views13 slides
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueShapeBlue
191 views23 slides
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...ShapeBlue
69 views29 slides

Recently uploaded(20)

Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue149 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue75 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue128 views
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by ShapeBlue
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
ShapeBlue59 views
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
ShapeBlue191 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue69 views
DRBD Deep Dive - Philipp Reisner - LINBIT by ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue110 views
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue by ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue134 views
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates by ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue178 views
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by ShapeBlue
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
ShapeBlue58 views
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T by ShapeBlue
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
ShapeBlue81 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE67 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li74 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker50 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue86 views
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue97 views
Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely76 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc130 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue63 views
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue120 views

Do Authors Deposit on Time? Tracking Open Access Policy Compliance

  • 1. Do Authors Deposit on Time? Tracking Open Access Policy Compliance Drahomira Herrmannova Nancy Pontika Petr Knoth June 4, 2019 – JCDL 2019, Urbana-Champaign, IL Big Scientific Data and Text Analytics Group Knowledge Media Institute, The Open University
  • 2. Introduction • Why we want Open Access (OA) • Taxpayers should be able to read publicly funded research • Help researchers at poorer institutions without access to subscriptions • Institutions suffer from rising journal subscription prices • Funders introduce policies to encourage OA • Notable examples: • U.S. Public Access Plan • U.S. NIH Public Access Policy • UK REF 2021 Open Access Policy • EC H2020 Open Access Policy 1/22
  • 3. Growing number of OA policies Source: http://roarmap.eprints.org/ Currently close to 1 thousand funder and institutional OA policies 2/22
  • 4. OA policies • Provide criteria for making papers OA • Requirements, such as: • Where should papers be made available (publication or deposit) • When should papers be deposited • What version should be deposited (e.g. pre-print vs. post-print) • Allowed embargo periods • Etc. 3/22
  • 5. Research questions What effect do OA policies have? 4/22
  • 6. Research questions • Piwowar et al. (2018): At least 28% of all research papers are OA • Lariviere and Sugimoto (2018): More than two thirds of papers from selected funders (with an OA policy) were OA • Gargouri et al. (2012): OA growth often due to retroactive self- archiving (often years after publication) 5/22
  • 7. Research questions When do author deposit? 6/22
  • 8. Research questions When do author deposit? Do they deposit in accordance with policies? 6/22
  • 9. Deposit time lag • What is deposit time lag? • The difference between date of publication and date of deposit in a repository expressed in days • We study deposit time lag across • Country • Time • Repository • Discipline 7/22
  • 14. Deposit time lag calculation • Deposit time lag = deposit date – publication date • The difference was expressed in days • Positive values: article deposited after publication • Negative values: article deposited prior to publication • Best: as low value as possible 9/22
  • 15. Dataset • 2013-2018 publications • Metadata from Crossref and CORE Publications 808,984 Repositories 728 Countries 70 Final dataset size Year of publication distribution 10/22
  • 16. Results: Deposit time lag per country 11/22
  • 17. Results: Deposit time lag per country/year • How has deposit time lag changed over time? • Average deposit time lag per year of publication ? 12/22
  • 18. Results: Deposit time lag per country/year • Two options: 1. Use all data 13/22
  • 19. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 20. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2013 publications 13/22
  • 21. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2017 publications …? 13/22
  • 22. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 23. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2013 publications 13/22
  • 24. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2017 publications 13/22
  • 25. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later • Underestimates deposit time lag for all, but especially for older publications 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 26. Results: Deposit time lag per year/country Option 1: All data Option 2: Max deposit time lag limit (1 yr) 14/22
  • 27. Results: Deposit time lag per subject Bars are not stacked, but overlayed 15/22
  • 28. REF 2021 OA Policy • In 2014, the UK introduced an OA Policy for its next research assessment exercise (REF) • Requirements • Deposit final manuscript in an OA repository • Deposit on publication/acceptance or within 3 months from it • Papers published since April 2016 • Sanction • The OA requirement is linked to performance review • Did the introduction of this mandatory policy affect deposit time lag in the UK compared to other countries? 16/22
  • 29. Single vs any repository deposit time lag 1. Single repository deposit time lag • Deposit time lag with respect to the publications’ deposit date in a given repository 2. Any repository deposit time lag • Deposit time lag with respect to the publications’ deposit date in any repository Repository 1 Repository 2 05/2017 09/2017 Single repository deposit time lag for Repository 1 = 05/2017 – publication date Any repository deposit time lag for Repository 1 = min(05/2017, 09/2017) – publication date 17/22
  • 30. Results: UK REF compliance per year Any repository deposit time lag 18/22
  • 31. Results: Deposit time lag per repository Full lines: Single repository deposit time lag Dashed lines: Any repository deposit time lag 19/22
  • 32. Results: Deposit time lag per year/country Option 1: All data Option 2: Max deposit time lag limit 2014: UK introduces REF 2021 OA policy 20/22
  • 33. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works 21/22
  • 34. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works • Our study excludes publications that were never deposited • To quantify missing deposits we would have to correctly match all CORE publications to their Crossref metadata • Focus on deposit time lag rather than the proportion of missing deposits 21/22
  • 35. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works • Our study excludes publications that were never deposited • To quantify missing deposits we would have to correctly match all CORE publications to their Crossref metadata • Focus on deposit time lag rather than the proportion of missing deposits • Matching between Crossref and CORE was done using metadata (titles, authors, publication years) • Strict approach, results in high accuracy (~95.27%) but lower recall 21/22
  • 36. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset 22/22
  • 37. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset • After introduction of the UK REF 2021 OA Policy this decrease in the UK has accelerated • As of early 2018, UK publications are deposited immediately upon publication or even slightly before 22/22
  • 38. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset • After introduction of the UK REF 2021 OA Policy this decrease in the UK has accelerated • As of early 2018, UK publications are deposited immediately upon publication or even slightly before • Key messages: • Our observations support the argument for the inclusion of time limited deposit requirement in OA policies • Institutional practices an important role in supporting OA policy adoption 22/22
  • 39. Thank you! Code: https://github.com/oacore/jcdl_2019 Data: https://doi.org/10.5281/zenodo.2605408