SlideShare a Scribd company logo
1 of 22
a centre of expertise in data curation and preservation




Moving the Repository upstream


                       Chris Rusbridge
                    ARROW Repositories day
                       14 October 2008
                                                                                                 Funded by:
  This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5
  UK: Scotland License. To view a copy of this license, visit http://creativecommons
  .org/licenses/by-nc-sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard
  Street, 5th Floor, San Francisco, California, 94105, USA.
a centre of expertise in data curation and preservation




                   Contents
    •   The resistant scholar
    •   Researcher work flow
    •   On negative clicks…
    •   Can the repository help rather than hinder?
    •   Towards a Research Repository System?




2                   ARROW Repositories Day
a centre of expertise in data curation and preservation




        The resistant scholar
    • Edinburgh Research Archive has 1100+
      (publicly accessible) items
    • Edinburgh scholarly output?
      • Wet finger in the air: annual output ~ number of
        academics? (RAE every 4 years wants 4 papers)
      • Ie ~2500 papers per year!
    • So after 4 years we have <10% of output
    • A common story everywhere!


3                  ARROW Repositories Day
a centre of expertise in data curation and preservation




               Why is this so?
    • Uncertainty, risk?
        • About copyright
        • About Ingelfinger rule
    •   Change
    •   Too busy
    •   Doesn’t fit in the way they do things now
    •   Not well motivated by advantages to others
    •   Little in it for them!

4                    ARROW Repositories Day
a centre of expertise in data curation and preservation




       Researcher work flow?
    • Many projects/tasks in parallel
    • All different stages
    • Teaching (several), research (several), writing
      up research, writing grant proposals,
      reviewing papers, administrative tasks,
      University governance, etc




5                 ARROW Repositories Day
a centre of expertise in data curation and preservation




         Researcher work flow?
    •   Think up research idea
    •   Write grant proposal with colleagues
    •   Submit, wait, refine/revise, resubmit
    •   Hire/assign staff, plan project
    •   Gather data, analyse data, refine hypothesis
    •   Refine methods, more data etc
    •   Write draft paper with colleagues
    •   Refine, revise submit paper, repeat until successful
    •   New directions, more data, new paper
    •   Conference presentations, discussion, new research
        ideas…
6                     ARROW Repositories Day
a centre of expertise in data curation and preservation




    Who are you working with?
    •   Your group
    •   Your department
    •   Other departments in your university
    •   Colleagues elsewhere worldwide
        • Often more of the latter than the former!
        • Wide variety of IT environments




7                     ARROW Repositories Day
a centre of expertise in data curation and preservation




         Write paper work flow?
    • PI and co-PIs outline structure
    • PI assign sections to colleagues
    • Gather sections, edit, circulate
    • Identify weaknesses, gather more data
    • Select & organise citations, images, tables, graphs,
      supplementary data
    • Comment, revise, circulate, repeat until deadline
    • Submit, wait
    • Revise following review, circulate, resubmit
        • By now working on other research!
    • It’s published! Add to bibliography…

8                      ARROW Repositories Day
a centre of expertise in data curation and preservation




       When do you submit to
           repository?
    • There’s no obvious point
    • It’s always extra work
    • Any doubt, uncertainty, distraction enough to
      put it off
    • “The library” wants you to do it? Sure, RSN

    • The repository doesn’t help, it hinders


9                  ARROW Repositories Day
a centre of expertise in data curation and preservation




            On negative clicks
     • Research in Glasgow for Effective Records
       Management project (JISC-funded)
       • Currall, Johnson, Johnston, Moss, Richmond
     • “How many extra clicks are you willing to
       make to ensure preservation of the records
       you are creating?”
       • Answer: zero
     • Design goal follows: reduce work for clerks
       (fewer clicks) AND ensure preservation


10                 ARROW Repositories Day
a centre of expertise in data curation and preservation




     How could a repository help?
     •   Support the research
     •   Support the researchers
     •   Support the writing
     •   Support the publishing
     •   Be a natural part of the work flow…

     • How could we do that?


11                   ARROW Repositories Day
a centre of expertise in data curation and preservation




      Negative click repository?
     • Could a repository reduce the workload of
       researchers?
       • Perhaps not on its own…
     • What would be needed to make things
       easier?
       • Maybe a system and services with the repository
         embedded?
       • Research Repository System



12                  ARROW Repositories Day
a centre of expertise in data curation and preservation




 Some known research issues
     • Extended teams across institutions, even legal
       jurisdictions
        • Varying technology: Windows/Mac/Linux versions, MS Office,
          OpenOffice, LaTeX, EndNote, BibTeX, etc
        • Localised, segregated identity management
        • Informal extranets
     • Distrust of anyone who (ever!) adds complexity or
       difficulty, even apparent
        • University, IT dept, Library, School…
     • Individualist local IT management
        •   Backup, version control, security, patching
        •   Data quantity, quality, provenance, metadata, version, sharing
        •   Analytic software version
        •   Lab notebooks…
13                        ARROW Repositories Day
a centre of expertise in data curation and preservation




           How could “we” help?
     • Who are “we”?
       •   Repository managers
       •   In/with the Library
       •   And IT Services
       •   Backed by the administration




14                    ARROW Repositories Day
a centre of expertise in data curation and preservation




              Maybe we could…
     • Help with publisher liaison
     • Support multiple authoring across several institutions
         • More permissive identity management/extranet
     • Support multiple versions
         • Fine-grained access control
         • Checkpointing
     •   Support supplementary data
     •   Provide basic data management capability
     •   Provide simple, cross-platform, persistent storage
     •   Provide some longevity
     •   Provide additional benefits

15                      ARROW Repositories Day
a centre of expertise in data curation and preservation




16   ARROW Repositories Day
a centre of expertise in data curation and preservation




                  Exposure
     • Digital Curation Blog posts
       • Comments and feedback
     • JISC Repository Ideascale discussions
       • Comments, feedback, voting
     • Blue Ribbon Task Force Ideascale
       discussions
       • voting




17                 ARROW Repositories Day
a centre of expertise in data curation and preservation




18   ARROW Repositories Day
a centre of expertise in data curation and preservation




               Some comments…
     •   Need to be careful of doing this as if it complicates the workflow, it just
         won't happen
     •   I think the RRS you envisage sounds fantastic and would be a 'good
         thing', what worries me is the 'function creep' taking us a few miles on
         from some of the more basic, simpler 'few keystrokes' approach…
     •   The RRS sounds to have many features of a Virtual Research
         Environment, albeit perhaps a less data centric VRE
     •   The availability of research via Open Access would increase if the same
         systems that provide Open Access also provided, or were integrated
         with, tools which support the authoring process
     •   My experience is, that feature requests of this sort [authoring support]
         are exactly the ones which end up in the "users didn't know what they
         wanted bin" (I'm a developer).
     •   A system to help streamline the editorial/publication process is fine (if
         we can persuade academics to use it)


19                           ARROW Repositories Day
a centre of expertise in data curation and preservation




                       CRIS
     • Several spotted that Current Research
       Information Systems can provide “fill the
       blanks” metadata
       •   Reduces workload
       •   Provides context
       •   Supports research disclosure
       •   Needs administration support




20                    ARROW Repositories Day
a centre of expertise in data curation and preservation




Is there something useful here?



     • DISCUSS!




21                ARROW Repositories Day
a centre of expertise in data curation and preservation




           Thank you
     c.rusbridge@ed.ac.uk




22    ARROW Repositories Day

More Related Content

What's hot

Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6eohallor
 
Bridging research and collections
Bridging research and collectionsBridging research and collections
Bridging research and collectionsvty
 
Archiving Best Practices -- Creative Operations Essentials
Archiving Best Practices -- Creative Operations EssentialsArchiving Best Practices -- Creative Operations Essentials
Archiving Best Practices -- Creative Operations Essentialsglobaledit®
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objectsseanb
 

What's hot (7)

Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6
 
Just Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel WilkschJust Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel Wilksch
 
Bridging research and collections
Bridging research and collectionsBridging research and collections
Bridging research and collections
 
Digitization - Basic questions
Digitization - Basic questionsDigitization - Basic questions
Digitization - Basic questions
 
Archiving Best Practices -- Creative Operations Essentials
Archiving Best Practices -- Creative Operations EssentialsArchiving Best Practices -- Creative Operations Essentials
Archiving Best Practices -- Creative Operations Essentials
 
Dcc endeavour-2006
Dcc endeavour-2006Dcc endeavour-2006
Dcc endeavour-2006
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 

Viewers also liked

Curation of scientifica data: Challenges for repositories
Curation of scientifica data: Challenges for repositoriesCuration of scientifica data: Challenges for repositories
Curation of scientifica data: Challenges for repositoriesChris Rusbridge
 
Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Chris Rusbridge
 
Sustainable Digital Preservation and Access
Sustainable Digital Preservation and AccessSustainable Digital Preservation and Access
Sustainable Digital Preservation and AccessChris Rusbridge
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Chris Rusbridge
 
Cautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your GardenCautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your GardenChris Rusbridge
 
Sandinista revolution in nicaragua
Sandinista revolution in nicaraguaSandinista revolution in nicaragua
Sandinista revolution in nicaraguaPaul Treadwell
 

Viewers also liked (7)

Curation of scientifica data: Challenges for repositories
Curation of scientifica data: Challenges for repositoriesCuration of scientifica data: Challenges for repositories
Curation of scientifica data: Challenges for repositories
 
Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...
 
Sustainable Digital Preservation and Access
Sustainable Digital Preservation and AccessSustainable Digital Preservation and Access
Sustainable Digital Preservation and Access
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...
 
The future of the DCC
The future of the DCCThe future of the DCC
The future of the DCC
 
Cautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your GardenCautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your Garden
 
Sandinista revolution in nicaragua
Sandinista revolution in nicaraguaSandinista revolution in nicaragua
Sandinista revolution in nicaragua
 

Similar to Moving the Repository upstream to support research workflows

Institutional repository
Institutional repositoryInstitutional repository
Institutional repositoryWaqas Ahmed
 
Cro presentation for library jan13v2
Cro presentation for library jan13v2Cro presentation for library jan13v2
Cro presentation for library jan13v2NeilStewartCity
 
Can Repositories be fun? Thinking about repositories
Can Repositories be fun? Thinking about repositoriesCan Repositories be fun? Thinking about repositories
Can Repositories be fun? Thinking about repositoriesPatrick Danowski
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016IzzyChad
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchJohn Kunze
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositoriesKwesi Yankson
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesDorothea Salo
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Resources and Services for your research at UCT
Resources and Services for your research at UCTResources and Services for your research at UCT
Resources and Services for your research at UCTAndiswa Mfengu
 
Managing Research Data in the Life Sciences
Managing Research Data in the Life SciencesManaging Research Data in the Life Sciences
Managing Research Data in the Life Sciencesalwerhane
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...Jenny Mitcham
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...datacite
 
Issues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineeringIssues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineeringChris Rusbridge
 

Similar to Moving the Repository upstream to support research workflows (20)

Institutional repository
Institutional repositoryInstitutional repository
Institutional repository
 
552 guestlecture 20140308
552 guestlecture 20140308552 guestlecture 20140308
552 guestlecture 20140308
 
Cro presentation for library jan13v2
Cro presentation for library jan13v2Cro presentation for library jan13v2
Cro presentation for library jan13v2
 
Can Repositories be fun? Thinking about repositories
Can Repositories be fun? Thinking about repositoriesCan Repositories be fun? Thinking about repositories
Can Repositories be fun? Thinking about repositories
 
Ecdl2004
Ecdl2004Ecdl2004
Ecdl2004
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich Research
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanities
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Resources and Services for your research at UCT
Resources and Services for your research at UCTResources and Services for your research at UCT
Resources and Services for your research at UCT
 
Managing Research Data in the Life Sciences
Managing Research Data in the Life SciencesManaging Research Data in the Life Sciences
Managing Research Data in the Life Sciences
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
 
Issues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineeringIssues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineering
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
 

More from Chris Rusbridge

The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...Chris Rusbridge
 
JISC Digital Library initiatives
JISC Digital Library initiativesJISC Digital Library initiatives
JISC Digital Library initiativesChris Rusbridge
 
Practical steps towards digital preservation at institutional levels
Practical steps towards digital preservation at institutional levelsPractical steps towards digital preservation at institutional levels
Practical steps towards digital preservation at institutional levelsChris Rusbridge
 
Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Chris Rusbridge
 
Create, curate, re-use: the expanding life course of digital research data
Create, curate, re-use: the expanding life course of digital research dataCreate, curate, re-use: the expanding life course of digital research data
Create, curate, re-use: the expanding life course of digital research dataChris Rusbridge
 
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stageChris Rusbridge
 
LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceChris Rusbridge
 
Curating data for integrated science
Curating data for integrated scienceCurating data for integrated science
Curating data for integrated scienceChris Rusbridge
 
Trust and repository audit: can repository managers assure trustworthiness?
Trust and repository audit: can repository managers assure trustworthiness?Trust and repository audit: can repository managers assure trustworthiness?
Trust and repository audit: can repository managers assure trustworthiness?Chris Rusbridge
 
Disciplinary dimensions of digital curation: introduction and synthesis
Disciplinary dimensions of digital curation: introduction and synthesisDisciplinary dimensions of digital curation: introduction and synthesis
Disciplinary dimensions of digital curation: introduction and synthesisChris Rusbridge
 
Reference Model for Economically Sustainable Digital Curation
Reference Model for Economically Sustainable Digital CurationReference Model for Economically Sustainable Digital Curation
Reference Model for Economically Sustainable Digital CurationChris Rusbridge
 
Blue Ribbon Task Force on Sustainable Digital Preservation
Blue Ribbon Task Force on Sustainable Digital PreservationBlue Ribbon Task Force on Sustainable Digital Preservation
Blue Ribbon Task Force on Sustainable Digital PreservationChris Rusbridge
 
Curating data for integrated science
Curating data for integrated scienceCurating data for integrated science
Curating data for integrated scienceChris Rusbridge
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositoriesChris Rusbridge
 

More from Chris Rusbridge (16)

The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...
 
JISC Digital Library initiatives
JISC Digital Library initiativesJISC Digital Library initiatives
JISC Digital Library initiatives
 
Practical steps towards digital preservation at institutional levels
Practical steps towards digital preservation at institutional levelsPractical steps towards digital preservation at institutional levels
Practical steps towards digital preservation at institutional levels
 
The Licence Trap
The Licence TrapThe Licence Trap
The Licence Trap
 
Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...Frequently-asked questions on Freedom of Information and Environmental Inform...
Frequently-asked questions on Freedom of Information and Environmental Inform...
 
Create, curate, re-use: the expanding life course of digital research data
Create, curate, re-use: the expanding life course of digital research dataCreate, curate, re-use: the expanding life course of digital research data
Create, curate, re-use: the expanding life course of digital research data
 
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage
"Tomorrow, and tomorrow, and tomorrow": the players on the curation stage
 
LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experience
 
Curating data for integrated science
Curating data for integrated scienceCurating data for integrated science
Curating data for integrated science
 
Dcc jsr phase 3
Dcc jsr phase 3Dcc jsr phase 3
Dcc jsr phase 3
 
Trust and repository audit: can repository managers assure trustworthiness?
Trust and repository audit: can repository managers assure trustworthiness?Trust and repository audit: can repository managers assure trustworthiness?
Trust and repository audit: can repository managers assure trustworthiness?
 
Disciplinary dimensions of digital curation: introduction and synthesis
Disciplinary dimensions of digital curation: introduction and synthesisDisciplinary dimensions of digital curation: introduction and synthesis
Disciplinary dimensions of digital curation: introduction and synthesis
 
Reference Model for Economically Sustainable Digital Curation
Reference Model for Economically Sustainable Digital CurationReference Model for Economically Sustainable Digital Curation
Reference Model for Economically Sustainable Digital Curation
 
Blue Ribbon Task Force on Sustainable Digital Preservation
Blue Ribbon Task Force on Sustainable Digital PreservationBlue Ribbon Task Force on Sustainable Digital Preservation
Blue Ribbon Task Force on Sustainable Digital Preservation
 
Curating data for integrated science
Curating data for integrated scienceCurating data for integrated science
Curating data for integrated science
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 

Recently uploaded

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Moving the Repository upstream to support research workflows

  • 1. a centre of expertise in data curation and preservation Moving the Repository upstream Chris Rusbridge ARROW Repositories day 14 October 2008 Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit http://creativecommons .org/licenses/by-nc-sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
  • 2. a centre of expertise in data curation and preservation Contents • The resistant scholar • Researcher work flow • On negative clicks… • Can the repository help rather than hinder? • Towards a Research Repository System? 2 ARROW Repositories Day
  • 3. a centre of expertise in data curation and preservation The resistant scholar • Edinburgh Research Archive has 1100+ (publicly accessible) items • Edinburgh scholarly output? • Wet finger in the air: annual output ~ number of academics? (RAE every 4 years wants 4 papers) • Ie ~2500 papers per year! • So after 4 years we have <10% of output • A common story everywhere! 3 ARROW Repositories Day
  • 4. a centre of expertise in data curation and preservation Why is this so? • Uncertainty, risk? • About copyright • About Ingelfinger rule • Change • Too busy • Doesn’t fit in the way they do things now • Not well motivated by advantages to others • Little in it for them! 4 ARROW Repositories Day
  • 5. a centre of expertise in data curation and preservation Researcher work flow? • Many projects/tasks in parallel • All different stages • Teaching (several), research (several), writing up research, writing grant proposals, reviewing papers, administrative tasks, University governance, etc 5 ARROW Repositories Day
  • 6. a centre of expertise in data curation and preservation Researcher work flow? • Think up research idea • Write grant proposal with colleagues • Submit, wait, refine/revise, resubmit • Hire/assign staff, plan project • Gather data, analyse data, refine hypothesis • Refine methods, more data etc • Write draft paper with colleagues • Refine, revise submit paper, repeat until successful • New directions, more data, new paper • Conference presentations, discussion, new research ideas… 6 ARROW Repositories Day
  • 7. a centre of expertise in data curation and preservation Who are you working with? • Your group • Your department • Other departments in your university • Colleagues elsewhere worldwide • Often more of the latter than the former! • Wide variety of IT environments 7 ARROW Repositories Day
  • 8. a centre of expertise in data curation and preservation Write paper work flow? • PI and co-PIs outline structure • PI assign sections to colleagues • Gather sections, edit, circulate • Identify weaknesses, gather more data • Select & organise citations, images, tables, graphs, supplementary data • Comment, revise, circulate, repeat until deadline • Submit, wait • Revise following review, circulate, resubmit • By now working on other research! • It’s published! Add to bibliography… 8 ARROW Repositories Day
  • 9. a centre of expertise in data curation and preservation When do you submit to repository? • There’s no obvious point • It’s always extra work • Any doubt, uncertainty, distraction enough to put it off • “The library” wants you to do it? Sure, RSN • The repository doesn’t help, it hinders 9 ARROW Repositories Day
  • 10. a centre of expertise in data curation and preservation On negative clicks • Research in Glasgow for Effective Records Management project (JISC-funded) • Currall, Johnson, Johnston, Moss, Richmond • “How many extra clicks are you willing to make to ensure preservation of the records you are creating?” • Answer: zero • Design goal follows: reduce work for clerks (fewer clicks) AND ensure preservation 10 ARROW Repositories Day
  • 11. a centre of expertise in data curation and preservation How could a repository help? • Support the research • Support the researchers • Support the writing • Support the publishing • Be a natural part of the work flow… • How could we do that? 11 ARROW Repositories Day
  • 12. a centre of expertise in data curation and preservation Negative click repository? • Could a repository reduce the workload of researchers? • Perhaps not on its own… • What would be needed to make things easier? • Maybe a system and services with the repository embedded? • Research Repository System 12 ARROW Repositories Day
  • 13. a centre of expertise in data curation and preservation Some known research issues • Extended teams across institutions, even legal jurisdictions • Varying technology: Windows/Mac/Linux versions, MS Office, OpenOffice, LaTeX, EndNote, BibTeX, etc • Localised, segregated identity management • Informal extranets • Distrust of anyone who (ever!) adds complexity or difficulty, even apparent • University, IT dept, Library, School… • Individualist local IT management • Backup, version control, security, patching • Data quantity, quality, provenance, metadata, version, sharing • Analytic software version • Lab notebooks… 13 ARROW Repositories Day
  • 14. a centre of expertise in data curation and preservation How could “we” help? • Who are “we”? • Repository managers • In/with the Library • And IT Services • Backed by the administration 14 ARROW Repositories Day
  • 15. a centre of expertise in data curation and preservation Maybe we could… • Help with publisher liaison • Support multiple authoring across several institutions • More permissive identity management/extranet • Support multiple versions • Fine-grained access control • Checkpointing • Support supplementary data • Provide basic data management capability • Provide simple, cross-platform, persistent storage • Provide some longevity • Provide additional benefits 15 ARROW Repositories Day
  • 16. a centre of expertise in data curation and preservation 16 ARROW Repositories Day
  • 17. a centre of expertise in data curation and preservation Exposure • Digital Curation Blog posts • Comments and feedback • JISC Repository Ideascale discussions • Comments, feedback, voting • Blue Ribbon Task Force Ideascale discussions • voting 17 ARROW Repositories Day
  • 18. a centre of expertise in data curation and preservation 18 ARROW Repositories Day
  • 19. a centre of expertise in data curation and preservation Some comments… • Need to be careful of doing this as if it complicates the workflow, it just won't happen • I think the RRS you envisage sounds fantastic and would be a 'good thing', what worries me is the 'function creep' taking us a few miles on from some of the more basic, simpler 'few keystrokes' approach… • The RRS sounds to have many features of a Virtual Research Environment, albeit perhaps a less data centric VRE • The availability of research via Open Access would increase if the same systems that provide Open Access also provided, or were integrated with, tools which support the authoring process • My experience is, that feature requests of this sort [authoring support] are exactly the ones which end up in the "users didn't know what they wanted bin" (I'm a developer). • A system to help streamline the editorial/publication process is fine (if we can persuade academics to use it) 19 ARROW Repositories Day
  • 20. a centre of expertise in data curation and preservation CRIS • Several spotted that Current Research Information Systems can provide “fill the blanks” metadata • Reduces workload • Provides context • Supports research disclosure • Needs administration support 20 ARROW Repositories Day
  • 21. a centre of expertise in data curation and preservation Is there something useful here? • DISCUSS! 21 ARROW Repositories Day
  • 22. a centre of expertise in data curation and preservation Thank you c.rusbridge@ed.ac.uk 22 ARROW Repositories Day