SlideShare a Scribd company logo
1 of 19
Download to read offline
A Generational Refresh:
Continuing Harvard's Digital Repository Service
legacy of innovation
Stefano Cossu, Harvard University IT stefano_cossu@harvard.edu
Open Repositories 2023, Stellenbosch University
Cape Town, South Africa
Harvard's Digital Repository Service
(DRS)
Established in 2000
No viable DP solutions at the time
Needed to address specific, complex needs of HUIT
2
DRS today
10 million objects
900 million replicated files
2Pb replicated data
63 departments
Actively supported
3
DRS challenges today
Technical debt
Inflexible content model
Inefficient UI/UX
4
DRS Futures
3-year capital-funded R&D and implementation project
Phase 1: discovery (Jul 2022 – Jun 2023)
Phase 2: planning (Jul – Dec 2023)
Phase 3: implementation (Jan 2024 – Jun 2025)
Open to different options (commercial, open source, home built)
Opportunity to re-envision digital preservation
5
DRS Futures team
Leaderless group
HUIT (central IT) and Library members
Made up of diverse skills, seniority levels, and backgrounds
Connected with many involved departments
Highly collaborative with specialized task forces
6
Dual approach to the problem
Inductive (bottom-up)
Department-specific interviews (focused on workflows)
Cross-department focus groups (focused on specific areas)
Office hours (free format)
Deductive (top-down)
Build up from previous experience
Set DP theoretical foundations & long-term vision
Anticipate challenges of a more capable system
7
DRS Futures tenets
Separation of storage and services
Separation of archive and workspace
Task automation
Re-envision digital preservation
Revolving feedback
Build for future scale
8
Separation of storage and services
Data store migrated to OCFL in 2022 (>1 year timeline)
Plan to keep storage fabric intact
Replace services on top of OCFL
9
Separation of archive and workspace
Provide users with a mid-term workspace services & store
Keep OCFL focused on preservation
Keep solution search focused on discrete functional areas
Multiple products can fulfill parts of the complete solution
10
Task automation
Remove repetitive tasks from staff duties
Set up event-driven architecture
Move preservation actions to the background
Added complexity paid off by volume
11
Re-envision digital preservation
Preserving the semantic context as well as the content
Archival resources are live materials, changing over time
Facilitate reuse and cumulative evolution of information
12
Revolving feedback
From Reference Model for an Open Archival Information System:
The Monitor Designated Community function interacts with Archive
Consumers and Producers to track changes in their service
requirements and available product technologies. [...] This function may
be accomplished via surveys, via a periodic formal review process, via
community workshops [...]. It provides reports, requirements alerts
and emerging standards to the Develop Preservation Strategies and
Standards function. It sends preservation requirements to Develop
Packaging Designs.
“
“
13
Revolving feedback
Build upon relationships acquired during discovery phase
Develop processes for continuous & iterative improvement
Support communication workflows along with production
workflows
14
Build for future scale
Exponential growth and problems related to it
Large untapped sources
A/V materials
Research data
Whole major schools
Unexpected usage patterns and needs
15
Where we are at
Gathered and summarized all feedback
Working on reconciling top-down and bottom-up approaches
Preparing RFP
Hiring developers and change manager
16
Key artifacts for Phase 1
User requirements catalog from stakeholder input
Technical foundational principles & requirements
Weight matrix of requirements (MoSCoW notation)
Persona profiles
Abstract reference content model
RFP
17
Conclusions
Allocating time and budget for long discovery phase paid off
Approaching the project with an unbiased, fact-driven mindset
Unexpected priorities and direction emerged
Having an open mind requires an open mind from our partners' end
18
Thank you
https://sites.harvard.edu/drs-futures/
19

More Related Content

Similar to Stefano_Cossu_OR23_deck.pdf

VIVO at the University of Idaho
VIVO at the University of IdahoVIVO at the University of Idaho
VIVO at the University of Idahoanniegaines
 
The Software Sustainability Institute and engagement with the Digital Humanities
The Software Sustainability Institute and engagement with the Digital HumanitiesThe Software Sustainability Institute and engagement with the Digital Humanities
The Software Sustainability Institute and engagement with the Digital HumanitiesShoaib Sufi
 
The D4Science Infrastructure to Support Academic Courses
The D4Science Infrastructure to Support Academic CoursesThe D4Science Infrastructure to Support Academic Courses
The D4Science Infrastructure to Support Academic CoursesBlue BRIDGE
 
Supporting open science oriented skills building by virtual research environm...
Supporting open science oriented skills building by virtual research environm...Supporting open science oriented skills building by virtual research environm...
Supporting open science oriented skills building by virtual research environm...Blue BRIDGE
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...DuraSpace
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability InstituteNeil Chue Hong
 
The DCC: Helping you curate your reputation
The DCC: Helping you curate your reputationThe DCC: Helping you curate your reputation
The DCC: Helping you curate your reputationJisc
 
NSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingNSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingDaniel S. Katz
 
Research Data Support at the University of Edinburgh
Research Data Support at the University of EdinburghResearch Data Support at the University of Edinburgh
Research Data Support at the University of EdinburghRobin Rice
 
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 program
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 programScientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 program
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 programDaniel S. Katz
 
SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability Sandra Gesing
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...Sandra Gesing
 
Heather Williamson, Dvle Call2010
Heather Williamson, Dvle Call2010Heather Williamson, Dvle Call2010
Heather Williamson, Dvle Call2010Sheila MacNeill
 
Hans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationHans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationNational Digital Forum
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeEUDAT
 
IT Infrastructure for the Digital Humanities Observatory
IT Infrastructure for the Digital Humanities ObservatoryIT Infrastructure for the Digital Humanities Observatory
IT Infrastructure for the Digital Humanities ObservatoryDon Gourley
 

Similar to Stefano_Cossu_OR23_deck.pdf (20)

VIVO at the University of Idaho
VIVO at the University of IdahoVIVO at the University of Idaho
VIVO at the University of Idaho
 
The Software Sustainability Institute and engagement with the Digital Humanities
The Software Sustainability Institute and engagement with the Digital HumanitiesThe Software Sustainability Institute and engagement with the Digital Humanities
The Software Sustainability Institute and engagement with the Digital Humanities
 
AUTH practice
AUTH practiceAUTH practice
AUTH practice
 
The D4Science Infrastructure to Support Academic Courses
The D4Science Infrastructure to Support Academic CoursesThe D4Science Infrastructure to Support Academic Courses
The D4Science Infrastructure to Support Academic Courses
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
Supporting open science oriented skills building by virtual research environm...
Supporting open science oriented skills building by virtual research environm...Supporting open science oriented skills building by virtual research environm...
Supporting open science oriented skills building by virtual research environm...
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability Institute
 
Sgci xsede-gateways-07-08-16
Sgci xsede-gateways-07-08-16Sgci xsede-gateways-07-08-16
Sgci xsede-gateways-07-08-16
 
The DCC: Helping you curate your reputation
The DCC: Helping you curate your reputationThe DCC: Helping you curate your reputation
The DCC: Helping you curate your reputation
 
Sustainability Training Workshop - Intro to the SSI
Sustainability Training Workshop - Intro to the SSISustainability Training Workshop - Intro to the SSI
Sustainability Training Workshop - Intro to the SSI
 
NSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingNSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meeting
 
Research Data Support at the University of Edinburgh
Research Data Support at the University of EdinburghResearch Data Support at the University of Edinburgh
Research Data Support at the University of Edinburgh
 
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 program
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 programScientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 program
Scientific Software Innovation Institutes (S2I2s) as part of NSF’s SI2 program
 
SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
 
Heather Williamson, Dvle Call2010
Heather Williamson, Dvle Call2010Heather Williamson, Dvle Call2010
Heather Williamson, Dvle Call2010
 
Hans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationHans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital Preservation
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science Theme
 
IT Infrastructure for the Digital Humanities Observatory
IT Infrastructure for the Digital Humanities ObservatoryIT Infrastructure for the Digital Humanities Observatory
IT Infrastructure for the Digital Humanities Observatory
 

More from Stefano Cossu

The Oxford Common File Layout
The Oxford Common File LayoutThe Oxford Common File Layout
The Oxford Common File LayoutStefano Cossu
 
Scossu gdi iiif_r+d_report_2019
Scossu gdi iiif_r+d_report_2019Scossu gdi iiif_r+d_report_2019
Scossu gdi iiif_r+d_report_2019Stefano Cossu
 
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghBrace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghStefano Cossu
 
IIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsIIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsStefano Cossu
 
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Stefano Cossu
 
Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Stefano Cossu
 
Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Stefano Cossu
 
A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016Stefano Cossu
 
Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Stefano Cossu
 
AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015Stefano Cossu
 
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu
 

More from Stefano Cossu (12)

The Oxford Common File Layout
The Oxford Common File LayoutThe Oxford Common File Layout
The Oxford Common File Layout
 
Scossu gdi iiif_r+d_report_2019
Scossu gdi iiif_r+d_report_2019Scossu gdi iiif_r+d_report_2019
Scossu gdi iiif_r+d_report_2019
 
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghBrace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
 
Behind 12 sunsets
Behind 12 sunsetsBehind 12 sunsets
Behind 12 sunsets
 
IIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsIIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & Tactics
 
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
 
Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018
 
Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017
 
A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016
 
Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015
 
AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015
 
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
 

Recently uploaded

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Paige Cruz
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxFIDO Alliance
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTopCSSGallery
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxjbellis
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Hiroshi SHIBATA
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxMasterG
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfAnubhavMangla3
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...ScyllaDB
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuidePixlogix Infotech
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe中 央社
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfSrushith Repakula
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligencePrecisely
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewDianaGray10
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptxFIDO Alliance
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 

Stefano_Cossu_OR23_deck.pdf

  • 1. A Generational Refresh: Continuing Harvard's Digital Repository Service legacy of innovation Stefano Cossu, Harvard University IT stefano_cossu@harvard.edu Open Repositories 2023, Stellenbosch University Cape Town, South Africa
  • 2. Harvard's Digital Repository Service (DRS) Established in 2000 No viable DP solutions at the time Needed to address specific, complex needs of HUIT 2
  • 3. DRS today 10 million objects 900 million replicated files 2Pb replicated data 63 departments Actively supported 3
  • 4. DRS challenges today Technical debt Inflexible content model Inefficient UI/UX 4
  • 5. DRS Futures 3-year capital-funded R&D and implementation project Phase 1: discovery (Jul 2022 – Jun 2023) Phase 2: planning (Jul – Dec 2023) Phase 3: implementation (Jan 2024 – Jun 2025) Open to different options (commercial, open source, home built) Opportunity to re-envision digital preservation 5
  • 6. DRS Futures team Leaderless group HUIT (central IT) and Library members Made up of diverse skills, seniority levels, and backgrounds Connected with many involved departments Highly collaborative with specialized task forces 6
  • 7. Dual approach to the problem Inductive (bottom-up) Department-specific interviews (focused on workflows) Cross-department focus groups (focused on specific areas) Office hours (free format) Deductive (top-down) Build up from previous experience Set DP theoretical foundations & long-term vision Anticipate challenges of a more capable system 7
  • 8. DRS Futures tenets Separation of storage and services Separation of archive and workspace Task automation Re-envision digital preservation Revolving feedback Build for future scale 8
  • 9. Separation of storage and services Data store migrated to OCFL in 2022 (>1 year timeline) Plan to keep storage fabric intact Replace services on top of OCFL 9
  • 10. Separation of archive and workspace Provide users with a mid-term workspace services & store Keep OCFL focused on preservation Keep solution search focused on discrete functional areas Multiple products can fulfill parts of the complete solution 10
  • 11. Task automation Remove repetitive tasks from staff duties Set up event-driven architecture Move preservation actions to the background Added complexity paid off by volume 11
  • 12. Re-envision digital preservation Preserving the semantic context as well as the content Archival resources are live materials, changing over time Facilitate reuse and cumulative evolution of information 12
  • 13. Revolving feedback From Reference Model for an Open Archival Information System: The Monitor Designated Community function interacts with Archive Consumers and Producers to track changes in their service requirements and available product technologies. [...] This function may be accomplished via surveys, via a periodic formal review process, via community workshops [...]. It provides reports, requirements alerts and emerging standards to the Develop Preservation Strategies and Standards function. It sends preservation requirements to Develop Packaging Designs. “ “ 13
  • 14. Revolving feedback Build upon relationships acquired during discovery phase Develop processes for continuous & iterative improvement Support communication workflows along with production workflows 14
  • 15. Build for future scale Exponential growth and problems related to it Large untapped sources A/V materials Research data Whole major schools Unexpected usage patterns and needs 15
  • 16. Where we are at Gathered and summarized all feedback Working on reconciling top-down and bottom-up approaches Preparing RFP Hiring developers and change manager 16
  • 17. Key artifacts for Phase 1 User requirements catalog from stakeholder input Technical foundational principles & requirements Weight matrix of requirements (MoSCoW notation) Persona profiles Abstract reference content model RFP 17
  • 18. Conclusions Allocating time and budget for long discovery phase paid off Approaching the project with an unbiased, fact-driven mindset Unexpected priorities and direction emerged Having an open mind requires an open mind from our partners' end 18