SlideShare a Scribd company logo
1 of 33
Download to read offline
Tim Donohue
 DSpace + DuraCloud Integrations

                                                                               DuraSpace


Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Basis for DSpace Integration

1. DSpace AIP Backup & Restore (1.7 +)
   (Initial DuraCloud use case: Backup & Restore)


2. DSpace Curation Task System (1.7 +)

3. DSpace Replication Task Suite (1.8)



          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
Intro to Archival Info Pkgs (1.7+)

• Primary Use Cases
  – Backup & Restore of DSpace Content
     • All content or just partial (Community/Collection/Item)
  – Migration/Export of DSpace Content
     • All content or just partial (Community/Collection/Item)
  – DuraCloud Integration




          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
How to Backup DSpace (pre-1.7)




                  Database                          Assetstore Folder



 Full Database                                                                        Folder
    Backup                                                                            Backup



         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
How to Restore All (pre-1.7)




                 Database                          Assetstore Folder



Full Database                                                                        Folder
   Backup                                                                            Backup



        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
How to Restore a Collection (pre-1.7)




                       Database                          Assetstore Folder



Temporary                                                                                                   Temporary
 Database                                                                                                    Folder?

      Full Database                                                                         Folder
         Backup                                                                             Backup
              Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                       To request other use: info@duracloud.org
How to Restore a Collection (pre-1.7)




                       Database                          Assetstore Folder



Temporary                                                                                                   Temporary
 Database                                                                                                    Folder?

      Full Database                                                                         Folder
         Backup                                                                             Backup
              Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                       To request other use: info@duracloud.org
Backup via Archival Info Pkgs




Package for each
  Community,
Collection & Item


        AIP backup
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
Restore All via Archival Info Pkgs




 Package for each
   Community,
 Collection & Item


         AIP backup
        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
Restore a Collection via AIPs




                                            1                          2
   Collection AIP

                                                                              Items in Collection

     AIP backup
    Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                             To request other use: info@duracloud.org
What’s in an AIP?


                                                                 Content Files
                License                                           or Logos                           Other Files
                                 METS                                                                in Bundles
                        (DIM / MODS / PREMIS /                                                           (optional)
                             METSRights)


                               *Also a BagIt version in works

Archival Information Package (AIP)

                Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                         To request other use: info@duracloud.org
What’s in an AIP?
Related
Object
 AIPs
                                                           Content Files
          License                                           or Logos                           Other Files
                           METS                                                                in Bundles
                  (DIM / MODS / PREMIS /                                                           (optional)
                       METSRights)

                     Descriptive Metadata: DIM & MODS
                     Tech/Preservation Metadata: PREMIS
                     Rights Metadata: METSRights
          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
The “Site” AIP
Top-Level
Community
  AIPs
                                              METS
                                     (DIM / MODS / PREMIS /
                                          METSRights)




                   Special AIP for site-wide info/metadata:
                   (e.g. Group Memberships, EPeople)

            Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                     To request other use: info@duracloud.org
What can AIPs restore?
Restore All In-Archive Content (Files + Metadata)
Restore All People & Groups
Restore All Permissions / Access Rights
Restore Community / Collection Logos, Metadata,
  Rights & Item Templates
Restore Community / Collection / Item Hierarchy
Restore In-Process / Incomplete Items
Restore Collection OAI-PMH/ORE Harvest Settings
Restore all configuration files (dspace.cfg, etc.)
        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
Migrate a Collection


One DSpace Install                                                           Another DSpace Install


           2                 1

                                   Collection AIP



                                                                           Items in Collection

     Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                              To request other use: info@duracloud.org
Migrate Content


One DSpace Install
                                                                                     (Future work)


           2                 1

                                   Collection AIP



                                                                           Items in Collection

     Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                              To request other use: info@duracloud.org
DuraCloud Integration (1.7.x)


                [1]                ./dspace packager -d

                [2]                java -jar synctool.jar

                                               1
Package for each
  Community,
Collection & Item


Local “Watch” Folder
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
DuraCloud Integration (1.7.x)


                [1]                java -jar retrievaltool.jar

                [2]                ./dspace packager -r


                                               2
Package for each
  Community,
Collection & Item


      Local Folder
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
New: DSpace Replication Suite in 1.8




Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
DSpace Curation System (1.7+)

• Enables a basic ‘microservices’ approach to
  curating DSpace objects
• Anyone can build a task & share it.
• Currently tasks must be written in Java
  – Working on JRuby & Jython integration (1.8?)
• “Frees” admin tasks from Command Line
  – Can now run from Admin UI or CLI


         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
DSpace Replication Suite (1.8)

• A set of curation tasks geared towards
  ‘replicating’ (backup/restore/audit) content
• “Wraps” 1.7 DSpace AIP Backup & Restore

  Backup content to AIP (filesystem or DuraCloud)
  Restore/Replace from AIP
  Audit AIP (compare to DSpace content)
  Basic IO Tracking of AIP Upload/Downloads
  All replication tasks can be run via Admin UI

         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
DuraCloud Integration (1.8.x)
                                                              “Replication Task Suite”:
                                                              • Suite of Curation Tasks
                                                              • One step Sync process
               OR
Command line             Curation Tools
                                                              • Via UI or CLI



 Package for each
   Community,
                                                                               1
 Collection & Item
                                                    1
   Local Temp Folder
        (Cache)
           Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                    To request other use: info@duracloud.org
DuraCloud Integration (1.8.x)
                                                              “Replication Task Suite”:
                                                              • One step Retrieve process
                                                              • Via UI or CLI
               OR
Command line             Curation Tools
                                                              • Also ‘auditing’ tools



 Package for each
   Community,
                                                                               1
 Collection & Item
                                                    1
   Local Temp Folder
        (Cache)
           Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                    To request other use: info@duracloud.org
DSpace Replication Suite Demo




Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Known Limitations

Cannot yet take advantage of DuraCloud
  streaming capabilities (AIPs are zip files)
Cannot yet take advantage of DuraCloud
  transformation services (AIPs are zip files)




       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
Next Steps

• Working towards “unzipped” AIPs (1.8?)
  – METS file & Content files stored in an AIP ‘folder’
    but NOT zipped up
  – Support for DuraCloud streaming, etc.
• DSpace UI Streaming Integration (@mire)
• ‘Auto-Sync’ options
  – Updates in DSpace -> DuraCloud (queued?)
  – Updates via DuraCloud services -> DSpace?

          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
In Large Thanks to…

• MIT : Richard Rodgers & Wendy Bossons
  – Developed Curation Task Framework
  – Developed initial Replication Suite tasks
• @mire : Mark Diggory
  – Look for @mire’s “Integrating DuraCloud Services
    in DSpace” talk on Friday at 3:30pm




          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
For More Information

• Replication Task Suite:
  – https://wiki.duraspace.org/display/DSPACE/Replic
    ationTaskSuite
• AIP Backup & Restore:
  – https://wiki.duraspace.org/display/DSDOC/AIP+Ba
    ckup+and+Restore
• Curation Task System:
  – https://wiki.duraspace.org/display/DSDOC/Curati
    on+System
         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
Photo/Icon Acknowledgments
•   Package : http://www.flickr.com/photos/halfbisqued/2353845688/
•   Harddrive & Terminal icons: http://tango.freedesktop.org/Tango_Desktop_Project
•   Folder icon: http://www.openclipart.org/detail/13740
•   Database icon: http://www.openclipart.org/detail/68413
•   Zip Pkg icon: http://veryicon.com/icons/system/capital-icon-suite-mac/zip-10.html
•   File icons: http://veryicon.com/icons/system/rhor-v2-part-3/
•   Checkmark & Delete icons: http://veryicon.com/icons/system/on-stage/
•   Tools Icon: http://veryicon.com/icons/system/azullustre/




                 Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                          To request other use: info@duracloud.org

More Related Content

Similar to DSpace & DuraCloud Integrations

Online storage for the masses and the case of pithos
Online storage for the masses and the case of pithosOnline storage for the masses and the case of pithos
Online storage for the masses and the case of pithosnkoziris
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSKathirvel Ayyaswamy
 
storage & file strucure in dbms
storage & file strucure in dbmsstorage & file strucure in dbms
storage & file strucure in dbmssachin2690
 
File system interfacefinal
File system interfacefinalFile system interfacefinal
File system interfacefinalmarangburu42
 
Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12DuraSpace
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsDrPDShebaKeziaMalarc
 
Distributed File System
Distributed File SystemDistributed File System
Distributed File SystemNtu
 
Ch11 OS
Ch11 OSCh11 OS
Ch11 OSC.U
 
Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2ijtsrd
 
Chapter 10 - File System Interface
Chapter 10 - File System InterfaceChapter 10 - File System Interface
Chapter 10 - File System InterfaceWayne Jones Jnr
 
File management
File managementFile management
File managementMohd Arif
 
File system.
File system.File system.
File system.elyza12
 
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...Amazon Web Services
 

Similar to DSpace & DuraCloud Integrations (20)

Online storage for the masses and the case of pithos
Online storage for the masses and the case of pithosOnline storage for the masses and the case of pithos
Online storage for the masses and the case of pithos
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMS
 
storage & file strucure in dbms
storage & file strucure in dbmsstorage & file strucure in dbms
storage & file strucure in dbms
 
File system interfacefinal
File system interfacefinalFile system interfacefinal
File system interfacefinal
 
Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data Analytics
 
CIT173_Ch15_Mnstr_23.pdf
CIT173_Ch15_Mnstr_23.pdfCIT173_Ch15_Mnstr_23.pdf
CIT173_Ch15_Mnstr_23.pdf
 
009709863.pdf
009709863.pdf009709863.pdf
009709863.pdf
 
Distributed File System
Distributed File SystemDistributed File System
Distributed File System
 
OSCh11
OSCh11OSCh11
OSCh11
 
OS_Ch11
OS_Ch11OS_Ch11
OS_Ch11
 
Ch11 OS
Ch11 OSCh11 OS
Ch11 OS
 
Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2
 
Windows Forensics
Windows ForensicsWindows Forensics
Windows Forensics
 
Chapter 10 - File System Interface
Chapter 10 - File System InterfaceChapter 10 - File System Interface
Chapter 10 - File System Interface
 
File management
File managementFile management
File management
 
File system.
File system.File system.
File system.
 
Personal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - WorkshopPersonal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - Workshop
 
Posscon2013
Posscon2013Posscon2013
Posscon2013
 
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
 

More from Tim Donohue

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTTim Donohue
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User InterfaceTim Donohue
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafTim Donohue
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitTim Donohue
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityTim Donohue
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014Tim Donohue
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)Tim Donohue
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012Tim Donohue
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)Tim Donohue
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for RepositoriesTim Donohue
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011Tim Donohue
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010Tim Donohue
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutTim Donohue
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnTim Donohue
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the WildTim Donohue
 

More from Tim Donohue (15)

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + REST
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User Interface
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace Community
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for Repositories
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact Out
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your Own
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the Wild
 

Recently uploaded

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 

DSpace & DuraCloud Integrations

  • 1. Tim Donohue DSpace + DuraCloud Integrations DuraSpace Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 2. Basis for DSpace Integration 1. DSpace AIP Backup & Restore (1.7 +) (Initial DuraCloud use case: Backup & Restore) 2. DSpace Curation Task System (1.7 +) 3. DSpace Replication Task Suite (1.8) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 3. Intro to Archival Info Pkgs (1.7+) • Primary Use Cases – Backup & Restore of DSpace Content • All content or just partial (Community/Collection/Item) – Migration/Export of DSpace Content • All content or just partial (Community/Collection/Item) – DuraCloud Integration Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 4. How to Backup DSpace (pre-1.7) Database Assetstore Folder Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 5. How to Restore All (pre-1.7) Database Assetstore Folder Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 6. How to Restore a Collection (pre-1.7) Database Assetstore Folder Temporary Temporary Database Folder? Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 7. How to Restore a Collection (pre-1.7) Database Assetstore Folder Temporary Temporary Database Folder? Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 8. Backup via Archival Info Pkgs Package for each Community, Collection & Item AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 9. Restore All via Archival Info Pkgs Package for each Community, Collection & Item AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 10. Restore a Collection via AIPs 1 2 Collection AIP Items in Collection AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 11. What’s in an AIP? Content Files License or Logos Other Files METS in Bundles (DIM / MODS / PREMIS / (optional) METSRights) *Also a BagIt version in works Archival Information Package (AIP) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 12. What’s in an AIP? Related Object AIPs Content Files License or Logos Other Files METS in Bundles (DIM / MODS / PREMIS / (optional) METSRights) Descriptive Metadata: DIM & MODS Tech/Preservation Metadata: PREMIS Rights Metadata: METSRights Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 13. The “Site” AIP Top-Level Community AIPs METS (DIM / MODS / PREMIS / METSRights) Special AIP for site-wide info/metadata: (e.g. Group Memberships, EPeople) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 14. What can AIPs restore? Restore All In-Archive Content (Files + Metadata) Restore All People & Groups Restore All Permissions / Access Rights Restore Community / Collection Logos, Metadata, Rights & Item Templates Restore Community / Collection / Item Hierarchy Restore In-Process / Incomplete Items Restore Collection OAI-PMH/ORE Harvest Settings Restore all configuration files (dspace.cfg, etc.) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 15. Migrate a Collection One DSpace Install Another DSpace Install 2 1 Collection AIP Items in Collection Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 16. Migrate Content One DSpace Install (Future work) 2 1 Collection AIP Items in Collection Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 17. DuraCloud Integration (1.7.x) [1] ./dspace packager -d [2] java -jar synctool.jar 1 Package for each Community, Collection & Item Local “Watch” Folder Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 18. DuraCloud Integration (1.7.x) [1] java -jar retrievaltool.jar [2] ./dspace packager -r 2 Package for each Community, Collection & Item Local Folder Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 19. New: DSpace Replication Suite in 1.8 Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 20. DSpace Curation System (1.7+) • Enables a basic ‘microservices’ approach to curating DSpace objects • Anyone can build a task & share it. • Currently tasks must be written in Java – Working on JRuby & Jython integration (1.8?) • “Frees” admin tasks from Command Line – Can now run from Admin UI or CLI Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 21. DSpace Replication Suite (1.8) • A set of curation tasks geared towards ‘replicating’ (backup/restore/audit) content • “Wraps” 1.7 DSpace AIP Backup & Restore Backup content to AIP (filesystem or DuraCloud) Restore/Replace from AIP Audit AIP (compare to DSpace content) Basic IO Tracking of AIP Upload/Downloads All replication tasks can be run via Admin UI Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 22. DuraCloud Integration (1.8.x) “Replication Task Suite”: • Suite of Curation Tasks • One step Sync process OR Command line Curation Tools • Via UI or CLI Package for each Community, 1 Collection & Item 1 Local Temp Folder (Cache) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 23. DuraCloud Integration (1.8.x) “Replication Task Suite”: • One step Retrieve process • Via UI or CLI OR Command line Curation Tools • Also ‘auditing’ tools Package for each Community, 1 Collection & Item 1 Local Temp Folder (Cache) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 24. DSpace Replication Suite Demo Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 25. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 26. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 27. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 28. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 29. Known Limitations Cannot yet take advantage of DuraCloud streaming capabilities (AIPs are zip files) Cannot yet take advantage of DuraCloud transformation services (AIPs are zip files) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 30. Next Steps • Working towards “unzipped” AIPs (1.8?) – METS file & Content files stored in an AIP ‘folder’ but NOT zipped up – Support for DuraCloud streaming, etc. • DSpace UI Streaming Integration (@mire) • ‘Auto-Sync’ options – Updates in DSpace -> DuraCloud (queued?) – Updates via DuraCloud services -> DSpace? Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 31. In Large Thanks to… • MIT : Richard Rodgers & Wendy Bossons – Developed Curation Task Framework – Developed initial Replication Suite tasks • @mire : Mark Diggory – Look for @mire’s “Integrating DuraCloud Services in DSpace” talk on Friday at 3:30pm Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 32. For More Information • Replication Task Suite: – https://wiki.duraspace.org/display/DSPACE/Replic ationTaskSuite • AIP Backup & Restore: – https://wiki.duraspace.org/display/DSDOC/AIP+Ba ckup+and+Restore • Curation Task System: – https://wiki.duraspace.org/display/DSDOC/Curati on+System Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 33. Photo/Icon Acknowledgments • Package : http://www.flickr.com/photos/halfbisqued/2353845688/ • Harddrive & Terminal icons: http://tango.freedesktop.org/Tango_Desktop_Project • Folder icon: http://www.openclipart.org/detail/13740 • Database icon: http://www.openclipart.org/detail/68413 • Zip Pkg icon: http://veryicon.com/icons/system/capital-icon-suite-mac/zip-10.html • File icons: http://veryicon.com/icons/system/rhor-v2-part-3/ • Checkmark & Delete icons: http://veryicon.com/icons/system/on-stage/ • Tools Icon: http://veryicon.com/icons/system/azullustre/ Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org