SlideShare a Scribd company logo
1 of 32
Fedora migration considerations
Juliet L. Hardesty
Metadata Analyst, Indiana University
Open Repositories, June 15, 2016
Fedora 3 diagram key
Repository object
Structural metadata datastream
Descriptive metadata datastream
Other metadata datastream
Indiana University President's Office records, 1937-1962.
Subject files, 1937-1962. Aeons, Board of, 1939-1940.
Fedora 3 - documents
VAA8877-06169 Fedora
DC
RELS-
EXT
METS
VAA8877-U-03624
Fedora
DC
RELS-
EXT
PDF
VAA8877-U-03624-001
Fedora
DC
RELS-
EXT
MASTER
Derivatives
MASTER-
MIX
Folder
Document in
folder
Page in document
isMemberOfCollection
isPartOf
isPartOf
Portable soft drink stand at Bowling Green
Fedora 3 - images
P02668 Fedora
DC
RELS-
EXT
METS
PURL
Redirect
isMemberOfCollection
DCMODS
Image
[Program, 2013-2014, no. 117]
Fedora 3 – time-based media
avalon:22187
Fedora
DC
RELS-
EXT
MODSRIGHTS
SECTIONS
TECH/DIS
PLAY
WORKFLOW
Fedora
DC
RELS-
EXT
STRUCTURE MATTERHORN
avalon:22191
Fedora
DC
RELS-
EXT
STRUCTURE MATTERHORN
Item
CD 1 CD 2
avalon:22189
isMemberOfCollection
TECH/DIS
PLAY
TECH/DIS
PLAY
isPartOf
isPartOf
Focusing on Fedora 3 structure
• RELS-EXT defines structure going up
• METS or special datastreams required to
define structure going down
• All in XML as datastreams on objects that
connect together down to the digital file
https://github.com/duraspace/pcdm/wiki
Portland
Common
Data
Model
Document example in PCDM
Wilcox, David and Andrew Woods. “Hands-On: Seeing Fedora 4 Firsthand.” Fedora 4 Training
Workshop. Open Repositories 2015.
METS: fileSec
METS: structMap
VAA8877-06169 VAA8877-U-03626 VAA8877-U-03626-page1
filesdocuments pages
VAA8877-U-03626Proxy VAA8877-U-03626-page1Proxy
thumb.jpg
screen.jpg
large.jpg
pcdm:
hasMember
pcdm:
hasMember
Fedora 4 structure diagram
http://localhost:8080/fcrepo/rest/VAA8877-U-
03625/pages/VAA8877-U-03625-page1Proxy
METS as PCDM in Fedora – SPARQL
query
select DISTINCT ?document ?firstPage ?nextPage where {
{ <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember ?document .
?document iana:first ?firstPageProxy .
?firstPageProxy ore:proxyFor ?firstPage .
}
union
{
<http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember+ ?page .
?page pcdm:hasFile ?f .
?f ebucore:hasMimeType "image/jpeg"^^xsd:string .
?pageProxy ore:proxyFor ?page .
?pageProxy iana:next ?nextPageProxy .
?nextPageProxy ore:proxyFor ?nextPage .
}
}
SPARQL query results for structure of
folder VAA8877-06169
Transforming METS to PCDM
• Consider structure for collections using METS
• fileSec and structMap
– @GROUPID, @ID, @FILEID (unique identifiers
from both sections) in combination with <div>
structure within structMap can point to grouping
and ordering structure for objects
• Migrating easier if identifiers brought over
Considerations for structural metadata
• RELS-EXT in Fedora 3 migrated to Fedora 4
• Other structure datastreams are on you to
map
• METS – possible to map programmatically but
maybe not in a standard way
Descriptive Metadata - IU
MODS
Fedora 4 options for descriptive
metadata: Option 1
• Migration tools
– migration-utils
– fedora-migrate gem
Neither really takes advantage of Fedora 4/external triplestore
unless original is already RDF; fedora-migrate requires Hydra
Available now, keeps all metadata, nothing lost; can have RDF
statements on object if already in RDF (fedora-migrate)
Descriptive metadata to Fedora 4:
Option 2
• Map only simple statements to RDF
– Minimal descriptive metadata (title, date) or
descriptive metadata indexed for discovery (title,
date, creator, type, subject, genre, language)
– Use ontologies that allow for simple statements
Descriptive metadata to Fedora 4:
Option 2
• Map only simple statements to RDF
– Minimal descriptive metadata (title, date) or
descriptive metadata indexed for discovery (title,
date, creator, type, subject, genre, language)
– Use ontologies that allow for simple statements
Not all metadata is in RDF; changes in
ontologies/standards might not be 1:1 match with original
Creates RDF statements in Fedora 4/external triplestore; great
option if metadata is DC or other non-hierarchical standard
Descriptive metadata to Fedora 4:
Option 3
• Map complex/hierarchical information into
external triplestore
– All RDF statements (simple and complex) go into
external triplestore but only simple statements
are on Fedora 4 object
Cushman photograph – Fedora 4
Cushman photograph – external triplestore
Descriptive metadata to Fedora 4:
Option 3
• Map complex/hierarchical information into
external triplestore
– All RDF statements (simple and complex) go into
external triplestore but only simple statements
are on Fedora 4 object
Separates metadata between repository and triplestore,
problematic if original metadata not kept in repository
More/all metadata available as RDF; updates easier to manage
through triplestore/Fedora 4 functionality
Descriptive metadata to Fedora 4:
Almost Option 4
• MODS “unofficial standard” mapping to RDF
– MODS and RDF Group’s mapping/transformation
scenario (http://mods2rdf.xyz/)
– Available as conversion code to see MODS mapped
into Fedora 4
Work in progress (not always up, transforming few elements);
you might not agree with suggested mappings
Available now to try; might offer standard way to transform
MODS similarly across institutions
Considerations for descriptive
metadata
• Current state of descriptive metadata
• Management needs of Fedora 4
• Transform to RDF statements or also to Linked
Data
• Decision: keep original descriptive metadata
or not?
– At IU, keep original descriptive metadata (for full
item view)
Thank you!
• Julie Hardesty
• jlhardes@iu.edu
• @jlhardes

More Related Content

What's hot

Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding FormJakob .
 
The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectAlexandro Colorado
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Dataaschwarte
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Richard Urban
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on AndroidEUCLID project
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Mikael Nilsson
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In PracticeMarcia Zeng
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataMuhammad Saleem
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarianstrevorthornton
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...eswcsummerschool
 
The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)Myungjin Lee
 
morph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationmorph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationNandana Mihindukulasooriya
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-tonvitucci
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedJakob .
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashupsgiurca
 

What's hot (19)

Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding Form
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit Project
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In Practice
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of Data
 
Dublin Core Intro
Dublin Core IntroDublin Core Intro
Dublin Core Intro
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
 
The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)
 
morph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationmorph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementation
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-to
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 

Similar to Fedora Migration Considerations

Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureMenzo Windhouwer
 
DHT2 - O Brother, Where Art Thou with Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with 	Shyam RanganathanDHT2 - O Brother, Where Art Thou with 	Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with Shyam RanganathanGluster.org
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Juan Sequeda
 
MPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource IndexMPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource IndexChris Wilper
 
An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2Alexandro Colorado
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordMark Wilkinson
 
How to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesHow to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesValeria Pesce
 
HydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media FilesHydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media FilesJon W. Dunn
 
Elns and research data management case study of RSpace at the University of ...
Elns and research data management  case study of RSpace at the University of ...Elns and research data management  case study of RSpace at the University of ...
Elns and research data management case study of RSpace at the University of ...rmacneil88
 
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”DuraSpace
 
Saveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF dataSaveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF dataFuming Shih
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Alasdair Gray
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data eohallor
 

Similar to Fedora Migration Considerations (20)

Introduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIsIntroduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIs
 
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
 
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for DataSecond Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
 
DHT2 - O Brother, Where Art Thou with Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with 	Shyam RanganathanDHT2 - O Brother, Where Art Thou with 	Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with Shyam Ranganathan
 
Omeka sc4l
Omeka sc4lOmeka sc4l
Omeka sc4l
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
 
Hdf5 intro
Hdf5 introHdf5 intro
Hdf5 intro
 
MPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource IndexMPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource Index
 
An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
How to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesHow to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issues
 
How to Describe a Dataset. Interoperability Issues, by Valeria Pesce
How to Describe a Dataset. Interoperability Issues, by Valeria PesceHow to Describe a Dataset. Interoperability Issues, by Valeria Pesce
How to Describe a Dataset. Interoperability Issues, by Valeria Pesce
 
HydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media FilesHydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media Files
 
Ld4 l triannon
Ld4 l triannonLd4 l triannon
Ld4 l triannon
 
Elns and research data management case study of RSpace at the University of ...
Elns and research data management  case study of RSpace at the University of ...Elns and research data management  case study of RSpace at the University of ...
Elns and research data management case study of RSpace at the University of ...
 
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
 
Saveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF dataSaveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF data
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data
 

More from Avalon Media System

Researching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository UsageResearching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository UsageAvalon Media System
 
Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015Avalon Media System
 
Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015Avalon Media System
 
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...Avalon Media System
 
Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015Avalon Media System
 
Avalon Media System: Implementation and Community
Avalon Media System: Implementation and CommunityAvalon Media System: Implementation and Community
Avalon Media System: Implementation and CommunityAvalon Media System
 
Avalon at Northwestern University
Avalon at Northwestern UniversityAvalon at Northwestern University
Avalon at Northwestern UniversityAvalon Media System
 
Avalon at the University of Virginia
Avalon at the University of VirginiaAvalon at the University of Virginia
Avalon at the University of VirginiaAvalon Media System
 
Avalon at Stanford University Libraries
Avalon at Stanford University LibrariesAvalon at Stanford University Libraries
Avalon at Stanford University LibrariesAvalon Media System
 
The Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and CommunityThe Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and CommunityAvalon Media System
 
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...Avalon Media System
 
The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...Avalon Media System
 
OSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project HydraOSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project HydraAvalon Media System
 
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...Avalon Media System
 
Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)Avalon Media System
 

More from Avalon Media System (20)

Researching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository UsageResearching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository Usage
 
CNI 2016: Avalon overview
CNI 2016: Avalon overviewCNI 2016: Avalon overview
CNI 2016: Avalon overview
 
Avalon 2016 Overview
Avalon 2016 OverviewAvalon 2016 Overview
Avalon 2016 Overview
 
Avalon 5.0 and Beyond
Avalon 5.0 and BeyondAvalon 5.0 and Beyond
Avalon 5.0 and Beyond
 
Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015
 
Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015
 
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
 
Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015
 
Avalon Media System: Implementation and Community
Avalon Media System: Implementation and CommunityAvalon Media System: Implementation and Community
Avalon Media System: Implementation and Community
 
Avalon at Northwestern University
Avalon at Northwestern UniversityAvalon at Northwestern University
Avalon at Northwestern University
 
Avalon at the University of Virginia
Avalon at the University of VirginiaAvalon at the University of Virginia
Avalon at the University of Virginia
 
Avalon at Stanford University Libraries
Avalon at Stanford University LibrariesAvalon at Stanford University Libraries
Avalon at Stanford University Libraries
 
Avalon at Indiana University
Avalon at Indiana UniversityAvalon at Indiana University
Avalon at Indiana University
 
The Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and CommunityThe Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and Community
 
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
 
The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...
 
OSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project HydraOSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
 
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
 
Avalon Media System update
Avalon Media System updateAvalon Media System update
Avalon Media System update
 
Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)
 

Recently uploaded

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Recently uploaded (20)

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Fedora Migration Considerations

  • 1. Fedora migration considerations Juliet L. Hardesty Metadata Analyst, Indiana University Open Repositories, June 15, 2016
  • 2. Fedora 3 diagram key Repository object Structural metadata datastream Descriptive metadata datastream Other metadata datastream
  • 3. Indiana University President's Office records, 1937-1962. Subject files, 1937-1962. Aeons, Board of, 1939-1940.
  • 4. Fedora 3 - documents VAA8877-06169 Fedora DC RELS- EXT METS VAA8877-U-03624 Fedora DC RELS- EXT PDF VAA8877-U-03624-001 Fedora DC RELS- EXT MASTER Derivatives MASTER- MIX Folder Document in folder Page in document isMemberOfCollection isPartOf isPartOf
  • 5. Portable soft drink stand at Bowling Green
  • 6. Fedora 3 - images P02668 Fedora DC RELS- EXT METS PURL Redirect isMemberOfCollection DCMODS Image
  • 8. Fedora 3 – time-based media avalon:22187 Fedora DC RELS- EXT MODSRIGHTS SECTIONS TECH/DIS PLAY WORKFLOW Fedora DC RELS- EXT STRUCTURE MATTERHORN avalon:22191 Fedora DC RELS- EXT STRUCTURE MATTERHORN Item CD 1 CD 2 avalon:22189 isMemberOfCollection TECH/DIS PLAY TECH/DIS PLAY isPartOf isPartOf
  • 9. Focusing on Fedora 3 structure • RELS-EXT defines structure going up • METS or special datastreams required to define structure going down • All in XML as datastreams on objects that connect together down to the digital file
  • 11. Document example in PCDM Wilcox, David and Andrew Woods. “Hands-On: Seeing Fedora 4 Firsthand.” Fedora 4 Training Workshop. Open Repositories 2015.
  • 14.
  • 15. VAA8877-06169 VAA8877-U-03626 VAA8877-U-03626-page1 filesdocuments pages VAA8877-U-03626Proxy VAA8877-U-03626-page1Proxy thumb.jpg screen.jpg large.jpg pcdm: hasMember pcdm: hasMember Fedora 4 structure diagram
  • 17. METS as PCDM in Fedora – SPARQL query select DISTINCT ?document ?firstPage ?nextPage where { { <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember ?document . ?document iana:first ?firstPageProxy . ?firstPageProxy ore:proxyFor ?firstPage . } union { <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember+ ?page . ?page pcdm:hasFile ?f . ?f ebucore:hasMimeType "image/jpeg"^^xsd:string . ?pageProxy ore:proxyFor ?page . ?pageProxy iana:next ?nextPageProxy . ?nextPageProxy ore:proxyFor ?nextPage . } }
  • 18. SPARQL query results for structure of folder VAA8877-06169
  • 19. Transforming METS to PCDM • Consider structure for collections using METS • fileSec and structMap – @GROUPID, @ID, @FILEID (unique identifiers from both sections) in combination with <div> structure within structMap can point to grouping and ordering structure for objects • Migrating easier if identifiers brought over
  • 20. Considerations for structural metadata • RELS-EXT in Fedora 3 migrated to Fedora 4 • Other structure datastreams are on you to map • METS – possible to map programmatically but maybe not in a standard way
  • 22. Fedora 4 options for descriptive metadata: Option 1 • Migration tools – migration-utils – fedora-migrate gem Neither really takes advantage of Fedora 4/external triplestore unless original is already RDF; fedora-migrate requires Hydra Available now, keeps all metadata, nothing lost; can have RDF statements on object if already in RDF (fedora-migrate)
  • 23. Descriptive metadata to Fedora 4: Option 2 • Map only simple statements to RDF – Minimal descriptive metadata (title, date) or descriptive metadata indexed for discovery (title, date, creator, type, subject, genre, language) – Use ontologies that allow for simple statements
  • 24.
  • 25. Descriptive metadata to Fedora 4: Option 2 • Map only simple statements to RDF – Minimal descriptive metadata (title, date) or descriptive metadata indexed for discovery (title, date, creator, type, subject, genre, language) – Use ontologies that allow for simple statements Not all metadata is in RDF; changes in ontologies/standards might not be 1:1 match with original Creates RDF statements in Fedora 4/external triplestore; great option if metadata is DC or other non-hierarchical standard
  • 26. Descriptive metadata to Fedora 4: Option 3 • Map complex/hierarchical information into external triplestore – All RDF statements (simple and complex) go into external triplestore but only simple statements are on Fedora 4 object
  • 28. Cushman photograph – external triplestore
  • 29. Descriptive metadata to Fedora 4: Option 3 • Map complex/hierarchical information into external triplestore – All RDF statements (simple and complex) go into external triplestore but only simple statements are on Fedora 4 object Separates metadata between repository and triplestore, problematic if original metadata not kept in repository More/all metadata available as RDF; updates easier to manage through triplestore/Fedora 4 functionality
  • 30. Descriptive metadata to Fedora 4: Almost Option 4 • MODS “unofficial standard” mapping to RDF – MODS and RDF Group’s mapping/transformation scenario (http://mods2rdf.xyz/) – Available as conversion code to see MODS mapped into Fedora 4 Work in progress (not always up, transforming few elements); you might not agree with suggested mappings Available now to try; might offer standard way to transform MODS similarly across institutions
  • 31. Considerations for descriptive metadata • Current state of descriptive metadata • Management needs of Fedora 4 • Transform to RDF statements or also to Linked Data • Decision: keep original descriptive metadata or not? – At IU, keep original descriptive metadata (for full item view)
  • 32. Thank you! • Julie Hardesty • jlhardes@iu.edu • @jlhardes

Editor's Notes

  1. HyBox Idea - https://github.com/projecthydra-labs/hybox-ideas/issues/19 Odering question on Fedora-tech - https://groups.google.com/forum/#!topic/fedora-tech/1dlfy9Nx76Q
  2. Can add MODSRDF namespace and add properties using that namespace Tried using mods:genre since it is is a flat non-hierarchical element in XML; even brought in a URI for the genre Documentary films from the LC Genre/Form Terms vocabulary Problem is: this is not MODSRDF MODSRDF is often complex (nested) RDF - How does it work to add a complex MODS field to an object?