Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
3TU Datacentrum Back-end

An overview of current status as to
    the technical infrastructure.
Etienne Posthumus TU Delft...
Overview
•   Fedora Repository Software
•   Python middleware
•   Fedora Front-end
•   Dataset types
•   Current Datasets
Fedora Repository Software
Investigation of use in past year
Institutional repository implemented
Not perfect, but most fl...
Python middleware
•   Django Application framework
•   Agile software development
•   Fedora coupling via REST HTTP API
• ...
Fedora Front-end
•   XSLT based
•   Dynamic queries based on Resource Index
•   Multiple output formats possible
•   Used ...
Ellips: Fedora object
Pijl: relatie (rdf)
Kleine rechthoek: tekstuele metadata
Grote rechthoek: datastream (anders dan DC ...
Fedora Front-end
•   XSLT based
•   Dynamic queries based on Resource Index
•   Multiple output formats possible
•   Used ...
Dataset types
We identify two different types

•   archival ingested submissions
•   enriched objects
Dataset types
Archival Submissions
• Recorded, checksum as-is
• For reference purposes
Dataset types
Archival Submissions
• Recorded, checksum as-is
• For reference purposes

• Considering Bagit(from Library o...
Dataset types
Enriched objects

Possible conversions to other formats
  For example CSV to XML
Manageable chunks
Dataset types
Enriched objects

Possible conversions to other formats
  For example CSV to XML
Manageable chunks

Selected...
Current Datasets
•   DARELUX
•   WindZon
•   Flame
•   Asfalt
•   Water
Discussion/Questions
Upcoming SlideShare
Loading in …5
×

3TU Datacentrum Tech Overview

564 views

Published on

Presented at 3TU Datacentrum project group in Utrecht April 2009

Published in: Education, Technology
  • Be the first to comment

  • Be the first to like this

3TU Datacentrum Tech Overview

  1. 1. 3TU Datacentrum Back-end An overview of current status as to the technical infrastructure. Etienne Posthumus TU Delft Library 14-04-2009
  2. 2. Overview • Fedora Repository Software • Python middleware • Fedora Front-end • Dataset types • Current Datasets
  3. 3. Fedora Repository Software Investigation of use in past year Institutional repository implemented Not perfect, but most flexible of comparable systems. The DIY downside is paradoxically also a key benefit.
  4. 4. Python middleware • Django Application framework • Agile software development • Fedora coupling via REST HTTP API • Also use SOLR for indexing • Fall-through to Fedora provided services always possible
  5. 5. Fedora Front-end • XSLT based • Dynamic queries based on Resource Index • Multiple output formats possible • Used CMA for behaviours
  6. 6. Ellips: Fedora object Pijl: relatie (rdf) Kleine rechthoek: tekstuele metadata Grote rechthoek: datastream (anders dan DC of RELS-EXT) Diagram by E Gramsbergen
  7. 7. Fedora Front-end • XSLT based • Dynamic queries based on Resource Index • Multiple output formats possible • Used CMA for behaviours demo link en link
  8. 8. Dataset types We identify two different types • archival ingested submissions • enriched objects
  9. 9. Dataset types Archival Submissions • Recorded, checksum as-is • For reference purposes
  10. 10. Dataset types Archival Submissions • Recorded, checksum as-is • For reference purposes • Considering Bagit(from Library of Congress) • Or accepting the Fedora FOXML format
  11. 11. Dataset types Enriched objects Possible conversions to other formats For example CSV to XML Manageable chunks
  12. 12. Dataset types Enriched objects Possible conversions to other formats For example CSV to XML Manageable chunks Selected metadata as RDF Stored in Resource Index
  13. 13. Current Datasets • DARELUX • WindZon • Flame • Asfalt • Water
  14. 14. Discussion/Questions

×