The Fedora Project
The Fedora Project <ul><li>Fedora Digital Object Repository System </li></ul><ul><ul><li>Extensible digital object model <...
Priorities for digital libraries <ul><li>Managing digital resources as if they are all the same </li></ul><ul><li>Deliveri...
Shortcomings of commercial digital library products <ul><li>Narrow focus on specific media formats (e.g. image databases, ...
Fedora History <ul><li>Research  (1997-present) :  </li></ul><ul><ul><li>DARPA and NSF-funded research project at Cornell ...
Fedora 1.x <ul><li>Architecture </li></ul><ul><li>Software  </li></ul><ul><li>Release 1.2 Features </li></ul><ul><li>Demo ...
Persistent ID ( PID ) Disseminators System Metadata Datastreams Globally unique persistent id Public view :  access method...
Persistent ID ( PID ) Default Disseminators Simple Image System Metadata Datastreams Digital Object Model  Example Dissemi...
Data Object Web Service Object Behavior Contracts   behavior contract behavior subscription data contract Behavior Mechani...
DEMO:  Basic Use Cases Image (multiple datastreams) Image (Mr. SID) EAD (Rita Mae Brown papers) Text conversion (TEI to PD...
Users access data objects through behaviors (or disseminations). Application <ul><li>Dynamic </li></ul><ul><li>data </li><...
Managers have direct access to each component of a data object.
Fedora and Web Services <ul><li>Fedora Repository system is a web service </li></ul><ul><ul><li>Access/Search (API-A) and ...
Fedora Repository System Client and Web Service Interactions Fedora Repository System Content Transform Service Content Tr...
Fedora Repository  Service Interfaces <ul><li>Management  Service (API-M) </li></ul><ul><ul><li>Ingest - XML-encoded objec...
Fedora Repository System
Fedora 1.2  Software Feature Set <ul><li>Open Fedora APIs  </li></ul><ul><ul><li>Repository as web services (REST and SOAP...
Fedora  Software Distribution Package <ul><li>Open Source (Mozilla Public License) </li></ul><ul><li>100% Java (Sun Java J...
DEMO:  Basic Use Cases Image (multiple datastreams) Image (Mr. SID) EAD (Rita Mae Brown papers) Text conversion (TEI to PD...
Projects using Fedora <ul><li>University of Virginia:   digital library (images,  EAD , e-texts) </li></ul><ul><li>Tufts U...
Fedora Downloads since May 2003 <ul><li>Total downloads:  >1500 </li></ul><ul><li>Average downloads per day: 9 </li></ul><...
Future Software Releases <ul><li>Fedora Object XML (FOXML) </li></ul><ul><ul><li>Internal storage format; direct expressio...
Future Development Proposals <ul><li>Digital Library in a Box </li></ul><ul><ul><li>Full-featured DL application with “Fed...
End
Upcoming SlideShare
Loading in...5
×

Fedora

1,396

Published on

Open Source Operating System

Published in: Education, Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,396
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Transcript of "Fedora"

  1. 1. The Fedora Project
  2. 2. The Fedora Project <ul><li>Fedora Digital Object Repository System </li></ul><ul><ul><li>Extensible digital object model </li></ul></ul><ul><ul><li>Repository System exposed via Web service APIs </li></ul></ul><ul><ul><li>Scalable, persistent storage for content and metadata </li></ul></ul><ul><ul><li>Local and remote content </li></ul></ul><ul><ul><li>Associate services with objects </li></ul></ul><ul><ul><li>Content versioning </li></ul></ul><ul><li>Fedora Use cases </li></ul><ul><ul><li>Content Management (CMS) </li></ul></ul><ul><ul><li>Digital Library architecture </li></ul></ul><ul><ul><li>Digital Asset Management </li></ul></ul><ul><ul><li>Institutional Repository </li></ul></ul><ul><ul><li>Scholarly publishing </li></ul></ul><ul><ul><li>Preservation </li></ul></ul><ul><li>Open source software </li></ul>
  3. 3. Priorities for digital libraries <ul><li>Managing digital resources as if they are all the same </li></ul><ul><li>Delivering digital resources as if they are all unique and free to participate in any number of contexts </li></ul><ul><li>Supporting digital scholarship wherever it may lead </li></ul>
  4. 4. Shortcomings of commercial digital library products <ul><li>Narrow focus on specific media formats (e.g. image databases, document management) </li></ul><ul><li>Fail to effectively address interrelationships among digital entities </li></ul><ul><li>Fail to address interoperability </li></ul><ul><li>Fail to provide facilities for managing programs and tools that deliver digital content. </li></ul><ul><li>Not extensible; do not enable easy integration of new tools and services </li></ul>
  5. 5. Fedora History <ul><li>Research (1997-present) : </li></ul><ul><ul><li>DARPA and NSF-funded research project at Cornell </li></ul></ul><ul><ul><li>(Carl Lagoze and Sandy Payette) </li></ul></ul><ul><ul><li>Reference implementation developed at Cornell </li></ul></ul><ul><li>First Application (1999-2001) : </li></ul><ul><ul><li>University of Virginia digital library prototype </li></ul></ul><ul><ul><li>(Thorny Staples and Ross Wayland) </li></ul></ul><ul><ul><li>Scale/stress testing for 10,000,000 objects </li></ul></ul><ul><li>Open Source Software (2002-present): </li></ul><ul><ul><li>Andrew W. Mellon Foundation granted Virginia and Cornell $1 million to develop a production-quality Fedora system </li></ul></ul><ul><ul><li>Fedora 1.0 released in May 2003 </li></ul></ul><ul><ul><li>www.fedora.info </li></ul></ul>
  6. 6. Fedora 1.x <ul><li>Architecture </li></ul><ul><li>Software </li></ul><ul><li>Release 1.2 Features </li></ul><ul><li>Demo Use Cases </li></ul>
  7. 7. Persistent ID ( PID ) Disseminators System Metadata Datastreams Globally unique persistent id Public view : access methods for obtaining “disseminations” of digital object content Internal view : metadata necessary to manage the object Protected view : content that makes up the “basis” of the object Digital Object Model Architectural View
  8. 8. Persistent ID ( PID ) Default Disseminators Simple Image System Metadata Datastreams Digital Object Model Example Disseminators Get Profile List Items Get Item List Methods Get DC Record Get Thumbnail Get Medium Get High Get VeryHigh
  9. 9. Data Object Web Service Object Behavior Contracts behavior contract behavior subscription data contract Behavior Mechanism Object Behavior Definition Object Persistent ID (PID) Behavior Definition Metadata System Metadata Datastreams Persistent ID (PID) Service Binding Metadata (WSDL) System Metadata Datastreams Persistent ID (PID) Disseminators Datastreams System Metadata
  10. 10. DEMO: Basic Use Cases Image (multiple datastreams) Image (Mr. SID) EAD (Rita Mae Brown papers) Text conversion (TEI to PDF) Basic Search
  11. 11. Users access data objects through behaviors (or disseminations). Application <ul><li>Dynamic </li></ul><ul><li>data </li></ul>services
  12. 12. Managers have direct access to each component of a data object.
  13. 13. Fedora and Web Services <ul><li>Fedora Repository system is a web service </li></ul><ul><ul><li>Access/Search (API-A) and Management (API-M) </li></ul></ul><ul><ul><li>Service descriptions published using WSDL </li></ul></ul><ul><ul><li>Both SOAP and HTTP bindings </li></ul></ul><ul><li>Back-end services </li></ul><ul><ul><li>Digital object behaviors implemented as linkages to other distributed web services </li></ul></ul><ul><ul><li>Service binding metadata (WSDL) stored in special Fedora Behavior Mechanism objects. </li></ul></ul><ul><ul><li>Fedora acts as mediator to these services. </li></ul></ul>
  14. 14. Fedora Repository System Client and Web Service Interactions Fedora Repository System Content Transform Service Content Transform Service user Web Service Dispatch Web Service Service Service Backend Frontend client application client application web browser user
  15. 15. Fedora Repository Service Interfaces <ul><li>Management Service (API-M) </li></ul><ul><ul><li>Ingest - XML-encoded object submission </li></ul></ul><ul><ul><li>Create - interactive object creation via API requests </li></ul></ul><ul><ul><li>Maintain - interactive object modification via API requests </li></ul></ul><ul><ul><li>Validate – application of integrity rules to objects </li></ul></ul><ul><ul><li>Identify - generate unique object identifiers </li></ul></ul><ul><ul><li>Security - authentication and access control </li></ul></ul><ul><ul><li>Preserve - automatic content versioning and audit trail </li></ul></ul><ul><ul><li>Export - XML-encoded object formats </li></ul></ul><ul><li>Access Service (API-A and API-A-LITE) </li></ul><ul><ul><li>Search - search repository for objects </li></ul></ul><ul><ul><li>Object Reflection - what disseminations can the object provide? </li></ul></ul><ul><ul><li>Object Dissemination - request a view of the object’s content </li></ul></ul><ul><li>OAI-PMH Provider Service </li></ul><ul><ul><li>OAI-DC records </li></ul></ul>
  16. 16. Fedora Repository System
  17. 17. Fedora 1.2 Software Feature Set <ul><li>Open Fedora APIs </li></ul><ul><ul><li>Repository as web services (REST and SOAP bindings); WSDL interface defs </li></ul></ul><ul><li>Flexible Digital Object Model </li></ul><ul><ul><li>Content View: objects as bundle of items (content and metadata) </li></ul></ul><ul><ul><li>Service View: objects as a set of service methods (“behaviors”) </li></ul></ul><ul><ul><li>Extensible functionality by associating services with objects </li></ul></ul><ul><li>Repository System </li></ul><ul><ul><li>Core Services: Management, Access/Search, OAI-PMH </li></ul></ul><ul><ul><li>Storage: XML object store; relational db object cache; relational db object registry </li></ul></ul><ul><ul><li>Mediation - auto-dispatching to distributed web services for content transformation </li></ul></ul><ul><ul><li>Auto-Indexing – system metadata and DC record of each object </li></ul></ul><ul><ul><li>HTTP Basic Authentication and Access Control </li></ul></ul><ul><ul><li>Built-in disseminator services: XSLT x-form, image manipulation, xml-to-PDF </li></ul></ul><ul><li>Content Versioning </li></ul><ul><ul><li>Automatic version control (saves version of content/metadata when modified) </li></ul></ul><ul><ul><li>Enables date-time stamped API requests (see object as it looked at a point in time) </li></ul></ul><ul><li>Clients </li></ul><ul><ul><li>Fedora Administrator: GUI client to create/maintain objects </li></ul></ul><ul><ul><li>Default Web browser interface: search; access objects via default disseminator </li></ul></ul><ul><ul><li>Command line utilities (batch load, ingest, purge, others) </li></ul></ul><ul><ul><li>Migration Utility – mass export/ingest </li></ul></ul>
  18. 18. Fedora Software Distribution Package <ul><li>Open Source (Mozilla Public License) </li></ul><ul><li>100% Java (Sun Java J2SDK1.4) </li></ul><ul><li>Supporting Technologies </li></ul><ul><ul><li>Apache Tomcat 4.1 and Apache Axis (SOAP) </li></ul></ul><ul><ul><li>Xerces 2-2.0.2 for XML parsing and validation </li></ul></ul><ul><ul><li>Saxon 6.5 for XSLT transformation </li></ul></ul><ul><ul><li>Schematron 1.5 for validation </li></ul></ul><ul><ul><li>MySQL and Mckoi relational database </li></ul></ul><ul><ul><li>Oracle 9i support </li></ul></ul><ul><li>Deployment Platforms </li></ul><ul><ul><li>Windows 2000, NT, XP </li></ul></ul><ul><ul><li>Solaris </li></ul></ul><ul><ul><li>Linux </li></ul></ul>
  19. 19. DEMO: Basic Use Cases Image (multiple datastreams) Image (Mr. SID) EAD (Rita Mae Brown papers) Text conversion (TEI to PDF) Basic Search
  20. 20. Projects using Fedora <ul><li>University of Virginia: digital library (images, EAD , e-texts) </li></ul><ul><li>Tufts University: educational (VUE/ concept maps); digital library </li></ul><ul><li>VTLS: basis for new commercial product (library system) </li></ul><ul><li>Indiana University: EVIA Digital Archive (video) </li></ul><ul><li>Northwestern: academic technologies ( images , art , video, e-texts) </li></ul><ul><li>Rutgers University: digital library (e-journals, numeric data) </li></ul><ul><li>Yale University: Electronic Records Archive </li></ul><ul><li>New York University: Humanities Computing Group </li></ul>
  21. 21. Fedora Downloads since May 2003 <ul><li>Total downloads: >1500 </li></ul><ul><li>Average downloads per day: 9 </li></ul><ul><li># Countries: 32 </li></ul><ul><li>Types of orgs: </li></ul><ul><ul><li>Universities: libraries, IT, departments </li></ul></ul><ul><ul><li>Software and technology companies </li></ul></ul><ul><ul><li>Defense/military </li></ul></ul><ul><ul><li>Banks </li></ul></ul><ul><ul><li>National libraries and archives </li></ul></ul><ul><ul><li>Publishers </li></ul></ul><ul><ul><li>Research labs </li></ul></ul><ul><ul><li>Library automation vendors </li></ul></ul><ul><ul><li>Scholarly societies </li></ul></ul>
  22. 22. Future Software Releases <ul><li>Fedora Object XML (FOXML) </li></ul><ul><ul><li>Internal storage format; direct expression of Fedora object model </li></ul></ul><ul><ul><li>Better support for relationships (“kinship” metadata) </li></ul></ul><ul><ul><li>Better support for audit trail (event history) </li></ul></ul><ul><ul><li>Format identifiers for dynamic service binding </li></ul></ul><ul><li>Shibboleth authentication </li></ul><ul><li>Policy Enforcement </li></ul><ul><ul><li>XACML expression language </li></ul></ul><ul><ul><li>Fedora policy enforcement module </li></ul></ul><ul><li>Web interface for easy content submission </li></ul><ul><li>Batch object modification utility </li></ul><ul><li>Administrative Reporting </li></ul><ul><li>Object Event History (ABC/RDF disseminations) </li></ul><ul><li>Better support for “collections” </li></ul><ul><li>New ingest and export formats (METS1.3, DIDL) </li></ul>December 2003 – December 2004
  23. 23. Future Development Proposals <ul><li>Digital Library in a Box </li></ul><ul><ul><li>Full-featured DL application with “Fedora inside” </li></ul></ul><ul><ul><li>Optimized for common set of content types </li></ul></ul><ul><li>Fedora Power Server </li></ul><ul><ul><li>Integrity Management Tools </li></ul></ul><ul><ul><li>Service and link liveness checker </li></ul></ul><ul><ul><li>Fault Tolerance </li></ul></ul><ul><ul><li>Mirroring and Replication </li></ul></ul><ul><ul><li>Peer-to-peer interoperability features </li></ul></ul><ul><ul><li>Repository clustering </li></ul></ul><ul><ul><li>Load balancing </li></ul></ul><ul><li>Object Creation Tools </li></ul><ul><ul><li>Workflow applications based on content models </li></ul></ul><ul><ul><li>Web interface for document/content submission </li></ul></ul>
  24. 24. End

×