Building an  open source  based, open standards, infrastructure for the large scale provisioning of  reusable   open  cont...
EKT: The National Documentation Centre
“Ανοιχτά Πρότυπα & Ανοιχτό Περιεχόμενο. Τα θεμέλια της ανάπτυξης του  Internet”   “ Building an  open  source based ,  ope...
standards ?
software ?
emerging requirements ? </li></ul></ul><ul><li>An open source  software   stack   for delivery of </li><ul><li>reusable, o...
in large scale, using open standards
and addressing emerging requirements </li></ul></ul>
Content types and characteristics (I) <ul><li>Different content types but similar functional requirements,  examples:  </l...
Cultural - Humanities repositories : from research papers to commented cultural artificats:  pandektis.ekt.gr
Educational – reference material :  studies, books, multimedia courses, project outcomes, e.g.  repository.edulll.gr
Digitised and/or born digital e-books :  ebooks.serrelib.gr  ( pre-release )
Library automation systems and OPACs: linking the digital with the physical realm,  abekt.gr
Digital objects in general </li></ul></ul>
Content types and characteristics (II) <ul><li>Common characteristics: </li><ul><ul><li>Online databases providing access ...
Collection, dissemination, preservation of material that has  archival  value for future reference </li></ul></ul><li>Diff...
Fast evolving landscape. Full range of features required: </li><ul><li>copyright information management,  metadata-bibliog...
<ul><li>Standards (… so far) </li></ul><ul><li>Open  and  reusable : </li></ul><ul><ul><li>Digital content, file formats, ...
Metadata description: </li><ul><li>UNIMARC , MARC21  to  DC  and MODS </li></ul><li>Interoperability protocols and interfa...
HANDLE.NET (RFC3652), openURL  </li></ul></ul></ul>
An open content open source software stack No easy task to tackle the full range  of these  issues, in a scalable  and via...
High quality FLOSS  tools are available: pluggable and modular stack, do one thing but do it at “world class” </li></ul>FL...
Upcoming SlideShare
Loading in...5
×

2011 ELLAK, EKT conference presentation,link at http://helios-eie.ekt.gr/EIE/handle/10442/8702

773

Published on

Permanent link @ http://helios-eie.ekt.gr/EIE/handle/10442/8702

Published in: Technology
1 Comment
0 Likes
Statistics
Notes
  • Animations have been supressed pls use for reference http://hanlde.net/10442/8702
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

No Downloads
Views
Total Views
773
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
2
Comments
1
Likes
0
Embeds 0
No embeds

No notes for slide

2011 ELLAK, EKT conference presentation,link at http://helios-eie.ekt.gr/EIE/handle/10442/8702

  1. 1. Building an open source based, open standards, infrastructure for the large scale provisioning of reusable open content Π. Σταθόπουλος , Ν. Χούσσος, Γ. Σταύρου , Ε. Σαχίνη, Ι.Ο. Σταθοπούλου, Κ. Σταμάτης, Α. Σουμπλής email: pstath; [email_address] Συνέδριο ΕΛ/ΛΑΚ 2011, Αθήνα Εθνικό Κέντρο Τεκμηρίωσης/Ε.Ι.Ε. National Documentation Center /N.H.R.F.
  2. 2. EKT: The National Documentation Centre
  3. 3. “Ανοιχτά Πρότυπα & Ανοιχτό Περιεχόμενο. Τα θεμέλια της ανάπτυξης του Internet” “ Building an open source based , open standards , infrastructure for the large scale provisioning of reusable open conten t” <ul><li>“ reusable ” and “ open ” content: </li></ul><ul><ul><li>functionality ?
  4. 4. standards ?
  5. 5. software ?
  6. 6. emerging requirements ? </li></ul></ul><ul><li>An open source software stack for delivery of </li><ul><li>reusable, open content
  7. 7. in large scale, using open standards
  8. 8. and addressing emerging requirements </li></ul></ul>
  9. 9. Content types and characteristics (I) <ul><li>Different content types but similar functional requirements, examples: </li><ul><li>Research output , e.g. papers , reports, patents, documentation, conferences, books, data: didaktorika.gr , helios-eie.ekt.gr , etc.
  10. 10. Cultural - Humanities repositories : from research papers to commented cultural artificats: pandektis.ekt.gr
  11. 11. Educational – reference material : studies, books, multimedia courses, project outcomes, e.g. repository.edulll.gr
  12. 12. Digitised and/or born digital e-books : ebooks.serrelib.gr ( pre-release )
  13. 13. Library automation systems and OPACs: linking the digital with the physical realm, abekt.gr
  14. 14. Digital objects in general </li></ul></ul>
  15. 15. Content types and characteristics (II) <ul><li>Common characteristics: </li><ul><ul><li>Online databases providing access to digital to objects (e.g., books, articles, images, mutlimedia) accompanied with rich metadata
  16. 16. Collection, dissemination, preservation of material that has archival value for future reference </li></ul></ul><li>Different requirements and emphasis in features implemented in comparison to Web Content Management System s
  17. 17. Fast evolving landscape. Full range of features required: </li><ul><li>copyright information management, metadata-bibliographic data management , metadata organisation and interoperability mechanisms ,highly automated backoffice digitisation processes support,digital content interoperability and reuse frameworks, persisted identifiers , end user interfaces and intuitive content interaction , highly efficient storage and computing resources management </li></ul></ul>
  18. 18. <ul><li>Standards (… so far) </li></ul><ul><li>Open and reusable : </li></ul><ul><ul><li>Digital content, file formats, encoders, etc.
  19. 19. Metadata description: </li><ul><li>UNIMARC , MARC21 to DC and MODS </li></ul><li>Interoperability protocols and interfaces: </li></ul></ul><ul><ul><ul><li>OAI-PMH , OAI-ORE , Z39.50, SRU/SRW, Europeana ESE , etc. </li></ul></ul></ul><ul><ul><li>Unique and persistent identifiers </li><ul><li>Independent from repository software,
  20. 20. HANDLE.NET (RFC3652), openURL </li></ul></ul></ul>
  21. 21. An open content open source software stack No easy task to tackle the full range of these issues, in a scalable and viable manner <ul><li>Custom built - problem specific solutions pay a high toll for standards compliance, locked in a specific solution/architecture </li></ul>Solution : best of breed open source systems for the assembly of a fully open source open content delivery software stack <ul><li>Not trivial task - or just an integration/deployment issue
  22. 22. High quality FLOSS tools are available: pluggable and modular stack, do one thing but do it at “world class” </li></ul>FLOSS/Open Standards is the enabler for scalable, viable, large scale open content infrastructures . Just like the Internet no long term alternative to open standards/FLOSS seems to exist!
  23. 23. … viewers and enhanced user experiences
  24. 24. … novel reading devices
  25. 25. … mashups and linked data
  26. 26. The open software stack components best of breed: open source, setting the standards pace, world class features, scalable and robust communities
  27. 27. Case studies <ul><li>Examples of “what can open source do for you”
  28. 28. 3 case studies: </li><ul><li>The Education and Life Long Learning Repository </li><ul><li>Persistent Ids, rich user experience/custom tag bilingual tag clouds based on documentation metadata </li></ul><li>The e-OPAC “alpha” version of openABEKT </li><ul><li>Connecting the digital with the physical realms </li></ul><li>Public Library e-books repository pilot: the Serres Public Library pilot </li><ul><li>Backwards interoperability, repositories, enhanced viewers and image servers infrastructure </li></ul></ul></ul>
  29. 29. repository.edulll.gr
  30. 30. <ul><li>http://ebooks.serrelib.gr </li></ul>
  31. 31. http://hdl.handle.net/10812/1929
  32. 32. ABEKT e-OPAC
  33. 33. Enhanced image viewer/server infrastructure Data webnode(s) pool datanode(s) pool Shared Storage: 1 Master (RW) + N “slaves” (RO) Currently: 300K pages, 3M imminent from didaktorika.gr, 10 resolutions/page EKT's virtualised infrastructure Shared Storage Backend Processing Frontend Load Balancer Backend Processing Internet Requests
  34. 34. Characteristics • Caching • Load Balancing • Error compensation through VCL • Backend Health Check Varnish • Lightweight & Simple • Stable • Right for the job NginX • On the fly Image Processing • No need for several Images with different resolutions • Open and Extensible API Djatoka Stateless “ Infinite” Scalability Highly Available End to End FLOSS Layer Security Apache Tomcat/DSpace, NGINX and Varnish cache engine in high availability configuration, PostgreSQL DB, on CentOS VMs
  35. 35. (open)ABEKT e-OPAC <ul><li>ABEKT new e-OPAC </li><ul><li>Different need: manage the library (digital and physical) using bibliographic tools and a librarian approach
  36. 36. The interoperability layer between the physical and the digital library
  37. 37. “ Richer” metadata and information </li></ul><li>ROR over OSS components </li></ul>
  38. 38. Digital repositories software <ul><li>Dspace 2.0: with Fedora inside </li><ul><li>EKT participates at the design team </li></ul><li>Dspace + Fedora = Duraspace
  39. 39. Advanced content provisioning and preservation functionalities
  40. 40. The core element for an interoperable repositories infrastructure
  41. 41. Preparing for operation as SaaS and over IaaS Cloud </li><ul><li>Delivers as SaaS a complex service </li></ul></ul>
  42. 42. Future work and informational links <ul><li>Current and forthcoming work </li><ul><li>Viewer with full text search: OCR XML
  43. 43. Back port features to existing repositories
  44. 44. Integrate to OJS the page by page functionality
  45. 45. openABEKT
  46. 46. Automate already existing economies of scale: offer SaaS capabilities for the whole range of tools </li></ul><li>Links </li><ul><li>http://www.duraspace.org/
  47. 47. http://openlibrary.org/dev/docs/bookreader
  48. 48. http://www.indexdata.com/zebra </li></ul></ul>
  49. 49. <ul>Thank you for your attention! </ul>
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×