UGent Datacenter of waarom we 140TB kopen

1,070 views

Published on

VLENGEL meeting

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,070
On SlideShare
0
From Embeds
0
Number of Embeds
13
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

UGent Datacenter of waarom we 140TB kopen

  1. 1. UGent Datacenterkopen... of waarom we 140TB
  2. 2. 300.000 volumes...
  3. 3. 140.000 volumes...
  4. 4. 70.000 objects...
  5. 5. 12.500 full-text documenten
  6. 6. Storage requirements
  7. 7. Storage requirements
  8. 8. Ideaal Model
  9. 9. Ideaal Model
  10. 10. Ideaal ModelObservaties
  11. 11. Ideaal ModelObservaties Ruwe Data
  12. 12. Ideaal Model HypotheseObservaties Ruwe Data
  13. 13. Ideaal Model HypotheseObservaties Ruwe Data Test
  14. 14. Ideaal Model HypotheseObservaties Ruwe Data Test
  15. 15. Ideaal Model HypotheseObservaties Ruwe Data Test Paper Annex
  16. 16. Ideaal Model HypotheseObservaties Ruwe Data Test Paper Bibliotheek Annex
  17. 17. Ideaal Model HypotheseObservaties Ruwe Data Test Paper Bibliotheek Annex
  18. 18. Praktijk HypotheseObservaties Ruwe Data Test Paper Bibliotheek Annex
  19. 19. Praktijk HypotheseObservaties Ruwe Data Test Paper Bibliotheek Annex
  20. 20. Praktijk HypotheseObservaties Paper Bibliotheek
  21. 21. Emerging standards forenhanced publications andrepository technology : surveyon technologyAmsterdam University Press 2009ISBN 9789089641892Karen Van Godtsenhoven, Mikael KarstensenElbæk, Gert Schmeltz Pedersen, BarbaraSierman, Magchiel Bijsterbosch, PatrickHochstenbach, Rosemary Russell, MauriceVanderfeesten
  22. 22. (Meta)datastandaarden voordigitale archievenUGent MMLab & Universiteitsbibliotheek Gent2009ISBN 9789052230009Paul Bastijns, Sam Coppens, Siska Corneillie,Patrick Hochstenbach, Erik Mannens, LiesbethVan Melle
  23. 23. Institutional Repository• 150.530 bibliografische beschrijvingen• 12.413 flagged full-text available• 50 bestandtypes • 94.3% .PDF , 2.7% .DOC • 3% .ZIP .JPG .TEX + 46 others • 1-2 bestanden per beschrijving
  24. 24. ETD’s• 6.336 bibliografische beschrijvingen• 1.370 flagged full-text available• 1.180 bestandtypes • PDF , DOC, TXT, XLS, ACC, TIF, JPG, EXE, CLASS,... • 1 - 10.000 bestanden per beschrijving
  25. 25. Risico’sFormaat$Interpreta?es$ Bit$Errors/Bugs$ File$Formaat$Wijzigingen$ Technologie$Shi:$ Organisatorische$wijzigingen$ 1980% 1990% 2000% Tijd$
  26. 26. Sources Digital Library LTP Biblio Meercat IR Discovery Waalse Krook Preservation Minerva Aleph ?ICA-Atom? Lib Catalog Archive System LMS ??? Scanning Ingest Repository Workflow
  27. 27. GREPCatmandu [Gent|Lund|Bielefeld Perl Framework] Fedora SOLR ActiveMQ Commons Amazon.com NetApp S3
  28. 28. • Sinds Mei 2010 operationeel• ~ 200.000 objecten• ~ 50.000.000 bestanden• 1.200 bestandtypes• Kleinste object: 10 KB• Grootste object: 250GB• 8.3 TB in ingested• 2.0 TB in queue + ( >> TB Google Books)• 100.000 paginaviews per jaar (via Meercat)
  29. 29. Demo
  30. 30. Ingest
  31. 31. Automatic Aleph Import
  32. 32. Access
  33. 33. OAI-PMH SPARQL REST SOAP

×