Cooperating with Google
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Cooperating with Google

on

  • 1,135 views

Presentation at the EVA / MINERVA 2011 Conference in Jerusalem, 15 November 2011

Presentation at the EVA / MINERVA 2011 Conference in Jerusalem, 15 November 2011

Statistics

Views

Total Views
1,135
Views on SlideShare
1,132
Embed Views
3

Actions

Likes
1
Downloads
8
Comments
0

2 Embeds 3

http://paper.li 2
https://twitter.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Cooperating with Google Presentation Transcript

  • 1. Cooperating with Google The Austrian National Library’s large-scale digitisation partnership with Google Max Kaiser Head R&D, Austrian National Library EVA / MINERVA 2011 Jerusalem 14–16 November, 2011@maxkaiser
  • 2. Austrian Books Onlinewww.onb.ac.at/ev/austrianbooksonline/@maxkaiser
  • 3. www.slideshare.net/maxkaiser@maxkaiser
  • 4. Digitisationof the entire historicalbook holdings of theAustrian National Library @maxkaiser
  • 5. Largest AustrianPublic Private Partnershipin the cultural sector @maxkaiser
  • 6. @maxkaiser
  • 7. History back to the14th century @maxkaiser
  • 8. One of the world‘s most significant historical collections@maxkaiser
  • 9. „Legal Deposit“ Quelle: http://commons.wikimedia.org/wiki/File:A ustria_Hungary_ethnic_de.svg@maxkaiser
  • 10. State Hall@maxkaiser
  • 11. @maxkaiser http://www.onb.ac.at/sammlungen/siawd/siawd_halev.htm
  • 12. @maxkaiser http://www.onb.ac.at/sammlungen/siawd/100hebraica.htm
  • 13. @maxkaiser
  • 14. @maxkaiser
  • 15. @maxkaiser
  • 16. @maxkaiser
  • 17. →Digitisation@maxkaiser
  • 18. →Digitisation→Online Publications→Web Archiving@maxkaiser
  • 19. 600,000 volumes200 Mio pages@maxkaiser
  • 20. 16th century 2nd half of 19th century _ @maxkaiser
  • 21. Google Books Digital Library Austrian National Library@maxkaiser
  • 22. Partner ProgramGoogle Books Library Program @maxkaiser
  • 23. 13 Libraries in Europe5 National Libraries  Italy  Austria  The Netherlands  Czech Republic  Great Britain @maxkaiser
  • 24. >15 Mio. books >3 Mio. public domain@maxkaiser
  • 25. 10 January 2011http://ec.europa.eu/information_society/activities/digital_libraries/doc/reflection_group/final_report_%20cds.pdf
  • 26. „Stimulating the flow of private fundsfor the digitisation of cultural assets throughequitable public private partnershipsappears as a viable and sustainable wayof tackling the pressing questionof making Europe’s cultural wealthaccessible online and preserving itfor future generations.“ @maxkaiser
  • 27. „The key question is not whetherpublic-private partnerships for digitisationshould be encouraged, but how‚and under which conditions.“ @maxkaiser
  • 28. 27 October 2011
  • 29. „(...) recommends that Member States (...)encourage partnerships between culturalinstitutions and the private sector inorder to create new ways of fundingdigitisation of cultural material and tostimulate innovative uses of the material,while ensuring that public privatepartnerships for digitisation are fair andbalanced (…).“ @maxkaiser
  • 30. Key principles:1. Respect for intellectual property rights → Only public-domain works digitised2. Non-exclusivity → Free to digitise material with other partners3. Transparency of the process → Public Tender @maxkaiser
  • 31. Key principles:4. Transparency of agreements → Very detailed FAQs online5. Accessibility through Europeana → All files available for non-commercial use → Access via platforms like Europeana → Provision to research partners6. Key criteria → [Next slide] @maxkaiser
  • 32. Key criteria for assessing PPPs(selection):→ (Free) access to material for general public→ Cross-border access→ Quality of digital copies for public partner→ Usage conditions for public partner @maxkaiser
  • 33. Additional key elements:→ Selection of books by library→ Institute for Restoration involved→ Termination @maxkaiser
  • 34. Who is paying for what?http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
  • 35. Costs→ Full text-digitisation: very expensive→ Report by Collections Trust for Comité des Sages http://ec.europa.eu/information_society/activities/digital_libraries/ doc/refgroup/annexes/digiti_report.pdf @maxkaiser
  • 36. Google:→ Transport→ Insurance→ Scanning→ OCR→ Image processing→ Quality control→ Google Books @maxkaiser
  • 37. Austrian National Library:→ Provision of Metadata→ Selection→ Internal logistics→ Conservational assessment→ Barcoding→ Metadata adjustments→ Data download and control→ Data storage & digital preservation→ Digital Library @maxkaiser
  • 38. →Conservation→Preservation http://www.mediathek.at/akustische-chronik/popup/popup.php?document_id=1000115&zone_id= 1000043&template_id=1000016&zone_name=IMAGE_ZONE1
  • 39. Which books?@maxkaiser
  • 40. Entire historical book holdings16th –19th century
  • 41. 200.000 volumes State Hall@maxkaiser
  • 42. Department of Manuscriptsand Rare Books Map Department Quelle: http://deu.archinform.net/projekte/107
  • 43. Department of Music
  • 44. Theatre Museum Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
  • 45. Fidei Commiss Library
  • 46. @maxkaiser
  • 47. 7 Work Packages  Book logistics  Metadata / Catalogues  Conservation / Restoration  Data download / Quality control  Access  IT infrastructure  Project management@maxkaiser
  • 48. Preparatory projectMid - end 2010→ Integration with organisational processes→ Personnel resources→ Logistics workflows @maxkaiser
  • 49. Internal communication→ Change processes→ Re-evaluation of workflows→ Availability of internal resources @maxkaiser
  • 50. Consultation with otherGoogle partners Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Bayerische_Staatsbibliothek_001.JPG
  • 51. 70+ staff members20+ exclusively for project → Book logistics → Metadata adaptation → Cataloguing → Conservation / restoration → Quality control → Software implementation → Project management@maxkaiser
  • 52. End of 2010Test shipment & Start operational projectSpring 2011Start of digitisation @maxkaiser
  • 53. No individual selection …
  • 54. Format
  • 55. Format
  • 56. Condition
  • 57. Preparation
  • 58. Conservationalevaluation
  • 59. Value
  • 60. Logistics in theState Hall
  • 61. Logistics in theState Hall
  • 62. Logistics in theState Hall
  • 63. Challenges…
  • 64. Challenges…
  • 65. Challenges…
  • 66. Challenges…
  • 67. Challenges…
  • 68. Logistics in the„Aurum“ Depot
  • 69. Logistics in the„Aurum“ Depot
  • 70. Preparation forDigitisation
  • 71. Manipulation area …
  • 72. Barcoding
  • 73. Adaptation of metadata
  • 74. @maxkaiser
  • 75. 8 minutes / volume
  • 76. books@maxkaiser
  • 77. hours@maxkaiser
  • 78. working days@maxkaiser
  • 79. person years@maxkaiser
  • 80. Complex cases …
  • 81. Bound-Togethers …
  • 82. Bound-Togethers …
  • 83. Bound-Togethers …
  • 84. „Slim“ volumes …
  • 85. Special collections …
  • 86. Conservational protection
  • 87. Conservational protection
  • 88. Conservational protection
  • 89. @maxkaiser Conservational protection
  • 90. Cataloguing of theFidei Commiss Library
  • 91. Cataloguing of theFidei Commiss Library
  • 92. Ready for Digitisation …
  • 93. Digitisation→ Scanning Center in Germany→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume checked after return→ Books unavailable to users for ~ 3 months @maxkaiser
  • 94. @maxkaiser
  • 95. Book Logistics Digitisation Data Download ADOCO Quality Control (Austrian Books Online Download & Control) Storage @maxkaiser Access
  • 96. digitised items / day @maxkaiser
  • 97. Quality control→ Automated jobs→ Representative samples→ IT assisted discovery of error clusters→ Error candidates checked manually @maxkaiser
  • 98. @maxkaiser
  • 99. @maxkaiser
  • 100. Technologies and Workflowsfrom EC co-funded FP7 projects:→ SCAPE (Scalable Preservation Environments) →http://www.scape-project.eu/→ IMPACT (Improving Access to Text) →http://www.impact-project.eu/ @maxkaiser
  • 101. http://www.digitisation.eu/
  • 102. Data Management & Access→ Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the-fly→ URN resolver for permanent identification @maxkaiser
  • 103. Book ViewerCatalogue /“Quick Search” [Mobile Apps] Full-Text Search @maxkaiser
  • 104. USER Book Viewer Fulltext Index Server Quick Search Image Server URN Resolver Catalogue Digital Repository Master ImagesGoogle ADOCO @maxkaiser
  • 105. Outlook→ Full-Text: new possibilities for research→ Data enrichment→ Named Entity Recognition→ Linked Data→ New data centric research in the Humanities & Social Sciences→ http://www.diggingintodata.org/ @maxkaiser
  • 106. @maxkaiser
  • 107. http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
  • 108. http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
  • 109. More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline @maxkaiser
  • 110. Thank you!max.kaiser@onb.ac.atwww.onb.ac.atwww.slideshare.net/maxkaiserwww.linkedin.com/in/maxkaisergplus.to/maxkaisertwitter.com/maxkaiser @maxkaiser