Cooperating with Google          The Austrian National Library’s  large-scale digitisation partnership with Google        ...
Austrian Books Onlinewww.onb.ac.at/ev/austrianbooksonline/@maxkaiser
www.slideshare.net/maxkaiser@maxkaiser
Digitisationof the entire historicalbook holdings of theAustrian National Library @maxkaiser
Largest AustrianPublic Private Partnershipin the cultural sector @maxkaiser
@maxkaiser
History back to the14th century @maxkaiser
One of the world‘s                  most significant             historical collections@maxkaiser
„Legal Deposit“             Quelle:                        http://commons.wikimedia.org/wiki/File:A                       ...
State Hall@maxkaiser
@maxkaiser   http://www.onb.ac.at/sammlungen/siawd/siawd_halev.htm
@maxkaiser   http://www.onb.ac.at/sammlungen/siawd/100hebraica.htm
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
→Digitisation@maxkaiser
→Digitisation→Online Publications→Web Archiving@maxkaiser
600,000 volumes200 Mio pages@maxkaiser
16th century                     2nd half of              19th   century _ @maxkaiser
Google Books             Digital Library             Austrian National Library@maxkaiser
Partner ProgramGoogle Books                Library Program   @maxkaiser
13 Libraries in Europe5 National Libraries             Italy             Austria             The Netherlands          ...
>15 Mio. books     >3 Mio.   public domain@maxkaiser
10 January 2011http://ec.europa.eu/information_society/activities/digital_libraries/doc/reflection_group/final_report_%20c...
„Stimulating the flow of private fundsfor the digitisation of cultural assets throughequitable public private partnerships...
„The key question is not whetherpublic-private partnerships for digitisationshould be encouraged, but how‚and under which ...
27 October 2011
„(...) recommends that Member States (...)encourage partnerships between culturalinstitutions and the private sector inord...
Key principles:1. Respect for intellectual property rights     → Only public-domain works digitised2. Non-exclusivity     ...
Key principles:4. Transparency of agreements     → Very detailed FAQs online5. Accessibility through Europeana     → All f...
Key criteria for assessing PPPs(selection):→ (Free) access to material for general  public→ Cross-border access→ Quality o...
Additional key elements:→ Selection of books by library→ Institute for Restoration involved→ Termination @maxkaiser
Who is paying                                                                     for what?http://www.bildarchivaustria.at...
Costs→ Full text-digitisation:  very expensive→ Report by  Collections Trust  for Comité des Sages                        ...
Google:→ Transport→ Insurance→ Scanning→ OCR→ Image processing→ Quality control→ Google Books @maxkaiser
Austrian National Library:→ Provision of Metadata→ Selection→ Internal logistics→ Conservational assessment→ Barcoding→ Me...
→Conservation→Preservation            http://www.mediathek.at/akustische-chronik/popup/popup.php?document_id=1000115&zone_...
Which books?@maxkaiser
Entire historical book holdings16th –19th century
200.000 volumes             State Hall@maxkaiser
Department of Manuscriptsand Rare Books    Map Department                            Quelle: http://deu.archinform.net/pro...
Department of Music
Theatre Museum          Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
Fidei Commiss Library
@maxkaiser
7      Work Packages            Book logistics            Metadata / Catalogues            Conservation / Restoration  ...
Preparatory projectMid - end 2010→ Integration with organisational processes→ Personnel resources→ Logistics workflows @ma...
Internal communication→ Change processes→ Re-evaluation of workflows→ Availability of internal resources @maxkaiser
Consultation with   otherGoogle partners                    Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Ba...
70+ staff members20+ exclusively for project        → Book logistics        → Metadata adaptation        → Cataloguing    ...
End of 2010Test shipment & Start operational projectSpring 2011Start of digitisation  @maxkaiser
No   individual selection …
Format
Format
Condition
Preparation
Conservationalevaluation
Value
Logistics in theState Hall
Logistics in theState Hall
Logistics in theState Hall
Challenges…
Challenges…
Challenges…
Challenges…
Challenges…
Logistics in the„Aurum“ Depot
Logistics in the„Aurum“ Depot
Preparation forDigitisation
Manipulation area …
Barcoding
Adaptation of metadata
@maxkaiser
8 minutes / volume
books@maxkaiser
hours@maxkaiser
working days@maxkaiser
person years@maxkaiser
Complex cases …
Bound-Togethers …
Bound-Togethers …
Bound-Togethers …
„Slim“ volumes …
Special collections …
Conservational protection
Conservational protection
Conservational protection
@maxkaiser             Conservational protection
Cataloguing of theFidei Commiss Library
Cataloguing of theFidei Commiss Library
Ready for Digitisation …
Digitisation→ Scanning Center in Germany→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume c...
@maxkaiser
Book Logistics              Digitisation                           Data Download     ADOCO                 Quality Control...
digitised items / day @maxkaiser
Quality control→ Automated jobs→ Representative samples→ IT assisted discovery of error clusters→ Error candidates checked...
@maxkaiser
@maxkaiser
Technologies and Workflowsfrom EC co-funded FP7 projects:→ SCAPE  (Scalable Preservation Environments)   →http://www.scape...
http://www.digitisation.eu/
Data Management & Access→ Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the...
Book ViewerCatalogue /“Quick Search”                                  [Mobile Apps]               Full-Text Search  @maxka...
USER                                       Book Viewer                        Fulltext                        Index Server...
Outlook→ Full-Text: new possibilities for research→ Data enrichment→ Named Entity Recognition→ Linked Data→ New data centr...
@maxkaiser
http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline  ...
Thank you!max.kaiser@onb.ac.atwww.onb.ac.atwww.slideshare.net/maxkaiserwww.linkedin.com/in/maxkaisergplus.to/maxkaisertwit...
Cooperating with Google
Upcoming SlideShare
Loading in …5
×

Cooperating with Google

1,130 views
1,059 views

Published on

Presentation at the EVA / MINERVA 2011 Conference in Jerusalem, 15 November 2011

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,130
On SlideShare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
9
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Cooperating with Google

  1. 1. Cooperating with Google The Austrian National Library’s large-scale digitisation partnership with Google Max Kaiser Head R&D, Austrian National Library EVA / MINERVA 2011 Jerusalem 14–16 November, 2011@maxkaiser
  2. 2. Austrian Books Onlinewww.onb.ac.at/ev/austrianbooksonline/@maxkaiser
  3. 3. www.slideshare.net/maxkaiser@maxkaiser
  4. 4. Digitisationof the entire historicalbook holdings of theAustrian National Library @maxkaiser
  5. 5. Largest AustrianPublic Private Partnershipin the cultural sector @maxkaiser
  6. 6. @maxkaiser
  7. 7. History back to the14th century @maxkaiser
  8. 8. One of the world‘s most significant historical collections@maxkaiser
  9. 9. „Legal Deposit“ Quelle: http://commons.wikimedia.org/wiki/File:A ustria_Hungary_ethnic_de.svg@maxkaiser
  10. 10. State Hall@maxkaiser
  11. 11. @maxkaiser http://www.onb.ac.at/sammlungen/siawd/siawd_halev.htm
  12. 12. @maxkaiser http://www.onb.ac.at/sammlungen/siawd/100hebraica.htm
  13. 13. @maxkaiser
  14. 14. @maxkaiser
  15. 15. @maxkaiser
  16. 16. @maxkaiser
  17. 17. →Digitisation@maxkaiser
  18. 18. →Digitisation→Online Publications→Web Archiving@maxkaiser
  19. 19. 600,000 volumes200 Mio pages@maxkaiser
  20. 20. 16th century 2nd half of 19th century _ @maxkaiser
  21. 21. Google Books Digital Library Austrian National Library@maxkaiser
  22. 22. Partner ProgramGoogle Books Library Program @maxkaiser
  23. 23. 13 Libraries in Europe5 National Libraries  Italy  Austria  The Netherlands  Czech Republic  Great Britain @maxkaiser
  24. 24. >15 Mio. books >3 Mio. public domain@maxkaiser
  25. 25. 10 January 2011http://ec.europa.eu/information_society/activities/digital_libraries/doc/reflection_group/final_report_%20cds.pdf
  26. 26. „Stimulating the flow of private fundsfor the digitisation of cultural assets throughequitable public private partnershipsappears as a viable and sustainable wayof tackling the pressing questionof making Europe’s cultural wealthaccessible online and preserving itfor future generations.“ @maxkaiser
  27. 27. „The key question is not whetherpublic-private partnerships for digitisationshould be encouraged, but how‚and under which conditions.“ @maxkaiser
  28. 28. 27 October 2011
  29. 29. „(...) recommends that Member States (...)encourage partnerships between culturalinstitutions and the private sector inorder to create new ways of fundingdigitisation of cultural material and tostimulate innovative uses of the material,while ensuring that public privatepartnerships for digitisation are fair andbalanced (…).“ @maxkaiser
  30. 30. Key principles:1. Respect for intellectual property rights → Only public-domain works digitised2. Non-exclusivity → Free to digitise material with other partners3. Transparency of the process → Public Tender @maxkaiser
  31. 31. Key principles:4. Transparency of agreements → Very detailed FAQs online5. Accessibility through Europeana → All files available for non-commercial use → Access via platforms like Europeana → Provision to research partners6. Key criteria → [Next slide] @maxkaiser
  32. 32. Key criteria for assessing PPPs(selection):→ (Free) access to material for general public→ Cross-border access→ Quality of digital copies for public partner→ Usage conditions for public partner @maxkaiser
  33. 33. Additional key elements:→ Selection of books by library→ Institute for Restoration involved→ Termination @maxkaiser
  34. 34. Who is paying for what?http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
  35. 35. Costs→ Full text-digitisation: very expensive→ Report by Collections Trust for Comité des Sages http://ec.europa.eu/information_society/activities/digital_libraries/ doc/refgroup/annexes/digiti_report.pdf @maxkaiser
  36. 36. Google:→ Transport→ Insurance→ Scanning→ OCR→ Image processing→ Quality control→ Google Books @maxkaiser
  37. 37. Austrian National Library:→ Provision of Metadata→ Selection→ Internal logistics→ Conservational assessment→ Barcoding→ Metadata adjustments→ Data download and control→ Data storage & digital preservation→ Digital Library @maxkaiser
  38. 38. →Conservation→Preservation http://www.mediathek.at/akustische-chronik/popup/popup.php?document_id=1000115&zone_id= 1000043&template_id=1000016&zone_name=IMAGE_ZONE1
  39. 39. Which books?@maxkaiser
  40. 40. Entire historical book holdings16th –19th century
  41. 41. 200.000 volumes State Hall@maxkaiser
  42. 42. Department of Manuscriptsand Rare Books Map Department Quelle: http://deu.archinform.net/projekte/107
  43. 43. Department of Music
  44. 44. Theatre Museum Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
  45. 45. Fidei Commiss Library
  46. 46. @maxkaiser
  47. 47. 7 Work Packages  Book logistics  Metadata / Catalogues  Conservation / Restoration  Data download / Quality control  Access  IT infrastructure  Project management@maxkaiser
  48. 48. Preparatory projectMid - end 2010→ Integration with organisational processes→ Personnel resources→ Logistics workflows @maxkaiser
  49. 49. Internal communication→ Change processes→ Re-evaluation of workflows→ Availability of internal resources @maxkaiser
  50. 50. Consultation with otherGoogle partners Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Bayerische_Staatsbibliothek_001.JPG
  51. 51. 70+ staff members20+ exclusively for project → Book logistics → Metadata adaptation → Cataloguing → Conservation / restoration → Quality control → Software implementation → Project management@maxkaiser
  52. 52. End of 2010Test shipment & Start operational projectSpring 2011Start of digitisation @maxkaiser
  53. 53. No individual selection …
  54. 54. Format
  55. 55. Format
  56. 56. Condition
  57. 57. Preparation
  58. 58. Conservationalevaluation
  59. 59. Value
  60. 60. Logistics in theState Hall
  61. 61. Logistics in theState Hall
  62. 62. Logistics in theState Hall
  63. 63. Challenges…
  64. 64. Challenges…
  65. 65. Challenges…
  66. 66. Challenges…
  67. 67. Challenges…
  68. 68. Logistics in the„Aurum“ Depot
  69. 69. Logistics in the„Aurum“ Depot
  70. 70. Preparation forDigitisation
  71. 71. Manipulation area …
  72. 72. Barcoding
  73. 73. Adaptation of metadata
  74. 74. @maxkaiser
  75. 75. 8 minutes / volume
  76. 76. books@maxkaiser
  77. 77. hours@maxkaiser
  78. 78. working days@maxkaiser
  79. 79. person years@maxkaiser
  80. 80. Complex cases …
  81. 81. Bound-Togethers …
  82. 82. Bound-Togethers …
  83. 83. Bound-Togethers …
  84. 84. „Slim“ volumes …
  85. 85. Special collections …
  86. 86. Conservational protection
  87. 87. Conservational protection
  88. 88. Conservational protection
  89. 89. @maxkaiser Conservational protection
  90. 90. Cataloguing of theFidei Commiss Library
  91. 91. Cataloguing of theFidei Commiss Library
  92. 92. Ready for Digitisation …
  93. 93. Digitisation→ Scanning Center in Germany→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume checked after return→ Books unavailable to users for ~ 3 months @maxkaiser
  94. 94. @maxkaiser
  95. 95. Book Logistics Digitisation Data Download ADOCO Quality Control (Austrian Books Online Download & Control) Storage @maxkaiser Access
  96. 96. digitised items / day @maxkaiser
  97. 97. Quality control→ Automated jobs→ Representative samples→ IT assisted discovery of error clusters→ Error candidates checked manually @maxkaiser
  98. 98. @maxkaiser
  99. 99. @maxkaiser
  100. 100. Technologies and Workflowsfrom EC co-funded FP7 projects:→ SCAPE (Scalable Preservation Environments) →http://www.scape-project.eu/→ IMPACT (Improving Access to Text) →http://www.impact-project.eu/ @maxkaiser
  101. 101. http://www.digitisation.eu/
  102. 102. Data Management & Access→ Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the-fly→ URN resolver for permanent identification @maxkaiser
  103. 103. Book ViewerCatalogue /“Quick Search” [Mobile Apps] Full-Text Search @maxkaiser
  104. 104. USER Book Viewer Fulltext Index Server Quick Search Image Server URN Resolver Catalogue Digital Repository Master ImagesGoogle ADOCO @maxkaiser
  105. 105. Outlook→ Full-Text: new possibilities for research→ Data enrichment→ Named Entity Recognition→ Linked Data→ New data centric research in the Humanities & Social Sciences→ http://www.diggingintodata.org/ @maxkaiser
  106. 106. @maxkaiser
  107. 107. http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
  108. 108. http://books.google.com/books?vid=ONB%2BZ119586207@maxkaiser
  109. 109. More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline @maxkaiser
  110. 110. Thank you!max.kaiser@onb.ac.atwww.onb.ac.atwww.slideshare.net/maxkaiserwww.linkedin.com/in/maxkaisergplus.to/maxkaisertwitter.com/maxkaiser @maxkaiser

×