Putting 600,000 books online The large-scale digitisation partnership between    the Austrian National Library and Google ...
Austrian Books Onlinewww.onb.ac.at/ev/austrianbooksonline/@maxkaiser
Digitisationof the entire historicalbook holdings of theAustrian National Library @maxkaiser
Largest AustrianPublic Private Partnershipin the cultural sector @maxkaiser
@maxkaiser
History back to the14th century @maxkaiser
One of the world‘s                  most significant             historical collections@maxkaiser
„Legal Deposit“             Quelle:                        http://commons.wikimedia.org/wiki/File:A                       ...
→Digitisation@maxkaiser
→Digitisation→Online Publications→Web Archiving@maxkaiser
6 years600,000 volumes180 Mio pages@maxkaiser
Google Books             Digital Library             Austrian National Library@maxkaiser
Partner ProgramGoogle Books                Library Program   @maxkaiser
13 Libraries in Europe5 National Libraries             Italy             Austria             The Netherlands          ...
http://ec.europa.eu/information_society/activities/digital_libraries/doc/reflection_group/final_report_%20cds.pdf
„Stimulating the flow of private fundsfor the digitisation of cultural assets throughequitable public private partnerships...
„The key question is not whetherpublic-private partnerships for digitisationshould be encouraged, but how‚and under which ...
Cornerstones:→ Respect for rights holders     →Only public-domain works digitised→ Transparency     →Very detailed FAQs on...
Cornerstones:→ Digital copies for library     →Identical with copies used by Google→ Re-use     →All files available for n...
Additional key elements:→ Selection of books by library→ Institute for Restoration involved→ Termination @maxkaiser
Who is paying                                                                     for what?http://www.bildarchivaustria.at...
Costs→ Full text-digitisation:  very expensive→ Report by  Collections Trust  for Comité des Sages                        ...
70–100 Euro / book Typical costs for projects withexternal service providers @maxkaiser
Google:→ Transport→ Insurance→ Scanning→ OCR→ Image processing→ Quality control→ Google Books @maxkaiser
Austrian National Library:→ Selection→ Internal logistics→ Conservational assessment→ Metadata→ Barcoding→ Data download a...
Which books?@maxkaiser
200.000 volumes             State Hall@maxkaiser
Entire historical book holdings16th –19th century
Department of Manuscriptsand Rare Books    Map Department                            Quelle: http://deu.archinform.net/pro...
Department of Music
Theatre Museum          Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
Fidei Commiss Library
→Conservation→Preservation            http://www.mediathek.at/akustische-chronik/popup/popup.php?document_id=1000115&zone_...
7      Work Packages            Book logistics            Metadata / Catalogues            Conservation / Restoration  ...
Preparatory projectMid - end 2010→ Integration with organisational processes→ Personnel resources→ Logistics workflows @ma...
Internal communication→ Change processes→ Re-evaluation of workflows→ Availability of internal resources @maxkaiser
Consultation with   otherGoogle partners                    Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Ba...
70+ staff members20+ exclusively for project        → Book logistics        → Metadata adaptation        → Cataloguing    ...
End of 2010Test shipment & Start operational projectSpring 2011Start of digitisation  @maxkaiser
No   individual selection …
Format
Format
Condition
Preparation
Conservationalevaluation     <1%     excluded for     conservational reasons
Value
Logistics in theState Hall
Logistics in theState Hall
Logistics in theState Hall
Challenges…
Challenges…
Challenges…
Challenges…
Challenges…
Logistics in the„Aurum“ Depot
Logistics in the„Aurum“ Depot
Preparation forDigitisation
Manipulation area …
Barcoding
Adaptation of metadata
4 minutes / volume
books@maxkaiser
hours@maxkaiser
working days@maxkaiser
person years@maxkaiser
Complex cases …
Adligata …
Adligata …
Adligata …
„Slim“ volumes …
Special collections …
Conservational protection
Conservational protection
Conservational protection
Cataloguing of theFidei Commiss Library
Cataloguing of theFidei Commiss Library
Ready for Digitisation …
Digitisation→ Scanning Center in Germany→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume c...
Book Logistics              Digitisation                           Data Download     ADOCO                 Quality Control...
digitised items / year @maxkaiser
digitised items / day @maxkaiser
Quality control→ Automated jobs→ Representative samples→ IT assisted discovery of error clusters→ Error candidates checked...
Data Management & Access→ Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the...
Book ViewerCatalogue /“Quick Search”                                  [Mobile Apps]               Full-Text Search  @maxka...
USER                                       Book Viewer                        Fulltext                        Index Server...
Outlook→ Full-Text: new possibilities for research→ Data enrichment→ Named Entity Recognition→ Linked Data→ New data centr...
@maxkaiser
@maxkaiser
@maxkaiser
More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline  ...
Happy Birthday!@maxkaiser
Thank you!max.kaiser@onb.ac.atwww.onb.ac.atwww.linkedin.com/in/maxkaisertwitter.com/maxkaiser  @maxkaiser
Putting 600,000 books online: The large-scale digitisation partnership between the Austrian National Library and Google
Upcoming SlideShare
Loading in …5
×

Putting 600,000 books online: The large-scale digitisation partnership between the Austrian National Library and Google

827 views
768 views

Published on

Presentation at the 40th LIBER Annual Conference, Barcelona, 29 June - 2 July 2011

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
827
On SlideShare
0
From Embeds
0
Number of Embeds
83
Actions
Shares
0
Downloads
18
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Putting 600,000 books online: The large-scale digitisation partnership between the Austrian National Library and Google

  1. 1. Putting 600,000 books online The large-scale digitisation partnership between the Austrian National Library and Google Max Kaiser Head R&D, Austrian National Library 40th LIBER Annual Conference Barcelona 29 June – 2 July, 2011@maxkaiser
  2. 2. Austrian Books Onlinewww.onb.ac.at/ev/austrianbooksonline/@maxkaiser
  3. 3. Digitisationof the entire historicalbook holdings of theAustrian National Library @maxkaiser
  4. 4. Largest AustrianPublic Private Partnershipin the cultural sector @maxkaiser
  5. 5. @maxkaiser
  6. 6. History back to the14th century @maxkaiser
  7. 7. One of the world‘s most significant historical collections@maxkaiser
  8. 8. „Legal Deposit“ Quelle: http://commons.wikimedia.org/wiki/File:A ustria_Hungary_ethnic_de.svg@maxkaiser
  9. 9. →Digitisation@maxkaiser
  10. 10. →Digitisation→Online Publications→Web Archiving@maxkaiser
  11. 11. 6 years600,000 volumes180 Mio pages@maxkaiser
  12. 12. Google Books Digital Library Austrian National Library@maxkaiser
  13. 13. Partner ProgramGoogle Books Library Program @maxkaiser
  14. 14. 13 Libraries in Europe5 National Libraries  Italy  Austria  The Netherlands  Czech Republic  Great Britain @maxkaiser
  15. 15. http://ec.europa.eu/information_society/activities/digital_libraries/doc/reflection_group/final_report_%20cds.pdf
  16. 16. „Stimulating the flow of private fundsfor the digitisation of cultural assets throughequitable public private partnershipsappears as a viable and sustainable wayof tackling the pressing questionof making Europe’s cultural wealthaccessible online and preserving itfor future generations.“ @maxkaiser
  17. 17. „The key question is not whetherpublic-private partnerships for digitisationshould be encouraged, but how‚and under which conditions.“ @maxkaiser
  18. 18. Cornerstones:→ Respect for rights holders →Only public-domain works digitised→ Transparency →Very detailed FAQs online→ Access →Free-of-charge access worldwide @maxkaiser
  19. 19. Cornerstones:→ Digital copies for library →Identical with copies used by Google→ Re-use →All files available for non-commercial use →Access via platforms like Europeana →Provision to research partners→ Non-exclusivity →Free to digitise material with other partners @maxkaiser
  20. 20. Additional key elements:→ Selection of books by library→ Institute for Restoration involved→ Termination @maxkaiser
  21. 21. Who is paying for what?http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
  22. 22. Costs→ Full text-digitisation: very expensive→ Report by Collections Trust for Comité des Sages http://ec.europa.eu/information_society/activities/digital_libraries/ doc/refgroup/annexes/digiti_report.pdf @maxkaiser
  23. 23. 70–100 Euro / book Typical costs for projects withexternal service providers @maxkaiser
  24. 24. Google:→ Transport→ Insurance→ Scanning→ OCR→ Image processing→ Quality control→ Google Books @maxkaiser
  25. 25. Austrian National Library:→ Selection→ Internal logistics→ Conservational assessment→ Metadata→ Barcoding→ Data download and control→ Data storage & digital preservation→ Digital Library @maxkaiser
  26. 26. Which books?@maxkaiser
  27. 27. 200.000 volumes State Hall@maxkaiser
  28. 28. Entire historical book holdings16th –19th century
  29. 29. Department of Manuscriptsand Rare Books Map Department Quelle: http://deu.archinform.net/projekte/107
  30. 30. Department of Music
  31. 31. Theatre Museum Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
  32. 32. Fidei Commiss Library
  33. 33. →Conservation→Preservation http://www.mediathek.at/akustische-chronik/popup/popup.php?document_id=1000115&zone_id= 1000043&template_id=1000016&zone_name=IMAGE_ZONE1
  34. 34. 7 Work Packages  Book logistics  Metadata / Catalogues  Conservation / Restoration  Data download / Quality control  Access  IT infrastructure  Project management@maxkaiser
  35. 35. Preparatory projectMid - end 2010→ Integration with organisational processes→ Personnel resources→ Logistics workflows @maxkaiser
  36. 36. Internal communication→ Change processes→ Re-evaluation of workflows→ Availability of internal resources @maxkaiser
  37. 37. Consultation with otherGoogle partners Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Bayerische_Staatsbibliothek_001.JPG
  38. 38. 70+ staff members20+ exclusively for project → Book logistics → Metadata adaptation → Cataloguing → Conservation / restoration → Quality control → Software implementation → Project management@maxkaiser
  39. 39. End of 2010Test shipment & Start operational projectSpring 2011Start of digitisation @maxkaiser
  40. 40. No individual selection …
  41. 41. Format
  42. 42. Format
  43. 43. Condition
  44. 44. Preparation
  45. 45. Conservationalevaluation <1% excluded for conservational reasons
  46. 46. Value
  47. 47. Logistics in theState Hall
  48. 48. Logistics in theState Hall
  49. 49. Logistics in theState Hall
  50. 50. Challenges…
  51. 51. Challenges…
  52. 52. Challenges…
  53. 53. Challenges…
  54. 54. Challenges…
  55. 55. Logistics in the„Aurum“ Depot
  56. 56. Logistics in the„Aurum“ Depot
  57. 57. Preparation forDigitisation
  58. 58. Manipulation area …
  59. 59. Barcoding
  60. 60. Adaptation of metadata
  61. 61. 4 minutes / volume
  62. 62. books@maxkaiser
  63. 63. hours@maxkaiser
  64. 64. working days@maxkaiser
  65. 65. person years@maxkaiser
  66. 66. Complex cases …
  67. 67. Adligata …
  68. 68. Adligata …
  69. 69. Adligata …
  70. 70. „Slim“ volumes …
  71. 71. Special collections …
  72. 72. Conservational protection
  73. 73. Conservational protection
  74. 74. Conservational protection
  75. 75. Cataloguing of theFidei Commiss Library
  76. 76. Cataloguing of theFidei Commiss Library
  77. 77. Ready for Digitisation …
  78. 78. Digitisation→ Scanning Center in Germany→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume checked after return→ Books unavailable to users for ~ 3 months @maxkaiser
  79. 79. Book Logistics Digitisation Data Download ADOCO Quality Control (Austrian Books Online Download & Control) Storage @maxkaiser Access
  80. 80. digitised items / year @maxkaiser
  81. 81. digitised items / day @maxkaiser
  82. 82. Quality control→ Automated jobs→ Representative samples→ IT assisted discovery of error clusters→ Error candidates checked manually→Crowdsourcing? @maxkaiser
  83. 83. Data Management & Access→ Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the-fly→ URN resolver for permanent identification @maxkaiser
  84. 84. Book ViewerCatalogue /“Quick Search” [Mobile Apps] Full-Text Search @maxkaiser
  85. 85. USER Book Viewer Fulltext Index Server Quick Search Image Server URN Resolver Catalogue Digital Repository Master ImagesGoogle ADOCO @maxkaiser
  86. 86. Outlook→ Full-Text: new possibilities for research→ Data enrichment→ Named Entity Recognition→ Linked Data→ New data centric research in the Humanities & Social Sciences→ http://www.diggingintodata.org/ @maxkaiser
  87. 87. @maxkaiser
  88. 88. @maxkaiser
  89. 89. @maxkaiser
  90. 90. More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline @maxkaiser
  91. 91. Happy Birthday!@maxkaiser
  92. 92. Thank you!max.kaiser@onb.ac.atwww.onb.ac.atwww.linkedin.com/in/maxkaisertwitter.com/maxkaiser @maxkaiser

×