Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google

664 views

Published on

Presentation at the European Business Press Editors’ Seminar , Vienna, 26 March 2014

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
664
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google

  1. 1. @maxkaiser Austrian Books Online Max Kaiser Head of Research and Development Austrian National Library max.kaiser@onb.ac.at European Business Press Editors’ Seminar Vienna, 26 March 2014 The Public Private Partnership of the Austrian National Library with Google
  2. 2. @maxkaiser www.slideshare.net/maxkaiser
  3. 3. @maxkaiser@maxkaiser
  4. 4. @maxkaiser history back to the 14th century
  5. 5. @maxkaiser@maxkaiser one of the world‘s most significant collections
  6. 6. @maxkaiser@maxkaiser Quelle: http://commons.wikimedia.org/wiki/File:A ustria_Hungary_ethnic_de.svg „legal deposit“
  7. 7. @maxkaiser@maxkaiser
  8. 8. @maxkaiser → Picture Archives and Graphics Department → Map Department → Music Department → Literary Archives → Papyri Department → Department of Planned Languages → Department of Rare Books and Manuscripts
  9. 9. @maxkaiser@maxkaiser
  10. 10. @maxkaiser → State Hall → Papyrus Museum → Globe Museum → Esperanto Museum
  11. 11. @maxkaiser@maxkaiser
  12. 12. @maxkaiser collect preserve describe make available foster research
  13. 13. @maxkaiser@maxkaiser
  14. 14. @maxkaiser@maxkaiser
  15. 15. @maxkaiser@maxkaiser
  16. 16. @maxkaiser@maxkaiser
  17. 17. @maxkaiser@maxkaiser
  18. 18. @maxkaiser@maxkaiser
  19. 19. @maxkaiser@maxkaiser
  20. 20. @maxkaiser@maxkaiser
  21. 21. @maxkaiser@maxkaiser
  22. 22. @maxkaiser
  23. 23. @maxkaiser
  24. 24. September 2012 http://www.onb.ac.at/ vision2025
  25. 25. @maxkaiser
  26. 26. @maxkaiser
  27. 27. @maxkaiser
  28. 28. @maxkaiser Vision 2025Knowledge for the world of tomorrow Our holdings are digitized We collect and sustain knowledge Access to our knowledge is simple With us, research is more faceted and effective We enrich cultural and social life
  29. 29. @maxkaiser@maxkaiser
  30. 30. @maxkaiser → substantial part of our book collections digitised → full-text search → important parts of other collections digitised → all our services are digital our holdings are digitised2025
  31. 31. @maxkaiser
  32. 32. @maxkaiser → focal point of our collection policy is digital → collect user-generated content and new digital formats → scalable system for digital long-term preservation we collect and sustain knowledge2025
  33. 33. @maxkaiser@maxkaiser
  34. 34. @maxkaiser → enrich metadata and connect with semantic web → link with external metadata (e.g. geo data) → build innovative (e.g. visual) interfaces → Open (Linked) Data access to knowledge is simple2025
  35. 35. @maxkaiser@maxkaiser
  36. 36. @maxkaiser →digital content integrated virtual research environments →tailored digital services for researchers →digital humanities →crowdsourcing with us, research is more faceted and simple2025
  37. 37. @maxkaiser@maxkaiser we enrich cultural and social life
  38. 38. @maxkaiser → digital services and reading rooms and museums → reinforce library as social space → foster user participation with our digital resources → user generated content we enrich cultural and social life2025
  39. 39. @maxkaiser@maxkaiser
  40. 40. @maxkaiser access for everyone from anywhere
  41. 41. @maxkaiser@maxkaiser
  42. 42. @maxkaiser@maxkaiser
  43. 43. @maxkaiser@maxkaiser
  44. 44. @maxkaiser@maxkaiser
  45. 45. @maxkaiser@maxkaiser
  46. 46. @maxkaiser Austrian Books Online
  47. 47. @maxkaiser Austrian Books Online www.onb.ac.at/ev/austrianbooksonline/
  48. 48. @maxkaiser digitisation of the entire historical book holdings of the Austrian National Library
  49. 49. @maxkaiser largest Austrian public private partnership in the cultural sector
  50. 50. @maxkaiser 600,000 volumes 200 Mio pages
  51. 51. @maxkaiser Google Books Digital Library Austrian National Library
  52. 52. @maxkaiser Partner Program Library Program Google Books
  53. 53. @maxkaiser 13 Libraries in Europe 5 National Libraries  Italy  Austria  The Netherlands  Czech Republic  Great Britain
  54. 54. @maxkaiser >20 Mio. books > 50% non-English ~ 75% from libraries ~ 2 Mio. books from European libraries > 3 Mio. books public domain
  55. 55. @maxkaiser
  56. 56. @maxkaiser →long duration of the cooperation →substantial investment by both partners →distribution of responsibilities and risks
  57. 57. @maxkaiser → intellectual property rights → public domain works only → non-exclusivity → ONB free to digitise material with other partners → transparency of process and agreement → public tender → detailed online FAQs
  58. 58. @maxkaiser@maxkaiser@maxkaiser → access → all files available free-of-charge for non- commercial use → access via platforms like Europeana → provision to research partners
  59. 59. @maxkaiser
  60. 60. @maxkaiser who is paying for what? http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
  61. 61. @maxkaiser Google: →transport →insurance →scanning →OCR →image processing →quality control →Google Books
  62. 62. @maxkaiser Austrian National Library: → provision of metadata → selection → internal logistics → conservational assessment → barcoding → metadata adjustments → data download and control → data storage & digital preservation → Digital Library
  63. 63. @maxkaiser 70+ ONB staff members 20+ exclusively for project → book logistics → metadata adaptation → cataloguing → conservation / restoration → quality control → software implementation → project management
  64. 64. @maxkaiser
  65. 65. @maxkaiser entire historical book holdings 16th–19th century
  66. 66. @maxkaiser@maxkaiser 200.000 volumes State Hall
  67. 67. @maxkaiser Quelle: http://deu.archinform.net/projekte/1073 Department of Manuscripts and Rare Books Map Department
  68. 68. @maxkaiser Department of Music
  69. 69. @maxkaiser Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg Theatre Museum
  70. 70. @maxkaiser Fidei Commiss Library
  71. 71. @maxkaiser Workflow
  72. 72. @maxkaiser „book flow“ „digital flow“
  73. 73. @maxkaiser book flow
  74. 74. @maxkaiser no individual selection …
  75. 75. @maxkaiser size
  76. 76. @maxkaiser size
  77. 77. @maxkaiser condition
  78. 78. @maxkaiser conservational evaluation
  79. 79. @maxkaiser value
  80. 80. @maxkaiser logistics in the State Hall
  81. 81. @maxkaiser challenges…
  82. 82. @maxkaiser challenges…
  83. 83. @maxkaiser challenges…
  84. 84. @maxkaiser logistics in the „Aurum“ Depot
  85. 85. @maxkaiser preparation for digitisation
  86. 86. @maxkaiser manipulation area …
  87. 87. @maxkaiser adaptation of metadata
  88. 88. @maxkaiser 8 minutes / volume
  89. 89. @maxkaiser 600.000 books
  90. 90. @maxkaiser 80.000 hours
  91. 91. @maxkaiser 10.256 working days
  92. 92. @maxkaiser 48,8 person years
  93. 93. @maxkaiser complex cases …
  94. 94. @maxkaiser bound-togethers …
  95. 95. @maxkaiser bound-togethers …
  96. 96. @maxkaiser bound-togethers …
  97. 97. @maxkaiser conservational protection
  98. 98. @maxkaiser conservational protection
  99. 99. @maxkaiser cataloguing the Fidei Commiss Library
  100. 100. @maxkaiser ready for digitisation …
  101. 101. @maxkaiser digitisation → scanning Center in Germany → procedures agreed → Austrian Federal Office for Monuments involved → each volume checked after return → books unavailable to users for ~ 3 months
  102. 102. @maxkaiser@maxkaiser
  103. 103. @maxkaiser book flowdigital flow
  104. 104. @maxkaiser digitisation data download book logistics quality control storage access ADOCO (Austrian Books Online Download & Control)
  105. 105. @maxkaiser quality control
  106. 106. @maxkaiser quality control →goal: automated jobs →representative samples →IT assisted discovery of error clusters →error candidates checked manually →detect systematic and critical errors
  107. 107. @maxkaiser bleedthrough non-critical
  108. 108. @maxkaiser cropping error critical!
  109. 109. @maxkaiser quality control via sampling re-processing re-download
  110. 110. @maxkaiser cropping error fixed!
  111. 111. @maxkaiser
  112. 112. @maxkaiser~215.000volumes digitised March 2013
  113. 113. @maxkaiser~68,5 Mio.pages March 2013
  114. 114. @maxkaiser 10% 13% 31% 44% 2% 16. Jh. 17. Jh. 18. Jh. 19. Jh. no year centuries…Austrian Books Online
  115. 115. @maxkaiser 3% 12% 14% 29% 33% 9% eng ita fre lat ger others languages…Austrian Books Online
  116. 116. @maxkaiser 0% 10% 20% 30% 40% 50% 60% 70% 16. Jh. 17. Jh. 18. Jh. 19. Jh. eng ita fre lat ger Austrian Books Online
  117. 117. @maxkaiser
  118. 118. @maxkaiser Catalogue / “Quick Search” full-text search ABO Book Viewer ANNO newspaper portal
  119. 119. @maxkaiser
  120. 120. @maxkaiser
  121. 121. @maxkaiser
  122. 122. @maxkaiser
  123. 123. @maxkaiser
  124. 124. @maxkaiser
  125. 125. @maxkaiser
  126. 126. @maxkaiser
  127. 127. @maxkaiser ABO Book Viewer
  128. 128. @maxkaiser outlook
  129. 129. @maxkaiser
  130. 130. @maxkaiser
  131. 131. @maxkaiser outlook → full-text: new possibilities for research →e.g. named entities search → data enrichment → linked data → new data centric research in the Humanities & Social Sciences
  132. 132. @maxkaiser critical mass of digitally available texts and (meta) data new research questions to textual material?
  133. 133. @maxkaiser Data
  134. 134. @maxkaiser ÖNB Hadoop- Cluster
  135. 135. @maxkaiser close reading distant reading interpretation / analysis / edition of individual texts analysis of Big Data textmining
  136. 136. @maxkaiser metadata digitised collections data fata data Server Server Server Server Server data processing Tool Tool Tool Tool
  137. 137. @maxkaiser thank you! max.kaiser@onb.ac.at www.onb.ac.at twitter.com/maxkaiser www.linkedin.com/in/maxkaiser plus.google.com/+maxkaiser1

×