@maxkaiser
Austrian Books Online
Max Kaiser
Head of Research and Development
Austrian National Library
max.kaiser@onb.ac.a...
@maxkaiser
www.slideshare.net/maxkaiser
@maxkaiser@maxkaiser
@maxkaiser
history back to the
14th century
@maxkaiser@maxkaiser
one of the world‘s
most significant
collections
@maxkaiser@maxkaiser
Quelle:
http://commons.wikimedia.org/wiki/File:A
ustria_Hungary_ethnic_de.svg
„legal deposit“
@maxkaiser@maxkaiser
@maxkaiser
→ Picture Archives and Graphics Department
→ Map Department
→ Music Department
→ Literary Archives
→ Papyri Dep...
@maxkaiser@maxkaiser
@maxkaiser
→ State Hall
→ Papyrus Museum
→ Globe Museum
→ Esperanto Museum
@maxkaiser@maxkaiser
@maxkaiser
collect preserve
describe make
available foster
research
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser
@maxkaiser
September 2012
http://www.onb.ac.at/
vision2025
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
Vision 2025Knowledge for the world of tomorrow
Our holdings are digitized
We collect and sustain knowledge
Acce...
@maxkaiser@maxkaiser
@maxkaiser
→ substantial part of our book collections digitised
→ full-text search
→ important parts of other collections ...
@maxkaiser
@maxkaiser
→ focal point of our collection policy is digital
→ collect user-generated content and new digital
formats
→ sc...
@maxkaiser@maxkaiser
@maxkaiser
→ enrich metadata and connect with semantic
web
→ link with external metadata (e.g. geo data)
→ build innovativ...
@maxkaiser@maxkaiser
@maxkaiser
→digital content integrated virtual research
environments
→tailored digital services for researchers
→digital h...
@maxkaiser@maxkaiser
we enrich cultural
and social life
@maxkaiser
→ digital services and reading rooms and museums
→ reinforce library as social space
→ foster user participatio...
@maxkaiser@maxkaiser
@maxkaiser
access for everyone
from anywhere
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser@maxkaiser
@maxkaiser
Austrian Books Online
@maxkaiser
Austrian Books Online
www.onb.ac.at/ev/austrianbooksonline/
@maxkaiser
digitisation
of the entire historical
book holdings of the
Austrian National Library
@maxkaiser
largest Austrian
public private partnership
in the cultural sector
@maxkaiser
600,000 volumes
200 Mio pages
@maxkaiser
Google Books
Digital Library
Austrian National Library
@maxkaiser
Partner Program
Library Program
Google Books
@maxkaiser
13 Libraries in Europe
5 National Libraries
 Italy
 Austria
 The Netherlands
 Czech Republic
 Great Britain
@maxkaiser
>20 Mio. books
> 50% non-English
~ 75% from libraries
~ 2 Mio. books from European libraries
> 3 Mio. books pub...
@maxkaiser
@maxkaiser
→long duration of the cooperation
→substantial investment by both partners
→distribution of responsibilities an...
@maxkaiser
→ intellectual property rights
→ public domain works only
→ non-exclusivity
→ ONB free to digitise material wit...
@maxkaiser@maxkaiser@maxkaiser
→ access
→ all files available free-of-charge for non-
commercial use
→ access via platform...
@maxkaiser
@maxkaiser
who is paying
for what?
http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
@maxkaiser
Google:
→transport
→insurance
→scanning
→OCR
→image processing
→quality control
→Google Books
@maxkaiser
Austrian National Library:
→ provision of metadata
→ selection
→ internal logistics
→ conservational assessment...
@maxkaiser
70+ ONB staff members
20+ exclusively for project
→ book logistics
→ metadata adaptation
→ cataloguing
→ conser...
@maxkaiser
@maxkaiser
entire historical
book holdings
16th–19th century
@maxkaiser@maxkaiser
200.000 volumes
State Hall
@maxkaiser
Quelle: http://deu.archinform.net/projekte/1073
Department of Manuscripts
and Rare Books
Map Department
@maxkaiser
Department of Music
@maxkaiser
Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg
Theatre Museum
@maxkaiser
Fidei Commiss Library
@maxkaiser
Workflow
@maxkaiser
„book flow“
„digital flow“
@maxkaiser
book flow
@maxkaiser
no individual selection …
@maxkaiser
size
@maxkaiser
size
@maxkaiser
condition
@maxkaiser
conservational
evaluation
@maxkaiser
value
@maxkaiser
logistics in the
State Hall
@maxkaiser
challenges…
@maxkaiser
challenges…
@maxkaiser
challenges…
@maxkaiser
logistics in the
„Aurum“ Depot
@maxkaiser
preparation for
digitisation
@maxkaiser
manipulation area …
@maxkaiser
adaptation of metadata
@maxkaiser
8 minutes / volume
@maxkaiser
600.000 books
@maxkaiser
80.000 hours
@maxkaiser
10.256 working days
@maxkaiser
48,8 person years
@maxkaiser
complex cases …
@maxkaiser
bound-togethers …
@maxkaiser
bound-togethers …
@maxkaiser
bound-togethers …
@maxkaiser
conservational protection
@maxkaiser
conservational protection
@maxkaiser
cataloguing the
Fidei Commiss Library
@maxkaiser
ready for digitisation …
@maxkaiser
digitisation
→ scanning Center in Germany
→ procedures agreed
→ Austrian Federal Office for Monuments involved
...
@maxkaiser@maxkaiser
@maxkaiser
book flowdigital flow
@maxkaiser
digitisation
data download
book logistics
quality control
storage
access
ADOCO
(Austrian Books Online
Download ...
@maxkaiser
quality control
@maxkaiser
quality control
→goal: automated jobs
→representative samples
→IT assisted discovery of error clusters
→error c...
@maxkaiser
bleedthrough
non-critical
@maxkaiser
cropping
error
critical!
@maxkaiser
quality control
via sampling
re-processing
re-download
@maxkaiser
cropping
error
fixed!
@maxkaiser
@maxkaiser~215.000volumes digitised
March 2013
@maxkaiser~68,5 Mio.pages
March 2013
@maxkaiser
10%
13%
31%
44%
2%
16. Jh.
17. Jh.
18. Jh.
19. Jh.
no year
centuries…Austrian Books
Online
@maxkaiser
3%
12%
14%
29%
33%
9%
eng
ita
fre
lat
ger
others
languages…Austrian Books
Online
@maxkaiser
0%
10%
20%
30%
40%
50%
60%
70%
16. Jh. 17. Jh. 18. Jh. 19. Jh.
eng
ita
fre
lat
ger
Austrian Books
Online
@maxkaiser
@maxkaiser
Catalogue /
“Quick Search”
full-text search
ABO
Book Viewer
ANNO
newspaper portal
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
@maxkaiser
ABO
Book Viewer
@maxkaiser
outlook
@maxkaiser
@maxkaiser
@maxkaiser
outlook
→ full-text: new possibilities for research
→e.g. named entities search
→ data enrichment
→ linked data...
@maxkaiser
critical mass
of digitally available texts
and (meta) data
new research questions
to textual material?
@maxkaiser
Data
@maxkaiser
ÖNB
Hadoop-
Cluster
@maxkaiser
close reading
distant reading
interpretation / analysis /
edition of individual texts
analysis of Big Data
text...
@maxkaiser
metadata
digitised
collections
data fata
data
Server
Server
Server
Server
Server
data
processing
Tool
Tool
Tool...
@maxkaiser
thank you!
max.kaiser@onb.ac.at
www.onb.ac.at
twitter.com/maxkaiser
www.linkedin.com/in/maxkaiser
plus.google.c...
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google
Upcoming SlideShare
Loading in...5
×

Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google

280

Published on

Presentation at the European Business Press Editors’ Seminar , Vienna, 26 March 2014

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
280
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Autrian Books Online - The Public Private Partnership of the Austrian National Library with Google

  1. 1. @maxkaiser Austrian Books Online Max Kaiser Head of Research and Development Austrian National Library max.kaiser@onb.ac.at European Business Press Editors’ Seminar Vienna, 26 March 2014 The Public Private Partnership of the Austrian National Library with Google
  2. 2. @maxkaiser www.slideshare.net/maxkaiser
  3. 3. @maxkaiser@maxkaiser
  4. 4. @maxkaiser history back to the 14th century
  5. 5. @maxkaiser@maxkaiser one of the world‘s most significant collections
  6. 6. @maxkaiser@maxkaiser Quelle: http://commons.wikimedia.org/wiki/File:A ustria_Hungary_ethnic_de.svg „legal deposit“
  7. 7. @maxkaiser@maxkaiser
  8. 8. @maxkaiser → Picture Archives and Graphics Department → Map Department → Music Department → Literary Archives → Papyri Department → Department of Planned Languages → Department of Rare Books and Manuscripts
  9. 9. @maxkaiser@maxkaiser
  10. 10. @maxkaiser → State Hall → Papyrus Museum → Globe Museum → Esperanto Museum
  11. 11. @maxkaiser@maxkaiser
  12. 12. @maxkaiser collect preserve describe make available foster research
  13. 13. @maxkaiser@maxkaiser
  14. 14. @maxkaiser@maxkaiser
  15. 15. @maxkaiser@maxkaiser
  16. 16. @maxkaiser@maxkaiser
  17. 17. @maxkaiser@maxkaiser
  18. 18. @maxkaiser@maxkaiser
  19. 19. @maxkaiser@maxkaiser
  20. 20. @maxkaiser@maxkaiser
  21. 21. @maxkaiser@maxkaiser
  22. 22. @maxkaiser
  23. 23. @maxkaiser
  24. 24. September 2012 http://www.onb.ac.at/ vision2025
  25. 25. @maxkaiser
  26. 26. @maxkaiser
  27. 27. @maxkaiser
  28. 28. @maxkaiser Vision 2025Knowledge for the world of tomorrow Our holdings are digitized We collect and sustain knowledge Access to our knowledge is simple With us, research is more faceted and effective We enrich cultural and social life
  29. 29. @maxkaiser@maxkaiser
  30. 30. @maxkaiser → substantial part of our book collections digitised → full-text search → important parts of other collections digitised → all our services are digital our holdings are digitised2025
  31. 31. @maxkaiser
  32. 32. @maxkaiser → focal point of our collection policy is digital → collect user-generated content and new digital formats → scalable system for digital long-term preservation we collect and sustain knowledge2025
  33. 33. @maxkaiser@maxkaiser
  34. 34. @maxkaiser → enrich metadata and connect with semantic web → link with external metadata (e.g. geo data) → build innovative (e.g. visual) interfaces → Open (Linked) Data access to knowledge is simple2025
  35. 35. @maxkaiser@maxkaiser
  36. 36. @maxkaiser →digital content integrated virtual research environments →tailored digital services for researchers →digital humanities →crowdsourcing with us, research is more faceted and simple2025
  37. 37. @maxkaiser@maxkaiser we enrich cultural and social life
  38. 38. @maxkaiser → digital services and reading rooms and museums → reinforce library as social space → foster user participation with our digital resources → user generated content we enrich cultural and social life2025
  39. 39. @maxkaiser@maxkaiser
  40. 40. @maxkaiser access for everyone from anywhere
  41. 41. @maxkaiser@maxkaiser
  42. 42. @maxkaiser@maxkaiser
  43. 43. @maxkaiser@maxkaiser
  44. 44. @maxkaiser@maxkaiser
  45. 45. @maxkaiser@maxkaiser
  46. 46. @maxkaiser Austrian Books Online
  47. 47. @maxkaiser Austrian Books Online www.onb.ac.at/ev/austrianbooksonline/
  48. 48. @maxkaiser digitisation of the entire historical book holdings of the Austrian National Library
  49. 49. @maxkaiser largest Austrian public private partnership in the cultural sector
  50. 50. @maxkaiser 600,000 volumes 200 Mio pages
  51. 51. @maxkaiser Google Books Digital Library Austrian National Library
  52. 52. @maxkaiser Partner Program Library Program Google Books
  53. 53. @maxkaiser 13 Libraries in Europe 5 National Libraries  Italy  Austria  The Netherlands  Czech Republic  Great Britain
  54. 54. @maxkaiser >20 Mio. books > 50% non-English ~ 75% from libraries ~ 2 Mio. books from European libraries > 3 Mio. books public domain
  55. 55. @maxkaiser
  56. 56. @maxkaiser →long duration of the cooperation →substantial investment by both partners →distribution of responsibilities and risks
  57. 57. @maxkaiser → intellectual property rights → public domain works only → non-exclusivity → ONB free to digitise material with other partners → transparency of process and agreement → public tender → detailed online FAQs
  58. 58. @maxkaiser@maxkaiser@maxkaiser → access → all files available free-of-charge for non- commercial use → access via platforms like Europeana → provision to research partners
  59. 59. @maxkaiser
  60. 60. @maxkaiser who is paying for what? http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg
  61. 61. @maxkaiser Google: →transport →insurance →scanning →OCR →image processing →quality control →Google Books
  62. 62. @maxkaiser Austrian National Library: → provision of metadata → selection → internal logistics → conservational assessment → barcoding → metadata adjustments → data download and control → data storage & digital preservation → Digital Library
  63. 63. @maxkaiser 70+ ONB staff members 20+ exclusively for project → book logistics → metadata adaptation → cataloguing → conservation / restoration → quality control → software implementation → project management
  64. 64. @maxkaiser
  65. 65. @maxkaiser entire historical book holdings 16th–19th century
  66. 66. @maxkaiser@maxkaiser 200.000 volumes State Hall
  67. 67. @maxkaiser Quelle: http://deu.archinform.net/projekte/1073 Department of Manuscripts and Rare Books Map Department
  68. 68. @maxkaiser Department of Music
  69. 69. @maxkaiser Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg Theatre Museum
  70. 70. @maxkaiser Fidei Commiss Library
  71. 71. @maxkaiser Workflow
  72. 72. @maxkaiser „book flow“ „digital flow“
  73. 73. @maxkaiser book flow
  74. 74. @maxkaiser no individual selection …
  75. 75. @maxkaiser size
  76. 76. @maxkaiser size
  77. 77. @maxkaiser condition
  78. 78. @maxkaiser conservational evaluation
  79. 79. @maxkaiser value
  80. 80. @maxkaiser logistics in the State Hall
  81. 81. @maxkaiser challenges…
  82. 82. @maxkaiser challenges…
  83. 83. @maxkaiser challenges…
  84. 84. @maxkaiser logistics in the „Aurum“ Depot
  85. 85. @maxkaiser preparation for digitisation
  86. 86. @maxkaiser manipulation area …
  87. 87. @maxkaiser adaptation of metadata
  88. 88. @maxkaiser 8 minutes / volume
  89. 89. @maxkaiser 600.000 books
  90. 90. @maxkaiser 80.000 hours
  91. 91. @maxkaiser 10.256 working days
  92. 92. @maxkaiser 48,8 person years
  93. 93. @maxkaiser complex cases …
  94. 94. @maxkaiser bound-togethers …
  95. 95. @maxkaiser bound-togethers …
  96. 96. @maxkaiser bound-togethers …
  97. 97. @maxkaiser conservational protection
  98. 98. @maxkaiser conservational protection
  99. 99. @maxkaiser cataloguing the Fidei Commiss Library
  100. 100. @maxkaiser ready for digitisation …
  101. 101. @maxkaiser digitisation → scanning Center in Germany → procedures agreed → Austrian Federal Office for Monuments involved → each volume checked after return → books unavailable to users for ~ 3 months
  102. 102. @maxkaiser@maxkaiser
  103. 103. @maxkaiser book flowdigital flow
  104. 104. @maxkaiser digitisation data download book logistics quality control storage access ADOCO (Austrian Books Online Download & Control)
  105. 105. @maxkaiser quality control
  106. 106. @maxkaiser quality control →goal: automated jobs →representative samples →IT assisted discovery of error clusters →error candidates checked manually →detect systematic and critical errors
  107. 107. @maxkaiser bleedthrough non-critical
  108. 108. @maxkaiser cropping error critical!
  109. 109. @maxkaiser quality control via sampling re-processing re-download
  110. 110. @maxkaiser cropping error fixed!
  111. 111. @maxkaiser
  112. 112. @maxkaiser~215.000volumes digitised March 2013
  113. 113. @maxkaiser~68,5 Mio.pages March 2013
  114. 114. @maxkaiser 10% 13% 31% 44% 2% 16. Jh. 17. Jh. 18. Jh. 19. Jh. no year centuries…Austrian Books Online
  115. 115. @maxkaiser 3% 12% 14% 29% 33% 9% eng ita fre lat ger others languages…Austrian Books Online
  116. 116. @maxkaiser 0% 10% 20% 30% 40% 50% 60% 70% 16. Jh. 17. Jh. 18. Jh. 19. Jh. eng ita fre lat ger Austrian Books Online
  117. 117. @maxkaiser
  118. 118. @maxkaiser Catalogue / “Quick Search” full-text search ABO Book Viewer ANNO newspaper portal
  119. 119. @maxkaiser
  120. 120. @maxkaiser
  121. 121. @maxkaiser
  122. 122. @maxkaiser
  123. 123. @maxkaiser
  124. 124. @maxkaiser
  125. 125. @maxkaiser
  126. 126. @maxkaiser
  127. 127. @maxkaiser ABO Book Viewer
  128. 128. @maxkaiser outlook
  129. 129. @maxkaiser
  130. 130. @maxkaiser
  131. 131. @maxkaiser outlook → full-text: new possibilities for research →e.g. named entities search → data enrichment → linked data → new data centric research in the Humanities & Social Sciences
  132. 132. @maxkaiser critical mass of digitally available texts and (meta) data new research questions to textual material?
  133. 133. @maxkaiser Data
  134. 134. @maxkaiser ÖNB Hadoop- Cluster
  135. 135. @maxkaiser close reading distant reading interpretation / analysis / edition of individual texts analysis of Big Data textmining
  136. 136. @maxkaiser metadata digitised collections data fata data Server Server Server Server Server data processing Tool Tool Tool Tool
  137. 137. @maxkaiser thank you! max.kaiser@onb.ac.at www.onb.ac.at twitter.com/maxkaiser www.linkedin.com/in/maxkaiser plus.google.com/+maxkaiser1
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×