SlideShare a Scribd company logo
Surveying Newspaper Digitisation in European
Libraries, Then Aggregating Them !
Europeana Newspapers
Alastair Dunning
Programme Manager, The European Library
@alastairdunning, alastair.dunning AT kb.nl
LIBER Conference, June 2013, Munich
This presentation is at http://www.slideshare.net/alastairdunning
On November 3, 1948,
the early edition of the
Chicago Tribune
proclaimed Thomas
Dewey as winner of the
US presidential
campaign
http://www.chicagotribune.com/news/politics/chi-histdewey_defeats_an20080104104816,0,547284.photo
In actual fact, the
campaign was won by
Harry Truman, who
became the 33rd
President of the United
States
http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
Later editions of the
Chicago Tribune
corrected this mistake
with headline
"DEMOCRATS MAKE
SWEEP OF STATE
OFFICES"
However, I cannot find
these online !
http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
As we shall see, presenting
comprehensive digital archives,
where everything is digitised, is
difficult... yet this is what users
often demand !
"This lack of collocation and collection
presents efficiency challenges and deepens
scholars’ concerns about
comprehensiveness. The anxiety over
“missing something” was quite common
across interviews."
Ithaka S+R, Supporting the Changing
Research Practices of Historians,
http://www.sr.ithaka.org/research-publications/supporting-changing-
research-practices-historians
"When lined up against the non-digital
object upon which it is based, the digital
object can only ever appear impoverished."
Jim Mussell, Historian at
University of Birmingham
http://jimmussell.com/2013/05/23/the-proximal-past-
digital-archives-and-the-here-and-now/
Genealogists - those studying family
history
"Genealogists represent the majority of
users in many archives. And yet, the
traditional archival information system
does not meet their needs."
Wendy M. Duff, Catherine A. Johnson, Where Is the
List with All the Names? Information-Seeking Behavior
of Genealogists, American Archivist, Volume 66(1),
2003, http://archivists.metapress.com/content/L375UJ047224737N
Despite this, European
libraries have made great
strides in digitising their
newspapers
(These results taken from first
Europeana Newspapers
survey, 2012. 47 libraries
responded.)
http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana-
newspapers-survey-report.pdf
129, 041, 663pages
from
23,987 titles
11 libraries have digitised more than 3m pages
1. National Library of Czech Republic
2. Koninklijke Bibliotheek van België
3. National Library of Spain
4. National Library of Norway
5. National and Univeristy Library of Iceland
6. BCU Lausanne
7. Hamburg State and University Library
8. Bibliothèque nationale de France
9. British Library
10. Koninklijke Bibliotheek
11. Austrian National Library
But, only 12 (26%) of the
libraries had digitised more than 10%
of their collection
(either in terms of titles or page numbers)
National Library of Luxembourg
620.000
pages digitised
4.000.000
pages in collection
National Library of Finland
620.000
pages digitised
2.010.246
pages in collection
Hamburg State and University Library
c. 2.000.000 pages digitised
c. 12.000.000 pages
in collection
What else did the survey discover ?
Access to digitised newspapers is nearly always
free of charge. At least 40 (85%)
offered free access to their digitised
newspapers.
One library had pay per view, whilst another three offered
subscription services for users (ie paid access per day or per
month).
Only four libraries licensed their newspaper contents to
other groups (e.g. school, universities).
Access to twentieth-century content remains
problematic.
27 out of 47 libraries (57%)have a cut off date
beyond which they will not publish digitised newspapers on
the web. Most frequently, this is based on a 70 year sliding
scale.
23%(11 out of 47) had an agreement with a rights
organisation so that in-copyright digitised newspapers could
be published, but often restricted to individual titles
There is still much to be done to exploit the richness
of digitised newspaper content
64%(37 from 47) of libraries made use of OCR
But only 17 of these libraries (36%) exposed the resulting
full text to the viewer
36%had undertaken zoning and segmentation and only six
libraries (13%) had included features such as facetted
browsing or extracting entities such as place or name
--> Motivation for Europeana
Newspapers
Others WPs will explain process of
improving digitised archives but I
want to return to one earlier
quote
"... the lack of comprehensive search
tools for primary sources ..."
Locating primary sources presents a
crucial challenge for reserachers.
--> TEL aggregator as part of
Europeana Newspapers project
Timetable: Early version with
limited content added to The
European Library website in
September 20
More content being added in 2013
and 2014
http://theeuropeanlibrary.org will
deliver a search interface to help
locate 18mpages digitised
at European libraires
Users will also be able to search
over titles of newspapers. Title
metadata will also be forwarded to
Europeana
Some Issues:
Copyright means that some
images cannot be shared at all,
only metadata (e.g. names and
dates of newspapers)
Some Issues:
OCR and zoning quality will affect
search results significantly. Eg
Higher quality OCR will be
returned more often in search
results
Some Issues:
Some pages have no OCR
whatsoever - more difficult to find
Some Issues:
Different libraries are willing to
share different amounts of
content
Some libraries happy for full
content to be shared; for others it
is just snippets of images
Last Thoughts and What Next ?:
The European Library will sustain access
beyond project funding; but adding more
content will require membership of TEL
How can we allow for transcription?
What do non-academic users want?
How do we create full-text APIs ?
Oh, the results here
were all based on the
first edition of the
project survey.
If your library want to
contribute to later
editions, see links by
July 2013
http://www.europeana-newspapers.eu/tell-us-about-your-newspaper-
digitisation-project/
http://www.surveymonkey.com/s/BQ28579

More Related Content

What's hot

Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers
 
Charper.lawdi.20130531
Charper.lawdi.20130531Charper.lawdi.20130531
Charper.lawdi.20130531
charper
 

What's hot (20)

Library ad Information Science Education in Germany
Library ad Information Science Education in GermanyLibrary ad Information Science Education in Germany
Library ad Information Science Education in Germany
 
Volkswagen haus
Volkswagen hausVolkswagen haus
Volkswagen haus
 
LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...
LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...
LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...
 
The Europeana Newspapers Project
The Europeana Newspapers ProjectThe Europeana Newspapers Project
The Europeana Newspapers Project
 
Sciences Po Grenoble library and Research, France
Sciences Po Grenoble library and Research, FranceSciences Po Grenoble library and Research, France
Sciences Po Grenoble library and Research, France
 
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop intro
 
Associations and Infrastructures of Germany's Library Community
Associations and Infrastructures of Germany's Library CommunityAssociations and Infrastructures of Germany's Library Community
Associations and Infrastructures of Germany's Library Community
 
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
 
Toward research data management in ktu, lithuania
Toward research data management in ktu, lithuania Toward research data management in ktu, lithuania
Toward research data management in ktu, lithuania
 
Open Cultural Data in Switzerland
Open Cultural Data in SwitzerlandOpen Cultural Data in Switzerland
Open Cultural Data in Switzerland
 
British Library Labs Presentation at Ed Tech Hackathon 2013 - hackathoncentra...
British Library Labs Presentation at Ed Tech Hackathon 2013 - hackathoncentra...British Library Labs Presentation at Ed Tech Hackathon 2013 - hackathoncentra...
British Library Labs Presentation at Ed Tech Hackathon 2013 - hackathoncentra...
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
“Archäologische Informationen” and Open Journal Systems. Chances and Possibil...
“Archäologische Informationen” and Open Journal Systems. Chances and Possibil...“Archäologische Informationen” and Open Journal Systems. Chances and Possibil...
“Archäologische Informationen” and Open Journal Systems. Chances and Possibil...
 
Estermann Panel on Authority Files, 3 June 2020
Estermann Panel on Authority Files, 3 June 2020Estermann Panel on Authority Files, 3 June 2020
Estermann Panel on Authority Files, 3 June 2020
 
Charper.lawdi.20130531
Charper.lawdi.20130531Charper.lawdi.20130531
Charper.lawdi.20130531
 
Launch of Welsh Newspapers Online
Launch of Welsh Newspapers OnlineLaunch of Welsh Newspapers Online
Launch of Welsh Newspapers Online
 
Use Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataUse Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked Data
 
Future Directions of the European Library
Future Directions of the European LibraryFuture Directions of the European Library
Future Directions of the European Library
 
BL Labs presentation given to the Digital Scholarship Team
BL Labs presentation given to the Digital Scholarship TeamBL Labs presentation given to the Digital Scholarship Team
BL Labs presentation given to the Digital Scholarship Team
 
Quo vadis University Presses
Quo vadis University PressesQuo vadis University Presses
Quo vadis University Presses
 

Similar to Digitised historic newspapers in Europe

Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
The European Library
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
The European Library
 
Electronic resources in academic libraries
Electronic resources in academic librariesElectronic resources in academic libraries
Electronic resources in academic libraries
estambulcervantes
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
Europeana Newspapers
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
The European Library
 
Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02
The European Library
 

Similar to Digitised historic newspapers in Europe (20)

Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries,...
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
 
What's up, Europeana Newspapers?
What's up, Europeana Newspapers?What's up, Europeana Newspapers?
What's up, Europeana Newspapers?
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
Europeana Libraries Review
Europeana Libraries ReviewEuropeana Libraries Review
Europeana Libraries Review
 
Update on our Wikipedia activities in 2015 - National library & Archives of t...
Update on our Wikipedia activities in 2015 - National library & Archives of t...Update on our Wikipedia activities in 2015 - National library & Archives of t...
Update on our Wikipedia activities in 2015 - National library & Archives of t...
 
They have left the building: The Web Route to Library Users
They have left the building: The Web Route to Library UsersThey have left the building: The Web Route to Library Users
They have left the building: The Web Route to Library Users
 
Electronic resources in academic libraries
Electronic resources in academic librariesElectronic resources in academic libraries
Electronic resources in academic libraries
 
Quantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archivesQuantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archives
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Finding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the WebFinding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the Web
 
Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
 
Situation Dänemark
Situation DänemarkSituation Dänemark
Situation Dänemark
 
Opening up the archives: from basement to browser
Opening up the archives: from basement to browserOpening up the archives: from basement to browser
Opening up the archives: from basement to browser
 
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final RevisedTonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
 
Terry Weech: Public Computing: Libraries and Volunteers
Terry Weech: Public Computing: Libraries and Volunteers Terry Weech: Public Computing: Libraries and Volunteers
Terry Weech: Public Computing: Libraries and Volunteers
 
Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 

More from TU Delft, Netherlands

More from TU Delft, Netherlands (16)

The Landscape of Research Data Management
The Landscape of Research Data Management The Landscape of Research Data Management
The Landscape of Research Data Management
 
Winning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipWinning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data Stewardship
 
Europeana and Researchers
Europeana and ResearchersEuropeana and Researchers
Europeana and Researchers
 
Introduction to eCloud
Introduction to eCloudIntroduction to eCloud
Introduction to eCloud
 
Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013
 
Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013
 
Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser
 
Open Data from the European Library
Open Data from the European LibraryOpen Data from the European Library
Open Data from the European Library
 
Why aggregate European Historic Newspapers
Why aggregate European Historic NewspapersWhy aggregate European Historic Newspapers
Why aggregate European Historic Newspapers
 
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the CloudEuropeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
 
A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project
 
Introduction to Europeana Cloud project
Introduction to Europeana Cloud projectIntroduction to Europeana Cloud project
Introduction to Europeana Cloud project
 
Presentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers OnlinePresentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers Online
 
Breaking the Waves
Breaking the WavesBreaking the Waves
Breaking the Waves
 
Presentation on The European Library
Presentation on The European LibraryPresentation on The European Library
Presentation on The European Library
 
The European Library
The European LibraryThe European Library
The European Library
 

Recently uploaded

Accounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdfAccounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdf
YibeltalNibretu
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
Accounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdfAccounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdf
 
Forest and Wildlife Resources Class 10 Free Study Material PDF
Forest and Wildlife Resources Class 10 Free Study Material PDFForest and Wildlife Resources Class 10 Free Study Material PDF
Forest and Wildlife Resources Class 10 Free Study Material PDF
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & EngineeringBasic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 

Digitised historic newspapers in Europe

  • 1. Surveying Newspaper Digitisation in European Libraries, Then Aggregating Them ! Europeana Newspapers Alastair Dunning Programme Manager, The European Library @alastairdunning, alastair.dunning AT kb.nl LIBER Conference, June 2013, Munich This presentation is at http://www.slideshare.net/alastairdunning
  • 2. On November 3, 1948, the early edition of the Chicago Tribune proclaimed Thomas Dewey as winner of the US presidential campaign http://www.chicagotribune.com/news/politics/chi-histdewey_defeats_an20080104104816,0,547284.photo
  • 3. In actual fact, the campaign was won by Harry Truman, who became the 33rd President of the United States http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
  • 4. Later editions of the Chicago Tribune corrected this mistake with headline "DEMOCRATS MAKE SWEEP OF STATE OFFICES" However, I cannot find these online ! http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
  • 5. As we shall see, presenting comprehensive digital archives, where everything is digitised, is difficult... yet this is what users often demand !
  • 6. "This lack of collocation and collection presents efficiency challenges and deepens scholars’ concerns about comprehensiveness. The anxiety over “missing something” was quite common across interviews." Ithaka S+R, Supporting the Changing Research Practices of Historians, http://www.sr.ithaka.org/research-publications/supporting-changing- research-practices-historians
  • 7. "When lined up against the non-digital object upon which it is based, the digital object can only ever appear impoverished." Jim Mussell, Historian at University of Birmingham http://jimmussell.com/2013/05/23/the-proximal-past- digital-archives-and-the-here-and-now/
  • 8. Genealogists - those studying family history "Genealogists represent the majority of users in many archives. And yet, the traditional archival information system does not meet their needs." Wendy M. Duff, Catherine A. Johnson, Where Is the List with All the Names? Information-Seeking Behavior of Genealogists, American Archivist, Volume 66(1), 2003, http://archivists.metapress.com/content/L375UJ047224737N
  • 9. Despite this, European libraries have made great strides in digitising their newspapers (These results taken from first Europeana Newspapers survey, 2012. 47 libraries responded.) http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana- newspapers-survey-report.pdf
  • 11.
  • 12. 11 libraries have digitised more than 3m pages 1. National Library of Czech Republic 2. Koninklijke Bibliotheek van België 3. National Library of Spain 4. National Library of Norway 5. National and Univeristy Library of Iceland 6. BCU Lausanne 7. Hamburg State and University Library 8. Bibliothèque nationale de France 9. British Library 10. Koninklijke Bibliotheek 11. Austrian National Library
  • 13. But, only 12 (26%) of the libraries had digitised more than 10% of their collection (either in terms of titles or page numbers)
  • 14. National Library of Luxembourg 620.000 pages digitised 4.000.000 pages in collection
  • 15. National Library of Finland 620.000 pages digitised 2.010.246 pages in collection
  • 16. Hamburg State and University Library c. 2.000.000 pages digitised c. 12.000.000 pages in collection
  • 17. What else did the survey discover ?
  • 18. Access to digitised newspapers is nearly always free of charge. At least 40 (85%) offered free access to their digitised newspapers. One library had pay per view, whilst another three offered subscription services for users (ie paid access per day or per month). Only four libraries licensed their newspaper contents to other groups (e.g. school, universities).
  • 19. Access to twentieth-century content remains problematic. 27 out of 47 libraries (57%)have a cut off date beyond which they will not publish digitised newspapers on the web. Most frequently, this is based on a 70 year sliding scale. 23%(11 out of 47) had an agreement with a rights organisation so that in-copyright digitised newspapers could be published, but often restricted to individual titles
  • 20. There is still much to be done to exploit the richness of digitised newspaper content 64%(37 from 47) of libraries made use of OCR But only 17 of these libraries (36%) exposed the resulting full text to the viewer 36%had undertaken zoning and segmentation and only six libraries (13%) had included features such as facetted browsing or extracting entities such as place or name
  • 21. --> Motivation for Europeana Newspapers Others WPs will explain process of improving digitised archives but I want to return to one earlier quote
  • 22. "... the lack of comprehensive search tools for primary sources ..." Locating primary sources presents a crucial challenge for reserachers. --> TEL aggregator as part of Europeana Newspapers project
  • 23. Timetable: Early version with limited content added to The European Library website in September 20 More content being added in 2013 and 2014
  • 24. http://theeuropeanlibrary.org will deliver a search interface to help locate 18mpages digitised at European libraires Users will also be able to search over titles of newspapers. Title metadata will also be forwarded to Europeana
  • 25. Some Issues: Copyright means that some images cannot be shared at all, only metadata (e.g. names and dates of newspapers)
  • 26. Some Issues: OCR and zoning quality will affect search results significantly. Eg Higher quality OCR will be returned more often in search results
  • 27. Some Issues: Some pages have no OCR whatsoever - more difficult to find
  • 28. Some Issues: Different libraries are willing to share different amounts of content Some libraries happy for full content to be shared; for others it is just snippets of images
  • 29. Last Thoughts and What Next ?: The European Library will sustain access beyond project funding; but adding more content will require membership of TEL How can we allow for transcription? What do non-academic users want? How do we create full-text APIs ?
  • 30. Oh, the results here were all based on the first edition of the project survey. If your library want to contribute to later editions, see links by July 2013 http://www.europeana-newspapers.eu/tell-us-about-your-newspaper- digitisation-project/ http://www.surveymonkey.com/s/BQ28579