SlideShare a Scribd company logo
1 of 22
Extrablatt: The Latest News on
Newspaper Digitisation in Europe
Clemens Neudecker @cneudecker
Staatsbibliothek zu Berlin
Preußischer Kulturbesitz
Bibliotheca Baltica | 4-5 October 2018
Contents
• Newspaper Digitisation in Germany
• Europeana Newspapers
• Digital Humanities Newspaper Research
• Newspaper Digitisation in Germany
Germany: Pilot Project & Masterplan
• DFG-funded Pilot Project (2013 – 2015)
investigating the feasibility of large-scale
digitisation of German historical newspapers
• Outcomes (unfortunately only in German):
– Master Plan & Online Guide
– Checklist in-house digitisation vs. Outsourcing
– Checklist digitisation from Microfilm vs. Original
– Guidelines for cataloguing in the ZDB
Zeitschriftendatenbank (ZDB)
• ZDB = database of serial publications
• New user interface with novel exploration
and visualization features
Germany: A National Newspaper
Portal
• Due to historical fragmentation (federalism),
already digitized newspapers in Germany are
spread across dozens of institutions, which is a
major burden for users
• Solution:
– Establish a single access point for digitized
newspapers from Germany
– Include this with the German Digital Library (DDB)
– Reuse results and findings of the pilot project,
Europeana Newspapers and others
– Starting November 1st 2018, initially for 2 years
Germany: DFG Call for Newspaper
Digitisation Projects
• DFG Call for Projects: Digitisation of historical
newspapers of the German-speaking area
– Goals:
• Significantly increase the availability of digitized German
newspapers - with a special focus on researchers needs
– Requirements:
• Proposals must be for a thematic cluster of titles
• Justification for digitisation has to be on a title level
• OCR is a must, a detailed OCR evaluation needs to be included
• Entries about planned digitisation in the ZDB catalogue
• Presentation must be compatible with DFG-Viewer
Elsewhere in Europe…
• UK: In the context of the British Library's Heritage
Made Digital programme, a major part involves
digitizing 19th century British newspapers and
publishing them open access [1]
• Sweden: All Swedish daily newspapers from 1734–
1906 will soon be available online, free for everyone
to read and download thanks to a donation from the
Arcadia research institution [2]
• Denmark: Digitization of 32m newspaper pages [3]
• Switzerland: Swiss National Library launches common
digital newspaper portal with correction facility [4]
• Europeana Newspapers
Europeana Newspapers
• Background:
– EU Project Europeana Newspapers (2012 – 2015)
– TEL Historic Newspaper Browser
– 2016: CENL decides to discontinue TEL 
– 2017: Finding a new home for 10 TB of newspaper
data…
• Europeana to the rescue!
• Problem: Europeana does not provide full-text search
• Problem: Europeana infrastructure, data model & API need
significant changes to support the newspapers
• Problem: Interdepencies with other content & services
 A lot of work!
• Digital Humanities Newspaper Research
Digital Humanities Newspaper
Research
• Strong (and growing) interest in digitized
historical newspapers from the research
community
• Diverse interests and research questions
from the humanities, social sciences,
linguistics, computer science
• Collaboration with CLARIN.EU and Europeana
Research to make newspaper data available
• Pilot project with EUDAT
Thank you for your attention!
Questions?
Clemens Neudecker @cneudecker
Staatsbibliothek zu Berlin
Preußischer Kulturbesitz
Bibliotheca Baltica | 4-5 October 2018

More Related Content

What's hot

You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?The European Library
 
Europeana Newspapers - Data, Tools & Future Plans
 Europeana Newspapers - Data, Tools & Future Plans  Europeana Newspapers - Data, Tools & Future Plans
Europeana Newspapers - Data, Tools & Future Plans cneudecker
 
Europeana Newspapers in a Nutshell
Europeana Newspapers in a NutshellEuropeana Newspapers in a Nutshell
Europeana Newspapers in a Nutshellcneudecker
 
Europeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 BerlinEuropeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 Berlincneudecker
 
Historical newspapers in the context of Digital Library of Slovenia
Historical newspapers in the context of Digital Library of SloveniaHistorical newspapers in the context of Digital Library of Slovenia
Historical newspapers in the context of Digital Library of SloveniaEuropeana Newspapers
 
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...dduin
 
02 europeana collections 1914 1918
02 europeana collections 1914 191802 europeana collections 1914 1918
02 europeana collections 1914 1918Europeana
 
Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Lars G. Svensson
 
Magnus Bognerud - Current digital collection management projects at nasjonalm...
Magnus Bognerud - Current digital collection management projects at nasjonalm...Magnus Bognerud - Current digital collection management projects at nasjonalm...
Magnus Bognerud - Current digital collection management projects at nasjonalm...lab_SNG
 
Open data and reuse: Issues and challenges for cultural institutions
Open data and reuse: Issues and challenges for cultural institutionsOpen data and reuse: Issues and challenges for cultural institutions
Open data and reuse: Issues and challenges for cultural institutionsAna Alvarez Lacambra
 
Europeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing PlanEuropeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing PlanThe European Library
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?cneudecker
 
Jan Braeckman- Day 2, Session 1
Jan Braeckman- Day 2, Session 1Jan Braeckman- Day 2, Session 1
Jan Braeckman- Day 2, Session 1Civic Agenda EU
 
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...Michèle Furer-Benedetti
 
Multimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical NewspapersMultimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical Newspaperscneudecker
 
Europeana 1914 18_collections+open+house
Europeana 1914 18_collections+open+houseEuropeana 1914 18_collections+open+house
Europeana 1914 18_collections+open+houseAquiles Alencar Brayner
 
Research partnerships, user participation, extended outreach – some of ETH L...
Research partnerships, user participation, extended outreach –  some of ETH L...Research partnerships, user participation, extended outreach –  some of ETH L...
Research partnerships, user participation, extended outreach – some of ETH L...ETH-Bibliothek
 

What's hot (20)

You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
Europeana Newspapers - Data, Tools & Future Plans
 Europeana Newspapers - Data, Tools & Future Plans  Europeana Newspapers - Data, Tools & Future Plans
Europeana Newspapers - Data, Tools & Future Plans
 
Europeana Newspapers in a Nutshell
Europeana Newspapers in a NutshellEuropeana Newspapers in a Nutshell
Europeana Newspapers in a Nutshell
 
Europeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 BerlinEuropeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 Berlin
 
Historical newspapers in the context of Digital Library of Slovenia
Historical newspapers in the context of Digital Library of SloveniaHistorical newspapers in the context of Digital Library of Slovenia
Historical newspapers in the context of Digital Library of Slovenia
 
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
 
02 europeana collections 1914 1918
02 europeana collections 1914 191802 europeana collections 1914 1918
02 europeana collections 1914 1918
 
Aly
AlyAly
Aly
 
Archiving News on the Web
Archiving News on the WebArchiving News on the Web
Archiving News on the Web
 
Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013
 
Magnus Bognerud - Current digital collection management projects at nasjonalm...
Magnus Bognerud - Current digital collection management projects at nasjonalm...Magnus Bognerud - Current digital collection management projects at nasjonalm...
Magnus Bognerud - Current digital collection management projects at nasjonalm...
 
Open data and reuse: Issues and challenges for cultural institutions
Open data and reuse: Issues and challenges for cultural institutionsOpen data and reuse: Issues and challenges for cultural institutions
Open data and reuse: Issues and challenges for cultural institutions
 
Europeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing PlanEuropeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing Plan
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?
 
Jan Braeckman- Day 2, Session 1
Jan Braeckman- Day 2, Session 1Jan Braeckman- Day 2, Session 1
Jan Braeckman- Day 2, Session 1
 
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...
Réseaux de bibliothèques à l'ère du cloud : que partager ? comment travailler...
 
Multimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical NewspapersMultimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical Newspapers
 
Europeana 1914 18_collections+open+house
Europeana 1914 18_collections+open+houseEuropeana 1914 18_collections+open+house
Europeana 1914 18_collections+open+house
 
Digital Research Support by Stella Wisdom
Digital Research Support by Stella WisdomDigital Research Support by Stella Wisdom
Digital Research Support by Stella Wisdom
 
Research partnerships, user participation, extended outreach – some of ETH L...
Research partnerships, user participation, extended outreach –  some of ETH L...Research partnerships, user participation, extended outreach –  some of ETH L...
Research partnerships, user participation, extended outreach – some of ETH L...
 

Similar to Extrablatt: The Latest News on Newspaper Digitisation in Europe

The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers ProjectEuropeana Newspapers
 
GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liberIGN Vorstand
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012Europeana Newspapers
 
The Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventThe Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventEuropeana Newspapers
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...IMPACT Centre of Competence
 
ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewEuropeana Newspapers
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectEuropeana Newspapers
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER Europe
 
Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage Europeana Newspapers
 
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...Neil Beagrie
 
Ifla 2013 newspapers_kiisa_day2_15082013
Ifla 2013 newspapers_kiisa_day2_15082013Ifla 2013 newspapers_kiisa_day2_15082013
Ifla 2013 newspapers_kiisa_day2_15082013Europeana Newspapers
 
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers
 
Europeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregatorEuropeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregatorLIBER Europe
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital libraryJakobheide
 
Developing a national digital library stapel - meijers 20160302
Developing a national digital library   stapel - meijers 20160302Developing a national digital library   stapel - meijers 20160302
Developing a national digital library stapel - meijers 20160302Enno Meijers
 
Europeana Libraries: bringing content to the researcher
Europeana Libraries: bringing content to the researcherEuropeana Libraries: bringing content to the researcher
Europeana Libraries: bringing content to the researcherLIBER Europe
 

Similar to Extrablatt: The Latest News on Newspaper Digitisation in Europe (20)

The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 
GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liber
 
The Europeana Newspapers Project
The Europeana Newspapers ProjectThe Europeana Newspapers Project
The Europeana Newspapers Project
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012
 
The Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventThe Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final Event
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
 
ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project Overview
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
Europeana Newspapers Project
Europeana Newspapers ProjectEuropeana Newspapers Project
Europeana Newspapers Project
 
Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage
 
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
 
Ifla 2013 newspapers_kiisa_day2_15082013
Ifla 2013 newspapers_kiisa_day2_15082013Ifla 2013 newspapers_kiisa_day2_15082013
Ifla 2013 newspapers_kiisa_day2_15082013
 
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop intro
 
Europeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregatorEuropeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregator
 
How to Build a Digital Library
How to Build a Digital LibraryHow to Build a Digital Library
How to Build a Digital Library
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital library
 
Developing a national digital library stapel - meijers 20160302
Developing a national digital library   stapel - meijers 20160302Developing a national digital library   stapel - meijers 20160302
Developing a national digital library stapel - meijers 20160302
 
Europeana Libraries: bringing content to the researcher
Europeana Libraries: bringing content to the researcherEuropeana Libraries: bringing content to the researcher
Europeana Libraries: bringing content to the researcher
 
Luca Martinelli Europeana
Luca Martinelli EuropeanaLuca Martinelli Europeana
Luca Martinelli Europeana
 

More from cneudecker

EuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State LibraryEuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State Librarycneudecker
 
ALTO, PAGE & Co. Formate für Volltexte
ALTO, PAGE & Co. Formate für VolltexteALTO, PAGE & Co. Formate für Volltexte
ALTO, PAGE & Co. Formate für Volltextecneudecker
 
OCR und Strukturerkennung für Zeitungen
OCR und Strukturerkennung für ZeitungenOCR und Strukturerkennung für Zeitungen
OCR und Strukturerkennung für Zeitungencneudecker
 
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...cneudecker
 
AI for digitized cultural heritage
AI for digitized cultural heritageAI for digitized cultural heritage
AI for digitized cultural heritagecneudecker
 
Kuratieren mit künstlicher Intelligenz
Kuratieren mit künstlicher IntelligenzKuratieren mit künstlicher Intelligenz
Kuratieren mit künstlicher Intelligenzcneudecker
 
Überblick zum DFG-Projekt OCR-D
Überblick zum DFG-Projekt OCR-DÜberblick zum DFG-Projekt OCR-D
Überblick zum DFG-Projekt OCR-Dcneudecker
 
The many uses of digitized newspapers
The many uses of digitized newspapersThe many uses of digitized newspapers
The many uses of digitized newspaperscneudecker
 
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...cneudecker
 
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...cneudecker
 
OCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documentsOCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documentscneudecker
 
Text and Data Mining
Text and Data MiningText and Data Mining
Text and Data Miningcneudecker
 
Formate für Volltexte
Formate für VolltexteFormate für Volltexte
Formate für Volltextecneudecker
 
Reise durch Europeana Collections in 11 Minuten
Reise durch Europeana Collections in 11 MinutenReise durch Europeana Collections in 11 Minuten
Reise durch Europeana Collections in 11 Minutencneudecker
 
lab.sbb.berlin
lab.sbb.berlinlab.sbb.berlin
lab.sbb.berlincneudecker
 
Named Entity Recognition for Europeana Newspapers
Named Entity Recognition for Europeana NewspapersNamed Entity Recognition for Europeana Newspapers
Named Entity Recognition for Europeana Newspaperscneudecker
 
Active archives @SBB
Active archives @SBBActive archives @SBB
Active archives @SBBcneudecker
 
Coding da Vinci Berlin 2017 - Europeana Newspapers
Coding da Vinci Berlin 2017 - Europeana NewspapersCoding da Vinci Berlin 2017 - Europeana Newspapers
Coding da Vinci Berlin 2017 - Europeana Newspaperscneudecker
 
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918cneudecker
 
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen BibliothekenDigitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliothekencneudecker
 

More from cneudecker (20)

EuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State LibraryEuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State Library
 
ALTO, PAGE & Co. Formate für Volltexte
ALTO, PAGE & Co. Formate für VolltexteALTO, PAGE & Co. Formate für Volltexte
ALTO, PAGE & Co. Formate für Volltexte
 
OCR und Strukturerkennung für Zeitungen
OCR und Strukturerkennung für ZeitungenOCR und Strukturerkennung für Zeitungen
OCR und Strukturerkennung für Zeitungen
 
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...
OCR und Strukturerkennung: Herausforderungen und Ansätze für die Zeitungsdigi...
 
AI for digitized cultural heritage
AI for digitized cultural heritageAI for digitized cultural heritage
AI for digitized cultural heritage
 
Kuratieren mit künstlicher Intelligenz
Kuratieren mit künstlicher IntelligenzKuratieren mit künstlicher Intelligenz
Kuratieren mit künstlicher Intelligenz
 
Überblick zum DFG-Projekt OCR-D
Überblick zum DFG-Projekt OCR-DÜberblick zum DFG-Projekt OCR-D
Überblick zum DFG-Projekt OCR-D
 
The many uses of digitized newspapers
The many uses of digitized newspapersThe many uses of digitized newspapers
The many uses of digitized newspapers
 
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...
Digitalisate kuratieren mit KI - von unstrukturierten Daten zu strukturierten...
 
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...
Von der Zeitungsdigitalisierung zu historischen Netzwerken - Methoden und Her...
 
OCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documentsOCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documents
 
Text and Data Mining
Text and Data MiningText and Data Mining
Text and Data Mining
 
Formate für Volltexte
Formate für VolltexteFormate für Volltexte
Formate für Volltexte
 
Reise durch Europeana Collections in 11 Minuten
Reise durch Europeana Collections in 11 MinutenReise durch Europeana Collections in 11 Minuten
Reise durch Europeana Collections in 11 Minuten
 
lab.sbb.berlin
lab.sbb.berlinlab.sbb.berlin
lab.sbb.berlin
 
Named Entity Recognition for Europeana Newspapers
Named Entity Recognition for Europeana NewspapersNamed Entity Recognition for Europeana Newspapers
Named Entity Recognition for Europeana Newspapers
 
Active archives @SBB
Active archives @SBBActive archives @SBB
Active archives @SBB
 
Coding da Vinci Berlin 2017 - Europeana Newspapers
Coding da Vinci Berlin 2017 - Europeana NewspapersCoding da Vinci Berlin 2017 - Europeana Newspapers
Coding da Vinci Berlin 2017 - Europeana Newspapers
 
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918
Coding da Vinci Berlin 2017 - Europeana Collections 1914-1918
 
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen BibliothekenDigitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
 

Recently uploaded

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Recently uploaded (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Extrablatt: The Latest News on Newspaper Digitisation in Europe

  • 1. Extrablatt: The Latest News on Newspaper Digitisation in Europe Clemens Neudecker @cneudecker Staatsbibliothek zu Berlin Preußischer Kulturbesitz Bibliotheca Baltica | 4-5 October 2018
  • 2. Contents • Newspaper Digitisation in Germany • Europeana Newspapers • Digital Humanities Newspaper Research
  • 4. Germany: Pilot Project & Masterplan • DFG-funded Pilot Project (2013 – 2015) investigating the feasibility of large-scale digitisation of German historical newspapers • Outcomes (unfortunately only in German): – Master Plan & Online Guide – Checklist in-house digitisation vs. Outsourcing – Checklist digitisation from Microfilm vs. Original – Guidelines for cataloguing in the ZDB
  • 5. Zeitschriftendatenbank (ZDB) • ZDB = database of serial publications • New user interface with novel exploration and visualization features
  • 6. Germany: A National Newspaper Portal • Due to historical fragmentation (federalism), already digitized newspapers in Germany are spread across dozens of institutions, which is a major burden for users • Solution: – Establish a single access point for digitized newspapers from Germany – Include this with the German Digital Library (DDB) – Reuse results and findings of the pilot project, Europeana Newspapers and others – Starting November 1st 2018, initially for 2 years
  • 7. Germany: DFG Call for Newspaper Digitisation Projects • DFG Call for Projects: Digitisation of historical newspapers of the German-speaking area – Goals: • Significantly increase the availability of digitized German newspapers - with a special focus on researchers needs – Requirements: • Proposals must be for a thematic cluster of titles • Justification for digitisation has to be on a title level • OCR is a must, a detailed OCR evaluation needs to be included • Entries about planned digitisation in the ZDB catalogue • Presentation must be compatible with DFG-Viewer
  • 8. Elsewhere in Europe… • UK: In the context of the British Library's Heritage Made Digital programme, a major part involves digitizing 19th century British newspapers and publishing them open access [1] • Sweden: All Swedish daily newspapers from 1734– 1906 will soon be available online, free for everyone to read and download thanks to a donation from the Arcadia research institution [2] • Denmark: Digitization of 32m newspaper pages [3] • Switzerland: Swiss National Library launches common digital newspaper portal with correction facility [4]
  • 10. Europeana Newspapers • Background: – EU Project Europeana Newspapers (2012 – 2015) – TEL Historic Newspaper Browser – 2016: CENL decides to discontinue TEL  – 2017: Finding a new home for 10 TB of newspaper data… • Europeana to the rescue! • Problem: Europeana does not provide full-text search • Problem: Europeana infrastructure, data model & API need significant changes to support the newspapers • Problem: Interdepencies with other content & services  A lot of work!
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. • Digital Humanities Newspaper Research
  • 17. Digital Humanities Newspaper Research • Strong (and growing) interest in digitized historical newspapers from the research community • Diverse interests and research questions from the humanities, social sciences, linguistics, computer science • Collaboration with CLARIN.EU and Europeana Research to make newspaper data available • Pilot project with EUDAT
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Thank you for your attention! Questions? Clemens Neudecker @cneudecker Staatsbibliothek zu Berlin Preußischer Kulturbesitz Bibliotheca Baltica | 4-5 October 2018