Copyright: Olmsted County Historical Society 
Europeana Newspapers 
…in aller Kürze 
Austrian Information Day 
16 Oktober 2014, Wien, ÖNB 
Hans-Jörg Lieder, Staatsbibliothek zu Berlin – 
Preußischer Kulturbesitz
ZDF: Zahlen, Daten, Fakten 
• Europeana Newspapers – EU ICT-PSP Best Practice Network 
• Beginn im Februar 2012, Ende im Januar 2015 
• 18 Partner, 11 assoziierte Partner, 22 Networking Partner aus 28 
Ländern 
• Budget: € 5.16M – EC Förderung: € 4.12M 
• Projektleitung: Staatsbibliothek zu Berlin - Preußischer Kulturbesitz 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 2
Europeana Newspapers in der wirklichen Welt 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
3 
Rot = Projekt 
Partner 
Blau = Assoziierte 
Partner 
Grün = Networking 
Partner
Vom Bild zum Text 
• 8 Millionen Seiten OCR 
• 2 Millionen Seiten OLR / Artikelseparierung 
• Named Entity Recognition (NER) in 3 Sprachen 
(Niederländisch, Deutsch, Französisch) 
• Metadaten für >18 Millionen Seiten in Europeana 
• Metadaten für >28 Millionen Seiten in der ZDB 
(plus “born digital” Zeitungen) 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
4
Content 
20 Sprachen 
ca. 950 Titel 
Country Partner Titel 
Startdate 
(overall) 
Enddate 
(overall) 
France BnF 80 1814 1944 
Germany SBB 6 1872 1940 
SUB-HH 16 1721 1945 
Netherlands KB 203 1618 1900 
Italy LFT 15 1813 1949 
Estonia NLE 43 1852 1944 
Finland NLF 11 1900 1910 
Latvia NLL 117 1868 1955 
Poland NLP 118 1914 1939 
Turkey NLT 22 1818 1928 
Austria ONB 275 1686 1945 
Serbia UB 45 1830 1944 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 5
Volltext-Suche 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
6
Kalendernavigation 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
7
Kartenbasiertes Stöbern 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
8
Navigation in Titelliste 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
9
Darstellung in Europeana 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 10
Metadaten 
• Europeana Newspapers METS/ALTO Profile (ENMAP) 
• Beiträge zur Entwicklung von ALTO v2.x, v3.0 
• Strukturdatenerfassung mit Unterstützung von Structify 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
11
Entwicklung unterstützender Software 
• Tools für die standardkonforme Imageerzeugung/ Konversion 
• Tools für die Ermittlung von zu erwartenden Qualitätsniveaus 
• Tools für die Analyse von OCR-Ergebnissen 
• Structify 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
• … 
12
Medien, Neuigkeiten, Events 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
13
Wo wir stehen 
This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the 
Competitiveness and Innovation Framework Programme by the European Community 
http://ec.europa.eu/ict_psp 
14
Vielen Dank! 
hans-joerg.lieder@sbb.spk-berlin.de 
http://www.europeana-newspapers.eu 
http://www.theeuropeanlibrary.org/tel4/newspapers 
http://www.europeana.eu/

Europeana_Newspapers_ONB_infoday_HJLieder

  • 1.
    Copyright: Olmsted CountyHistorical Society Europeana Newspapers …in aller Kürze Austrian Information Day 16 Oktober 2014, Wien, ÖNB Hans-Jörg Lieder, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz
  • 2.
    ZDF: Zahlen, Daten,Fakten • Europeana Newspapers – EU ICT-PSP Best Practice Network • Beginn im Februar 2012, Ende im Januar 2015 • 18 Partner, 11 assoziierte Partner, 22 Networking Partner aus 28 Ländern • Budget: € 5.16M – EC Förderung: € 4.12M • Projektleitung: Staatsbibliothek zu Berlin - Preußischer Kulturbesitz This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 2
  • 3.
    Europeana Newspapers inder wirklichen Welt This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 3 Rot = Projekt Partner Blau = Assoziierte Partner Grün = Networking Partner
  • 4.
    Vom Bild zumText • 8 Millionen Seiten OCR • 2 Millionen Seiten OLR / Artikelseparierung • Named Entity Recognition (NER) in 3 Sprachen (Niederländisch, Deutsch, Französisch) • Metadaten für >18 Millionen Seiten in Europeana • Metadaten für >28 Millionen Seiten in der ZDB (plus “born digital” Zeitungen) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 4
  • 5.
    Content 20 Sprachen ca. 950 Titel Country Partner Titel Startdate (overall) Enddate (overall) France BnF 80 1814 1944 Germany SBB 6 1872 1940 SUB-HH 16 1721 1945 Netherlands KB 203 1618 1900 Italy LFT 15 1813 1949 Estonia NLE 43 1852 1944 Finland NLF 11 1900 1910 Latvia NLL 117 1868 1955 Poland NLP 118 1914 1939 Turkey NLT 22 1818 1928 Austria ONB 275 1686 1945 Serbia UB 45 1830 1944 This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 5
  • 6.
    Volltext-Suche This projectis partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 6
  • 7.
    Kalendernavigation This projectis partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 7
  • 8.
    Kartenbasiertes Stöbern Thisproject is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 8
  • 9.
    Navigation in Titelliste This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 9
  • 10.
    Darstellung in Europeana This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 10
  • 11.
    Metadaten • EuropeanaNewspapers METS/ALTO Profile (ENMAP) • Beiträge zur Entwicklung von ALTO v2.x, v3.0 • Strukturdatenerfassung mit Unterstützung von Structify This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 11
  • 12.
    Entwicklung unterstützender Software • Tools für die standardkonforme Imageerzeugung/ Konversion • Tools für die Ermittlung von zu erwartenden Qualitätsniveaus • Tools für die Analyse von OCR-Ergebnissen • Structify This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp • … 12
  • 13.
    Medien, Neuigkeiten, Events This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 13
  • 14.
    Wo wir stehen This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 14
  • 15.
    Vielen Dank! hans-joerg.lieder@sbb.spk-berlin.de http://www.europeana-newspapers.eu http://www.theeuropeanlibrary.org/tel4/newspapers http://www.europeana.eu/

Editor's Notes

  • #5 NER große Chance für Bibliotheken: Normdaten.
  • #6 Respects library wishes:The available content in influenced by what restrictions in copyright and business model from each of the contributing libraries. A)Location of digital image B)Size of image C)Format of image Reflects relationship to original physical newspaper collections: Not all issues in a newspaper title will be available to TEL, or even digitised  Documents hosted by TEL will be different quality than those  Contextual information vital to ensure user confidence
  • #15 Digitalisierungsverfahren (Mikrofilm oder Original) Mengengerüste und Auswahlkriterien Services: Volltextsuche bis Data mining, APIs, Labs etc. Rechtsfragen Geschäftsmodelle