SlideShare a Scribd company logo
The Europeana Newspapers
Project
IMPACT Final Event
Den Haag, 26-06-2012
Lotte Wilms
Europeana Newspapers
Why newspapers?
  • Important source of information for researchers
  • Relevant for general public

Europeana Newspapers:
  • Aims at the aggregation and refinement of newspapers for The European
    Library and Europeana.
  • Will use refinement methods for OCR, OLR (article segmentation), and named
    entity (NER) and class recognition
  • The libraries participating in the project will provide around 18 million digitised
    newspaper pages to Europeana
  • More libraries will be encouraged to contribute newspapers to Europeana and
    TEL by the project
  • Builds on work from IMPACT


                                                                                      2
Project Profile: Consortium & stakeholders

• 17 partners from 12 countries within the consortium
    • National libraries
    • University libraries
    • SME

• External partners and stakeholders:
    • Involvement of libraries outside the project consortium

• Framework:
    • Funded as a Best Practice Network in the ICT-PSP program of the
      European Commission
    • Project Duration: February 2012 – January 2015

                                                                        3
Europeana Newspapers Consortium


                                    NL E                       NLF
                   LIBER
       TEL
                              SUB HH
                                                         NLL
                                        CCS
USAL
                                                   NLP

       BL                         SBB
                      KB                   ONB

                                                                 NLT
                           UIBK
             BnF

                                              UB
                             LFT
Project Profile: Objectives
1) Selection, Refinement & Aggregation of content
   • Provision of more than 18 million newspaper pages to Europeana,
     many of those with full-text
   • Support move from images to texts in Europeana

2) Analysis of existing newspaper collections
   • Survey of newspaper holdings in Europe

3) Quality Assurance & Best practice recommendations
   • Contribute to optimised workflows
   • Provide best practice recommendations for digitisation, refinement,
     workflows, metadata etc.

4) Presentation and full-text search
   • Improve access to newspaper collections within Europeana

                                                                           5
1) Selection, Refinement & Aggregation of content

• Aggregation of 18 million pages of digitised
  newspapers to Europeana and to The
  European Library
    • 8 million pages “as is” (content providers)
    • 8 million refined pages: OCR (UIBK,
      Austria)                                      www.europeana.eu/
    • 2 million refined pages: OCR/OLR (article
      segmentation) (CCS, Germany)
• Analysis of available digital newspaper
  collections and selection of subsets
  suitable for refinement

                                                    www.theeuropeanlibrary.org/


                                                                              6
1) Refinement – OCR and OLR - UIBK

• 8 million refined pages:
 OCR using ABBYY FRE10 (UIBK,
 Austria)

   • UIBK enriches the OCR with structural
     information from the Document
     Understanding Platform (FEP)
     developed within IMPACT

   • Dedicated profiles will be produced
     which are specifically tuned to the
     characteristics of newspapers to yield
     optimal results
1) Refinement – OCR and OLR - CCS

• 2 million refined pages:
 OCR/OLR (article segmentation)
 (CCS, Germany)

   • CCS produces OCR and verification of
     column recognition, zoning, article
     segmentation, and page class
     recognition

   • CCS provides libraries with a client
     technology for manual correction of
     recognition and segmentation results

   • OCRing done with ABBYY FRE10,
     which includes improvements developed
                                             CCS: Column recognition, article segmentation
     within IMPACT
1) Refinement - Named Entity Recognition

• KB provides named entities recognition (NER) for material from up to
 three languages (Dutch, English, and German)
   • Pilot planned for second half of 2012




            Image by Frank Landsbergen (INL)
2) Analysis of existing digitised newspaper collections


• Project partners and others are contacted to provide input until 31 July
  2012 to analyse the extent of digitised newspapers collections at their
  institutions
        • Results will be embedded in “Zeitschriftendatenbank” of
          Staatsbibliothek zu Berlin (Union Catalogue of Serials)
        • Potential new partners for the extension of the network will be
          suggested by survey
• Also useful to ascertain the technical status of digitised data


If you have a digital newspaper collection and would like to participate in
the survey  please go to: http://www.surveymonkey.com/s/BQ28579
3) Quality Assurance & Best practice recommendations


• The digitisation workflow for newspapers, including
 refinement, will be evaluation through an evaluation and
 quality assessment framework, containing tools developed
 in IMPACT
   • Document Management System
   • Ground truth production tool Aletheia
   • Evaluation tools


• Provide recommendations on best
  practices for digitisation and
  refinement of newspapers
3) Quality Assurance & Best practice recommendations


• Analysis of metadata formats in use by libraries in
 digitisation projects


• Align metadata models with the METS/ALTO
 standard


• Release best practice recommendation on how to
 apply these formats in newspaper digitisation and
 refinement


• Supports content browser
4) Presentation & Access to full-text

• Within the lifetime of the project, a content browser
 will be built within TEL portal so that users can …
  • Search full text, e.g.
     •   by search term,
     •   by named entities
     •   by collections of newspapers
     •   by date ….
  • See newspaper images
  • Be linked to relevant library sources
  • This browser will be built in TEL during the project;
    and exported to Europeana after the project
5) Dissemination

• Objectives:
   • Establishment of publicity
   • Increasing usage of Europeana
   • Awareness raising among target groups
• Tasks:
   1. Media Communication
   2. Workshops and conferences
   • Three main dissemination workshops
   • National information days
   • Network extension
   3. Exploitation



                                             14
Thank you for your attention!
http://www.europeana-newspapers.eu/

 Lotte Wilms
 Lotte.wilms@kb.nl

More Related Content

What's hot

GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liber
IGN Vorstand
 
Building Bridges: from Europeana Libraries to Europeana Newspapers
Building Bridges: from Europeana Libraries to Europeana NewspapersBuilding Bridges: from Europeana Libraries to Europeana Newspapers
Building Bridges: from Europeana Libraries to Europeana Newspapers
LIBER Europe
 
EuropeanaLocal: overview, progress, aggregation
EuropeanaLocal: overview, progress, aggregationEuropeanaLocal: overview, progress, aggregation
EuropeanaLocal: overview, progress, aggregation
EuropeanaLocal Project
 
What library associations can do, advocacy experiences from Germany
What library associations can do, advocacy experiences from GermanyWhat library associations can do, advocacy experiences from Germany
What library associations can do, advocacy experiences from Germany
nvbonline
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
The European Library
 
Ewelina Rockenbauer - WP1
Ewelina Rockenbauer - WP1Ewelina Rockenbauer - WP1
Ewelina Rockenbauer - WP1
Digitised Manuscripts to Europeana
 
Positioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscapePositioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscape
LIBER Europe
 
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday GenereuxEuropeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers
 
Challenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds projectChallenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds project
Hugo Manguinhas
 
Barcelona oldmapsonline
Barcelona oldmapsonlineBarcelona oldmapsonline
Barcelona oldmapsonline
Petr Pridal
 
Europeana Newspapers - Data, Tools & Future Plans
 Europeana Newspapers - Data, Tools & Future Plans  Europeana Newspapers - Data, Tools & Future Plans
Europeana Newspapers - Data, Tools & Future Plans
cneudecker
 
Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...
Hugo Manguinhas
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
The European Library
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
TU Delft, Netherlands
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
The European Library
 
Europeana en de digitale ontsluiting van cultureel erfgoed
Europeana en de digitale ontsluiting van cultureel erfgoedEuropeana en de digitale ontsluiting van cultureel erfgoed
Europeana en de digitale ontsluiting van cultureel erfgoed
EuropeanaLocal Project
 
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
Hugo Manguinhas
 
The Successes of Europeana Libraries
The Successes of Europeana LibrariesThe Successes of Europeana Libraries
The Successes of Europeana Libraries
The European Library
 
IFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza AtanassovaIFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza Atanassova
Europeana Newspapers
 
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
dduin
 

What's hot (20)

GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liber
 
Building Bridges: from Europeana Libraries to Europeana Newspapers
Building Bridges: from Europeana Libraries to Europeana NewspapersBuilding Bridges: from Europeana Libraries to Europeana Newspapers
Building Bridges: from Europeana Libraries to Europeana Newspapers
 
EuropeanaLocal: overview, progress, aggregation
EuropeanaLocal: overview, progress, aggregationEuropeanaLocal: overview, progress, aggregation
EuropeanaLocal: overview, progress, aggregation
 
What library associations can do, advocacy experiences from Germany
What library associations can do, advocacy experiences from GermanyWhat library associations can do, advocacy experiences from Germany
What library associations can do, advocacy experiences from Germany
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
 
Ewelina Rockenbauer - WP1
Ewelina Rockenbauer - WP1Ewelina Rockenbauer - WP1
Ewelina Rockenbauer - WP1
 
Positioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscapePositioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscape
 
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday GenereuxEuropeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday Genereux
 
Challenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds projectChallenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds project
 
Barcelona oldmapsonline
Barcelona oldmapsonlineBarcelona oldmapsonline
Barcelona oldmapsonline
 
Europeana Newspapers - Data, Tools & Future Plans
 Europeana Newspapers - Data, Tools & Future Plans  Europeana Newspapers - Data, Tools & Future Plans
Europeana Newspapers - Data, Tools & Future Plans
 
Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
Europeana en de digitale ontsluiting van cultureel erfgoed
Europeana en de digitale ontsluiting van cultureel erfgoedEuropeana en de digitale ontsluiting van cultureel erfgoed
Europeana en de digitale ontsluiting van cultureel erfgoed
 
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
Linking subject labels in Cultural Heritage Metadata to MIMO vocabulary using...
 
The Successes of Europeana Libraries
The Successes of Europeana LibrariesThe Successes of Europeana Libraries
The Successes of Europeana Libraries
 
IFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza AtanassovaIFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza Atanassova
 
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
Global References index to Biodiversity (GRIB), a bibliographic index of EDIT...
 

Similar to The Europeana Newspapers Project at IMPACT Final Event

ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project Overview
Europeana Newspapers
 
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
cneudecker
 
EuropeanaLocal: objectives, progress and aggregation
EuropeanaLocal: objectives, progress and aggregationEuropeanaLocal: objectives, progress and aggregation
EuropeanaLocal: objectives, progress and aggregation
EuropeanaLocal Project
 
How to Build a Digital Library
How to Build a Digital LibraryHow to Build a Digital Library
How to Build a Digital Library
Charleston Conference
 
Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage
Europeana Newspapers
 
Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013
Europeana Newspapers
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?
cneudecker
 
Representation and Absence in Digital Resources: The Case of Europeana Newspa...
Representation and Absence in Digital Resources: The Case of Europeana Newspa...Representation and Absence in Digital Resources: The Case of Europeana Newspa...
Representation and Absence in Digital Resources: The Case of Europeana Newspa...
TU Delft, Netherlands
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
EuropeanaConnect
 
Europeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregatorEuropeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregator
LIBER Europe
 
Data Mining Newspapers Metadata
Data Mining Newspapers MetadataData Mining Newspapers Metadata
Data Mining Newspapers Metadata
Jean-Philippe Moreux
 
Europeana Newspapers in a Nutshell
Europeana Newspapers in a NutshellEuropeana Newspapers in a Nutshell
Europeana Newspapers in a Nutshell
cneudecker
 
Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday MuehlbergerEuropeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
Europeana
 
The ABES Discovery Study
The ABES Discovery StudyThe ABES Discovery Study
The ABES Discovery Study
ABES
 
2012.03.20 ihr farquhar v03
2012.03.20 ihr   farquhar v032012.03.20 ihr   farquhar v03
2012.03.20 ihr farquhar v03
Digital History
 
ENP Belgrade WS Metadata
ENP Belgrade WS MetadataENP Belgrade WS Metadata
ENP Belgrade WS Metadata
Europeana Newspapers
 
Europeana Newspapers ICT2013 networking session
Europeana Newspapers ICT2013 networking sessionEuropeana Newspapers ICT2013 networking session
Europeana Newspapers ICT2013 networking session
Europeana Newspapers
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital library
Jakobheide
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’Europeana
Douglas McCarthy
 

Similar to The Europeana Newspapers Project at IMPACT Final Event (20)

ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project Overview
 
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
Neudecker who-cares-about-yesterday’s-news-–-use-cases-and-requirements-for-n...
 
EuropeanaLocal: objectives, progress and aggregation
EuropeanaLocal: objectives, progress and aggregationEuropeanaLocal: objectives, progress and aggregation
EuropeanaLocal: objectives, progress and aggregation
 
How to Build a Digital Library
How to Build a Digital LibraryHow to Build a Digital Library
How to Build a Digital Library
 
Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage Realising the value of Europe's newspaper heritage
Realising the value of Europe's newspaper heritage
 
Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?
 
Representation and Absence in Digital Resources: The Case of Europeana Newspa...
Representation and Absence in Digital Resources: The Case of Europeana Newspa...Representation and Absence in Digital Resources: The Case of Europeana Newspa...
Representation and Absence in Digital Resources: The Case of Europeana Newspa...
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
 
Europeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregatorEuropeana Libraries: the value of a library domain aggregator
Europeana Libraries: the value of a library domain aggregator
 
Data Mining Newspapers Metadata
Data Mining Newspapers MetadataData Mining Newspapers Metadata
Data Mining Newspapers Metadata
 
Europeana Newspapers in a Nutshell
Europeana Newspapers in a NutshellEuropeana Newspapers in a Nutshell
Europeana Newspapers in a Nutshell
 
Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday MuehlbergerEuropeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday Muehlberger
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
 
The ABES Discovery Study
The ABES Discovery StudyThe ABES Discovery Study
The ABES Discovery Study
 
2012.03.20 ihr farquhar v03
2012.03.20 ihr   farquhar v032012.03.20 ihr   farquhar v03
2012.03.20 ihr farquhar v03
 
ENP Belgrade WS Metadata
ENP Belgrade WS MetadataENP Belgrade WS Metadata
ENP Belgrade WS Metadata
 
Europeana Newspapers ICT2013 networking session
Europeana Newspapers ICT2013 networking sessionEuropeana Newspapers ICT2013 networking session
Europeana Newspapers ICT2013 networking session
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital library
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’Europeana
 

More from Europeana Newspapers

Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in ParisPresentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
Europeana Newspapers
 
Presentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information DayPresentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information Day
Europeana Newspapers
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
Europeana Newspapers
 
Presentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information DayPresentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information Day
Europeana Newspapers
 
Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
Europeana Newspapers
 
Presentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information DayPresentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information Day
Europeana Newspapers
 
Presentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information DayPresentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information Day
Europeana Newspapers
 
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne KoutsEuropeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers
 
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel VeimannEuropeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers
 
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista KiisaEuropeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers
 
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista AruEuropeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers
 
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred PussEuropeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers
 
Europeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday NeudeckerEuropeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday Neudecker
Europeana Newspapers
 
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday ThompsonEuropeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers
 
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday RossiEuropeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers
 
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday MessinaEuropeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers
 
Europeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday MarchettiEuropeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday Marchetti
Europeana Newspapers
 
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday KempfEuropeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers
 
Europeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday BolioliEuropeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday Bolioli
Europeana Newspapers
 

More from Europeana Newspapers (20)

Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in ParisPresentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
 
Presentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information DayPresentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information Day
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
 
Presentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information DayPresentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information Day
 
Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
 
Presentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information DayPresentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information Day
 
Presentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information DayPresentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information Day
 
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne KoutsEuropeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne Kouts
 
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel VeimannEuropeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel Veimann
 
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista KiisaEuropeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista Kiisa
 
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista AruEuropeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista Aru
 
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred PussEuropeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred Puss
 
Europeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday NeudeckerEuropeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday Neudecker
 
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday ThompsonEuropeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday Thompson
 
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday RossiEuropeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday Rossi
 
Enp lft infoday_neudecker
Enp lft infoday_neudeckerEnp lft infoday_neudecker
Enp lft infoday_neudecker
 
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday MessinaEuropeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday Messina
 
Europeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday MarchettiEuropeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday Marchetti
 
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday KempfEuropeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday Kempf
 
Europeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday BolioliEuropeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday Bolioli
 

Recently uploaded

CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSECHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
kumarjarun2010
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
Adam Dunkels
 
Google I/O Extended Harare Merged Slides
Google I/O Extended Harare Merged SlidesGoogle I/O Extended Harare Merged Slides
Google I/O Extended Harare Merged Slides
Google Developer Group - Harare
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
ChristopherTHyatt
 
WhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring AppsWhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring Apps
HackersList
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
aslasdfmkhan4750
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
Jimmy Lai
 
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes..."Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
Anant Gupta
 
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Torry Harris
 
Three New Criminal Laws in India 1 July 2024
Three New Criminal Laws in India 1 July 2024Three New Criminal Laws in India 1 July 2024
Three New Criminal Laws in India 1 July 2024
aakash malhotra
 
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdfWhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
ArgaBisma
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
BrainSell Technologies
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
SynapseIndia
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
KAMAL CHOUDHARY
 
Use Cases & Benefits of RPA in Manufacturing in 2024.pptx
Use Cases & Benefits of RPA in Manufacturing in 2024.pptxUse Cases & Benefits of RPA in Manufacturing in 2024.pptx
Use Cases & Benefits of RPA in Manufacturing in 2024.pptx
SynapseIndia
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyyActive Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
RaminGhanbari2
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
Neo4j
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
Priyanka Aash
 

Recently uploaded (20)

CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSECHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
CHAPTER-8 COMPONENTS OF COMPUTER SYSTEM CLASS 9 CBSE
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
 
Google I/O Extended Harare Merged Slides
Google I/O Extended Harare Merged SlidesGoogle I/O Extended Harare Merged Slides
Google I/O Extended Harare Merged Slides
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
 
WhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring AppsWhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring Apps
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
 
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes..."Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
"Mastering Graphic Design: Essential Tips and Tricks for Beginners and Profes...
 
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
 
Three New Criminal Laws in India 1 July 2024
Three New Criminal Laws in India 1 July 2024Three New Criminal Laws in India 1 July 2024
Three New Criminal Laws in India 1 July 2024
 
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdfWhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
 
Use Cases & Benefits of RPA in Manufacturing in 2024.pptx
Use Cases & Benefits of RPA in Manufacturing in 2024.pptxUse Cases & Benefits of RPA in Manufacturing in 2024.pptx
Use Cases & Benefits of RPA in Manufacturing in 2024.pptx
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
 
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyyActive Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
 

The Europeana Newspapers Project at IMPACT Final Event

  • 1. The Europeana Newspapers Project IMPACT Final Event Den Haag, 26-06-2012 Lotte Wilms
  • 2. Europeana Newspapers Why newspapers? • Important source of information for researchers • Relevant for general public Europeana Newspapers: • Aims at the aggregation and refinement of newspapers for The European Library and Europeana. • Will use refinement methods for OCR, OLR (article segmentation), and named entity (NER) and class recognition • The libraries participating in the project will provide around 18 million digitised newspaper pages to Europeana • More libraries will be encouraged to contribute newspapers to Europeana and TEL by the project • Builds on work from IMPACT 2
  • 3. Project Profile: Consortium & stakeholders • 17 partners from 12 countries within the consortium • National libraries • University libraries • SME • External partners and stakeholders: • Involvement of libraries outside the project consortium • Framework: • Funded as a Best Practice Network in the ICT-PSP program of the European Commission • Project Duration: February 2012 – January 2015 3
  • 4. Europeana Newspapers Consortium NL E NLF LIBER TEL SUB HH NLL CCS USAL NLP BL SBB KB ONB NLT UIBK BnF UB LFT
  • 5. Project Profile: Objectives 1) Selection, Refinement & Aggregation of content • Provision of more than 18 million newspaper pages to Europeana, many of those with full-text • Support move from images to texts in Europeana 2) Analysis of existing newspaper collections • Survey of newspaper holdings in Europe 3) Quality Assurance & Best practice recommendations • Contribute to optimised workflows • Provide best practice recommendations for digitisation, refinement, workflows, metadata etc. 4) Presentation and full-text search • Improve access to newspaper collections within Europeana 5
  • 6. 1) Selection, Refinement & Aggregation of content • Aggregation of 18 million pages of digitised newspapers to Europeana and to The European Library • 8 million pages “as is” (content providers) • 8 million refined pages: OCR (UIBK, Austria) www.europeana.eu/ • 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) • Analysis of available digital newspaper collections and selection of subsets suitable for refinement www.theeuropeanlibrary.org/ 6
  • 7. 1) Refinement – OCR and OLR - UIBK • 8 million refined pages: OCR using ABBYY FRE10 (UIBK, Austria) • UIBK enriches the OCR with structural information from the Document Understanding Platform (FEP) developed within IMPACT • Dedicated profiles will be produced which are specifically tuned to the characteristics of newspapers to yield optimal results
  • 8. 1) Refinement – OCR and OLR - CCS • 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) • CCS produces OCR and verification of column recognition, zoning, article segmentation, and page class recognition • CCS provides libraries with a client technology for manual correction of recognition and segmentation results • OCRing done with ABBYY FRE10, which includes improvements developed CCS: Column recognition, article segmentation within IMPACT
  • 9. 1) Refinement - Named Entity Recognition • KB provides named entities recognition (NER) for material from up to three languages (Dutch, English, and German) • Pilot planned for second half of 2012 Image by Frank Landsbergen (INL)
  • 10. 2) Analysis of existing digitised newspaper collections • Project partners and others are contacted to provide input until 31 July 2012 to analyse the extent of digitised newspapers collections at their institutions • Results will be embedded in “Zeitschriftendatenbank” of Staatsbibliothek zu Berlin (Union Catalogue of Serials) • Potential new partners for the extension of the network will be suggested by survey • Also useful to ascertain the technical status of digitised data If you have a digital newspaper collection and would like to participate in the survey  please go to: http://www.surveymonkey.com/s/BQ28579
  • 11. 3) Quality Assurance & Best practice recommendations • The digitisation workflow for newspapers, including refinement, will be evaluation through an evaluation and quality assessment framework, containing tools developed in IMPACT • Document Management System • Ground truth production tool Aletheia • Evaluation tools • Provide recommendations on best practices for digitisation and refinement of newspapers
  • 12. 3) Quality Assurance & Best practice recommendations • Analysis of metadata formats in use by libraries in digitisation projects • Align metadata models with the METS/ALTO standard • Release best practice recommendation on how to apply these formats in newspaper digitisation and refinement • Supports content browser
  • 13. 4) Presentation & Access to full-text • Within the lifetime of the project, a content browser will be built within TEL portal so that users can … • Search full text, e.g. • by search term, • by named entities • by collections of newspapers • by date …. • See newspaper images • Be linked to relevant library sources • This browser will be built in TEL during the project; and exported to Europeana after the project
  • 14. 5) Dissemination • Objectives: • Establishment of publicity • Increasing usage of Europeana • Awareness raising among target groups • Tasks: 1. Media Communication 2. Workshops and conferences • Three main dissemination workshops • National information days • Network extension 3. Exploitation 14
  • 15. Thank you for your attention! http://www.europeana-newspapers.eu/ Lotte Wilms Lotte.wilms@kb.nl