This document summarizes a workshop on the Europeana Newspapers Project. The project aims to digitize 18 million newspaper pages from 18 partners in 12 European countries. It will refine optical character recognition (OCR) and other metadata for 10 million pages and article segmentation for 2 million pages. The goals are to spread best practices for newspaper digitization, aggregate content for Europeana and The European Library, and encourage more libraries to contribute newspaper content to Europeana. Future work includes processing more content, addressing copyright issues for 20th century papers, and improving accessibility through full text search.
Europeana Newspapers - the Gateway to European Newspapers Onlinecneudecker
Europeana Newspapers - the Gateway to European Newspapers Online
IFLA 2013 Satellite Meeting on Newspaper & Genloc Sections, Science Centre Singapore, 14-15 August 2013, Singapore.
The challenges of making Europe's newspapers available onlineLIBER Europe
tPresentation from WLIC2013. Reports on a survey conducted by the Europeana Newspaper project of digitised newspaper collections in LIBER (European research) libraries.
Refinement
Europeana Newspapers Workshop: A Gateway to European Newspapers Online. Research Information Infrastructures and the Future Role of Libraries.
LIBER 2013 Annual Conference, Bavarian State Library, 26-29 June 2013, Munich, Germany.
The Presentation of Hans-Jörg Lieder, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz, at the BnF Information Day for Europeana Newspapers (November 2014).
Large scale refinement of digital historical newspapers with named entities r...cneudecker
Large scale refinement of digital historical newspapers with named entities recognition
IFLA 2013 Satellite Meeting on Newspaper & Genloc Sections, 13-14 August 2014, Geneva, Switzerland.
An overview of the Europeana Newspapers Project by Rossitza Atanassova, British Library. Presentation given at the Europeana Newspapers Information Day, held at the British Library on 9 June 2014.
Performance Evaluation and Quality Assessment by Stefan Pletschacher, University of Salford. Presentation given at the Europeana Newspapers Information Day, held at the British Library on 9 June 2014.
Europeana Newspapers - the Gateway to European Newspapers Onlinecneudecker
Europeana Newspapers - the Gateway to European Newspapers Online
IFLA 2013 Satellite Meeting on Newspaper & Genloc Sections, Science Centre Singapore, 14-15 August 2013, Singapore.
The challenges of making Europe's newspapers available onlineLIBER Europe
tPresentation from WLIC2013. Reports on a survey conducted by the Europeana Newspaper project of digitised newspaper collections in LIBER (European research) libraries.
Refinement
Europeana Newspapers Workshop: A Gateway to European Newspapers Online. Research Information Infrastructures and the Future Role of Libraries.
LIBER 2013 Annual Conference, Bavarian State Library, 26-29 June 2013, Munich, Germany.
The Presentation of Hans-Jörg Lieder, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz, at the BnF Information Day for Europeana Newspapers (November 2014).
Large scale refinement of digital historical newspapers with named entities r...cneudecker
Large scale refinement of digital historical newspapers with named entities recognition
IFLA 2013 Satellite Meeting on Newspaper & Genloc Sections, 13-14 August 2014, Geneva, Switzerland.
An overview of the Europeana Newspapers Project by Rossitza Atanassova, British Library. Presentation given at the Europeana Newspapers Information Day, held at the British Library on 9 June 2014.
Performance Evaluation and Quality Assessment by Stefan Pletschacher, University of Salford. Presentation given at the Europeana Newspapers Information Day, held at the British Library on 9 June 2014.
Representation and Absence in Digital Resources: The Case of Europeana Newspa...TU Delft, Netherlands
Presentation at Digital Humanities 2014, Lausanne. Looks at some of the issues related to digitising historic newspapers in Europe, particularly how a website that can search through all of them can be built
Experimental Workflow Development in Digitisationcneudecker
Experimental Workflow Development in Digitisation
2nd Qualitative and Quantitative Methods in Libraries International Conference (QQML2010), 25-28 May 2010, Chania, Greece.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
1. Europeana Newspapers Project
Workshop on Refinement and Quality Assessment
University Library "Svetozar Marković“
Belgrade, June 13th
2013
Hans-Jörg Lieder/ Ulrike Kölsch
Project Coordinator
Berlin State Library, Germany
Belgrade/June 13th 2013/University Library
2. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp 2
Content
Project Profile
• Consortium & Stakeholders
• Aims and Objectives
• Adding value
• Where do we go from here?
3. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp 3
Consortium & Stakeholders
• 18 partners from 12 countries within the consortium
National and University libraries
Universities
SME
• External partners and stakeholders
Involvement of libraries outside the project consortium via associated and
network partnerships
• Framework
Funded as a Best Practice Network in the ICT PSP program of the
European Commission
Project duration: February 2012 – January 2015
4. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Consortium Partners
10. CCS Content Conversion
Specialists GmbH
11. Stichting LIBER, Netherlands
12. National Library of Latvia
13. National Library of Turkey
14. University Library of Belgrade
15. University of Innsbruck
16. State Library Dr. Friedrich
Tessmann, Italy
17. The British Library, UK
18. Europeana Foundation,
Netherlands
01. State Library Berlin, Germany
02. National Library of the
Netherlands
03. National Library of Estonia
04. National Library of Austria
05. National Library of Finland
06. State and University Library
Hamburg, Germany
07. National Library of France
08. National Library of Poland
09. University of Salford
5. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Europeana Newspapers Consortium
NLF
SBB ONB
NLP
BnF
NLE
SUB HH
USAL
NLLLIBER,
KB, EF
CCS
NLT
UB
UIBK
LFT
BL
6. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Associated Partners
1. National Library of Czech Republic
2. National Library of Wales
3. National and University Library Ljubljana, Slovenia
4. National Library of Portugal
5. National and University Library of Iceland
6. National Library of Spain
7. National and University Library Zagreb, Croatia
8. National Library of Belgium
9. St. Cyril and Methodius National Library, Bulgaria
10.National Library of Luxembourg
11.Lucian Blaga Central University Library, Romania
Since April 2013 the project has eleven Associated partners and started
intensive networking with further libraries
7. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp 7
Europeana Newspapers: Aims and Objectives
• Refinement methods for OCR, OLR (article segmentation),
Named Entity Recognition (NER) and class recognition
Creation of 18 million pages of digitised newspapers
- 10 million refined pages: OCR (UIBK, Austria)
- 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany)
Delivery of 8 million pages already available locally
• Quality evaluation and prediction tools
• Aggregation and refinement of newspapers for The European Library
and Europeana
• Metadata: best practice recommendation for
Creation of OCR-ready images
Full-texts and associated metadata
NER
• Dissemination: Further libraries are encouraged and supported in
contributing newspapers content to Europeana
8. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Value: Europeana Newspapers spreads best practice
Europeana Newspapers supports the creation of a larger window
into European culture by:
• Developing best practice for the digitisation of newspapers
• Sharing best practice and experiences through workshop with project partners,
associated partners, and networking partners
• Publishing best practice on our website
• National Information days
9. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Added Value: Aggregation
Activities focused on three key messages:
1. The project and its outcomes (e.g. online access to a
collection of high-quality digitised newspapers);
2. The technological challenges (e.g. techniques for refining
content and the development of a standardised metadata
model);
3. The content-related issues (e.g. improving the extent of
newspaper digitisation, the changing nature of historical
research).
The European Library
• A single library domain aggregator
• Content from major European libraries
• Dedicated newspaper content browser
• Full-text search capabilities
• Portal for researchers
10. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp 10
Added Value: Scenarios
• Keyword and Phrase Search
• Image Browsing
• Access via content structure (OLR and NER results)
• Geo-location based service
• Text mining
• Crowd sourced correction and enrichment
• Access through mobile apps
• ...
11. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Where are we now?
• OCR-Processing completed almost four million newspaper pages
• Available specification of use scenarios
• Available initial versions of evaluation tools
• Europeana Newspapers survey report
• Development of three tools to support highly standardised data
creation, data controlling and data delivery within the project
• Metadata recommendations ready to be published in October 2013
• Specifications for content browser
• CCS has started work (OLR)
• Dissemination and Information
- Established associated and networking partnerships
12. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp
Where do we go from here
Activities focused on three key messages:
1. The project and its outcomes (e.g. online access to a
collection of high-quality digitised newspapers);
2. The technological challenges (e.g. techniques for refining
content and the development of a standardised metadata
model);
3. The content-related issues (e.g. improving the extent of
newspaper digitisation, the changing nature of historical
research).
More newspaper content
• Most libraries have digitised less than 10% of their physical
newspaper collection
More recent content
• 20th century content unavailable or only available under licence at
national level: need to work with publishers and rights holders
Exploit richness of European digitised newspaper collections
• OCR not applied across the board and often selectively
Improved accessiblity
• Richness of content has knock on effect on accessibility (e.g. full
text search)
13. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the
Competitiveness and Innovation Framework Programme by the European Community http://
ec.europa.eu/ict_psp 13
Why newspapers? …and how, anyway?
"Die Zeitungen sind die Sekundenzeiger der Geschichte.“
(Newspapers are the second hands of history)
(This hand however, is not only of inferior metal to the other hands, it also
seldom works properly.)
Arthur Schopenhauer
Relevant to all customers/citizens
Relevant to regional and European policies incl. Europeana
Newspaper holdings in public institutions are…
• … sometimes: solid and complete, beautiful bound; excellent microfilm copies
• … frequently: frail and crumbly, missing editions, incomplete supplements,
poorly bound; poor microfilm copies, legal uncertainties with contemporary
material
14. Thank you for your attention!
Contact:
hans-joerg.lieder@sbb.spk-berlin.de
ulrike.koelsch@europeana-newspapers.eu
For more information, please see www.europeana-newspapers.eu
or follow our project news via Twitter (@eurnews) and
Facebook (https://www.facebook.com/EuropeanaNewspapers)
Editor's Notes
Titel Overview Mission statement Why newspapers iew, not 1 and 6 Special focus: Turkey Thanks and bye