Ethical stalking by Mark Williams. UpliftLive 2024
Building Bridges: from Europeana Libraries to Europeana Newspapers
1. Building Bridges: from Europeana
Libraries to Europeana Newspapers
Susan Reilly, LIBER
Twitter: @skreilly
IFLA Newspapers/GENLOC, Helsinki, 13th Aug 2012
2. Overview
About LIBER
Introduction to Europeana Newspapers
The foundation stone: Europeana Libraries
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp 2
3. LIBER & the European Digital Agenda
Association of European Research Libraries
Our projects:
Content
Europeana Libraries
Europeana Newspapers
Policy
MEDOANET
Infrastructure
APARSEN
AAA Study
ODE
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
4. Europeana Newspapers
• 17 partner institutions
• 3 years (2012-2015)
• Aggregation of more than 18 million newspapers
• Will use refinement methods for OCR, OLR (article
segmentation), and named entity (NER) and class
recognition
• Suvey existing collections in Europe
• Make content accessible
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
5. Why newspapers?
“The museum (and the
newspaper) today seeks
whatever represents normal life
in its own native locality and
with infinite pains its collections
are arranged in a manner which
is natural to them in their own
habitat”
Lucy Maynard Salmon (1976) in The Newspaper and the Historian
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
6. Europeana Newspapers: where the content
comes from…
We are looking for
more libraries! NL E
LIBER
NLF
SUB HH
NLL
CCS
USAL
NLP
BL KB SBB ONB
UIBK NLT
BnF
UB
LFT
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
7. What we do with the content
• Select 10 million items to be OCR’d
• Structural information by UKIB e.g. headings, table of contents
• Select 2 million items for OCR and OLR
• Article segmentation and page class recognition by CCS
• Libraries carry out manual correction of recognition and
segmentation results
• Named entity recognition applied to English, Dutch and
German material
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
8. Making the content accessible
• OCR enables full text searching
• OLR enables more targeted searching (titles and sections)
• NER enables searching by people, place,and the discover of
new relationships between entities
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
9. No access without aggregation
• Europeana Libraries
• A single library domain aggregator
• Content from European research libraries
• Full-text search capabilities
• Portal for researchers
Access = Sustainability
Access = Visibility
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
11. Thank you for your attention!
http://www.libereurope.eu
http://www.europeana-newspapers.eu/
http://www.europeana-libraries.eu/
Hall 4/5, stand H104
Editor's Notes
Before we get in to the drivers and barriers for data sharing I would like to ‘share’ 2 things about me with you.. First of all, I am a librarian. I work as project officer for LIBER, which is the Association of European Research Libraries. We have 380 member libraries from all over Europe. Our projects really focus on developing the role of the library as part of the Europeana Research Infrastructure and they fall into 3 main categories.
To this.. How do we get from the image of the research we have built up to a dedicated pan-European research portal with content from practically all the research libraries in Europe, including bibliographic records, full text and special tools for resaercher- all the things that we know that researchers want. Well of course I’m going to say though partnership, through enabling national, university and other research libraries to work together to build this service and provide research content in a sustainable mannor. Which is what the Europeana Libraries project sets out to do…