Alastair Dunning
Europeana Newspapers
September 2013
Challenges and Solutions in
Creating a European Historic
Newspapers B...
Task $
“...
Creation of a full-text index
of newspaper content
Development of a
newspaper content
browser
Work
Package 4 -...
The European Library is
building an interface to
allow cross-searching of
historic newspapers
digitised by project
partner...
Timetable
Sep 2013 - Beta version with limited
content and functionality made
available
2014 - Ongoing inclusion of more
c...
What content
will be
included ?
Full Images, Full Text, Metadata
Latvia, Belgrade, Hamburg, Berlin,
Estonia, Finland, Neth...
Complete
Newspaper image
can be shown
Eesti Potimees ehk
Naddaleleht , 2
November 1866
(National Library of
Estonia)
What content
will be
included ?
Full Images, Full Text, Metadata
Latvia, Belgrade, Hamburg, Berlin,
Estonia, Finland, Neth...
Fragment of Newspaper
image can be shown
Dziennik Slaskui, 10 June
1915
(National Library of
Poland)
Just Metadata
Turkey
(Partners with copyright issues)
All Associate Partners (for now)
The available content in influenced...
Just title level metadata can
be shown:
“Kleine Blatt, 15 November
1932”
(National Library of Austria)
(Although can we ha...
Creating a
newspapers
interface
that ...
• Provides unique value to users
• Reflects relationship to original
physical new...
Users can cross-search
European Newspapers
18m pages, 10m with full text
Users can see what was
published on a particular ...
Local historians
Researchers
Undergraduates
Genealogists
Teachers and / school pupils
‘Interested public’
….
(According to...
Respects
library wishes
The available content in
influenced by what
restrictions in copyright and
business model from each...
Reflects
relationship to
original
physical
newspaper
collections
Not all issues in a
newspaper title will be
available to ...
Embedded in The European
Library (TEL) portal
TEL membership fees will
help with ongoing costs
TEL members can add
content...
Logos and links back to
source of original content
But also evidence of usage
of library content via TEL /
what statistics...
Interface will respond to
usability testing
Harvesting of different
material will affect interface
Changing requests from
...
First
Iteration
Basic text search
Filtering of results by
●date
●country
●newspaper
●language
●library
● OCR shown
● Zoomable version of full
image
● Clickable links between
full text and image
(sometimes)
● Link to newspaper...
Second
Iteration
● Fragments (where requested by
library)
● See information on particular
title
● See what was published o...
Newspapers from national libraries of
Finland and Austria are available for
searching (Sample search terms: Linz,
Graz, Sa...
Upcoming SlideShare
Loading in...5
×

Challenges and solutions in creating a european historic newspapers browser

639

Published on

Alastair Dunning of The European Library presents the challenges and solutions in creating a web browser for Europe's historic newspapers.

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
639
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Challenges and solutions in creating a european historic newspapers browser

  1. 1. Alastair Dunning Europeana Newspapers September 2013 Challenges and Solutions in Creating a European Historic Newspapers Browser
  2. 2. Task $ “... Creation of a full-text index of newspaper content Development of a newspaper content browser Work Package 4 - Aggregation and presentation of digitized newspapers
  3. 3. The European Library is building an interface to allow cross-searching of historic newspapers digitised by project partners Title-level metadata exported to Europeana. In reality ...
  4. 4. Timetable Sep 2013 - Beta version with limited content and functionality made available 2014 - Ongoing inclusion of more content and functionality Spring 2014 - Usability testing I (subject to project funding) Winter 2014 - Usability testing II (subject to project funding) Jan 2015 - All scheduled content and functionality completed
  5. 5. What content will be included ? Full Images, Full Text, Metadata Latvia, Belgrade, Hamburg, Berlin, Estonia, Finland, Netherlands *, Austria * Snippets of Images, Full Text, Metadata Frederich Tessman, France *, Poland
  6. 6. Complete Newspaper image can be shown Eesti Potimees ehk Naddaleleht , 2 November 1866 (National Library of Estonia)
  7. 7. What content will be included ? Full Images, Full Text, Metadata Latvia, Belgrade, Hamburg, Berlin, Estonia, Finland, Netherlands *, Austria * Snippets of Images, Full Text, Metadata Frederich Tessman, France *, Poland
  8. 8. Fragment of Newspaper image can be shown Dziennik Slaskui, 10 June 1915 (National Library of Poland)
  9. 9. Just Metadata Turkey (Partners with copyright issues) All Associate Partners (for now) The available content in influenced by what restrictions in copyright and business model from each of the contributing libraries. What content will be included ?
  10. 10. Just title level metadata can be shown: “Kleine Blatt, 15 November 1932” (National Library of Austria) (Although can we have dark index of full text ?)
  11. 11. Creating a newspapers interface that ... • Provides unique value to users • Reflects relationship to original physical newspaper collections • Is sustainable • Offers contributors added value • Defines relationship to Europeana • Respects library wishes
  12. 12. Users can cross-search European Newspapers 18m pages, 10m with full text Users can see what was published on a particular day across Europe Users can see information on individual newspapers Provides unique value to users
  13. 13. Local historians Researchers Undergraduates Genealogists Teachers and / school pupils ‘Interested public’ …. (According to the project Description of Work it is for the ‘researcher’) But who are the users ?
  14. 14. Respects library wishes The available content in influenced by what restrictions in copyright and business model from each of the contributing libraries. ●Location of digital image ●Size of image ●Format of image
  15. 15. Reflects relationship to original physical newspaper collections Not all issues in a newspaper title will be available to TEL, or even digitised Documents hosted by TEL will be different quality than those Contextual information vital
  16. 16. Embedded in The European Library (TEL) portal TEL membership fees will help with ongoing costs TEL members can add content to newspaper browser over time Is sustainable
  17. 17. Logos and links back to source of original content But also evidence of usage of library content via TEL / what statistics are needed ? Offers contributors added value
  18. 18. Interface will respond to usability testing Harvesting of different material will affect interface Changing requests from libraries Uneven quality, especially in Is developed iteratively
  19. 19. First Iteration Basic text search Filtering of results by ●date ●country ●newspaper ●language ●library
  20. 20. ● OCR shown ● Zoomable version of full image ● Clickable links between full text and image (sometimes) ● Link to newspaper source library (where we have been provided with links) First Iteration
  21. 21. Second Iteration ● Fragments (where requested by library) ● See information on particular title ● See what was published on a particular day ● Search over titles (not just text) ● Other browseable visualisations of publication and library source ● Search / browse via entities
  22. 22. Newspapers from national libraries of Finland and Austria are available for searching (Sample search terms: Linz, Graz, Salzburg, Turku, Oulu, Tampere) Testing the Site
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×