Europeana Newspapers Aggregation Plan

768 views

Published on

A presentation by Markus Muhr at the Europeana Newspapers workshop in Amsterdam.

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
768
On SlideShare
0
From Embeds
0
Number of Embeds
382
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Europeana Newspapers Aggregation Plan

  1. 1. Europeana Newspapers WP 4 Aggregation & Indexing Plan Markus Muhr
  2. 2. 2 Agenda ● Customer Relationship Management ● Aggregation Workflow - Metadata • Aggregation Workflow - Full-text and Images • Newspaper Content Browser Options • Viewing Images • Delivery to Europeana / Zeitschriftendatenbank • Aggregation and Indexing Plan • Questions
  3. 3. 3 Customer Relationship Management • SugarCRM • Management of all administrative information • Organisations, contacts, datasets, projects, etc. • Important features for project handling • Newspaper collections • Cases per specific collection • Aggregation and Indexing Plan • Automatic reporting
  4. 4. 4 Customer Relationship Management
  5. 5. 5 Customer Relationship Management
  6. 6. 6 Customer Relationship Management
  7. 7. 7 Aggregation Workflow – Metadata ● Scheduling of ingestion ● Datasets ready for harvesting ● Create case in CRM: case # to provider ● Harvesting metadata (OAI-PMH, FTP, ...) ● Enhance metadata (VIAF, Geonames, MACS,...) ● Indexing in acceptance portal ● E-mail to provider to accept dataset ● Live index = live portal ● Delivery to Europeana ● Enhancing and publishing in Europeana
  8. 8. 8 Aggregation Workflow – Metadata
  9. 9. 9 Aggregation Workflow - Full-text and Images ● Hard-disk delivery by UIBK/CSS ● Hard-disk delivery to ULCC ● Ingestion and alignment of fulltext and images with harvested metadata ● JPEG 2000 generation for hosted IIP image server ● Enrichment with named entities from KB ● Indexing into content browser ● Adaptations of image viewer for external image servers • E-mail to partner
  10. 10. 10 Aggregation Workflow - Full-text and Images
  11. 11. 11 Aggregation Workflow - Full-text and Images
  12. 12. 12 Aggregation Workflow - Full-text and Images
  13. 13. 13 Newspaper Content Browser Options • Questionnaire to content providers determined how the content would appear in newspaper content browser • Option 1 - Images and full-text • Option 2 - Snippets of images and full-text • Option 3 - Full-text only • Option 4 - Metadata only • Option 5 - Option 1 via external image server • Option 6 - Option 2 via external image server
  14. 14. 14 Viewing Images ● The European Library hosts images for Option 1 and 2 ● IIP Image Server with JPEG 2000 ● Viewing images transformed into JPEG 2000 ● Ingestion workflow includes transformation step for tifs and jpgs ● Time-demanding operation ● Image viewer is IIPMooViewer ● Open source projects ● Europeana Regia http://www.theeuropeanlibrary.org/tel4/virtual/regia
  15. 15. 15 Viewing Images ● External image servers for Option 5 and 6 ● Current support of external viewers via iframe ● Alignment and highlighting not available ● Improved usage of content browser via integrated image viewer ● Adaptations for each different kind of image server ● Time-demanding task ● Existing viewer that can be easily embedded in the Newspaper Content Browser are preferable ● Technical support at partner libraries is necessary
  16. 16. 16 Delivery to Europeana / Zeitschriftendatenbank ● Metadata from Full and Associate Partners should go into Newspapers content browser, Europeana portal and Zeitschriftendatenbank / Union Catalogue of Serials ● EDM to Europeana ● Duplin Core to Zeitschriftendatenbank ● Europeana Data Model delivery should be finalised soon
  17. 17. 17 Europeana Data Model
  18. 18. 18 Dublin Core
  19. 19. 19 Aggregation and Indexing Plan ● Plan includes aggregation of partners and 11 associated partners ● Q3 first quarter with indexing work ● Aggregation and indexing is aligned with deliveries from UIBK/CCS ● Deliveries to Europeana & Zeitschriftendatenbank from Q4 onwards ● Aggregation and indexing is split over multiple quarters for some partners
  20. 20. 20 Aggregation and Indexing Plan – Q3 2013 ● Österreichische Nationalbibliothek / Austrian National Library – Option 5 ● Currently working on first batch of 1.090k full-text pages ● Kansalliskirjasto / National Library of Finland – Option 1 (new) ● Currently working on first batch of 132k full-text pages and images
  21. 21. 21 Aggregation and Indexing Plan – Q4 2013 ● Landesbibliothek Dr. Friedrich Teßmann / Teßmann Library – Option 2 ● 857k full-text pages and thumbnail images ● Österreichische Nationalbibliothek / Austrian National Library – Option 5 and 4 ● Remaining batches of 1.090k full-text pages ● Metadata for 5.691k pages
  22. 22. 22 Aggregation and Indexing Plan – Q4 2014 ● Bibliotheque Nationale de France / National Library France – Option 5 ● First batch of 2.388k full-text pages ● Latvijas Nacionala Biblitoteka / National Library of Latvia – Option 1 ● 450k full-text pages and images
  23. 23. 23 Aggregation and Indexing Plan – Q4 2013 ● Landsbókasafn Íslands - Háskólabókasafn / National and Univeristy Library of Iceland – Associated Partner ● Metadata for 4.112k pages ● National Library of Spain – Associated Partner ● Metadata for 5.831k pages ● Bibliothèque nationale de Luxembourg / National Library of Luxembourg – Associated Partner ● Metadata for 620k pages
  24. 24. 24 Aggregation and Indexing Plan – Q1 2014 ● Bibliotheque Nationale de France / National Library France – Option 5 ● Next batch of 2.388k full-text pages ● Eesti Rahvusraamatukogu / Estonian National Library – Option 1 ● First batch of 594k full-text pages and images ● Milli Kutuphane Baskanligi / National Library of Turkey – Option 4 ● Metadata for 9k pages
  25. 25. 25 Aggregation and Indexing Plan – Q1 2014 ● Staatsbibliothek zu Berlin / Berlin State Library – Option 1 ● First batch of 248k full-text pages and images ● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1 ● First batch of 1707k full-text pages and images ● Univerzitet u Beogradu / University Library of Belgrade – Option 1 ● First batch of 408k full-text pages and images
  26. 26. 26 Aggregation and Indexing Plan – Q1 2014 ● National Library of Wales – Associated Partner ● Metadata for 1.100k pages ● National Library and University Library in Zagreb – Associated Partner ● Metadata for 300k pages
  27. 27. 27 Aggregation and Indexing Plan – Q1 2014 ● St. Cyril and Methodius National Library / The National Library of Bulgaria – Associated Partner ● Metadata for 12k pages ● National Library of Czech Republic – Associated Partner ● Metadata for 5.760k pages
  28. 28. 28 Aggregation and Indexing Plan – Q2 2014 ● Bibliotheque Nationale de France / National Library France – Option 5 ● Next batch of 2.388k full-text pages ● Eesti Rahvusraamatukogu / Estonian National Library – Option 1 ● Next batch of 594k full-text pages and images
  29. 29. 29 Aggregation and Indexing Plan – Q2 2014 ● Biblioteka Narodowa / National Library of Poland – Option 2 ● 83k full-text pages and images ● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1 ● Next batch of 1707k full-text pages and images ● Koninklijke Bibliotheek / National Library of the Netherlands – Option 5 ● 1.900k full-text pages
  30. 30. 30 Aggregation and Indexing Plan – Q2 2014 ● Narodna in univerzitetna knjižnica / National and University Library of Slovenia – Associated Partner ● Metadata for ?k pages ● National Library of Portugal – Associated Partner ● Metadata for 400k pages ● National Library of Romania – Associated Partner ● Metadata for 442k pages
  31. 31. 31 Aggregation and Indexing Plan – Q3 2014 ● Bibliotheque Nationale de France / National Library France – Option 5 ● Next batch of 2.388k full-text pages ● Eesti Rahvusraamatukogu / Estonian National Library – Option 1 ● Next batch of 594k full-text pages and images
  32. 32. 32 Aggregation and Indexing Plan – Q3 2014 ● Staatsbibliothek zu Berlin / Berlin State Library – Option 1 ● Next batch of 248k full-text pages and images ● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1 ● Next batch of 1707k full-text pages and images
  33. 33. 33 Aggregation and Indexing Plan – Q4 2014 ● Bibliotheque Nationale de France / National Library France – Option 5 ● Final batch of 2.388k full-text pages ● Eesti Rahvusraamatukogu / Estonian National Library – Option 1 ● Final batch of 594k full-text pages and images
  34. 34. 34 Aggregation and Indexing Plan – Q4 2014 ● Staatsbibliothek zu Berlin / Berlin State Library – Option 1 ● Final batch of 248k full-text pages and images ● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1 ● Final batch of 1707k full-text pages and images ● Kansalliskirjasto / National Library of Finland – Option 1 ● Final batch of 132k full-text pages and images
  35. 35. 35 Operations Officers Anastasia Gasia Junior Operations Officer anastasia.gasia@kb.nl Chiara Latronico Operations Officer chiara.latronico@kb.nl Operations Mailbox: collections@theeuropeanlibrary.org
  36. 36. Thank you for your attention! Markus Muhr (markus.muhr@kb.nl) www.europeana-newspapers.eu

×