ENP Belgrade Workshop Project Overview

1,149 views

Published on

Published in: Education, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,149
On SlideShare
0
From Embeds
0
Number of Embeds
521
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Titel Overview Mission statement Why newspapers iew, not 1 and 6 Special focus: Turkey Thanks and bye
  • ENP Belgrade Workshop Project Overview

    1. 1. Europeana Newspapers ProjectWorkshop on Refinement and Quality AssessmentUniversity Library "Svetozar Marković“Belgrade, June 13th2013Hans-Jörg Lieder/ Ulrike KölschProject CoordinatorBerlin State Library, GermanyBelgrade/June 13th 2013/University Library
    2. 2. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 2ContentProject Profile• Consortium & Stakeholders• Aims and Objectives• Adding value• Where do we go from here?
    3. 3. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 3Consortium & Stakeholders• 18 partners from 12 countries within the consortium National and University libraries Universities SME• External partners and stakeholders Involvement of libraries outside the project consortium via associated andnetwork partnerships• Framework Funded as a Best Practice Network in the ICT PSP program of theEuropean Commission Project duration: February 2012 – January 2015
    4. 4. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspConsortium Partners10. CCS Content ConversionSpecialists GmbH11. Stichting LIBER, Netherlands12. National Library of Latvia13. National Library of Turkey14. University Library of Belgrade15. University of Innsbruck16. State Library Dr. FriedrichTessmann, Italy17. The British Library, UK18. Europeana Foundation,Netherlands01. State Library Berlin, Germany02. National Library of theNetherlands03. National Library of Estonia04. National Library of Austria05. National Library of Finland06. State and University LibraryHamburg, Germany07. National Library of France08. National Library of Poland09. University of Salford
    5. 5. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspEuropeana Newspapers ConsortiumNLFSBB ONBNLPBnFNLESUB HHUSALNLLLIBER,KB, EFCCSNLTUBUIBKLFTBL
    6. 6. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspAssociated Partners1. National Library of Czech Republic2. National Library of Wales3. National and University Library Ljubljana, Slovenia4. National Library of Portugal5. National and University Library of Iceland6. National Library of Spain7. National and University Library Zagreb, Croatia8. National Library of Belgium9. St. Cyril and Methodius National Library, Bulgaria10.National Library of Luxembourg11.Lucian Blaga Central University Library, RomaniaSince April 2013 the project has eleven Associated partners and startedintensive networking with further libraries
    7. 7. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 7Europeana Newspapers: Aims and Objectives• Refinement methods for OCR, OLR (article segmentation),Named Entity Recognition (NER) and class recognition Creation of 18 million pages of digitised newspapers- 10 million refined pages: OCR (UIBK, Austria)- 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) Delivery of 8 million pages already available locally• Quality evaluation and prediction tools• Aggregation and refinement of newspapers for The European Libraryand Europeana• Metadata: best practice recommendation for Creation of OCR-ready images Full-texts and associated metadata NER• Dissemination: Further libraries are encouraged and supported incontributing newspapers content to Europeana
    8. 8. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspValue: Europeana Newspapers spreads best practiceEuropeana Newspapers supports the creation of a larger windowinto European culture by:• Developing best practice for the digitisation of newspapers• Sharing best practice and experiences through workshop with project partners,associated partners, and networking partners• Publishing best practice on our website• National Information days
    9. 9. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspAdded Value: AggregationActivities focused on three key messages:1. The project and its outcomes (e.g. online access to acollection of high-quality digitised newspapers);2. The technological challenges (e.g. techniques for refiningcontent and the development of a standardised metadatamodel);3. The content-related issues (e.g. improving the extent ofnewspaper digitisation, the changing nature of historicalresearch).The European Library• A single library domain aggregator• Content from major European libraries• Dedicated newspaper content browser• Full-text search capabilities• Portal for researchers
    10. 10. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 10Added Value: Scenarios• Keyword and Phrase Search• Image Browsing• Access via content structure (OLR and NER results)• Geo-location based service• Text mining• Crowd sourced correction and enrichment• Access through mobile apps• ...
    11. 11. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspWhere are we now?• OCR-Processing completed almost four million newspaper pages• Available specification of use scenarios• Available initial versions of evaluation tools• Europeana Newspapers survey report• Development of three tools to support highly standardised datacreation, data controlling and data delivery within the project• Metadata recommendations ready to be published in October 2013• Specifications for content browser• CCS has started work (OLR)• Dissemination and Information- Established associated and networking partnerships
    12. 12. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_pspWhere do we go from hereActivities focused on three key messages:1. The project and its outcomes (e.g. online access to acollection of high-quality digitised newspapers);2. The technological challenges (e.g. techniques for refiningcontent and the development of a standardised metadatamodel);3. The content-related issues (e.g. improving the extent ofnewspaper digitisation, the changing nature of historicalresearch).More newspaper content• Most libraries have digitised less than 10% of their physicalnewspaper collectionMore recent content• 20th century content unavailable or only available under licence atnational level: need to work with publishers and rights holdersExploit richness of European digitised newspaper collections• OCR not applied across the board and often selectivelyImproved accessiblity• Richness of content has knock on effect on accessibility (e.g. fulltext search)
    13. 13. This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of theCompetitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 13Why newspapers? …and how, anyway?"Die Zeitungen sind die Sekundenzeiger der Geschichte.“(Newspapers are the second hands of history)(This hand however, is not only of inferior metal to the other hands, it alsoseldom works properly.)Arthur Schopenhauer Relevant to all customers/citizens Relevant to regional and European policies incl. Europeana Newspaper holdings in public institutions are…• … sometimes: solid and complete, beautiful bound; excellent microfilm copies• … frequently: frail and crumbly, missing editions, incomplete supplements,poorly bound; poor microfilm copies, legal uncertainties with contemporarymaterial
    14. 14. Thank you for your attention!Contact:hans-joerg.lieder@sbb.spk-berlin.deulrike.koelsch@europeana-newspapers.euFor more information, please see www.europeana-newspapers.euor follow our project news via Twitter (@eurnews) andFacebook (https://www.facebook.com/EuropeanaNewspapers)

    ×