SlideShare a Scribd company logo
1 of 33
Data scale and diversity
issues at Europeana
Antoine Isaac
Collections – storing and using the metadata
Finnish National Museum, 16 November 2017
Title here
CC BY-SA
What is Europeana?
CC BY-SA
Europeana’s markets and services
Title here
CC BY-SA
Access to Europe’s cultural heritage
Europeana Essentials
CC BY-SACC BY-SA
53 million digitized objects, from 3,700 institutions in 44 countries
Engage people
● Place the digital cultural heritage where
people are: Wikipedia, Pinterest, online but
also in the street, participating via collection
days, or transcribathons.
● And develop more browse entry points for
specific interest areas
Re-use with Partners
Make content available to:
●Educators, Teacher Networks and
Ministries of Education
●Research Networks
●the Creative industries/innovators
So it they can make it work for their
audiences
Title here
CC BY-SACC BY-SA
A network of data partners
●Data providers: Cultural heritage institutions providing content and metadata to
Europeana
●"Intermediate” Aggregators: organizations
or projects gathering metadata and content
for institutions from a specific country, sector,
or on a specific domain (music, archaeology,
theater…) and making it available for
Europeana and other data consumers
Title here
CC BY-SA
What data does Europeana
hold?
Europeana Essentials
CC BY-SACC BY-SA
● Descriptive and technical metadata
● Thumbnails
As a rule, content is still served from our data partners
● Some content for specific projects
● newspapers text and images
● user-generated content (Europeana 1914-1918)
Title here
CC BY-SA
Big data?
Europeana Essentials
CC BY-SACC BY-SA
Volume – not so big
● Metadata <1TB
● Thumbnails <4TB
● Some content for specific projects <10TB
53 million digitized objects, from 3,700 institutions in 44 countries
Title here
CC BY-SA
Big data?
Europeana Essentials
CC BY-SACC BY-SA
Velocity - manageable
● Approx. 2M records added or removed between May 2017
and September 2017
● Updates are done continuously, but not in real time (our
data ingestion team pushes the buttons)
Title here
CC BY-SA
Big data?
Europeana Essentials
CC BY-SACC BY-SA
Veracity - problematic
● We have big data quality issues
Title here
CC BY-SA
Big data?
Europeana Essentials
CC BY-SACC BY-SA
Variety – huge
53 million digitized objects, from 3,700 institutions in 44
countries
● Many different themes and types of objects
Books, newspapers, journals, letters, diaries, archival papers, paintings, maps, drawings,
photographs, music, spoken word, radio broadcasts, film, newsreels, television, fashion,
sculpture, 3D objects, and more
● Libraries, archives, museums have different ways to describe objects.
Even within a sector, big differences can be observed
● Heterogeneity makes quality issues even harder to cope with
Title here
CC BY-SA
Multilinguism
Europeana Essentials
CC BY-SACC BY-SA
● Officially we get metadata in 39 languages
● But there are more languages used in individual metadata
fields
Title here
CC BY-SA
Europeana Essentials
CC BY-SACC BY-SA
Work by Péter Kiraly (Göttingen Research alliance)
http://144.76.218.178/europeana-qa/languages.php?collectionId=
Title here
CC BY-SA
Europeana Essentials
CC BY-SACC BY-SA
Work by Péter Kiraly (Göttingen Research alliance)
http://144.76.218.178/europeana-qa/languages.php?collectionId=
Title here
CC BY-SA
Multilinguism
Europeana Essentials
CC BY-SACC BY-SA
● Officially we get metadata in 39 languages
● But there are more languages used in individual metadata
fields
• Over 400 language codes
• E.g., 6 values in x-aramaic-latn - not a valid code by the way
• But the most common case is lack of language information!
What do we do with it?
France, Public Domain
1914, National Library of France
Agence de presse Meurisse
Concours de cycles nautiques sur le lac
d’Enghien : Berregent piloté par Austerling
Segmenting - discovery
Title here
CC BY-SA
Europeana Collections
Europeana Essentials
CC BY-SACC BY-SA
Title here
CC BY-SACC BY-SA
Thematic collections
Title here
CC BY-SA
Contributing to third party sites -
Wikimedia Commons
Europeana Essentials
CC BY-SACC BY-SA
Europeana 1914-1918 content on Wikimedia Commons
Wikimedia Commons| CC BY-SA
Segmenting - re-use
France, Public Domain
1932, National Library of France
Agence de presse Mondial Photo-Presse.
Tournoi royal de motos à Londres :
changement d'une roue de side-car en marche
Still, we need to
work on data
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
Data modelling for
interoperability and richer data
CC BY-SA
Clavecin, Bartolomeo Cristofori
Cite de la Musique,
MIMO - Musical Instruments Museums Online|CC BY-NC-SA
Europeana Data Model example
Title here
CC BY-SACC BY-SA
A community driven model
• Involving experts from libraries, archives and museums, as
well as academic partners
• The input from the different communities makes the model
stronger
Bible Translators
Anatolia College, Greece | Public Domain
Title here
CC BY-SACC BY-SA
The Europeana Data Model
(EDM)
OAI-
ORE
OWL
Title here
CC BY-SACC BY-SA
Following the Linked Open Data
principles
http://vimeo.com/36752317
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
Enriching data
CC BY-SA
Several processes produce richer data that
we and others can use to build new and
innovative services
• Harvesting richer Linked Open Data from
data partners
• Crowdsourcing
•Automatic semantic enrichment
Latvijas dzelzce u karteļ
1937, National Library of Latvia, Latvia | Public Domain
CC BY-SA
Thumbnail
Metadata
Link to data
provider
Rights
CC BY-SA
Links to contextual entities
Title here
CC BY-SACC BY-SA
The Entity Collection
Contribution to multilingual coverage
Entities effectively used to enrich Europeana Objects
Entities present in the Entity Collection
Title here
CC BY-SACC BY-SA
Multilingual enrichment is not easy
Poisonous India or the Importance of a Semantic and
Multilingual Enrichment Strategy
Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012
http://link.springer.com/chapter/10.1007%2F978-3-642-35233-1_2
Title here
CC BY-SACC BY-SA
Data Quality Committee
Working on recommendations for the community on:
○ Mandatory metadata elements for ingestion of EDM data
○ Metadata checking and normalization
○ Meaningful metadata values (in the context of use)
○ Quality of content (digital media)
○ Coordination with other quality-related initiatives
http://pro.europeana.eu/get-involved/europeana-tech/data-quality-committee
Title here
CC BY-SACC BY-SA
Title here
CC BY-SA
Name of image | Creator
Providing organization|
Country, licence
Name of image | Creator
Providing organization| Country, licence
antoine.isaac@europeana.eu
@antoine_isaac

More Related Content

What's hot

Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
The European Library
 

What's hot (20)

Europeana Network Association AGM 2016 - 8 November 2016 - Max Kaiser - Report
Europeana Network Association AGM 2016 - 8 November 2016 - Max Kaiser - ReportEuropeana Network Association AGM 2016 - 8 November 2016 - Max Kaiser - Report
Europeana Network Association AGM 2016 - 8 November 2016 - Max Kaiser - Report
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
Ocd impact analysis presentation gla mwiki 2015
Ocd impact analysis presentation gla mwiki 2015Ocd impact analysis presentation gla mwiki 2015
Ocd impact analysis presentation gla mwiki 2015
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
Europeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decadeEuropeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decade
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Europeana Cloud as part of the Europeana Ecosystem
Europeana Cloud as part of the Europeana EcosystemEuropeana Cloud as part of the Europeana Ecosystem
Europeana Cloud as part of the Europeana Ecosystem
 
Introduction to CARARE
Introduction to CARAREIntroduction to CARARE
Introduction to CARARE
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Europeana Music Channel, wireframes
Europeana Music Channel, wireframesEuropeana Music Channel, wireframes
Europeana Music Channel, wireframes
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in Europeana
 
AGM 2015 - 4/11 - BP 2016
AGM 2015 - 4/11 - BP 2016AGM 2015 - 4/11 - BP 2016
AGM 2015 - 4/11 - BP 2016
 
Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
 
Europeana Archaeology
Europeana ArchaeologyEuropeana Archaeology
Europeana Archaeology
 
Europeana Creative - What is this Europeana thing?
Europeana Creative - What is this Europeana thing?Europeana Creative - What is this Europeana thing?
Europeana Creative - What is this Europeana thing?
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’Europeana
 
Consolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research ServicesConsolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research Services
 

Similar to Data scale and diversity issues at Europeana

Europeana Essentials (updated June 2014)
Europeana Essentials (updated June 2014)Europeana Essentials (updated June 2014)
Europeana Essentials (updated June 2014)
Europeana
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Nuno Freire
 

Similar to Data scale and diversity issues at Europeana (20)

Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Why join Europeana?
Why join Europeana?Why join Europeana?
Why join Europeana?
 
Europeana essentials June 2013
Europeana essentials June 2013Europeana essentials June 2013
Europeana essentials June 2013
 
Europeana Essentials (updated June 2014)
Europeana Essentials (updated June 2014)Europeana Essentials (updated June 2014)
Europeana Essentials (updated June 2014)
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
Europeana essentials August 2013
Europeana essentials August 2013Europeana essentials August 2013
Europeana essentials August 2013
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access
 
Europeana creating the backbone
Europeana creating the backboneEuropeana creating the backbone
Europeana creating the backbone
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Sharing 3D Cultural Heritage
Sharing 3D Cultural HeritageSharing 3D Cultural Heritage
Sharing 3D Cultural Heritage
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage Data
 
Europeana 2019 - Connect Communities - Pitch your project
Europeana 2019 - Connect Communities - Pitch your projectEuropeana 2019 - Connect Communities - Pitch your project
Europeana 2019 - Connect Communities - Pitch your project
 
Europeana Essentials - Latest
Europeana Essentials - LatestEuropeana Essentials - Latest
Europeana Essentials - Latest
 
Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Connecting Cuture with Europeana, Barcelona, 12 june 2018
Connecting Cuture with Europeana, Barcelona, 12 june 2018Connecting Cuture with Europeana, Barcelona, 12 june 2018
Connecting Cuture with Europeana, Barcelona, 12 june 2018
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
 

More from Antoine Isaac

More from Antoine Isaac (20)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Data scale and diversity issues at Europeana

  • 1. Data scale and diversity issues at Europeana Antoine Isaac Collections – storing and using the metadata Finnish National Museum, 16 November 2017
  • 2. Title here CC BY-SA What is Europeana? CC BY-SA Europeana’s markets and services
  • 3. Title here CC BY-SA Access to Europe’s cultural heritage Europeana Essentials CC BY-SACC BY-SA 53 million digitized objects, from 3,700 institutions in 44 countries
  • 4. Engage people ● Place the digital cultural heritage where people are: Wikipedia, Pinterest, online but also in the street, participating via collection days, or transcribathons. ● And develop more browse entry points for specific interest areas
  • 5. Re-use with Partners Make content available to: ●Educators, Teacher Networks and Ministries of Education ●Research Networks ●the Creative industries/innovators So it they can make it work for their audiences
  • 6. Title here CC BY-SACC BY-SA A network of data partners ●Data providers: Cultural heritage institutions providing content and metadata to Europeana ●"Intermediate” Aggregators: organizations or projects gathering metadata and content for institutions from a specific country, sector, or on a specific domain (music, archaeology, theater…) and making it available for Europeana and other data consumers
  • 7. Title here CC BY-SA What data does Europeana hold? Europeana Essentials CC BY-SACC BY-SA ● Descriptive and technical metadata ● Thumbnails As a rule, content is still served from our data partners ● Some content for specific projects ● newspapers text and images ● user-generated content (Europeana 1914-1918)
  • 8. Title here CC BY-SA Big data? Europeana Essentials CC BY-SACC BY-SA Volume – not so big ● Metadata <1TB ● Thumbnails <4TB ● Some content for specific projects <10TB 53 million digitized objects, from 3,700 institutions in 44 countries
  • 9. Title here CC BY-SA Big data? Europeana Essentials CC BY-SACC BY-SA Velocity - manageable ● Approx. 2M records added or removed between May 2017 and September 2017 ● Updates are done continuously, but not in real time (our data ingestion team pushes the buttons)
  • 10. Title here CC BY-SA Big data? Europeana Essentials CC BY-SACC BY-SA Veracity - problematic ● We have big data quality issues
  • 11. Title here CC BY-SA Big data? Europeana Essentials CC BY-SACC BY-SA Variety – huge 53 million digitized objects, from 3,700 institutions in 44 countries ● Many different themes and types of objects Books, newspapers, journals, letters, diaries, archival papers, paintings, maps, drawings, photographs, music, spoken word, radio broadcasts, film, newsreels, television, fashion, sculpture, 3D objects, and more ● Libraries, archives, museums have different ways to describe objects. Even within a sector, big differences can be observed ● Heterogeneity makes quality issues even harder to cope with
  • 12. Title here CC BY-SA Multilinguism Europeana Essentials CC BY-SACC BY-SA ● Officially we get metadata in 39 languages ● But there are more languages used in individual metadata fields
  • 13. Title here CC BY-SA Europeana Essentials CC BY-SACC BY-SA Work by Péter Kiraly (Göttingen Research alliance) http://144.76.218.178/europeana-qa/languages.php?collectionId=
  • 14. Title here CC BY-SA Europeana Essentials CC BY-SACC BY-SA Work by Péter Kiraly (Göttingen Research alliance) http://144.76.218.178/europeana-qa/languages.php?collectionId=
  • 15. Title here CC BY-SA Multilinguism Europeana Essentials CC BY-SACC BY-SA ● Officially we get metadata in 39 languages ● But there are more languages used in individual metadata fields • Over 400 language codes • E.g., 6 values in x-aramaic-latn - not a valid code by the way • But the most common case is lack of language information!
  • 16. What do we do with it? France, Public Domain 1914, National Library of France Agence de presse Meurisse Concours de cycles nautiques sur le lac d’Enghien : Berregent piloté par Austerling
  • 18. Title here CC BY-SA Europeana Collections Europeana Essentials CC BY-SACC BY-SA
  • 19. Title here CC BY-SACC BY-SA Thematic collections
  • 20. Title here CC BY-SA Contributing to third party sites - Wikimedia Commons Europeana Essentials CC BY-SACC BY-SA Europeana 1914-1918 content on Wikimedia Commons Wikimedia Commons| CC BY-SA
  • 22. France, Public Domain 1932, National Library of France Agence de presse Mondial Photo-Presse. Tournoi royal de motos à Londres : changement d'une roue de side-car en marche Still, we need to work on data
  • 23. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA Data modelling for interoperability and richer data CC BY-SA Clavecin, Bartolomeo Cristofori Cite de la Musique, MIMO - Musical Instruments Museums Online|CC BY-NC-SA Europeana Data Model example
  • 24. Title here CC BY-SACC BY-SA A community driven model • Involving experts from libraries, archives and museums, as well as academic partners • The input from the different communities makes the model stronger Bible Translators Anatolia College, Greece | Public Domain
  • 25. Title here CC BY-SACC BY-SA The Europeana Data Model (EDM) OAI- ORE OWL
  • 26. Title here CC BY-SACC BY-SA Following the Linked Open Data principles http://vimeo.com/36752317
  • 27. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA Enriching data CC BY-SA Several processes produce richer data that we and others can use to build new and innovative services • Harvesting richer Linked Open Data from data partners • Crowdsourcing •Automatic semantic enrichment Latvijas dzelzce u karteļ 1937, National Library of Latvia, Latvia | Public Domain
  • 28. CC BY-SA Thumbnail Metadata Link to data provider Rights
  • 29. CC BY-SA Links to contextual entities
  • 30. Title here CC BY-SACC BY-SA The Entity Collection Contribution to multilingual coverage Entities effectively used to enrich Europeana Objects Entities present in the Entity Collection
  • 31. Title here CC BY-SACC BY-SA Multilingual enrichment is not easy Poisonous India or the Importance of a Semantic and Multilingual Enrichment Strategy Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012 http://link.springer.com/chapter/10.1007%2F978-3-642-35233-1_2
  • 32. Title here CC BY-SACC BY-SA Data Quality Committee Working on recommendations for the community on: ○ Mandatory metadata elements for ingestion of EDM data ○ Metadata checking and normalization ○ Meaningful metadata values (in the context of use) ○ Quality of content (digital media) ○ Coordination with other quality-related initiatives http://pro.europeana.eu/get-involved/europeana-tech/data-quality-committee
  • 33. Title here CC BY-SACC BY-SA Title here CC BY-SA Name of image | Creator Providing organization| Country, licence Name of image | Creator Providing organization| Country, licence antoine.isaac@europeana.eu @antoine_isaac

Editor's Notes

  1. Image is: http://www.europeana.eu/portal/record/9200365/BibliographicResource_1000055421061.html Copyright url: http://creativecommons.org/publicdomain/mark/1.0/
  2. Europeana gives you access to: Books, newspapers, journals, letters, diaries, archival papers, paintings, maps, drawings, photographs, music, spoken word, radio broadcasts, film, newsreels, television, fashion, sculpture, 3D objects, and more Previews and information (metadata) about each object with, where possible, (direct) access to full texts, high-quality imagery, sound and video files either on Europeana or on the provider’s own website. Europeana Collections was updated and given its new name in December 2015. Its fresh, clean feel, designed in response to stakeholder and audience feedback, makes it easy to navigate. Improved search and filters, including innovative colour searches (try this one for ‘Dark slate blue’) and quality searches (find our very best maps of Paris), mean finding what you’re looking for is simple. Better, bigger previews, a zoom function for high resolution images and documents (get really close up to this Monet), direct play for video (try this silent film) and audio (listen to this Maria Callas aria) and a new download option mean less time clicking and more time engrossed in cultural content. Clear copyright information shows you what you can and can’t do with the treasures you find. All this together gives you trustworthy and relevant information and content to use, to learn, to share or to contribute to research. See for yourself with our quick demo. Explore Europeana Collections at http://europeana.eu
  3. 3. Engaging and involving people more We also need to continue to deliver content to Wikipedia and Pinterest so that the more casual user gets to experience a broader spectrum of our heritage in the places they already use. And we undertake broad engagement campaigns such as Europeana 1914-1918 which I will come to and we are building one on migration to play out next year because we know that as a citizen of Europe, taking part in our thematic campaigns – like adding your family memorabilia to Europeana 1914-1918 – helps increase appreciation of our shared history and reinforces the importance of our shared future.
  4. 53 million pieces of cultural heritage - search, browse, filter From 3,500 institutions in 35 countries Europeana gives you access to: Books, newspapers, journals, letters, diaries, archival papers, paintings, maps, drawings, photographs, music, spoken word, radio broadcasts, film, newsreels, television, fashion, sculpture, 3D objects, and more Previews and information (metadata) about each object with, where possible, (direct) access to full texts, high-quality imagery, sound and video files either on Europeana or on the provider’s own website. Europeana Collections was updated and given its new name in December 2015. Its fresh, clean feel, designed in response to stakeholder and audience feedback, makes it easy to navigate. Improved search and filters, including innovative colour searches (try this one for ‘Dark slate blue’) and quality searches (find our very best maps of Paris), mean finding what you’re looking for is simple. Better, bigger previews, a zoom function for high resolution images and documents (get really close up to this Monet), direct play for video (try this silent film) and audio (listen to this Maria Callas aria) and a new download option mean less time clicking and more time engrossed in cultural content. Clear copyright information shows you what you can and can’t do with the treasures you find. All this together gives you trustworthy and relevant information and content to use, to learn, to share or to contribute to research. See for yourself with our quick demo. Explore Europeana Collections at http://europeana.eu
  5. Europeana isn’t content with a service in which audiences simply come to us. We take Europe’s collections to the online places in which our audiences congregate. We share openly licensed collections in as many places as we can. Here are some examples. Europeana 1914-1918 The Europeana blog Europeana virtual exhibitions, curated by our expert partners Facebook Pinterest Twitter Google+ Wikimedia Europeana newsletter We’re always looking for ways to improve how we connect with our audiences, so our work on things like the portal, blog, virtual exhibitions and our newsletter is never done! Working with partners and projects is really important for our end-user services. It’s their expertise in specific areas that helps us create fascinating exhibitions and take people on great cultural journeys with our channels.
  6. 2. Scaling with partners We have 5 markets we are trying to reach with Europeana: Lovers of the various aspects of culture from music to art to fashion to maps to natural history; whether they are Cultural Professionals; Educators, Researchers or from the Creative Industries and Innovators. We cannot reach these markets by ourselves we dont have the subject knowledge or capability of reaching the audiences so need to rely on partners. But to do this we must be able to provide educator, researcher or creative, with clearly labelled, high-quality material content for use. So they can make it work in their systems and users get rich, trusted cultural heritage in their workflow, tailored to their specific need.
  7. Image url:http://www.europeana.eu/portal/record/90402/SK_A_3899.html Copyright url:http://creativecommons.org/publicdomain/mark/1.0/