SlideShare a Scribd company logo
Europeana Semantic Data in
         OWLIM
               Mariana Damova, PhD


      The Bulgarian Participation in Europeana.
           Cooperation and Development
                       Varna
                    October 2012
Ontotext
   – Top-5 provider of core Semantic Technology
   – Established in year 2000; offices in Bulgaria, UK, USA
   – Active both in research and commercial projects (FP7 funding for 10 years)

• 360° semantic technology – unique portfolio:
   – Semantic Databases: high-performance RDF DBMS, scalable reasoning
   – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR)
   – Web Mining: focused crawling, screen scraping, data fusion
   – Linked Data Management and Data Integration

   Good recognition in the SemTech community
   – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at
     GYM, #3 for “linked data management” at Google

   Several joint ventures and subsidiaries
   – Innovantage: leading online recruitment intelligence provider in UK
Ontotext Clients (selected)

          British Broadcasting Corporation (BBC)
                – Run its World Cup 2010 sites on top of OWLIM
                – Since Mar’12 BBC Sports
                – 2012 Olympics sections are driven
                  by OWLIM and a Concept Extraction service developed by Ontotext
          Press Association (UK)
                – Analysis of Sports news
                – Concept extraction
                – Linked data generation
          Top-3 USA media (not allowed to name)
          The National Archives (UK) contracted Ontotext to implement
          semantic KB and semantic search for the Government Web Archive
          British Museum (UK) Ontotext leads the development of Phase 3 of
          ResearchSpace project on collaborative research in cultural heritage;
          British Museum’s public SPARQL end-point is powered by OWLIM
          de Bibliothek (Holland) aggregation of data from 150 library databases
Outline



• Europeana
• bulgariana.eu
• Collections
• Europeana Data Standards
• Metadata mapping, conversion and ingestion
• Digital repository
• Conclusion



                             4
Europeana

                                   http://www.europeana.eu




•   Launched in 2008
•   Project funded by the European Commission
•   Based in the National Library of the Netherlands, the Koninklijke Bibliotheek
•   Goal to make Europe's cultural and scientific heritage accessible to the public
•   Over 180 heritage and knowledge organzations and IT experts across Europe
•   Europeana Collection: 5M objects in 2009, 10M in 2010, 20M at present
•   Endorsed by the European parliament in 2010
•   2011 "Comité des Sages" makes recommendations about Europeana
       to put online the collections held by Europe's libraries, archives, museums and
       audiovisual archives – vast numbers of books and periodicals (there are some 2.5bn
       items in Europe's libraries alone), and millions of hours of film and video covering the
       whole of Europe's diverse history and culture.


                                                5
Europeana

 • Collection types: Image, Sound, Video, Text
 • Present Europeana Architecture




                      Europeana         Solr       ingestion
                      Portal            DB

           visitor                                              Provider


                       system context

                                                  back office


 • Europeana data standards
 • Europeana aggregators (by country or cultural heritage sector)
 • Process of ingesting content (4-6 weeks)


                                           6
bulgariana.eu




                7
bulgariana.eu

• Main Purpose: BG
  aggregator for Europeana
• Secondary Purpose:
  networking and special
  interest group for BG
  Cultural Heritage




                             8
Collections




              9
Collections
Golden Pages from the Bulgarian Renaissance
Златни страници от Българското Възраждане
                  unique manuscripts of Bulgarian folk songs collected in 19th century
                  by Miladinov Brothers, renowned Bulgarian Folklorists
                  published in 2008 by D-r Luchia Antonova,
                  Institute of Bulgarian Language, Bulgarian Academy of Sciences



                                                      МАРКО КРАЛЕВИКИ БОЛЕН СЕ КАИТ И СЕ
                                                      ИСПОВЕДВИТ

                                                      Поболил се Марко Кралевике,
                                                      що си лежал токму три години,
                                                      от нищо се иляч (1) не на’ож’ал.
                                                      И му рече негва стара майќа:
                                                      “Ай ти, Марко, ай ти, синко милий;
                                                      не си болен, синко, от господа,
                                                      тук си болен, синко, от гре’о’и,
                                                      да ти викна попой (2), ду’овници,
                                                      лепо да се синко исповедиш,
                                                      да си кажиш твоите гре’о’и!”
                                                      ….


                                                10
Collections
Pra-historic and Thracian Civilizations
Праисторическа и Тракийска цивилизация
                  Unpublished Thracian archeological objects collected by Prof.
                  Valeria Fol, Center of Thracology at the Institute for Balkan Studies
                  at the Bulgarian Academy of Sciences




                                                 11
Links



• http://bulgariana.eu
• http://bulgarianheritage.bulgariana.eu
• http://www.europeana.eu
   – europeana_collectionName: 20215*
   – for the individual sets use europeana_collectionName: 2021501* (or
     2021502*)

• http://britishmuseum.ontotext.com




                                      12
Europeana Data Standards




                           13
Europeana Data Standards
• Unified metadata
     • ESE – Europeana Semantic Elements
               • DublinCore & Europeana fields
               • 36 fields: flat, limited ability semantic links
                         dc:title                      europeana:provider
                         dc:creator                    europeana:dataProvider
                         dc:subject                    europeana:rights
                         dc:description                europeana:type
                         dc:publisher                  europeana:isShownBy and/or europeana:isShownAt
                         …                             …


     • EDM - Europeana Data Model




      Basic data model                                                             Two contextual classes


                                                     14
Linking Open Data

• Linking Open Data (LOD) W3C SWEO Community project
  http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData




• Initiative for publishing “linked data” – a set of principles,
  which allows browsing of RDF data, spread across different
  servers, in the way HTML is browsed
Semantic Repositories: Major Characteristics

• Easy integration of multiple data-sources
   – once the schemata of these sources is semantically aligned, the
     inference capabilities of the engine supports the interlinking and
     combination of the facts from the different sources;

• Easy querying against rich or diverse data schemata
   – inference is applied to match semantics of the query to the
     semantics of the data, regardless of the vocabulary and the data
     modeling patterns used for encoding of the data;
Physical data representation: RDF vs. RDBMS

                                                                       Statement
                     Person                              Subject      Predicate    Object
         ID     Name              Gender                 myo:Person   rdf:type     rdfs:Class
         1      Maria P.          F                      myo:gender   rdfs:type    rdfs:Property
         2      Ivan Jr.          M                      myo:parent   rdfs:range   myo:Person
         3      …                                        myo:spouse   rdfs:range   myo:Person
                                                         myd:Maria    rdf:type     myo:Person
                                                         myd:Maria    rdf:label    “Maria P.”
                                                         myd:Maria    myo:gender   “F”
    Parent                                Spouse
                                                         myd:Maria    rdf:label    “Ivan Jr.”
ParID   ChiID              S1ID       S2ID   From   To
                                                         myd:Ivan     myo:gender   “M”
1       2                  1          3
                                                         myd:Maria    myo:parent   Myd:Ivan
…                          …
                                                         myd:Maria    myo:spouse   myd:John
                                                         …
OWLIM


OWLIM is a family of semantic repositories, or RDF database management systems, with
the following characteristics:

•   native RDF engines, implemented in Java
•   delivering full performance through both Sesame and Jena
•   robust support for the semantics of RDFS, OWL 2 RL and OWL 2 QL
•   best scalability, loading and query evaluation performance

OWLIM is used in a large number of research projects and software tools. Independent
opinions justifying our bold claims are referred to here.

The presentation Lowering the Cost of Data and Content Integration and enabling Searching
and Querying of Billions of Facts on the Web presents the key features of OWLIM alongside
an introduction to the benefits of using RDF databases for data integration and a discussion
on linked data management.
OWLIM Replication Cluster

• Distribution through data replication is used to ensure:
   – Better handling of concurrent user requests
   – Failover support
• How does it work?
   – Every user request is pushed in a transaction queue
   – Each data write request is are multiplexed to all repository instances
   – Each read request is dispatched to one of the
     instance only
   – To ensure load-balancing, each
     read requests is send to the
     instance with smallest execution
     queue at this point in time
Europeana Data in EDM

      • 268GB of data
      • cultural objects data and linkages to other datasets


      Dataset size:
                   NumberOfStatements=3,899,531,218
                   NumberOfExplicitStatements=993,332,911
                   NumberOfEntities=264,523,842

      EDM model
      SKOS




Sofia, 13 March 2012                       20
Prototype available at




http://europeana.ontotext.com


to become
             -> http://data.europeana.eu
Upcoming …


Europeana Creative - PSP project
      lead by the Austrian National Library
      26 partners
      Objective: experimenting with re-use of cultural
                   content for creativity
      Project: Europeana re-use framework and 6 pilots in
               different domains such as education,
               tourism, etc.
     Ontotext: participate in the infrastructure for re-use with
                 the semantic repository OWLIM, and data
                 integration



Sofia, 13 March 2012             22
Upcoming …




  Round table organized by the Ministry of Culture and Ontotext
  where Europeana officials will explain the organizational
  principles of Europeana data collection and aggregation and
  will share experience with setting up national aggregator’s to
  be held in November 2012 in Sofia




Sofia, 13 March 2012             23
Thank you for your attention!




mariana.damova@ontotext.com




            24

More Related Content

Similar to Europeana datainowlim oct2012

Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
Sally Chambers
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
Dov Winer
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013
Europeana
 
Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)
Národní technická knihovna (NTK)
 
20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
Vladimir Alexiev, PhD, PMP
 
EuropeanaLocal, its contribution
EuropeanaLocal, its contributionEuropeanaLocal, its contribution
EuropeanaLocal, its contribution
EuropeanaLocal Project
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
EuropeanaConnect
 
Europeana and open data
Europeana and open dataEuropeana and open data
Europeana and open data
RobinaClayphan
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Valentine Charles
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
Antoine Isaac
 
Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
Museums Computer Group
 
Digitised Content: How we Make It Relevant to Researchers, Teachers and Students
Digitised Content: How we Make It Relevant to Researchers, Teachers and StudentsDigitised Content: How we Make It Relevant to Researchers, Teachers and Students
Digitised Content: How we Make It Relevant to Researchers, Teachers and Students
LIBER Europe
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
Antoine Isaac
 
Semantic web in Cultural Heritage and Archaeology
Semantic web in Cultural Heritage and ArchaeologySemantic web in Cultural Heritage and Archaeology
Semantic web in Cultural Heritage and Archaeology
Monika Solanki
 
Europeana update, Aggregation, Collections and Project Shift - Strategies and...
Europeana update, Aggregation, Collections and Project Shift - Strategies and...Europeana update, Aggregation, Collections and Project Shift - Strategies and...
Europeana update, Aggregation, Collections and Project Shift - Strategies and...
Europeana
 
Ontologies and thesauri. How to answer complex questions using interoperability?
Ontologies and thesauri. How to answer complex questions using interoperability?Ontologies and thesauri. How to answer complex questions using interoperability?
Ontologies and thesauri. How to answer complex questions using interoperability?
Equipex Biblissima
 
Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013
TU Delft, Netherlands
 
Breaking the Waves - Alastair Dunning
Breaking the Waves - Alastair DunningBreaking the Waves - Alastair Dunning
Breaking the Waves - Alastair Dunning
Jisc
 
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
cneudecker
 

Similar to Europeana datainowlim oct2012 (20)

Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013
 
Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)
 
20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
 
EuropeanaLocal, its contribution
EuropeanaLocal, its contributionEuropeanaLocal, its contribution
EuropeanaLocal, its contribution
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
 
Europeana and open data
Europeana and open dataEuropeana and open data
Europeana and open data
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
 
Digitised Content: How we Make It Relevant to Researchers, Teachers and Students
Digitised Content: How we Make It Relevant to Researchers, Teachers and StudentsDigitised Content: How we Make It Relevant to Researchers, Teachers and Students
Digitised Content: How we Make It Relevant to Researchers, Teachers and Students
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Semantic web in Cultural Heritage and Archaeology
Semantic web in Cultural Heritage and ArchaeologySemantic web in Cultural Heritage and Archaeology
Semantic web in Cultural Heritage and Archaeology
 
Europeana update, Aggregation, Collections and Project Shift - Strategies and...
Europeana update, Aggregation, Collections and Project Shift - Strategies and...Europeana update, Aggregation, Collections and Project Shift - Strategies and...
Europeana update, Aggregation, Collections and Project Shift - Strategies and...
 
Ontologies and thesauri. How to answer complex questions using interoperability?
Ontologies and thesauri. How to answer complex questions using interoperability?Ontologies and thesauri. How to answer complex questions using interoperability?
Ontologies and thesauri. How to answer complex questions using interoperability?
 
Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013
 
Breaking the Waves - Alastair Dunning
Breaking the Waves - Alastair DunningBreaking the Waves - Alastair Dunning
Breaking the Waves - Alastair Dunning
 
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...
 

More from Mariana Damova, Ph.D

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
Mariana Damova, Ph.D
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Mariana Damova, Ph.D
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
Mariana Damova, Ph.D
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
Mariana Damova, Ph.D
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
Mariana Damova, Ph.D
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
Mariana Damova, Ph.D
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
Mariana Damova, Ph.D
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
Mariana Damova, Ph.D
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
Mariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
Mariana Damova, Ph.D
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
Mariana Damova, Ph.D
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
Mariana Damova, Ph.D
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
Mariana Damova, Ph.D
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
Mariana Damova, Ph.D
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
Mariana Damova, Ph.D
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
Mariana Damova, Ph.D
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
Mariana Damova, Ph.D
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Mariana Damova, Ph.D
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Mariana Damova, Ph.D
 

More from Mariana Damova, Ph.D (20)

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
 

Recently uploaded

一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
etycev
 
Party Photo Booth Prop Trends to Unleash Your Inner Style
Party Photo Booth Prop Trends to Unleash Your Inner StyleParty Photo Booth Prop Trends to Unleash Your Inner Style
Party Photo Booth Prop Trends to Unleash Your Inner Style
Birthday Galore
 
Gladiator 2 (Action, Adventure, Drama Movie)
Gladiator 2 (Action, Adventure, Drama Movie)Gladiator 2 (Action, Adventure, Drama Movie)
Gladiator 2 (Action, Adventure, Drama Movie)
roohifaiza
 
VR Economy
VR EconomyVR Economy
Clyde the cat and Space Poems by Basak Serin
Clyde the cat and Space Poems by Basak SerinClyde the cat and Space Poems by Basak Serin
Clyde the cat and Space Poems by Basak Serin
Basak24
 
The Evolution and Impact of Tom Cruise Long Hair
The Evolution and Impact of Tom Cruise Long HairThe Evolution and Impact of Tom Cruise Long Hair
The Evolution and Impact of Tom Cruise Long Hair
greendigital
 
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
juliancopeman444
 
SERV - Fun Things To Do In Overland Park
SERV - Fun Things To Do In Overland ParkSERV - Fun Things To Do In Overland Park
SERV - Fun Things To Do In Overland Park
SERV
 
HD Video Player All Format - 4k & live stream
HD Video Player All Format - 4k & live streamHD Video Player All Format - 4k & live stream
HD Video Player All Format - 4k & live stream
HD Video Player
 
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
abqenm
 
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women MagazineTaylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
CIOWomenMagazine
 
The Enigma of the Midnight Canvas, In the heart of Paris
The Enigma of the Midnight Canvas, In the heart of ParisThe Enigma of the Midnight Canvas, In the heart of Paris
The Enigma of the Midnight Canvas, In the heart of Paris
John Emmett
 
Sara Saffari: Turning Underweight into Fitness Success at 23
Sara Saffari: Turning Underweight into Fitness Success at 23Sara Saffari: Turning Underweight into Fitness Success at 23
Sara Saffari: Turning Underweight into Fitness Success at 23
get joys
 
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
humbertogarsia692
 
How OTT Players Are Transforming Our TV Viewing Experience.pdf
How OTT Players Are Transforming Our TV Viewing Experience.pdfHow OTT Players Are Transforming Our TV Viewing Experience.pdf
How OTT Players Are Transforming Our TV Viewing Experience.pdf
Genny Knight
 
The Midnight Sculptor.pdf writer by Ali alsiad
The Midnight Sculptor.pdf writer by Ali alsiadThe Midnight Sculptor.pdf writer by Ali alsiad
The Midnight Sculptor.pdf writer by Ali alsiad
ali345alghlay
 
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
sbewyav
 
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girlsℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
meherkumarescorts
 
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite GameLeonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
greendigital
 
Audio Video equipment supplier in Gurgaon
Audio Video equipment supplier in GurgaonAudio Video equipment supplier in Gurgaon
Audio Video equipment supplier in Gurgaon
demoacsindia
 

Recently uploaded (20)

一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
一比一原版(AUT毕业证)奥克兰理工大学毕业证如何办理
 
Party Photo Booth Prop Trends to Unleash Your Inner Style
Party Photo Booth Prop Trends to Unleash Your Inner StyleParty Photo Booth Prop Trends to Unleash Your Inner Style
Party Photo Booth Prop Trends to Unleash Your Inner Style
 
Gladiator 2 (Action, Adventure, Drama Movie)
Gladiator 2 (Action, Adventure, Drama Movie)Gladiator 2 (Action, Adventure, Drama Movie)
Gladiator 2 (Action, Adventure, Drama Movie)
 
VR Economy
VR EconomyVR Economy
VR Economy
 
Clyde the cat and Space Poems by Basak Serin
Clyde the cat and Space Poems by Basak SerinClyde the cat and Space Poems by Basak Serin
Clyde the cat and Space Poems by Basak Serin
 
The Evolution and Impact of Tom Cruise Long Hair
The Evolution and Impact of Tom Cruise Long HairThe Evolution and Impact of Tom Cruise Long Hair
The Evolution and Impact of Tom Cruise Long Hair
 
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
欧洲杯赌球-欧洲杯赌球竞猜官网-欧洲杯赌球竞猜网站|【​网址​🎉ac10.net🎉​】
 
SERV - Fun Things To Do In Overland Park
SERV - Fun Things To Do In Overland ParkSERV - Fun Things To Do In Overland Park
SERV - Fun Things To Do In Overland Park
 
HD Video Player All Format - 4k & live stream
HD Video Player All Format - 4k & live streamHD Video Player All Format - 4k & live stream
HD Video Player All Format - 4k & live stream
 
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
一比一原版(mcmaste毕业证书)加拿大麦克马斯特大学毕业证如何办理
 
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women MagazineTaylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women Magazine
 
The Enigma of the Midnight Canvas, In the heart of Paris
The Enigma of the Midnight Canvas, In the heart of ParisThe Enigma of the Midnight Canvas, In the heart of Paris
The Enigma of the Midnight Canvas, In the heart of Paris
 
Sara Saffari: Turning Underweight into Fitness Success at 23
Sara Saffari: Turning Underweight into Fitness Success at 23Sara Saffari: Turning Underweight into Fitness Success at 23
Sara Saffari: Turning Underweight into Fitness Success at 23
 
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
欧洲杯足彩-欧洲杯足彩下注网站-欧洲杯足彩投注网站|【​网址​🎉ac99.net🎉​】
 
How OTT Players Are Transforming Our TV Viewing Experience.pdf
How OTT Players Are Transforming Our TV Viewing Experience.pdfHow OTT Players Are Transforming Our TV Viewing Experience.pdf
How OTT Players Are Transforming Our TV Viewing Experience.pdf
 
The Midnight Sculptor.pdf writer by Ali alsiad
The Midnight Sculptor.pdf writer by Ali alsiadThe Midnight Sculptor.pdf writer by Ali alsiad
The Midnight Sculptor.pdf writer by Ali alsiad
 
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
一比一原版(uw毕业证书)美国威斯康星大学麦迪逊分校毕业证如何办理
 
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girlsℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
ℂall Girls Lucknow (india) +91-7426014248 Lucknow ℂall Girls
 
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite GameLeonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Game
 
Audio Video equipment supplier in Gurgaon
Audio Video equipment supplier in GurgaonAudio Video equipment supplier in Gurgaon
Audio Video equipment supplier in Gurgaon
 

Europeana datainowlim oct2012

  • 1. Europeana Semantic Data in OWLIM Mariana Damova, PhD The Bulgarian Participation in Europeana. Cooperation and Development Varna October 2012
  • 2. Ontotext – Top-5 provider of core Semantic Technology – Established in year 2000; offices in Bulgaria, UK, USA – Active both in research and commercial projects (FP7 funding for 10 years) • 360° semantic technology – unique portfolio: – Semantic Databases: high-performance RDF DBMS, scalable reasoning – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR) – Web Mining: focused crawling, screen scraping, data fusion – Linked Data Management and Data Integration Good recognition in the SemTech community – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at GYM, #3 for “linked data management” at Google Several joint ventures and subsidiaries – Innovantage: leading online recruitment intelligence provider in UK
  • 3. Ontotext Clients (selected) British Broadcasting Corporation (BBC) – Run its World Cup 2010 sites on top of OWLIM – Since Mar’12 BBC Sports – 2012 Olympics sections are driven by OWLIM and a Concept Extraction service developed by Ontotext Press Association (UK) – Analysis of Sports news – Concept extraction – Linked data generation Top-3 USA media (not allowed to name) The National Archives (UK) contracted Ontotext to implement semantic KB and semantic search for the Government Web Archive British Museum (UK) Ontotext leads the development of Phase 3 of ResearchSpace project on collaborative research in cultural heritage; British Museum’s public SPARQL end-point is powered by OWLIM de Bibliothek (Holland) aggregation of data from 150 library databases
  • 4. Outline • Europeana • bulgariana.eu • Collections • Europeana Data Standards • Metadata mapping, conversion and ingestion • Digital repository • Conclusion 4
  • 5. Europeana http://www.europeana.eu • Launched in 2008 • Project funded by the European Commission • Based in the National Library of the Netherlands, the Koninklijke Bibliotheek • Goal to make Europe's cultural and scientific heritage accessible to the public • Over 180 heritage and knowledge organzations and IT experts across Europe • Europeana Collection: 5M objects in 2009, 10M in 2010, 20M at present • Endorsed by the European parliament in 2010 • 2011 "Comité des Sages" makes recommendations about Europeana to put online the collections held by Europe's libraries, archives, museums and audiovisual archives – vast numbers of books and periodicals (there are some 2.5bn items in Europe's libraries alone), and millions of hours of film and video covering the whole of Europe's diverse history and culture. 5
  • 6. Europeana • Collection types: Image, Sound, Video, Text • Present Europeana Architecture Europeana Solr ingestion Portal DB visitor Provider system context back office • Europeana data standards • Europeana aggregators (by country or cultural heritage sector) • Process of ingesting content (4-6 weeks) 6
  • 8. bulgariana.eu • Main Purpose: BG aggregator for Europeana • Secondary Purpose: networking and special interest group for BG Cultural Heritage 8
  • 10. Collections Golden Pages from the Bulgarian Renaissance Златни страници от Българското Възраждане unique manuscripts of Bulgarian folk songs collected in 19th century by Miladinov Brothers, renowned Bulgarian Folklorists published in 2008 by D-r Luchia Antonova, Institute of Bulgarian Language, Bulgarian Academy of Sciences МАРКО КРАЛЕВИКИ БОЛЕН СЕ КАИТ И СЕ ИСПОВЕДВИТ Поболил се Марко Кралевике, що си лежал токму три години, от нищо се иляч (1) не на’ож’ал. И му рече негва стара майќа: “Ай ти, Марко, ай ти, синко милий; не си болен, синко, от господа, тук си болен, синко, от гре’о’и, да ти викна попой (2), ду’овници, лепо да се синко исповедиш, да си кажиш твоите гре’о’и!” …. 10
  • 11. Collections Pra-historic and Thracian Civilizations Праисторическа и Тракийска цивилизация Unpublished Thracian archeological objects collected by Prof. Valeria Fol, Center of Thracology at the Institute for Balkan Studies at the Bulgarian Academy of Sciences 11
  • 12. Links • http://bulgariana.eu • http://bulgarianheritage.bulgariana.eu • http://www.europeana.eu – europeana_collectionName: 20215* – for the individual sets use europeana_collectionName: 2021501* (or 2021502*) • http://britishmuseum.ontotext.com 12
  • 14. Europeana Data Standards • Unified metadata • ESE – Europeana Semantic Elements • DublinCore & Europeana fields • 36 fields: flat, limited ability semantic links dc:title europeana:provider dc:creator europeana:dataProvider dc:subject europeana:rights dc:description europeana:type dc:publisher europeana:isShownBy and/or europeana:isShownAt … … • EDM - Europeana Data Model Basic data model Two contextual classes 14
  • 15. Linking Open Data • Linking Open Data (LOD) W3C SWEO Community project http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData • Initiative for publishing “linked data” – a set of principles, which allows browsing of RDF data, spread across different servers, in the way HTML is browsed
  • 16. Semantic Repositories: Major Characteristics • Easy integration of multiple data-sources – once the schemata of these sources is semantically aligned, the inference capabilities of the engine supports the interlinking and combination of the facts from the different sources; • Easy querying against rich or diverse data schemata – inference is applied to match semantics of the query to the semantics of the data, regardless of the vocabulary and the data modeling patterns used for encoding of the data;
  • 17. Physical data representation: RDF vs. RDBMS Statement Person Subject Predicate Object ID Name Gender myo:Person rdf:type rdfs:Class 1 Maria P. F myo:gender rdfs:type rdfs:Property 2 Ivan Jr. M myo:parent rdfs:range myo:Person 3 … myo:spouse rdfs:range myo:Person myd:Maria rdf:type myo:Person myd:Maria rdf:label “Maria P.” myd:Maria myo:gender “F” Parent Spouse myd:Maria rdf:label “Ivan Jr.” ParID ChiID S1ID S2ID From To myd:Ivan myo:gender “M” 1 2 1 3 myd:Maria myo:parent Myd:Ivan … … myd:Maria myo:spouse myd:John …
  • 18. OWLIM OWLIM is a family of semantic repositories, or RDF database management systems, with the following characteristics: • native RDF engines, implemented in Java • delivering full performance through both Sesame and Jena • robust support for the semantics of RDFS, OWL 2 RL and OWL 2 QL • best scalability, loading and query evaluation performance OWLIM is used in a large number of research projects and software tools. Independent opinions justifying our bold claims are referred to here. The presentation Lowering the Cost of Data and Content Integration and enabling Searching and Querying of Billions of Facts on the Web presents the key features of OWLIM alongside an introduction to the benefits of using RDF databases for data integration and a discussion on linked data management.
  • 19. OWLIM Replication Cluster • Distribution through data replication is used to ensure: – Better handling of concurrent user requests – Failover support • How does it work? – Every user request is pushed in a transaction queue – Each data write request is are multiplexed to all repository instances – Each read request is dispatched to one of the instance only – To ensure load-balancing, each read requests is send to the instance with smallest execution queue at this point in time
  • 20. Europeana Data in EDM • 268GB of data • cultural objects data and linkages to other datasets Dataset size: NumberOfStatements=3,899,531,218 NumberOfExplicitStatements=993,332,911 NumberOfEntities=264,523,842 EDM model SKOS Sofia, 13 March 2012 20
  • 21. Prototype available at http://europeana.ontotext.com to become -> http://data.europeana.eu
  • 22. Upcoming … Europeana Creative - PSP project lead by the Austrian National Library 26 partners Objective: experimenting with re-use of cultural content for creativity Project: Europeana re-use framework and 6 pilots in different domains such as education, tourism, etc. Ontotext: participate in the infrastructure for re-use with the semantic repository OWLIM, and data integration Sofia, 13 March 2012 22
  • 23. Upcoming … Round table organized by the Ministry of Culture and Ontotext where Europeana officials will explain the organizational principles of Europeana data collection and aggregation and will share experience with setting up national aggregator’s to be held in November 2012 in Sofia Sofia, 13 March 2012 23
  • 24. Thank you for your attention! mariana.damova@ontotext.com 24