Over the past five years, the amount of contextual entities in Europeana’s metadata has grown considerably. These entities are provided as references as part of the metadata delivered by Europeana or selected by Europeana semantic automatic enrichment. Pursuing their efforts towards the creation of a semantic network around cultural heritage objects, Europeana and its partners providers and aggregators are investigating ways to better exchange vocabulary data and manage co-references/alignments between vocabularies. In this presentation we will explore the potential of tools such as OpenSkos and Cultuurlink for supporting the building of networked references.
Presented at the 6th DBpedia Community Meeting in The Hague 2016, see http://wiki.dbpedia.org/meetings/TheHague2016
Semantic Interoperability at Europeana - MultilingualDSIs2018Antoine Isaac
Presentation on general interoperability and multilinguality issues at Europeana, for a workshop on Semantic Interoperability for Multilingual DSIs (https://ec.europa.eu/cefdigital/wiki/display/ETCOMMUNITY/Semantic+Interoperability+for+Multilingual+DSIs)
Slides for a presentation on recent work with Web Archives at the Oxford Internet Institute (http://www.oii.ox.ac.uk/) given at WIRE2014 (http://wp.comminfo.rutgers.edu/nsfia/schedule/)
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Antoine Isaac
"Wikidata, a target for Europeana's semantic strategy"/ Presentation at the GLAM-Wiki conference with Valentine Charles, Hugo Manguinhas, Antoine Isaac, Vladimir Alexiev http://nl.wikimedia.org/wiki/GLAM-WIKI_2015/
Open Science, Open Data: towards a new transparent and reproducible ecosystemLIBER Europe
Presented at the Preforma Open Source Workshop 8 April 2016
As a library membership organization, LIBER works on addressing Open Science barriers. Standardisation of file formats can really help in overcoming some of these barriers: it enables us to process and preserve data in a controlled way, it helps ensure that outputs are really open and accessible in the long term and it improves interoperability of new tools and services. Making sure data is stored in a controlled way and can be (re) used today and in the future is an important element in Open Science. We see this as not only a technical challenge but also a social one: awareness, trust and community building is needed in order to ensure uptake of these standards. Libraries therefore have a valuable role to play in the development of good research data management throughout all phases of the Open Data lifecycle.
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...locloud
Presentation given by Bülent Yılmaz, Özgür Külcü, Tolga Çakmak
Hacettepe University Department of Information Management. Turkey
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
Update on the EuropeanaTech community activities. Presentation with Greg Markus, Sound and Vision. Europeana general Assembly Meeting 2015, November 2-4 2015. http://pro.europeana.eu/event/europeana-annual-general-meeting-2015
Europeana - American Art Collaborative LOD MeetingAntoine Isaac
Presentation at a seminar on linked data and art museums at the Smithsonian Institute, April 29 2013.
Other presentations at http://lodlam.net/2013/05/07/linked-open-data-in-art/
Europeana as a Linked Data (Quality) caseAntoine Isaac
Presentation for the 3rd Workshop on Humanities in the Semantic Web (WHiSe), co-located with the 15th Extended Semantic Web Conference (ESWC 2020)
June 2, 2020, online
http://whise.cc/2020/
Linux Performance Analysis: New Tools and Old SecretsBrendan Gregg
Talk for USENIX/LISA2014 by Brendan Gregg, Netflix. At Netflix performance is crucial, and we use many high to low level tools to analyze our stack in different ways. In this talk, I will introduce new system observability tools we are using at Netflix, which I've ported from my DTraceToolkit, and are intended for our Linux 3.2 cloud instances. These show that Linux can do more than you may think, by using creative hacks and workarounds with existing kernel features (ftrace, perf_events). While these are solving issues on current versions of Linux, I'll also briefly summarize the future in this space: eBPF, ktap, SystemTap, sysdig, etc.
Broken benchmarks, misleading metrics, and terrible tools. This talk will help you navigate the treacherous waters of Linux performance tools, touring common problems with system tools, metrics, statistics, visualizations, measurement overhead, and benchmarks. You might discover that tools you have been using for years, are in fact, misleading, dangerous, or broken.
The speaker, Brendan Gregg, has given many talks on tools that work, including giving the Linux PerformanceTools talk originally at SCALE. This is an anti-version of that talk, to focus on broken tools and metrics instead of the working ones. Metrics can be misleading, and counters can be counter-intuitive! This talk will include advice for verifying new performance tools, understanding how they work, and using them successfully.
Semantic Interoperability at Europeana - MultilingualDSIs2018Antoine Isaac
Presentation on general interoperability and multilinguality issues at Europeana, for a workshop on Semantic Interoperability for Multilingual DSIs (https://ec.europa.eu/cefdigital/wiki/display/ETCOMMUNITY/Semantic+Interoperability+for+Multilingual+DSIs)
Slides for a presentation on recent work with Web Archives at the Oxford Internet Institute (http://www.oii.ox.ac.uk/) given at WIRE2014 (http://wp.comminfo.rutgers.edu/nsfia/schedule/)
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Antoine Isaac
"Wikidata, a target for Europeana's semantic strategy"/ Presentation at the GLAM-Wiki conference with Valentine Charles, Hugo Manguinhas, Antoine Isaac, Vladimir Alexiev http://nl.wikimedia.org/wiki/GLAM-WIKI_2015/
Open Science, Open Data: towards a new transparent and reproducible ecosystemLIBER Europe
Presented at the Preforma Open Source Workshop 8 April 2016
As a library membership organization, LIBER works on addressing Open Science barriers. Standardisation of file formats can really help in overcoming some of these barriers: it enables us to process and preserve data in a controlled way, it helps ensure that outputs are really open and accessible in the long term and it improves interoperability of new tools and services. Making sure data is stored in a controlled way and can be (re) used today and in the future is an important element in Open Science. We see this as not only a technical challenge but also a social one: awareness, trust and community building is needed in order to ensure uptake of these standards. Libraries therefore have a valuable role to play in the development of good research data management throughout all phases of the Open Data lifecycle.
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...locloud
Presentation given by Bülent Yılmaz, Özgür Külcü, Tolga Çakmak
Hacettepe University Department of Information Management. Turkey
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
Update on the EuropeanaTech community activities. Presentation with Greg Markus, Sound and Vision. Europeana general Assembly Meeting 2015, November 2-4 2015. http://pro.europeana.eu/event/europeana-annual-general-meeting-2015
Europeana - American Art Collaborative LOD MeetingAntoine Isaac
Presentation at a seminar on linked data and art museums at the Smithsonian Institute, April 29 2013.
Other presentations at http://lodlam.net/2013/05/07/linked-open-data-in-art/
Europeana as a Linked Data (Quality) caseAntoine Isaac
Presentation for the 3rd Workshop on Humanities in the Semantic Web (WHiSe), co-located with the 15th Extended Semantic Web Conference (ESWC 2020)
June 2, 2020, online
http://whise.cc/2020/
Linux Performance Analysis: New Tools and Old SecretsBrendan Gregg
Talk for USENIX/LISA2014 by Brendan Gregg, Netflix. At Netflix performance is crucial, and we use many high to low level tools to analyze our stack in different ways. In this talk, I will introduce new system observability tools we are using at Netflix, which I've ported from my DTraceToolkit, and are intended for our Linux 3.2 cloud instances. These show that Linux can do more than you may think, by using creative hacks and workarounds with existing kernel features (ftrace, perf_events). While these are solving issues on current versions of Linux, I'll also briefly summarize the future in this space: eBPF, ktap, SystemTap, sysdig, etc.
Broken benchmarks, misleading metrics, and terrible tools. This talk will help you navigate the treacherous waters of Linux performance tools, touring common problems with system tools, metrics, statistics, visualizations, measurement overhead, and benchmarks. You might discover that tools you have been using for years, are in fact, misleading, dangerous, or broken.
The speaker, Brendan Gregg, has given many talks on tools that work, including giving the Linux PerformanceTools talk originally at SCALE. This is an anti-version of that talk, to focus on broken tools and metrics instead of the working ones. Metrics can be misleading, and counters can be counter-intuitive! This talk will include advice for verifying new performance tools, understanding how they work, and using them successfully.
Talk for PerconaLive 2016 by Brendan Gregg. Video: https://www.youtube.com/watch?v=CbmEDXq7es0 . "Systems performance provides a different perspective for analysis and tuning, and can help you find performance wins for your databases, applications, and the kernel. However, most of us are not performance or kernel engineers, and have limited time to study this topic. This talk summarizes six important areas of Linux systems performance in 50 minutes: observability tools, methodologies, benchmarking, profiling, tracing, and tuning. Included are recipes for Linux performance analysis and tuning (using vmstat, mpstat, iostat, etc), overviews of complex areas including profiling (perf_events), static tracing (tracepoints), and dynamic tracing (kprobes, uprobes), and much advice about what is and isn't important to learn. This talk is aimed at everyone: DBAs, developers, operations, etc, and in any environment running Linux, bare-metal or the cloud."
Talk for SCaLE13x. Video: https://www.youtube.com/watch?v=_Ik8oiQvWgo . Profiling can show what your Linux kernel and appliacations are doing in detail, across all software stack layers. This talk shows how we are using Linux perf_events (aka "perf") and flame graphs at Netflix to understand CPU usage in detail, to optimize our cloud usage, solve performance issues, and identify regressions. This will be more than just an intro: profiling difficult targets, including Java and Node.js, will be covered, which includes ways to resolve JITed symbols and broken stacks. Included are the easy examples, the hard, and the cutting edge.
Video: https://www.youtube.com/watch?v=FJW8nGV4jxY and https://www.youtube.com/watch?v=zrr2nUln9Kk . Tutorial slides for O'Reilly Velocity SC 2015, by Brendan Gregg.
There are many performance tools nowadays for Linux, but how do they all fit together, and when do we use them? This tutorial explains methodologies for using these tools, and provides a tour of four tool types: observability, benchmarking, tuning, and static tuning. Many tools will be discussed, including top, iostat, tcpdump, sar, perf_events, ftrace, SystemTap, sysdig, and others, as well observability frameworks in the Linux kernel: PMCs, tracepoints, kprobes, and uprobes.
This tutorial is updated and extended on an earlier talk that summarizes the Linux performance tool landscape. The value of this tutorial is not just learning that these tools exist and what they do, but hearing when and how they are used by a performance engineer to solve real world problems — important context that is typically not included in the standard documentation.
Video: https://www.youtube.com/watch?v=JRFNIKUROPE . Talk for linux.conf.au 2017 (LCA2017) by Brendan Gregg, about Linux enhanced BPF (eBPF). Abstract:
A world of new capabilities is emerging for the Linux 4.x series, thanks to enhancements that have been included in Linux for to Berkeley Packet Filter (BPF): an in-kernel virtual machine that can execute user space-defined programs. It is finding uses for security auditing and enforcement, enhancing networking (including eXpress Data Path), and performance observability and troubleshooting. Many new open source tools that have been written in the past 12 months for performance analysis that use BPF. Tracing superpowers have finally arrived for Linux!
For its use with tracing, BPF provides the programmable capabilities to the existing tracing frameworks: kprobes, uprobes, and tracepoints. In particular, BPF allows timestamps to be recorded and compared from custom events, allowing latency to be studied in many new places: kernel and application internals. It also allows data to be efficiently summarized in-kernel, including as histograms. This has allowed dozens of new observability tools to be developed so far, including measuring latency distributions for file system I/O and run queue latency, printing details of storage device I/O and TCP retransmits, investigating blocked stack traces and memory leaks, and a whole lot more.
This talk will summarize BPF capabilities and use cases so far, and then focus on its use to enhance Linux tracing, especially with the open source bcc collection. bcc includes BPF versions of old classics, and many new tools, including execsnoop, opensnoop, funcccount, ext4slower, and more (many of which I developed). Perhaps you'd like to develop new tools, or use the existing tools to find performance wins large and small, especially when instrumenting areas that previously had zero visibility. I'll also summarize how we intend to use these new capabilities to enhance systems analysis at Netflix.
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
Presentation for the paper "Designing a multilingual knowledge graph as service for cultural heritage" at the DCMI2018 conference https://www.dublincore.org/conferences/2018/abstracts/#559
Linked Data for EuropeanaCultural Heritage: the Europeana approachValentine Charles
Presentation given on April 28th in Paris at International Conference organised by ISSN IC
http://www.issn.org/international-conference-organised-by-issn-ic-bibliographic-metadata-getting-linked/
Presentation at the Education Session of the American Art Collaborative (AAC) Linked Open Data Initiative, 31 March 2015. http://americanartcollaborative.org/
CLARIAH Toogdag 2018: A distributed network of digital heritage informationEnno Meijers
Slides of my keynote at the CLARIAH Toogdag 2018 on 9 March at the National Library of the Netherlands. The main topics were the development of the distributed digital heritage network and the alignment to and cooperation with the CLARIAH infrastructure and data. It also points at some of the current limitations of the semantic web technology.
This paper surveys the landscape of linked open data projects in cultural heritage, exam- ining the work of groups from around the world. Traditionally, linked open data has been ranked using the five star method proposed by Tim Berners-Lee. We found this ranking to be lacking when evaluating how cultural heritage groups not merely develop linked open datasets, but find ways to used linked data to augment user experience. Building on the five-star method, we developed a six-stage life cycle describing both dataset development and dataset usage. We use this framework to describe and evaluate fifteen linked open data projects in the realm of cultural heritage.
Manage it locally to share it globally: RDM and Wikimedia CommonsNick Sheppard
Slides for Internet Librarian International 2018 about the Data Management Engagement Award, a first-ever competition launched to elicit new and imaginative ideas for engaging researchers in the practices of good Research Data Management (RDM) - http://www.rdmengagementaward.org/
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
1. Building an ecosystem of
networked references
Hugo Manguinhas | DBpedia Community Meeting 2016
2. Europeana has many data challenges
Building an ecosystem of networked references
CC BY-SA
We aggregate very heterogeneous metadata:
• More than 48M objects
• 3,500 galleries, libraries, archives and museums
• 50 languages
• From all EU countries
• Level of quality varies greatly
• Huge amount of references to places, agents, concepts, time
3. Europeana Linked Data Strategy
Motivation
Building an ecosystem of networked references
CC BY-SA
• Improve user experience
• support better ways of searching and navigating through the
collections, eliminating ambiguity and clarifying the meaning of
descriptions
• better adapt to the language of the user
• by improving the interlinking of data
• brings more context to the objects
• alleviates polysemy issues
• better language coverage
• Contributes to build a web of data ('knowledge graph') that
third parties can use to improve their users' experience
4. Europeana Linked Data Strategy
Our efforts and lines of work
Building an ecosystem of networked references
CC BY-SA
• Europeana Data Model (EDM) offers a base for linking
data
• We apply an enrichment strategy to link source data to
reference data, incl. DBpedia
• Encourage data providers to contribute their own
vocabularies so that we can benefit from data links made
at data providers’ level
• We encourage alignment activities between domain
vocabularies
5. Europeana Linked Data Strategy
Vocabularies currently provided to Europeana
CC BY-SA
Building an ecosystem of networked references
6. Europeana Linked Data Strategy
Europeana also hosts vocabularies
CC BY-SA
Building an ecosystem of networked references
7. Europeana Linked Data Strategy
A strategy for Entities
Building an ecosystem of networked references
CC BY-SA
• As a cornerstone for our strategy we are building an
"Entity Collection"
• A service that acts as a centralized point of reference and
access to data about contextual entities
• Caching and curating data from the wider Linked Open Data
cloud
• A sort of Europeana "knowledge graph"
8. The Entity Collection
Use Cases
CC BY-SA
Building an ecosystem of networked references
Europeana Collections Portal
● Findability: users can look for entities, not
only records (Entity-Based Search)
● Understandability: Entity Pages group and
present all assertions about an entity
● Exploration: Navigation along relationships
becomes possible
Crowdsourcing
● Objects can be annotated with references to
entities
● A controlled vocabulary for client applications
Enrichment of Provider’s Data
● A controlled vocabulary to help identify
named references to entities
Republication for Re-use
● Entities can be republished as an open
source to the community
Entity Collection
9. The Entity Collection
Why DBpedia?
CC BY-SA
Building an ecosystem of networked references
• It offers labels in about 124 languages through all its
language editions of which 48 match the languages that
Europeana supports
• It gives fairly complete and accurate descriptive metadata
about entities
• Works great as a “pivot” vocabulary, providing further links to
other vocabularies such as Wikidata and Freebase
10. Entity Collection
The Entity Collection
Integrating DBpedia resources
CC BY-SA
Building an ecosystem of networked references
RDF
dumps
48 Language
Editions
(~3.6GB/GZ)
http://data.dws.informatik.uni-
mannheim.de/dbpedia/2014/
Triple
Store
MongoDB
SPARQL
Dumps were carefully selected and
downloaded for all languages that
Europeana supports...
SPARQL queries select
DBpedia resources for each
EDM Contextual Class
Each DBpedia resource is converted
to EDM using XSLT and further
filtered
loaded...
11. The Entity Collection
Some statistics for DBpedia
CC BY-SA
Building an ecosystem of networked references
Entity Class Target vocabulary Size
Places GeoNames 140,097
Concepts DBpedia 5,284
GEMET 280
Agents DBpedia 161,209
Time Semium Time 2,566
13. The Entity Collection
Is DBpedia enough?
CC BY-SA
• Not enough coreferencing information to other vocabularies
• particularly to the ones we receive from data providers (e.g.
MIMO)
• Labels and values are not always accurate and normalized
• need for better reference data (e.g. VIAF)
Building an ecosystem of networked references
14. The Entity Collection
Our roadmap for the next years
CC BY-SA
Building an ecosystem of networked references
• Generate Europeana URIs for Entities
• Make entity services and data available via an API, and
further integrate existing components
• Integrate vocabularies that can further improve
• entity descriptions and multilingual coverage (e.g. VIAF)
• linking between entities (e.g. Wikidata)
• Integrate alignments
• particularly, links between local/domain vocabularies to pivot
vocabularies
15. Linking Metadata to MIMO with CultuurLink
About CultuurLink
CC BY-SA
Building an ecosystem of networked references
• Online tool for aligning SKOS vocabularies,
http://cultuurlink.beeldengeluid.nl
• Successor to EuropeanaConnect's Amalgame
• Developed by Spinque
• With support of the Network Digital Heritage
• Semi-automatic discovery of alignments
• Ability to define complex strategies
• Manual assessment and export of the results
16. Linking Metadata to MIMO with CultuurLink
The Europeana Sounds Experiment
CC BY-SA
Building an ecosystem of networked references
• Scope
• Evaluate MIMO as target vocabulary for enrichment of subject fields
• Evaluate the potential of CultuurLink for alignment of vocabularies
• Participants
• British Library, CREM, MMSH, NISV (6 collections in total)
• Work done
• A SKOS vocabulary was generated for each collection from the labels
found within subject fields
• Each participant used CultuurLink to discover alignments by designing
and testing different strategies
17. Linking Metadata to MIMO with CultuurLink
Example from CNRS
CC BY-SA
Building an ecosystem of networked references
see http://cultuurlink.beeldengeluid.nl
18. Linking Metadata to MIMO with CultuurLink
Results of the experiment
CC BY-SA
Building an ecosystem of networked references
• There were a couple of issues with metadata quality
• matching previous observations: enrichment works better when
metadata quality is good
• The feedback was positive for CultuurLink
• Applying different strategies showed to be crucial for discovering
alignments between concepts
• ... taking into account different features of the data (prefered /
alternative labels, different languages, vernacular labels, etc.)
• Providers have realized the interest and feasibility of linking to a
richer, more multilingual pivot vocabulary such as MIMO
19. Linking Metadata to MIMO with CultuurLink
Demo
CC BY-SA
Building an ecosystem of networked references
http://cultuurlink.beeldengeluid.nl
20. Conclusion
CC BY-SA
Building an ecosystem of networked references
• A Strategy for Entities is a “must” for Europeana
• There is no “one fits all” vocabulary
• We have a long way to go…
• ...but we are making progress