Basic introductory talk about the Web of Linked Data, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010. Knowledge about Semantic Web is required
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
This is the presentation of the #ojoaldata100 initiative (http://ojoaldata100.okfn.es) for the selection of 100 datasets that every city should be publishing in their open data portal. This presentation was used in a call for sharing session at the 4th International Open Data Conference (IODC2016).
Very basic introductory talk about the Semantic Web, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...Oscar Corcho
Presentation for one of the keynotes at EKAW2014, where I talked about the need to lower the barrier for ontology development for those who have no experience with ontologies.
Estermann ENICPA Wiki Loves Performing Arts 20191022Beat Estermann
Presentation at the ENICPA Round Table on 22 October 2019 in Prague on Wikidata and performing arts. Author: Beat Estermann, Bern University of Applied Sciences.
Basic introductory talk about the Web of Linked Data, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010. Knowledge about Semantic Web is required
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
This is the presentation of the #ojoaldata100 initiative (http://ojoaldata100.okfn.es) for the selection of 100 datasets that every city should be publishing in their open data portal. This presentation was used in a call for sharing session at the 4th International Open Data Conference (IODC2016).
Very basic introductory talk about the Semantic Web, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...Oscar Corcho
Presentation for one of the keynotes at EKAW2014, where I talked about the need to lower the barrier for ontology development for those who have no experience with ontologies.
Estermann ENICPA Wiki Loves Performing Arts 20191022Beat Estermann
Presentation at the ENICPA Round Table on 22 October 2019 in Prague on Wikidata and performing arts. Author: Beat Estermann, Bern University of Applied Sciences.
The Galleries, Libraries, Archives and Museums (GLAM) sector deals with complex and varied data. Integrating that data, especially across institutions, has always been a challenge. Semantic data integration is the best approach to deal with such challenges. Linked Open Data (LOD) enable large-scale Digital Humanities (DH) research, collaboration and aggregation, allowing DH researchers to make connections between (and make sense of) the multitude of digitized Cultural Heritage (CH) available on the web. An upsurge of interest in semtech and LOD has swept the CH and DH communities. An active Linked Open Data for Libraries, Archives and Museums (LODLAM) community exists, CH data is published as LOD, and international collaborations have emerged. The value of LOD is especially high in the GLAM sector, since culture by its very nature is cross-border and interlinked. We present interesting LODLAM projects, datasets, and ontologies, as well as Ontotext's experience in this domain. An extended paper on these topics is also available. It has 77 pages, 67 figures, detailed info about CH content and XML standards, Wikidata and global authority control.
A presentation by Susanne Thorbord, Bibliographic Consultant at the Danish Bibliographic Centre (DBC).
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
A presentation by Daniel Lewis of the Open Knowledge Foundation.
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
"Semantic Integration Is What You Do Before The Deep Learning". dev.bg Machine Learning seminar, 13 May 2019.
It's well known that 80\% of the effort of a data scientist is spent on data preparation. Semantic integration is arguably the best way to spend this effort more efficiently and to reuse it between tasks, projects and organizations. Knowledge Graphs (KG) and Linked Open Data (LOD) have become very popular recently. They are used by Google, Amazon, Bing, Samsung, Springer Nature, Microsoft Academic, AirBnb… and any large enterprise that would like to have a holistic (360 degree) view of its business. The Semantic Web (web 3.0) is a way to build a Giant Global Graph, just like the normal web is a Global Web of Documents. IEEE already talks about Big Data Semantics. We review the topic of KGs and their applicability to Machine Learning.
Introduction: The Big Data Europe Project at the: CMG-AE Event: Big Data: Strategien, Technologien und Nutzen
19th of May 2015, Expat Center der Wirtschaftsagentur, Vienna, Austria
See: http://www.big-data-europe.eu
Open Data Institute Course - Open Data in a Day conducted by Registered ODI Trainer Ian Henshaw on October 14, 2015 in RTP, NC USA - Deck #1 Introduction to Open Data
Valentine Charles, Hugo Manguinhas, Vladimir Alexiev. Glam-Wiki 2015, 11 Apr 2015, The Hague
For Europeana, the platform for Europe’s digital cultural heritage from libraries, museums and archives, getting richer (semantic and multilingual) metadata is a priority. It improves access to the 40 million cultural heritage objects, notably enabling the multilingual retrieval of documents and creates relations between objects. To enhance data and enable retrieval across languages, Europeana performs automatic enrichment by selecting source metadata field(s) in the Europeana data and creating links to a selected target vocabulary or dataset representing contextual resources such as places, concepts, agents and time periods. Wikidata is since a while on Europeana’s radar as a potential new target for enrichment but how can it be integrated with cultural heritage data?
https://nl.wikimedia.org/wiki/GLAM-WIKI_2015/Programme/Discussions/Strategy#Presentation:_Wikidata.2C_a_target_for_Europeana.E2.80.99s_semantic_strategy.3F
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
Presentation from mentoring event of Open Education Europa Challenge (http://www.openeducationchallenge.eu/) about using Linked Data in educational applications.
Invited report at Digital Presentation and Preservation of Cultural and Scientific Heritage (DIPP 2018), Burgas, Bulgaria. Report: http://dipp.math.bas.bg/images/2018/019-050_32_11-iDiPP2018-34.pdf
Talk delivered at London Natural History Museum's "Informatics Horizons for the Natural History Museum" video and programme here
http://scratchpads.eu/NHMInformaticsday
A presentation by Gill Hamilton, Digital Access Manager at the National Library of Scotland (NLS).
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
How to Build Linked Data Sites with Drupal 7 and RDFascorlosquet
Slides of the tutorial Stéphane Corlosquet, Lin Clark and Alexandre Passant presented at SemTech 2010 in San Francisco http://semtech2010.semanticuniverse.com/sessionPop.cfm?confid=42& proposalid=2889
The Galleries, Libraries, Archives and Museums (GLAM) sector deals with complex and varied data. Integrating that data, especially across institutions, has always been a challenge. Semantic data integration is the best approach to deal with such challenges. Linked Open Data (LOD) enable large-scale Digital Humanities (DH) research, collaboration and aggregation, allowing DH researchers to make connections between (and make sense of) the multitude of digitized Cultural Heritage (CH) available on the web. An upsurge of interest in semtech and LOD has swept the CH and DH communities. An active Linked Open Data for Libraries, Archives and Museums (LODLAM) community exists, CH data is published as LOD, and international collaborations have emerged. The value of LOD is especially high in the GLAM sector, since culture by its very nature is cross-border and interlinked. We present interesting LODLAM projects, datasets, and ontologies, as well as Ontotext's experience in this domain. An extended paper on these topics is also available. It has 77 pages, 67 figures, detailed info about CH content and XML standards, Wikidata and global authority control.
A presentation by Susanne Thorbord, Bibliographic Consultant at the Danish Bibliographic Centre (DBC).
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
A presentation by Daniel Lewis of the Open Knowledge Foundation.
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
"Semantic Integration Is What You Do Before The Deep Learning". dev.bg Machine Learning seminar, 13 May 2019.
It's well known that 80\% of the effort of a data scientist is spent on data preparation. Semantic integration is arguably the best way to spend this effort more efficiently and to reuse it between tasks, projects and organizations. Knowledge Graphs (KG) and Linked Open Data (LOD) have become very popular recently. They are used by Google, Amazon, Bing, Samsung, Springer Nature, Microsoft Academic, AirBnb… and any large enterprise that would like to have a holistic (360 degree) view of its business. The Semantic Web (web 3.0) is a way to build a Giant Global Graph, just like the normal web is a Global Web of Documents. IEEE already talks about Big Data Semantics. We review the topic of KGs and their applicability to Machine Learning.
Introduction: The Big Data Europe Project at the: CMG-AE Event: Big Data: Strategien, Technologien und Nutzen
19th of May 2015, Expat Center der Wirtschaftsagentur, Vienna, Austria
See: http://www.big-data-europe.eu
Open Data Institute Course - Open Data in a Day conducted by Registered ODI Trainer Ian Henshaw on October 14, 2015 in RTP, NC USA - Deck #1 Introduction to Open Data
Valentine Charles, Hugo Manguinhas, Vladimir Alexiev. Glam-Wiki 2015, 11 Apr 2015, The Hague
For Europeana, the platform for Europe’s digital cultural heritage from libraries, museums and archives, getting richer (semantic and multilingual) metadata is a priority. It improves access to the 40 million cultural heritage objects, notably enabling the multilingual retrieval of documents and creates relations between objects. To enhance data and enable retrieval across languages, Europeana performs automatic enrichment by selecting source metadata field(s) in the Europeana data and creating links to a selected target vocabulary or dataset representing contextual resources such as places, concepts, agents and time periods. Wikidata is since a while on Europeana’s radar as a potential new target for enrichment but how can it be integrated with cultural heritage data?
https://nl.wikimedia.org/wiki/GLAM-WIKI_2015/Programme/Discussions/Strategy#Presentation:_Wikidata.2C_a_target_for_Europeana.E2.80.99s_semantic_strategy.3F
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
Presentation from mentoring event of Open Education Europa Challenge (http://www.openeducationchallenge.eu/) about using Linked Data in educational applications.
Invited report at Digital Presentation and Preservation of Cultural and Scientific Heritage (DIPP 2018), Burgas, Bulgaria. Report: http://dipp.math.bas.bg/images/2018/019-050_32_11-iDiPP2018-34.pdf
Talk delivered at London Natural History Museum's "Informatics Horizons for the Natural History Museum" video and programme here
http://scratchpads.eu/NHMInformaticsday
A presentation by Gill Hamilton, Digital Access Manager at the National Library of Scotland (NLS).
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
How to Build Linked Data Sites with Drupal 7 and RDFascorlosquet
Slides of the tutorial Stéphane Corlosquet, Lin Clark and Alexandre Passant presented at SemTech 2010 in San Francisco http://semtech2010.semanticuniverse.com/sessionPop.cfm?confid=42& proposalid=2889
I used these slides for an introductory lecture (90min) to a seminar on SPARQL. This slideset introduces the RDF query language SPARQL from a user's perspective.
"SPARQL Cheat Sheet" is a short collection of slides intended to act as a guide to SPARQL developers. It includes the syntax and structure of SPARQL queries, common SPARQL prefixes and functions, and help with RDF datasets.
The "SPARQL Cheat Sheet" is intended to accompany the SPARQL By Example slides available at http://www.cambridgesemantics.com/2008/09/sparql-by-example/ .
This is one out of a series of presentations which I have given during a recent trip to the United States. I will make them all public, but content does not vary a lot between some of them
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studioCulturaItalia
Maria Emilia Masci, Scuola Normale Superiore, Linked Open Data (LOD): Un’Opportunità per il Patrimonio Culturale Digitale, Roma, ICCU, 29 novembre 2013
for getting the library resources fro the libraries entire world, the important tool is Library catalogues. every can browse all most all the world literature through WorldCat fro the INTERNET.
There are high expectations for Linked Government Data—the practice of publishing public sector information on the Web using Linked Data formats. This slideset reviews some of the ongoing work in the US, UK, and within W3C, as well as activities within my institute (DERI, National University of Ireland, Galway).
Organisational Interoperability in Practice at Universidad Politécnica de MadridOscar Corcho
Presentation on EOSC Interoperability Framework in relation to Organisational Interoperability, and how it can be applied to a Research Performing Organisation such as UPM
Open Data (and Software, and other Research Artefacts) -A proper managementOscar Corcho
Presentation at the event "Let's do it together: How to implement Open Science Practices in Research Projects" (29/11/2019), organised by Universidad Politécnica de Madrid, where we discuss on the need to take into account not only open access or open research data, but also all the other artefacts that are a result of our research processes.
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosOscar Corcho
Esta presentación se ha realizado en el contexto de la Jornada sobre difusión, accesibilidad y reutilización de la estadística y cartografía oficial (http://www.juntadeandalucia.es/institutodeestadisticaycartografia/blog/2019/11/jornada-plan/), organizada por el Instituto de Estadística y Cartografía de Andalucía.
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
Seminar at the School of Informatics, The University of Edinburgh.
In this talk we will present how we are applying ontology engineering principles and tools for the development of a set of shared vocabularies across municipalities in Spain, so that they can start homogenising the generation and publication of open data that may be useful for their own internal reuse as well as for third parties who want to develop applications reusing open data once and deploy them for all municipalities. We will discuss on the main challenges for ontology engineering that arise in this setting, as well as present the work that we have done to integrate ontology development tools into common software development infrastructure used by those who are not experts in Ontology Engineering.
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Oscar Corcho
Presentación sobre iniciativas de Open Data Internacionales y nacionales, realizada en el contexto del Curso de Verano de la Universidad de Extremadura "BigData y Machine Learning junto a fuentes de datos abiertos para especializar el sector agroganadero", el 25/09/2018
Presentación general sobre contaminación lumínica, en español, del proyecto STARS4ALL (www.stars4all.eu). Generada por el consorcio del proyecto, con especial agradecimiento a Lucía García (@shekda) por generar la primera versión en inglés, y Miquel Serra-Ricart, por realizar su traducción inicial.
Towards Reproducible Science: a few building blocks from my personal experienceOscar Corcho
Invited keynote given at the Second International Workshop on Semantics for BioDiversity (http://fusion.cs.uni-jena.de/s4biodiv2017/), held in conjunction with ISWC2017 (https://iswc2017.semanticweb.org/)
Publishing Linked Statistical Data: Aragón, a case studyOscar Corcho
Presentation at the Semstats2017 workshop (http://semstats.org/2017/) for the paper "Publishing Linked Statistical Data: Aragón, a Case Study", by Oscar Corcho, Idafen Santana-Pérez, Hugo Lafuente, David Portolés, César Cano, Alfredo Peris, José María Subero.
An initial analysis of topic-based similarity among scientific documents base...Oscar Corcho
Presentation given at the SemSci2017 workshop (https://semsci.github.io/semSci2017/), for the paper "An Initial Analysis of Topic-based Similarity among Scientific Documents Based on their Rhetorical Discourse Parts" http://ceur-ws.org/Vol-1931/paper-03.pdf
Introductory talk on the usage of Linked Data for official statistics, given at the ESS (Linked) Open Data Workshop 2017, in Malta, January 2017.
In this introductory talk we will discuss the main foundations for the application of Linked Data principles into official statistics. We will briefly introduce what Linked Data is, as well as the main principles, languages and technologies behind it (URIs, RDF, SPARQL). We will also discuss about the different formats in which data can be made available on the Web (e.g., RDF Turtle, JSON-LD, CSV on the Web). We will then move into providing a detailed presentation, with step by step examples based on existing Linked Statistical Data sources, of the W3C recommendation RDF DataCube, which is the basis for the dissemination of statistical data as Linked Data. Finally, we will provide some examples of applications, and the opportunities that this approach offers for the development of the proofs of concepts selected by Eurostat and to be discussed during the meeting.
Aplicando los principios de Linked Data en AEMETOscar Corcho
Presentación realizada en uno de los paneles de la jornada sobre datos abiertos organizada por AEMET el 13 de diciembre del 2016, sobre la aplicación de los principios de Linked Data la API REST de AEMET
Educando sobre datos abiertos: desde el colegio a la universidadOscar Corcho
Presentación realizada en la mesa 3 del evento Aporta 2016, uno de los pre-eventos de la semana de los datos abiertos en Madrid. Realizada el 3 de octubre del 2016.
http://datos.gob.es/encuentro-aporta?q=node/654503
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaOscar Corcho
En esta presentación mostramos el trabajo realizado para la generación y publicación de datos enlazados a partir de los datos de estadística local del Instituto Aragonés de Estadística
Presentación de la red de excelencia de Open Data y Smart CitiesOscar Corcho
Presentación general de la red de excelencia de Open Data y Smart Cities (http://www.opencitydata.es), realizada en Medialab-Prado el 18 de febrero de 2016
Why do they call it Linked Data when they want to say...?Oscar Corcho
The four Linked Data publishing principles established in 2006 seem to be quite clear and well understood by people inside and outside the core Linked Data and Semantic Web community. However, not only when discussing with outsiders about the goodness of Linked Data but also when reviewing papers for the COLD workshop series, I find myself, in many occasions, going back again to the principles in order to see whether some approach for Web data publication and consumption is actually Linked Data or not. In this talk we will review some of the current approaches that we have for publishing data on the Web, and we will reflect on why it is sometimes so difficult to get into an agreement on what we understand by Linked Data. Furthermore, we will take the opportunity to describe yet another approach that we have been working on recently at the Center for Open Middleware, a joint technology center between Banco Santander and Universidad Politécnica de Madrid, in order to facilitate Linked Data consumption.
Linked Statistical Data: does it actually pay off?Oscar Corcho
Invited keynote at the ISWC2015 Workshop on Semantics and Statistics (SemStats 2015). http://semstats.github.io/2015/
The release of the W3C RDF Data Cube recommendation was a significant milestone towards improving the maturity of the area of Linked Statistical Data. Many Data Cube-based datasets have been released since then. Tools for the generation and exploitation of such datasets have also appeared. While the benefits for the usage of RDF Data Cube and the generation of Linked Data in this area seem to be clear, there are still many challenges associated to the generation and exploitation of such data. In this talk we will reflect about them, based on our experience on generating and exploiting such type of data, and hopefully provoke some discussion about what the next steps should be.
Research Objects for improved sharing and reproducibilityOscar Corcho
Presentation about the usage of Research Objects to improve scientific experiment sharing and reproducibility, given at the Dagstuhl Perspective Workshop on the intersection between Computer Sciences and Psychology (July 2015)
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Free Complete Python - A step towards Data Science
Linked Data Tutorial (Florianópolis)
1. Mini-curso sobre LinkedData Oscar Corcho, Asunción Gómez Pérez ({ocorcho, asun}@fi.upm.es) Universidad Politécnica de Madrid Florianópolis, September 1st 2010(3º OntoBras 2010) Credits: Raúl García Castro, Oscar Muñoz, Jose Angel Ramos Gargantilla, María del Carmen Suárez de Figueroa, Boris Villazón, Alex de León, Víctor Saquicela, Luis Vilches, Miguel Angel García, Manuel Salvadores, Juan Sequeda, Carlos Ruiz Moreno and manyothers WorkdistributedunderthelicenseCreativeCommonsAttribution-Noncommercial-Share Alike 3.0
2. Contents IntroductiontoLinked Data Linked Data Foundations: RDF, RDF Schema, SPARQL and OWL Coffee break Linked Data publication MethodologicalguidelinesforLinked Data publication RDB2RDF tools Technicalaspects of Linked Data publication Linked Data consumption 2
3. Whatisthe Web of Linked Data? An extension of the current Web… … where information and services are given well-defined and explicitly represented meaning, … … so that it can be shared and used by humans and machines, ... ... better enabling them to work in cooperation How? Promoting information exchange by tagging web content with machineprocessable descriptions of its meaning. And technologies and infrastructure to do this And clear principles on how to publish data data
4. What is Linked Data? Linked Data is a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF. Part of the Semantic Web Exposing, sharing and connecting data Technologies: URIs and RDF (although others are also important)
5. The fourprinciples (Tim Berners Lee, 2006) Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) Include links to other URIs, so that they can discover more things. http://www.w3.org/DesignIssues/LinkedData.html 5 http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
24. Howshouldwepublish data? Formats in which data ispublishednowadays… XML HTML DBs APIs CSV XLS … However, mainlimitationsfrom a Web of Data point of view Difficulttointegrate Data isnotlinkedtoeachother, as ithappenswith Web documents.
25. Which format do we use then? RDF (ResourceDescription Framework) Data model Basedon triples: subject, predicate, object <Oscar> <vive en> <Madrid> <Madrid> <es la capital de> <España> <España> <es campeona de> <Mundial de Fútbol> … Serialised in differentformats RDF/XML, RDFa, N3, Turtle, JSON…
26. URIs (Universal-UniformResourceIdentifer) Two types of identifiers can be used to identify Linked Data resources URIRefs(Unique Resource IdentifiersReferences) A URI and an optional FragmentIdentifier separated from the URI by the hash symbol ‘#’ http://www.ontology.org/people#Person people:Person Plain URIs can also be used, as in FOAF: http://xmlns.com/foaf/0.1/Person 17
27. How do wepublishLinked Data? ExposingRelationalDatabasesorother similar formatsintoLinked Data D2R Triplify R2O NOR2O Virtuoso Ultrawrap … Usingnative RDF triplestores Sesame Jena Owlim Talisplatform … Incorporatingit in theform of RDFa in CMSslikeDrupal 18
28. How do we consume Linked Data? Linked Data browsers To explore things and datasets and to navigate between them. Tabulator Browser (MIT, USA), Marbles (FU Berlin, DE), OpenLink RDF Browser (OpenLink, UK), Zitgist RDF Browser (Zitgist, USA), Disco Hyperdata Browser (FU Berlin, DE), Fenfire (DERI, Ireland) Linked Data mashups Sites that mash up (thus combine Linked data) Revyu.com (KMI, UK), DBtune Slashfacet (Queen Mary, UK), DBPedia Mobile (FU Berlin, DE), Semantic Web Pipes (DERI, Ireland) Search engines To search for Linked Data. Falcons (IWS, China), Sindice (DERI, Ireland), MicroSearch (Yahoo, Spain), Watson (Open University, UK), SWSE (DERI, Ireland), Swoogle (UMBC, USA) Listing on this slide by T. Heath, M. Hausenblas, C. Bizer, R. Cyganiak, O. Hartig 19
41. Linked Data Mashup (data.gov) Clean Air Status and Trends (CASTNET) http://data-gov.tw.rpi.edu/demo/exhibit/demo-8-castnet.php
42. Linked Data in the UK Education http://education.data.gov.uk/id/school/106661 Parliament http://parliament.psi.enakting.org/id/member/1227 Maps E.g., London: http://data.ordnancesurvey.co.uk/id/7000000000041428 http://map.psi.enakting.org Transport http://www.dft.gov.uk/naptan/ SameAs service http://www.sameas.org Challenges http://gov.tso.co.uk/openup/sparql/gov-transport 29
43. Linked Data Mashup (data.gov.uk) Research Funding Explorer http://bis.clients.talis.com/
48. Linked Data Mashup (Waterquality) Water quality in Asturias’ beaches http://datos.fundacionctic.org/sandbox/asturias/playas/
49. Contents IntroductiontoLinked Data Linked Data Foundations: RDF, RDF Schema, SPARQL and OWL Coffee break Linked Data publication MethodologicalguidelinesforLinked Data publication RDB2RDF tools Technicalaspects of Linked Data publication Linked Data consumption 36
51. RDF: Resource Description Framework W3C recommendation RDF is graphical formalism ( + XML syntax + semantics) For representing metadata For describing the semantics of information in a machine- accessible way Resources are described in terms of properties andproperty values using RDF statements Statements are represented as triples, consisting of asubject, predicate and object. [S, P, O] “Oscar Corcho García” person:hasName person:hasColleague oeg:Oscar oeg:Asun person:hasHomePage person:hasColleague “http://www.fi.upm.es/” oeg:Raul 38
52. RDF and URIs RDF uses URIRefs(Unique Resource IdentifiersReferences) toidentify resources A URIRef consists of a URI and an optional FragmentIdentifier separated from the URI by the hash symbol ‘#’ Examples http://www.co-ode.org/people#hasColleague coode:hasColleague A set of URIRefs is known as a vocabulary E.g., the RDF Vocabulary The set of URIRefs used indescribing the RDF concepts:rdf:Property,rdf:Resource, rdf:type, etc. The RDFS Vocabulary The set of URIRefs used indescribing the RDF Schema language: rdfs:Class, rdfs:domain, etc. The ‘Pizza Ontology’ Vocabulary pz:hasTopping,pz:Pizza, pz:VegetarianPizza, etc. 39
53. RDF Serialisations Normative RDF/XML (www.w3.org/TR/rdf-syntax-grammar/) Alternative (for human consumption) N3 (http://www.w3.org/DesignIssues/Notation3.html) Turtle (http://www.dajobe.org/2004/01/turtle/) TriX (http://www.w3.org/2004/03/trix/) … Important: the RDF serializations allow different syntactic variants. E.g., the order of RDF statements has no meaning 40
61. Exercise 1.a. Create a graph from a file Open the file StickyNote_PureRDF.rdf Create the corresponding graph from it Compare your graph with those of your colleagues 44
63. Exercise 1.b. Create files from a graph Transform the following graph into N3 syntax 46 hasMeasurement Measurement8401 Sensor029 includes hasTemperature atTime Class01 includes 2010-06-12T12:00:12 29 Computer101 hasOwner User10A hasName Pedro
64. Blank nodes: structured property values Most real-world data involves structures that are more complicated than sets of RDF triple statements In RDF/XML, it is an <rdf:Description> node with no rdf:about In N3, it is a resource identifier that starts with ‘_’ E.g., “_:nodeX” Thisintermediate URI doesnotneedtohave a name “Oscar Corcho García” person:hasName person:hasPostalAddress oeg:Oscar address:hasStreetName address:city Campus de Montegancedo s/n city:BoadillaDelMonte 47
65. Typed literals So far, all values have been presented as strings XML Schema datatypes can be used to specify values (objects in some RDF triple statements) In RDF/XML, this is expressed as: <rdf:Description rdf:about=”#Oscar”> <person:hasBirthDate rdf:datatype="http://www.w3.org/2001/XMLSchema#date">1976-02-02 </person:hasBirthDate></rdf:Description> In N3, this is expressed as: oeg:Oscar person:hasBirthDate ”1976-02-02”^^xsd:date . person:hasBirthDate oeg:Oscar 1976-02-02 48
66. RDF Containers There is often the need to describe groups of things A book was created by several authors A lesson is taught by several persons etc. RDF provides a container vocabulary rdf:Bag A group of resources or literals, possibly including duplicate members, where the order of members is not significant rdf:Seq A group of resources or literals, possibly including duplicate members, where the order of members is significant rdf:Alt A group of resources or literals that are alternatives (typically for a single value of a property) rdf:type person:hasEmailAddress oeg:Oscar rdf:Seq rdf:_2 rdf:_1 “oscar.corcho@upm.es” “ocorcho@fi.upm.es” 49
67. RDF Reification RDF statements about other RDF statements “Raúl believes that Oscar’s birthdate is on Feb 2nd, 1976 and that his e-mail address is ocorcho@fi.upm.es” RDF Reification Allows expressing beliefs (and other modalities) Allows expressing trust models, digital signatures, etc. Allows expressing metadata about metadata modal:believes oeg:Raúl oeg:Oscar person:hasBirthDate person:hasEmailAddress 02/02/1976 “ocorcho@fi.upm.es” 50
68. Main value of a structured value Sometimes one of the values of a structured value is the main one The weight of an item is 2.4 kilograms The most important value is 2.4, which is expressed with rdf:value Scarcely used product:hasWeight product:Item1 rdf:value units:hasWeightUnit units:kilogram 2.4 51
70. RDF inference. Graph matching techniques RDF inference is based on graph matching techniques Basically, the RDF inference process consists of the following steps: Transform an RDF query into a template graph that has to be matched against the RDF graph It contains constant and variable nodes, and constant and variable edges between nodes Match against the RDF graph, taking into account constant nodes and edges Provide a solution for variable nodes and edges 53
71. RDF inference. Examples (I) Sample RDF graph Query: “Tell me who are the persons who have Asun as a colleague” Result: oeg:Oscar and oeg:Raúl “Oscar Corcho García” person:hasName person:hasColleague oeg:Oscar oeg:Asun person:hasHomePage person:hasColleague “http://www.fi.upm.es/” oeg:Raúl person:hasColleague ? oeg:Asun 54
72. RDF inference. Examples (II) Query: “Tell me which are the relationships between Oscar and Asun” Result: oeg:hasColleague Query: “Tell me the homepage of Oscar colleagues” Result: “http://www.fi.upm.es/” ? oeg:Oscar oeg:Asun person:hasColleague oeg:Oscar person:hasHomePage ? 55
75. RDFS: RDF Schema W3C Recommendation RDF Schema extends RDF to enable talking about classes of resources, and the properties to be used with them Class definition: rdfs:Class, rdfs:subClassOf Property definition: rdfs:subPropertyOf, rdfs:range, rdfs:domain Other primitives: rdfs:comment, rdfs:label, rdfs:seeAlso, rdfs:isDefinedBy RDFS vocabulary adds constraints on models, e.g.: x,y,ztype(x,y) and subClassOf(y,z) type(x,z) ex:Animal rdfs:subClassOf rdf:type ex:Oscar ex:Person 58
85. Exercise 2.a. Create a graph from a file Open the files StickyNote.rdf and StickyNote.rdfs Create the corresponding graph from them Compare your graph with those of your colleagues 64
88. Exercise 2.b. Create files from a graph Transform the following graph into N3 syntax 67 Room Person Measurement Object hasMeasurement Sensor029 includes hasTemperature atTime Class01 includes 2010-06-12T12:00:12 29 Computer101 hasOwner User10A hasName Pedro
92. RDF(S) limitations RDFS too weak to describe resources in sufficient detail No localised range and domain constraints Can’t say that the range of hasChild is person when applied to persons and elephant when applied to elephants No existence/cardinality constraints Can’t say that all instances of person have a mother that is also a person, or that persons have exactly 2 parents No boolean operators Can’t say or, not, etc. No transitive, inverse or symmetrical properties Can’t say that isPartOf is a transitive property, that hasPart is the inverse of isPartOf or that touches is symmetrical Difficult to provide reasoning support No “native” reasoners for non-standard semantics May be possible to reason via FOL axiomatisation 71
96. Given a scenario description, build a simple ontology in RDF Schema72
97. Exercise 3. Domain description Un lugar puede ser un lugar de interés. Los lugares de interés pueden ser lugares turísticos o establecimientos, pero no las dos cosas a la vez. Los lugares turísticos pueden ser palacios, iglesias, ermitas y catedrales. Los establecimientos pueden ser hoteles, hostales o albergues. Un lugar está situado en una localidad, la cual a su vez puede ser una villa, un pueblo o una ciudad. Un lugar de interés tiene una dirección postal que incluye su calle y su número. Las localidades tienen un número de habitantes. Las localidades se encuentran situadas en provincias. Covarrubias es un pueblo con 634 habitantes de la provincia de Burgos. El restaurante “El Galo” está situado en Covarrubias, en la calle Mayor, número 5. Una de las iglesias de Covarrubias está en la calle de Santo Tomás. 73
100. Sample RDF APIs RDF libraries for different languages: Java, Python, C, C++, C#, .Net, Javascript, Tcl/Tk, PHP, Lisp, Obj-C, Prolog, Perl, Ruby, Haskell List in http://esw.w3.org/topic/SemanticWebTools Usually related to a RDF repository Multilanguage: Redland RDF Application Framework (C, Perl, PHP, Python and Ruby): http://www.redland.opensource.ac.uk/ Java: Jena: http://jena.sourceforge.net/ Sesame: http://www.openrdf.org/ PHP: RAP - RDF API for PHP: http://www4.wiwiss.fu-berlin.de/bizer/rdfapi/ Python: RDFLib: http://rdflib.net/ Pyrple: http://infomesh.net/pyrple/ 76
101. Jena Java framework for building Semantic Web applications Open source software from HP Labs The Jena framework includes: A RDF API An OWL API Reading and writing RDF in RDF/XML, N3 and N-Triples In-memory and persistent storage A rule based inference engine SPARQL query engine 77
102. Sesame A framework for storage, querying and inferencing of RDF and RDF Schema A Java Library for handling RDF A Database Server for (remote) access to repositories of RDF data Highly expressive query and transformation languages SeRQL, SPARQL Various backends Native Store RDBMS (MySQL, Oracle 10, DB2, PostgreSQL) main memory Reasoning support RDF Schema reasoner OWL DLP (OWLIM) domain reasoning (custom rule engine) 78
103. Jena example. Graph creation http://.../JohnSmith vcard:FN vcard:N John Smith vcard:Given vcard:Family Smith John // some definitions String personURI = "http://somewhere/JohnSmith"; String givenName = "John"; String familyName = "Smith"; String fullName = givenName + " " + familyName; // create an empty Model Model model = ModelFactory.createDefaultModel(); // create the resource // and add the properties cascading style Resource johnSmith = model.createResource(personURI) .addProperty(VCARD.FN, fullName) .addProperty(VCARD.N, model.createResource() .addProperty(VCARD.Given, givenName) .addProperty(VCARD.Family, familyName)); 79
104. Jena example. Read and write // create an empty model Model model = ModelFactory.createDefaultModel(); // use the FileManager to find the input file InputStream in = FileManager.get().open( inputFileName ); if (in == null) { throw new IllegalArgumentException("File not found"); } // read the RDF/XML file model.read(in, ""); // write it to standard out model.write(System.out); <rdf:RDF xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#' xmlns:vcard='http://www.w3.org/2001/vcard-rdf/3.0#' > <rdf:Description rdf:nodeID="A0"> <vcard:Family>Smith</vcard:Family> <vcard:Given>John</vcard:Given> </rdf:Description> <rdf:Description rdf:about='http://somewhere/JohnSmith/'> <vcard:FN>John Smith</vcard:FN> <vcard:N rdf:nodeID="A0"/> </rdf:Description> ... </rdf:RDF> 80
108. RDF(S) query languages Languages developed to allow accessing datasets expressed in RDF(S) (and in some cases OWL) Supported by the most important language APIs Jena (HP labs) Sesame (Aduna) Boca (IBM) ... There are some differences wrt. languages like SQL, such as Combination of different sources Trust management Open World Assumption 84 Application Application SQL queries SPARQL, RQL, etc., queries RelationalDB RDF(S)OWL
109. Query types Selection and extraction “Select all the essays, together with their authors and their authors’ names” “Select everything that is related to the book ‘Bellum Civille’” Reduction: we specify what it should not be returned “Select everything except for the ontological information and the book translators” Restructuring: the original structure is changed in the final result “Invert the relationship ‘author’ by ‘is author of’” Aggregation “Return all the essays together with the mean number of authors per essay” Combination and inferences “Combine the information of a book called ‘La guerra civil’ and whose author is Julius Caesar with the book whose identifier is ‘Bellum Civille’” “Select all the essays, together with its authors and author names”, including also the instances of the subclasses of Essay “Obtain the relationship ‘coauthor’ among persons who have written the same book” 85
110. RDF(S) query language families SPARQL W3C Recommendation 15 January 2008 Description graphs Query semantics Triple database Query structure SquishQLFamily SquishQL rdfDB Query Language RDQL BRQL TriQL XPath, XSLT, XQuery XQueryfor RDF XsRQL TreeHuggerandRDFTwig RDFT, Nexus Query Language RDFPath, RpathandRXPath Versa RQL Family RQL SeRQL eRQL Controlled natural language Metalog Other Algae iTQL N3QL PerlRDF Query Language RDEVICE DeductiveLanguage RDFQBE RDFQL TRIPLE WQL XML repository Query syntax 86
111. SPARQL SPARQL Protocol and RDF Query Language Supported by: Jena, Sesame, IBM Boca, etc. Features Itsupportsmostoftheaforementionedqueries Itsupportsdatatypereasoning(datatypes can be requestedinsteadof actual values) Thedomainvocabularyandtheknowledgerepresentationvocabularyare treateddifferently by thequeryinterpreters Itallowsmakingqueriesoverpropertieswithmultiplevalues, overmultiplepropertiesof a resourceandoverreifications Queries can containoptionalstatements Someimplementationssupportaggregationqueries Limitations Neitherset operationsnorexistentialor universal quantifierscan be included in thequeries Itdoesnotsupportrecursivequeries 87
112. SPARQL is also a protocol SPARQL is a Query Language …Find names and websites of contributors to PlanetRDF: PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?name ?website FROM <http://planetrdf.com/bloggers.rdf> WHERE { ?person foaf:weblog ?website . ?person foaf:name ?name . ?website a foaf:Document } ... and a Protocolhttp://.../qps?query-lang=http://www.w3.org/TR/rdf-sparql-query/ &graph-id=http://planetrdf.com/bloggers.rdf&query=PREFIXfoaf: <http://xmlns.com/foaf/0.1/... Services running SPARQL queries over a set of graphs A transport protocol for invoking the service Based on ideas from earlier protocol work such as Joseki Describing the service with Web Service technologies 88
113. SPARQL Endpoints SPARQL protocol services Enables users (human or other) to query a knowledge base using SPARQL Results are typically returned in one or more machine-processable formats List of SPARQL Endpoints http://esw.w3.org/topic/SparqlEndpoints Programmatic access using libraries: ARC, RAP, Jena, Sesame, Javascript SPARQL, PySPARQL, etc. Examples: 89
116. Each way a pattern can be matched yields a solution
117. The sequence of solutions is filtered by: Project, distinct, order, limit/offset
118. One of the result forms is applied: SELECT, CONSTRUCT, DESCRIBE, ASK91
119. Graph patterns Basic Graph Patterns, where a set of triple patterns must match Group Graph Pattern, where a set of graph patterns must all match Optional Graph patterns, where additional patterns may extend the solution Alternative Graph Pattern, where two or more possible patterns are tried Patterns on Named Graphs, where patterns are matched against named graphs 92
128. Patterns on named graphs II PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?src ?bobNick FROM NAMED <http://example.org/foaf/aliceFoaf> FROM NAMED <http://example.org/foaf/bobFoaf> WHERE { GRAPH ?src { ?xfoaf:mbox <mailto:bob@work.example> . ?xfoaf:nick ?bobNick } } PREFIX foaf: <http://xmlns.com/foaf/0.1/> PREFIX data: <http://example.org/foaf/> SELECT ?nick FROM NAMED <http://example.org/foaf/aliceFoaf> FROM NAMED <http://example.org/foaf/bobFoaf> WHERE { GRAPH data:bobFoaf { ?x foaf:mbox <mailto:bob@work.example> . ?x foaf:nick ?nick } } 101
130. Value tests Based on XQuery 1.0 and XPath 2.0 Function and Operators XSD boolean, string, integer, decimal, float, double, dateTime Notation <, >, =, <=, >= and != for value comparisonApply to any type BOUND, isURI, isBLANK, isLITERAL REGEX, LANG, DATATYPE, STR (lexical form) Function call for casting and extensions functions 103
131. Solution sequences and modifiers SELECT ?name WHERE { ?x foaf:name ?name ; :empId ?emp } ORDER BY ?name DESC(?emp) Order modifier: put the solutions in order Projection modifier: choose certain variables Distinct modifier: ensure solutions in the sequence are unique Reduced modifier: permit elimination of some non-unique solutions Limit modifier: restrict the number of solutions Offset modifier: control where the solutions start from in the overall sequence of solutions SELECT ?name WHERE { ?x foaf:name ?name } SELECT DISTINCT ?name WHERE { ?x foaf:name ?name } SELECT REDUCED ?name WHERE { ?x foaf:name ?name } SELECT ?name WHERE { ?x foaf:name ?name } LIMIT 20 SELECT ?name WHERE { ?x foaf:name ?name } ORDER BY ?name LIMIT 5 OFFSET 10 104
132. SPARQL query forms SELECT Returns all, or a subset of, the variables bound in a query pattern match CONSTRUCT Returns an RDF graph constructed by substituting variables in a set of triple templates ASK Returns a boolean indicating whether a query pattern matches or not DESCRIBE Returns an RDF graph that describes the resources found 105
137. Main References Prud’hommeaux E, Seaborne A (2008) SPARQL Query Language for RDF. W3C Recommendation http://www.w3.org/TR/rdf-sparql-query/ SPARQL validator: http://www.sparql.org/validator.html SPARQL implementations: http://esw.w3.org/topic/SparqlImplementations SPARQL Endpoints http://esw.w3.org/topic/SparqlEndpoints SPARQL in Dbpedia http://dbpedia.org/sparql 110
139. Description Logics A family of logic based Knowledge Representation formalisms Descendants of semantic networks and KL-ONE Describe domain in terms of concepts (classes), roles (relationships) and individuals Specificlanguagescharacterisedbytheconstructors and axiomsusedtoassertknowledgeaboutclasses, roles and individuals. Example: ALC (theleastexpressivelanguage in DL thatispropositionallyclosed) Constructors: boolean(and, or, not) Role restrictions Distinguished by: Model theoretic semantics Decidable fragments of FOL Closely related to Propositional Modal & Dynamic Logics Provision of inference services Sound and complete decision procedures for key problems Implemented systems (highly optimised)
140. Structure of DL Ontologies A DL ontology can be divided into two parts: Tbox (Terminological KB): a set of axioms that describe the structure of a domain : Doctor Person Person Man Woman HappyFather Man hasDescendant.(Doctor hasDescendant.Doctor) Abox (Assertional KB): a set of axioms that describe a specific situation : John HappyFather hasDescendant (John, Mary)
141. Mostcommonconstructors in classdefinitions Intersection: C1 ... Cn Human Male Union: C1 ... Cn Doctor Lawyer Negation: C Male Nominals: {x1} ... {xn} {john} ... {mary} Universal restriction: P.C hasChild.Doctor Existentialrestriction: P.C hasChild.Lawyer Maximumcardinality: nP.C 3hasChild.Doctor Minimumcardinality: nP.C 1hasChild.Male SpecificValue: P.{x} hasColleague.{Matthew} Nesting of constructors can bearbitrarilycomplex Person hasChild.(Doctor hasChild.Doctor) Lots of redundancy AB is equivalent to ( A B) P.C is equivalent to P. C
152. OWL 2 (II). Three new profiles OWL2 EL Ontologies that define very large numbers of classes and/or properties, Ontology consistency, class expression subsumption, and instance checking can be decided in polynomial time. OWL2 QL Sound and complete query answering is in LOGSPACE (more precisely, in AC0) with respect to the size of the data (assertions), Provides many of the main features necessary to express conceptual models (UML class diagrams and ER diagrams). It contains the intersection of RDFS and OWL 2 DL. OWL2 RL Inspired by Description Logic Programs and pD*. Syntactic subset of OWL 2 which is amenable to implementation using rule-based technologies, and presenting a partial axiomatization of the OWL 2 RDF-Based Semantics in the form of first-order implications that can be used as the basis for such an implementation. Scalable reasoning without sacrificing too much expressive power. Designed for OWL applications trading the full expressivity of the language for efficiency, RDF(S) applications that need some added expressivity from OWL 2.
153. OWL: Most common constructors Intersection: C1 ... CnintersectionOf Human Male Union: C1 ... CnunionOf Doctor Lawyer Negation: C complementOf Male Nominals: {x1} ... {xn} oneOf {john} ... {mary} Universal restriction: P.C allValuesFrom hasChild.Doctor Existentialrestriction: P.C someValuesFrom hasChild.Lawyer Maximumcardinality: nP[.C] maxCardinality (qualifiedornot) 3hasChild[.Doctor] Minimumcardinality: nP[.C] minCardinality (qualifiedornot) 1hasChild[.Male] Exactcardinality: =nP[.C] exactCardinality (qualifiedornot) =1hasMother[.Female] SpecificValue: P.{x} hasValue hasColleague.{Matthew} Local reflexivity: -- hasSelfNarcisist Person hasSelf(loves) Keys -- hasKeyhasKey(Person, passportNumber, country) Subclass C1 C2 subClassOf Human Animal Biped Equivalence C1 C2 equivalentClass Man Human Male Disjointness C1 C2 disjointWith, AllDisjointClasses Male Female DisjointUnion C C1 ... Cn and Ci Cj foralli≠jdisjointUnionOf Person DisjointUnionOf (Man, Woman) Metaclasses and annotations on axioms are also valid in OWL2, and declarations of classes have to provided. Full list available in reference specs and in the Quick Reference Guide: http://www.w3.org/2007/OWL/refcard
154. OWL: Most common constructors Subproperty P1 P2 subPropertyOf hasDaughter hasChild Equivalence P1 P2 equivalentProperty cost price DisjointProperties P1 ... Pn disjointObjectProperties hasDaughter hasSon Inverse P1 P2- inverseOf hasChild hasParent- Transitive P+ P TransitiveProperty ancestor+ ancestor Functional 1P FunctionalProperty T 1hasMother InverseFunctional 1P- InverseFunctionalProperty T 1hasPassportID- Reflexive ReflexiveProperty Irreflexive IrreflexiveProperty Asymmetric AsymmetricProperty Property chains P P1 o ... o Pn propertyChainAxiom hasUncle hasFather o hasBrother Equivalence {x1} {x2} sameIndividualAs {oeg:OscarCorcho}{img:Oscar} Different {x1} {x2} differentFrom, AllDifferent {john} {peter} NegativePropertyAssertion NegativeDataPropertyAssertion {hasAge john 35} NegativeObjectPropertyAssertion {hasChild john peter} Besides, top and bottom object and datatype properties exist
155. Basic Inference Tasks Subsumption – check knowledge is correct (captures intuitions) Does C subsume D w.r.t. ontology O? (in every modelI of O, CIDI ) Equivalence – check knowledge is minimally redundant (no unintended synonyms) Is C equivalent to D w.r.t. O? (in every modelI of O, CI = DI ) Consistency – check knowledge is meaningful (classes can have instances) Is C satisfiable w.r.t. O? (there exists some modelI of O s.t. CI) Instantiation and querying Is x an instance of C w.r.t. O? (in every modelI of O, xICI ) Is (x,y) an instance of R w.r.t. O? (in every modelI of O, (xI,yI) RI ) All reducible to KB satisfiability or concept satisfiability w.r.t. a KB Can be decided using highly optimised tableaux reasoners
156. MainReferences W3C OWL Working Group (2009) OWL2 Web Ontology Language Document Overview. http://www.w3.org/TR/2009/REC-owl2-overview-20091027/ Dean M, Schreiber G (2004) OWL Web Ontology Language Reference. W3C Recommendation. http://www.w3.org/TR/owl-ref/ Gómez-Pérez, A.; Fernández-López, M.; Corcho, O. Ontological Engineering. Springer Verlag. 2003 Capítulo 4: Ontology languages Baader F, McGuinness D, Nardi D, Patel-Schneider P (2003) The Description Logic Handbook: Theory, implementation and applications. Cambridge University Press, Cambridge, United Kingdom Jena web site:http://jena.sourceforge.net/ Jena API: http://jena.sourceforge.net/tutorial/RDF_API/ Jena tutorials:http://www.ibm.com/developerworks/xml/library/j-jena/index.html http://www.xml.com/pub/a/2001/05/23/jena.html Pellet: http://clarkparsia.com/pellet RACER: http://www.racer-systems.com/ FaCT++: http://owl.man.ac.uk/factplusplus/ HermIT: http://hermit-reasoner.com/
157. Contents IntroductiontoLinked Data Linked Data Foundations: RDF, RDF Schema, SPARQL and OWL Coffee break Linked Data publication MethodologicalguidelinesforLinked Data publication RDB2RDF tools Technicalaspects of Linked Data publication Linked Data consumption 122
158. MethodologicalguidelinesforLinked Data publication Motivation Related Work GeoLinkedData Identification of the data sources Vocabulary Development Generation of the RDF data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery Future Work
159. GeoLinkedData It is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data. This initiative has started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN-E) and National Statistics Institute (INE) http://geo.linkeddata.es
160. Motivation 99.171 % English 0.019 % Spanish The Web of Data ismainlyfor Englishspeakers Poorpresence of Spanish Source:Billion Triples dataset at http://km.aifb.kit.edu/projects/btc-2010/ Thanks to Aidan and Richard
162. Impact of Geo.linkeddata.es Número de tripletas en Español (July): 1.412.248 Número de tripletas en Español (End august): 21.463.088 127 Asunción Gómez Pérez
163. Processfor Publishing Linked Data onthe Web Identification of the data sources Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
164. 1. Identification and selection of the data sources Identification of the data sources Instituto GeográficoNacional Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Instituto Nacionalde Estadística Enable effective discovery
170. 1. Identification and selection of the data sources IndustryProductionIndex Year Province
171. 2. Vocabulary development http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/#whichvocabs Identification of the data sources Vocabulary development Generation of the RDF Data Thisisnotenough Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
172. 2. Vocabularydevelopment Features Lightweight : Taxonomies and a fewproperties Consensuatedvocabularies Toavoidthemappingproblems Multilingual Linked data are multilingual TheNeOnmethodology can helpto Re-enginer Non ontologicalresourcesintoontologies Pros: use domainterminologyalreadyconsensuatedbydomainexperts Withdraw in heavyweightontologiesthosefeaturesthatyoudon’tneed Reuseexistingvocabularies 134 Identification of the data sources Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery Asunción Gómez Pérez
173. Knowledge Resources Ontological Resources O. Design Patterns 3 4 O. Repositories and Registries 5 6 Flogic RDF(S) OWL OntologicalResource Reuse O. Aligning O. Merging 5 6 2 Ontology Design Pattern Reuse Non Ontological Resource Reuse 4 3 6 Non Ontological Resources 2 Ontological Resource Reengineering 7 Glossaries Dictionaries Lexicons 5 Non Ontological Resource Reengineering 4 6 Classification Schemas Thesauri Taxonomies Alignments 2 RDF(S) 1 Flogic O. Conceptualization O. Implementation O. Formalization O. Specification Scheduling OWL 8 Ontology Restructuring (Pruning, Extension, Specialization, Modularization) 9 O. Localization 1,2,3,4,5,6,7,8, 9 Ontology Support Activities: Knowledge Acquisition (Elicitation); Documentation; Configuration Management; Evaluation (V&V); Assessment 135
174. Vocabularydevelopment: Specification Content requirements: Identifythe set of questionsthattheontologyshouldanswer Whichone are theprovinces in Spain? Where are thebeaches? Where are thereservoirs? Identifytheproductionindex in Madrid Whichoneisthecitywithhigherproductionindex? Give me Madrid latitude and altitude …. Non-contentrequirements Theontologymustbe in thefourofficialSpanishlanguages 136 Asunción Gómez Pérez
175. 2. Lightweight Ontology Development WGS84 Geo Positioning: an RDF vocabulary scv:Dimension scv:Item scv:Dataset hydrographical phenomena (rivers, lakes, etc.) Vocabulary for instants, intervals, durations, etc. Names and international code systems for territories and groups Ontology for OGC Geography Markup Language reused Following the INSPIRE (INfrastructure for SPatial InfoRmation in Europe) recommendation. hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time
176. Objetivos: INSPIRE intenta conseguir fuentes armonizadas de Información Geográfica para dar soporte a la formulación, implementación y evaluación de políticas comunitarias (Medio Ambiente, etc). Fuentes de Información Geográfica: Bases de datos de los Estados Miembros (UE) a nivel local, regional, nacional e internacional. Contexto – Directiva INSPIRE Luis Manuel Vilches Blázquez
178. hydrOntology Existencia de gran diversidad de problemas (múltiples fuentes, heterogeneidad de contenido y estructuración, ambigüedad del lenguaje natural, etc.) en la información geográfica. Necesidad de un modelo compartido para solventar los problemas de armonización y estructuración de la información hidrográfica. hydrOntology es una ontología global de dominio desarrollada conforme a un acercamiento top-down. Recubrir la mayoría de los fenómenos representables cartográficamente asociados al dominio hidrográfico. Servir como marco de armonización entre los diferentes productores de información geo-espacial en el entorno nacional e internacional. Comenzar con los pasos necesarios para obtener una mejor organización y gestión de la información geográfica (hidrográfica). Luis Manuel Vilches Blázquez
179. Fuentes Tesauros y Bibliografía Catálogos de fenómenos Getty FTT ADL BCN25 GEMET WFD CC.AA. EGM & ERM Diccionarios y Monografías BCN200 Nomenclátor Geográfico Nacional Nomenclátor Conciso Luis Manuel Vilches Blázquez
180. Criterios de estructuración Directiva Marco del Agua Propuesta por Parlamento y Consejo de la UE Lista de definiciones de fenómenos hidrográficos Proyecto SDIGER Proyecto piloto INSPIRE Dos cuencas, países e idiomas Criterios semánticos Diccionarios geográficos Diccionario de la Real Academia de la Lengua WordNet Wikipedia Bibliografía de varias áreas de conocimiento Herencia: Estructuración actual de catálogos Asesoramiento expertos en toponimia del IGN Luis Manuel Vilches Blázquez
184. 3. Generation of RDF From the Data sources Geographic information (Databases) Statistic information (.xsl) Geospatial information Different technologies for RDF generation Reengineering patterns R20 and ODEMapster Annotation tools Geometry generation Identification of the data sources Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
185. 3. Generation of the RDF Data NOR2O INE ODEMapster IGN Geometry2RDF Geospatial column IGN
186. 3. Generation of the RDF Data / instances NOR2O is a software librarythatimplementsthetransformationsproposedbythePatternsfor Re-engineering Non-OntologicalResources (PR-NOR). Currentlywehave 16 PR-NORs. PR-NORs define a procedurethattransforms a Non-OntologicalResource (NOR) componentsintoontologyelements. http://ontologydesignpatterns.org/ · Classification schemes NOR2O · Thesauri · Lexicons NOR2O FAO Water classification · Classification scheme · Path enumeration data model · Implemented in a database
191. 3. Generation of the RDF Data – R2O & ODEMapster Creation of the R2O Mappings
192. 3. Generation of the RDF Data – Geometry2RDF Oracle STO UTIL package SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry)) AS Gml311Geometry FROM "BCN200"."BCN200_0301L_RIO" c WHERE c.Etiqueta='Arroyo'
195. 3. Generation of the RDF data – RDF graphs IGN INE So far 7 RDF Named Graphs 1.412.248 triples BTN25 BCN200 IPI …. http://geo.linkeddata.es/dataset/IGN/BTN25 http://geo.linkeddata.es/dataset/IGN/BCN200 http://geo.linkeddata.es/dataset/INE/IPI
196. 4. Publication of the RDF Data Identification of the data sources Vocabulary development SPARQL Linked Data HTML Generation of the RDF Data IncludingProvenance Support Publication of the RDF data Pubby Pubby 0.3 Data cleansing Linking the RDF data Enable effective discovery Virtuoso 6.1.0
198. 4. Publication of the RDF Data - License License for GeoLinkedData Creative Commons Attribution-ShareAlike 3.0 GNU Free Documentation License Each dataset will have its own specific license, IGN, INE, etc.
199. 5. Data cleansing Identification of the data sources Lack of documentation of the IGN datasets Broken links: Spain, IGN resources Lack of documentation of theontology Missingenglish and spanishlabels Building a spanish ontology and importing some concepts of other ontology (in English): Importing the English ontology. Add annotations like a Spanish label to them. Importing the English ontology, creating new concepts and properties with a Spanish name and map those to the English equivalents. Re-declaring the terms of the English ontology that we need (using the same URI as in the English ontology), and adding a Spanish label. Creating your own class and properties that model the same things as the English ontology. Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
200. 5. Data cleansing URIs in Spanish http://geo.linkeddata.es/ontology/Río RDF allows UTF-8 characters for URIs But, Linked Data URIs has to be URLs as well So, non ASCII-US characters have to be %code http://geo.linkeddata.es/ontology/R%C3%ADo
201. 6. Linking of the RDF Data Identification of the data sources Silk - A Link Discovery Framework for the Web of Data First set of links: Provinces of Spain 86% accuracy Vocabulary development Geonames Generation of the RDF Data GeoLinkedData DBPedia Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
202. 6. Linking of the RDF Data http://geo.linkeddata.es/page/Provincia/Granada 164 Asunción Gómez Pérez
203. 7. Enable effective discovery Identification of the data sources Vocabulary development Generation of the RDF Data Publication of the RDF data Data cleansing Linking the RDF data Enable effective discovery
209. Future Work Generate more datasets from other domains, e.g. universities in Spain. Identify more links to DBPedia and Geonames. Cover complex geometrical information, i.e. not only Point and LineString-like data; we will also treat information representation through polygons.
210. Contents IntroductiontoLinked Data Linked Data Foundations: RDF, RDF Schema, SPARQL and OWL Coffee break Linked Data publication MethodologicalguidelinesforLinked Data publication RDB2RDF tools Technicalaspects of Linked Data publication Linked Data consumption 172
211. Ontology-based Access to DBs 1 3 2 4 Build a new ontology from 1 DB schema and 1 DB Align the ontology built with approach 1 with a legacy ontology Align an existing DB with a legacy ontology a) Massive dump (semantic data warehouse) b) Query-driven Align an ontology network with n DB schemas and other data sources a) Massive dump (semantic data warehouse) b) Query-driven new ontology existing ontology
212. Ontology-based Access to Databases Universidad Profesor Doctorando Ontología ? Organización Personal BDR Modelo Relacional Pregunta: Nombre de los profesores de la universidad UPM * Un profesor es una persona cuyo puesto es “docente” * Una universidad es una organización de tipo “3” Procesador Procesado de la consulta de acuerdo a la descripción formal de correspondencia Consulta: valores de la columna nombre de los registros de la tabla Personal para los que el valor de la columna puesto is “docente” que estén relacionados con al menos un registro de la tabla Organización con el valor “3” en la columna tipo y “UPM” en la columna nombre.
213. Align data sourceswithlegacyontologies Aeropuertos Ontología O2 Ontología O1 Centro Comunicaciones PuntoGPS Estación Punto Europeo Aeropuerto PuntoAsiatico PuntoEspañol Aeropuerto f (Aeropuertos) = PuntoEuropeo f (Aeropuertos) = RC(O2,M1) RC(O1,M1) Modelo Relacional M1
214. R2O is a declarative language to specify mappings between relational data sources and ontologies. <xml> R2O Mapping </xml> Organization Persons University RDB Professor Student Relational Model Ontology
215. Example: types of mappingsneeded Attibute Mapping with transformation (Regular Expression) Attibute Direct Mapping Relation Mapping w. Transformation (Regular Expression) Relation Mapping w. Transformation (Keyword search)
216. Population example (II) Population example (II) The Operation element defines a transformation based on a regular expression to be applied to the database column for extracting property values
217. For concepts... One or more concepts can be extracted from a single data field (not in 1NF). A view maps exactly one concept in the ontology. For attributes... A column in a database view maps directly an attribute or a relation. A subset of the columns in the view map a concept in the ontology. A subset (selection) of the records of a database view map a concept in the ontology. A column in a database view maps an attribute or a relation after some transformation. A subset of the records of a database view map a concept in the onto. but the selection cannot be made using SQL. A set of columns in a database view map an attribute or a relation. R2O (Relational-to-Ontology) Language
227. Using an RDF repository Itallowsstoring and accessing RDF data Forexample, SESAME (http://www.openrdf.org/) Downloaditfromhttp://www.openrdf.org/download.jsp openrdf-sesame-2.3.0-sdk.zip Deploythe .war in Tomcat (JDK and Tomcatneeded) Create a repository at http://localhost:8080/openrdf-sesame Check: http://localhost:8080/openrdf-sesame/repositories/XXXX http://localhost:8080/openrdf-sesame/repositories/XXX/statements
228. Linked Data frontend Toexpose data as Linked Data Includingcontentnegotiation, etc. Forexample, Pubby http://www4.wiwiss.fu-berlin.de/pubby/ Installation Use pubby-0.3.zip Deploythewebapp folder (and rename)in Tomcat Modify config.n3 Restarttomcat Check: http://localhost:8080/XXX/
233. Contents IntroductiontoLinked Data Linked Data Foundations: RDF, RDF Schema, SPARQL and OWL Coffee break Linked Data publication MethodologicalguidelinesforLinked Data publication RDB2RDF tools Technicalaspects of Linked Data publication Linked Data consumption 189
234. RelFinder: finding relations in Linked Data E.g., relations between films “Pulp Fiction”, “Kill Bill” y “Reservoir Dogs”
235.
236. Designing URI sets forthePublic Sector (UK) http://www.cabinetoffice.gov.uk/media/301253/puiblic_sector_uri.pdf 193
238. Mini-curso sobre LinkedData Oscar Corcho, Asunción Gómez Pérez ({ocorcho, asun}@fi.upm.es) Universidad Politécnica de Madrid Florianópolis, September 1st 2010(3º OntoBras 2010) Credits: Raúl García Castro, Oscar Muñoz, Jose Angel Ramos Gargantilla, María del Carmen Suárez de Figueroa, Boris Villazón, Alex de León, Víctor Saquicela, Luis Vilches, Miguel Angel García, Manuel Salvadores, Juan Sequeda, Carlos Ruiz Moreno and manyothers WorkdistributedunderthelicenseCreativeCommonsAttribution-Noncommercial-Share Alike 3.0