Presentation for one of the keynotes at EKAW2014, where I talked about the need to lower the barrier for ontology development for those who have no experience with ontologies.
Details of the tiger conservation in India, including tiger mortality from Ministry of Environment and Forests.
http://indianbiodiversitytalk.blogspot.in/2012/08/19-tiger-deaths-in-corbett-national.html
Basic introductory talk about the Web of Linked Data, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010. Knowledge about Semantic Web is required
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
This is the presentation of the #ojoaldata100 initiative (http://ojoaldata100.okfn.es) for the selection of 100 datasets that every city should be publishing in their open data portal. This presentation was used in a call for sharing session at the 4th International Open Data Conference (IODC2016).
Android Services Black Magic by Aleksandar GargentaMarakana Inc.
Presented at Android Builders Summit on February 14th in Redwood Shores, CA by Aleksandar (Saša) Gargenta, from Marakana Inc.
For the complete slides from this talk go to http://mrkn.co/munz7
"The most interesting part of Android stack are the Android System Services. The 60+ such services expose the low level functionality, such as Power Management, Wifi, Camera, Sensors, GPS, Display, Audio, Media, and so on, all the hardware all the way up to the application layer. While each one is different, the all have certain similarities, namely the way they rely on Binder (Android's IPC mechanism), use JNI to cross Java-C boundary, and use of shared libraries to abstract the Linux drivers. In this talk, we'll explore the common system services in Android and discuss their architecture. You will get to see the diagrams of the inner workings of some of the previously undocumented parts of the Android stack. By the end of the talk, you should have a better understanding of the underpinnings of the backbone of Android OS."
https://events.linuxfoundation.org/events/android-builders-summit/gargentaa
Very basic introductory talk about the Semantic Web, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
This interactive seminar will explore trends and initiatives in the digital community of practice in the humanities and the social sciences. Participants will come away with a appreciation of from where the field has emerged and how it interacts with traditional disciplines. This seminar will be of interest to those in traditional disciplines as well as the wider academy as digital humanities is both collaborative and multidisciplinary in practise. It is intended to form a broad and easy introduction to the practise of digital humanities and will appeal especially to new scholar who is open to the potential to combine their traditional scholarship with digital tools and methodologies. It is *introductory* in nature.
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...Stéphane Ducasse
This talk will present the synergy arising from building platforms on top of which do our research. RMOD our team [1] is developing two platforms: Pharo (a dynamic reflective object-oriented language supporting live programming) and Moose (an open-source software analysis platform [2]). Developing platforms forces us to develop really usable systems. While some activities are more engineering than research per se, it is really interesting to deeply understand problems or impacts of certain design decisions. Developing platforms is rewarding because it is more a long term effort and ensures a degree of stability. Platforms also often exhibit non-linear growth that is really exciting. Finally this setup raises many interesting questions such as “What is the value in terms of citations or published papers of a couple of end-users”, or “Is it not really stupid not to work on latest hype language?” To try to open our minds, I will draw parallels with the notion of wealth of an ecosystem in biology. In the second part of the talk I will present some selected results around Pharo and Moose such as: automatic minimal system core generation, dynamic core updates, selector namespace, dependencies in past commit branches and automatic migration rule generation.
[1] http://rmod.lille.inria.fr/
[2] http://www.moosetechnology.org/
[3] http://www.pharo.org/
Presentation at 2013 World Summit on the Information Society multistakeholder review event (WSIS+10)
UNESCO, Paris, 25-27 February 2013
ISSC Session: Critical Social Sciences in the Digital Age
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19jodischneider
Invited talk 2015-01-19 at EURCOM.
Two themes:
How do communities curate knowledge?
and
How can information technology help?
Q: How do communities curate knowledge?
A: Communities curate knowledge by discussing evidence and applying community standards to it.
In Wikipedia, 4 questions are used to evaluate borderline articles:
Notability – Is the topic appropriate for our encyclopedia?
Sources – Is the article well-sourced?
Maintenance – Can we maintain this article?
Bias – Is the article neutral? POV appropriately weighted?
Q: How can information technology help?
A: Information technology can organize evidence based on the criteria communities use.
In Wikipedia, we developed an alternate interface for deletion discussions.
Intelligent Web Applications guest lecture about LOD and how to use it for applications. Includes pointers and demos for MultimediaN Eculture, Verrijkt Koninkrijk, Dutch Ships and Sailors, IATI 2LOD and RadioMarche
This presentation is delivered as part of the Faculty training program at Kristu Jayanthi College, Bangalore. The intent was to help students build competency and contribute to open source projects. Also which will eventually help them to build professional career in open source connected domains.
This event was organized by the SODA Foundation and lots of fabulous speakers delivered the series. Thank you SODA!!!!
Details of the tiger conservation in India, including tiger mortality from Ministry of Environment and Forests.
http://indianbiodiversitytalk.blogspot.in/2012/08/19-tiger-deaths-in-corbett-national.html
Basic introductory talk about the Web of Linked Data, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010. Knowledge about Semantic Web is required
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
This is the presentation of the #ojoaldata100 initiative (http://ojoaldata100.okfn.es) for the selection of 100 datasets that every city should be publishing in their open data portal. This presentation was used in a call for sharing session at the 4th International Open Data Conference (IODC2016).
Android Services Black Magic by Aleksandar GargentaMarakana Inc.
Presented at Android Builders Summit on February 14th in Redwood Shores, CA by Aleksandar (Saša) Gargenta, from Marakana Inc.
For the complete slides from this talk go to http://mrkn.co/munz7
"The most interesting part of Android stack are the Android System Services. The 60+ such services expose the low level functionality, such as Power Management, Wifi, Camera, Sensors, GPS, Display, Audio, Media, and so on, all the hardware all the way up to the application layer. While each one is different, the all have certain similarities, namely the way they rely on Binder (Android's IPC mechanism), use JNI to cross Java-C boundary, and use of shared libraries to abstract the Linux drivers. In this talk, we'll explore the common system services in Android and discuss their architecture. You will get to see the diagrams of the inner workings of some of the previously undocumented parts of the Android stack. By the end of the talk, you should have a better understanding of the underpinnings of the backbone of Android OS."
https://events.linuxfoundation.org/events/android-builders-summit/gargentaa
Very basic introductory talk about the Semantic Web, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
This interactive seminar will explore trends and initiatives in the digital community of practice in the humanities and the social sciences. Participants will come away with a appreciation of from where the field has emerged and how it interacts with traditional disciplines. This seminar will be of interest to those in traditional disciplines as well as the wider academy as digital humanities is both collaborative and multidisciplinary in practise. It is intended to form a broad and easy introduction to the practise of digital humanities and will appeal especially to new scholar who is open to the potential to combine their traditional scholarship with digital tools and methodologies. It is *introductory* in nature.
SLE/GPCE Keynote: What's the value of an end user? Platforms and Research: Th...Stéphane Ducasse
This talk will present the synergy arising from building platforms on top of which do our research. RMOD our team [1] is developing two platforms: Pharo (a dynamic reflective object-oriented language supporting live programming) and Moose (an open-source software analysis platform [2]). Developing platforms forces us to develop really usable systems. While some activities are more engineering than research per se, it is really interesting to deeply understand problems or impacts of certain design decisions. Developing platforms is rewarding because it is more a long term effort and ensures a degree of stability. Platforms also often exhibit non-linear growth that is really exciting. Finally this setup raises many interesting questions such as “What is the value in terms of citations or published papers of a couple of end-users”, or “Is it not really stupid not to work on latest hype language?” To try to open our minds, I will draw parallels with the notion of wealth of an ecosystem in biology. In the second part of the talk I will present some selected results around Pharo and Moose such as: automatic minimal system core generation, dynamic core updates, selector namespace, dependencies in past commit branches and automatic migration rule generation.
[1] http://rmod.lille.inria.fr/
[2] http://www.moosetechnology.org/
[3] http://www.pharo.org/
Presentation at 2013 World Summit on the Information Society multistakeholder review event (WSIS+10)
UNESCO, Paris, 25-27 February 2013
ISSC Session: Critical Social Sciences in the Digital Age
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19jodischneider
Invited talk 2015-01-19 at EURCOM.
Two themes:
How do communities curate knowledge?
and
How can information technology help?
Q: How do communities curate knowledge?
A: Communities curate knowledge by discussing evidence and applying community standards to it.
In Wikipedia, 4 questions are used to evaluate borderline articles:
Notability – Is the topic appropriate for our encyclopedia?
Sources – Is the article well-sourced?
Maintenance – Can we maintain this article?
Bias – Is the article neutral? POV appropriately weighted?
Q: How can information technology help?
A: Information technology can organize evidence based on the criteria communities use.
In Wikipedia, we developed an alternate interface for deletion discussions.
Intelligent Web Applications guest lecture about LOD and how to use it for applications. Includes pointers and demos for MultimediaN Eculture, Verrijkt Koninkrijk, Dutch Ships and Sailors, IATI 2LOD and RadioMarche
This presentation is delivered as part of the Faculty training program at Kristu Jayanthi College, Bangalore. The intent was to help students build competency and contribute to open source projects. Also which will eventually help them to build professional career in open source connected domains.
This event was organized by the SODA Foundation and lots of fabulous speakers delivered the series. Thank you SODA!!!!
Openness is the becoming more crucial feature of science.
Science is to be open because of healthiness of science itself and its public relationship. Openness in science is not only papers but also data now. Scholarly communication should adapt to the openness of papers and data in science. It is a long experience for openness of papers, i.e., open access of papers, but openness of data is relatively new and more challenging since data is diverse in contents and quantity and manifold in use. One of the key technology for sharing papers and data is identifier. Identifiers can remove ambiguity of digital objects and their attributions, realize smarter use of data, and make more linkages to other information. DOI (Digital Object Identifier) is the most successful identifiers in this area, which is now extending their target to data. Japan Link Center (JaLC) now starts the experimental use of DOI for data. The other identifier to be noted is ORCID (Open Researcher and Contributor Identifier), which is designed and used for identifying researchers worldwide. Embeding these idenfiers in metadata can make more suitable scholarly commucation to openness of science.
At my first visit to SciPy in Latin America, I was able to review the history of PyData, SciPy, and NumFOCUS, and discuss how to grow its communities and cooperate in the future. I also introduce OpenTeams as a way for open-source contributors to grow their reputation and build businesses.
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
Seminar at the School of Informatics, The University of Edinburgh.
In this talk we will present how we are applying ontology engineering principles and tools for the development of a set of shared vocabularies across municipalities in Spain, so that they can start homogenising the generation and publication of open data that may be useful for their own internal reuse as well as for third parties who want to develop applications reusing open data once and deploy them for all municipalities. We will discuss on the main challenges for ontology engineering that arise in this setting, as well as present the work that we have done to integrate ontology development tools into common software development infrastructure used by those who are not experts in Ontology Engineering.
ArCo: the Knowledge Graph of Italian Cultural HeritageValentina Presutti
ArCo is a very ambitious ontology project. Starting from the official central catalogue of Italian Cultural Heritage (maintained by the Ministry) as its main source, its goal is to release an open knowledge graph encoding knowledge about the entities described in catalogue records. This means going beyond the mere representation of their metadata. Although there's still a long way to go, ArCo reached its first 'stable' version (https://w3id.org/arco). The experience in developing this project has taught us important lessons both in knowledge engineering in general, and on its application to Cultural Heritage. In this talk I will tell ArCo's story and lessons learned focusing on methodological, social and ontological perspectives.
Similar to EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already there? (20)
Organisational Interoperability in Practice at Universidad Politécnica de MadridOscar Corcho
Presentation on EOSC Interoperability Framework in relation to Organisational Interoperability, and how it can be applied to a Research Performing Organisation such as UPM
Open Data (and Software, and other Research Artefacts) -A proper managementOscar Corcho
Presentation at the event "Let's do it together: How to implement Open Science Practices in Research Projects" (29/11/2019), organised by Universidad Politécnica de Madrid, where we discuss on the need to take into account not only open access or open research data, but also all the other artefacts that are a result of our research processes.
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosOscar Corcho
Esta presentación se ha realizado en el contexto de la Jornada sobre difusión, accesibilidad y reutilización de la estadística y cartografía oficial (http://www.juntadeandalucia.es/institutodeestadisticaycartografia/blog/2019/11/jornada-plan/), organizada por el Instituto de Estadística y Cartografía de Andalucía.
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Oscar Corcho
Presentación sobre iniciativas de Open Data Internacionales y nacionales, realizada en el contexto del Curso de Verano de la Universidad de Extremadura "BigData y Machine Learning junto a fuentes de datos abiertos para especializar el sector agroganadero", el 25/09/2018
Presentación general sobre contaminación lumínica, en español, del proyecto STARS4ALL (www.stars4all.eu). Generada por el consorcio del proyecto, con especial agradecimiento a Lucía García (@shekda) por generar la primera versión en inglés, y Miquel Serra-Ricart, por realizar su traducción inicial.
Towards Reproducible Science: a few building blocks from my personal experienceOscar Corcho
Invited keynote given at the Second International Workshop on Semantics for BioDiversity (http://fusion.cs.uni-jena.de/s4biodiv2017/), held in conjunction with ISWC2017 (https://iswc2017.semanticweb.org/)
Publishing Linked Statistical Data: Aragón, a case studyOscar Corcho
Presentation at the Semstats2017 workshop (http://semstats.org/2017/) for the paper "Publishing Linked Statistical Data: Aragón, a Case Study", by Oscar Corcho, Idafen Santana-Pérez, Hugo Lafuente, David Portolés, César Cano, Alfredo Peris, José María Subero.
An initial analysis of topic-based similarity among scientific documents base...Oscar Corcho
Presentation given at the SemSci2017 workshop (https://semsci.github.io/semSci2017/), for the paper "An Initial Analysis of Topic-based Similarity among Scientific Documents Based on their Rhetorical Discourse Parts" http://ceur-ws.org/Vol-1931/paper-03.pdf
Introductory talk on the usage of Linked Data for official statistics, given at the ESS (Linked) Open Data Workshop 2017, in Malta, January 2017.
In this introductory talk we will discuss the main foundations for the application of Linked Data principles into official statistics. We will briefly introduce what Linked Data is, as well as the main principles, languages and technologies behind it (URIs, RDF, SPARQL). We will also discuss about the different formats in which data can be made available on the Web (e.g., RDF Turtle, JSON-LD, CSV on the Web). We will then move into providing a detailed presentation, with step by step examples based on existing Linked Statistical Data sources, of the W3C recommendation RDF DataCube, which is the basis for the dissemination of statistical data as Linked Data. Finally, we will provide some examples of applications, and the opportunities that this approach offers for the development of the proofs of concepts selected by Eurostat and to be discussed during the meeting.
Aplicando los principios de Linked Data en AEMETOscar Corcho
Presentación realizada en uno de los paneles de la jornada sobre datos abiertos organizada por AEMET el 13 de diciembre del 2016, sobre la aplicación de los principios de Linked Data la API REST de AEMET
Educando sobre datos abiertos: desde el colegio a la universidadOscar Corcho
Presentación realizada en la mesa 3 del evento Aporta 2016, uno de los pre-eventos de la semana de los datos abiertos en Madrid. Realizada el 3 de octubre del 2016.
http://datos.gob.es/encuentro-aporta?q=node/654503
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaOscar Corcho
En esta presentación mostramos el trabajo realizado para la generación y publicación de datos enlazados a partir de los datos de estadística local del Instituto Aragonés de Estadística
Presentación de la red de excelencia de Open Data y Smart CitiesOscar Corcho
Presentación general de la red de excelencia de Open Data y Smart Cities (http://www.opencitydata.es), realizada en Medialab-Prado el 18 de febrero de 2016
Why do they call it Linked Data when they want to say...?Oscar Corcho
The four Linked Data publishing principles established in 2006 seem to be quite clear and well understood by people inside and outside the core Linked Data and Semantic Web community. However, not only when discussing with outsiders about the goodness of Linked Data but also when reviewing papers for the COLD workshop series, I find myself, in many occasions, going back again to the principles in order to see whether some approach for Web data publication and consumption is actually Linked Data or not. In this talk we will review some of the current approaches that we have for publishing data on the Web, and we will reflect on why it is sometimes so difficult to get into an agreement on what we understand by Linked Data. Furthermore, we will take the opportunity to describe yet another approach that we have been working on recently at the Center for Open Middleware, a joint technology center between Banco Santander and Universidad Politécnica de Madrid, in order to facilitate Linked Data consumption.
Linked Statistical Data: does it actually pay off?Oscar Corcho
Invited keynote at the ISWC2015 Workshop on Semantics and Statistics (SemStats 2015). http://semstats.github.io/2015/
The release of the W3C RDF Data Cube recommendation was a significant milestone towards improving the maturity of the area of Linked Statistical Data. Many Data Cube-based datasets have been released since then. Tools for the generation and exploitation of such datasets have also appeared. While the benefits for the usage of RDF Data Cube and the generation of Linked Data in this area seem to be clear, there are still many challenges associated to the generation and exploitation of such data. In this talk we will reflect about them, based on our experience on generating and exploiting such type of data, and hopefully provoke some discussion about what the next steps should be.
Research Objects for improved sharing and reproducibilityOscar Corcho
Presentation about the usage of Research Objects to improve scientific experiment sharing and reproducibility, given at the Dagstuhl Perspective Workshop on the intersection between Computer Sciences and Psychology (July 2015)
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
UiPath Test Automation using UiPath Test Suite series, part 4
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already there?
1. Ontology Engineering
for and by the masses:
are we already there?
19th International Conference on Knowledge
Engineering and Knowledge Management
EKAW2014
27/11/2014
Oscar Corcho
ocorcho@fi.upm.es
@ocorcho
https://www.slideshare.com/ocorcho
2. License
• This work is licensed under the license
CC BY-NC-SA 4.0 International
• http://purl.org/NET/rdflicense/cc-by-nc-sa4.0
• You are free:
• to Share — to copy, distribute and transmit the work
• to Remix — to adapt the work
• Under the following conditions
• Attribution — You must attribute the work by inserting
• “[source Oscar Corcho]” at the footer of each reused slide
• a credits slide stating: “These slides are partially based on
“Ontology Engineering for and by the masses: are we
already there?” by O. Corcho”
• Non-commercial
• Share-Alike
3. A walk through our Brave Little World of
Ontology Engineering
The world is living at year 21 After Gruber (A.G.)
All inhabitants of this world have this motto:
“Nothing is more beautiful than a formal,
explicit specification of a shared conceptualization”
repeated to them every night while the sleep (since year 5 A.G.)
Everybody loves Gruberliness, the property of creating models that are
SHARED, FORMAL and EXPLICIT
It is written in every single building and ontology repository
4. The world is divided in 10 regions, led by 10 world controllers who have
dominated it so far (according to Google Scholar)
Oxford (Ian Horrocks, 35K cit.) Milton Keynes (Enrico Motta, 11K cit.)
Buffalo (Barry Smith, 18K cit.) Trento (Nicola Guarino, 17K cit.)
Stanford (Mark Musen, 25K cit.) Karlsruhe (Rudi Studer, unknown cit.)
Madrid (Asunción Gómez-Pérez, 13K cit.) Amsterdam (Frank van Harmelen, 25K cit.)
Toronto (Mark S. Fox, 13K cit.) Osaka (Riichiro Mizoguchi, 7K cit.)
Disclaimer: This calculation is not exact. It only considers individuals, and favours geographical
distribution
5. Several wars in these 21 years of existence, which led to the current status:
• The “Language War”. It lasted 4 years. W3C Treaty Signed in Year 11 A.G.
Description logic won over frames, first order logic and semantic networks.
• The “Tool War”. It lasted 10 years. Tools like Protégé, OilEd, SWOOP,
WebODE, OntoEdit, or NeOn Toolkit fought among each other to get
installed on the computers of our world citizens. Protégé won. No treaty
signed
7. Average age: 50+
Number of individuals: 100+
Education: Formal logic and philosophy.
Sometimes Computer Science
Contribution to the world:
Write formal upper-level ontologies
DOLCE, BFO, GFO, SUMO
Languages spoken:
They are polyglots
First order and many other logics
OWL, OBO
Secret meetings: FOIS
Daily routine:
Wake up
Write a new term in a whiteboard
Think about it carefully
Incorporate it in an upper-level ontology
Some days they don’t include new terms
Alphas
8. Betas
Average age: 40+
Number of individuals: 1,000+
Education: Computer Science, Biology,
Geography. Some courses on logic
Contribution to the world:
Domain and application ontologies
Heavyweight, with many axioms, properties
and concepts
DOLCE, BFO, GFO, SUMO
Languages spoken:
Mostly OWL
Secret meetings: EKAW, KCAP
Daily routine:
Wake up
Open ontology design methodology book
Open ontology design pattern website
Open Protégé and activate reasoning
Work on 10 new terms
9. Gammas
Average age: 30+
Number of individuals: 10,000+
Education: Mostly Computer Science
Courses on ontologies as undergrads
Contribution to the world:
Write lightweight ontologies
Call them vocabularies
Create Linked (Open) Data
Languages spoken:
Native RDF Schema, and a bit of OWL
Secret meetings: ESWC, ISWC, WWW
Daily routine:
Wake up
Open LOV or prefix.cc
Look for vocabularies for their dataset
Select, extend and upload them in LOV
Update dataset in datahub.io
10. Deltas
Average age: unknown
Number of individuals: 10,000+
Education: Library Science
Some course on Computer Science
Contribution to the world:
Write codelists and thesauri
Languages spoken:
SKOS, RDF Schema
Secret meetings: Dublin Core Conference
Daily routine:
Wake up
Write a couple of new codelists/thesauri
Submit them to metadataregistry.org
Annotate some documents with them
11. Epsilons
Average age: 20+
Number of individuals: 100,000+
Education: Web development
Contribution to the world:
Write schema.org annotations
Some contributions to schema.org classes
and structured types
Languages spoken:
HTML, RDFa, JSON-LD
Secret meetings: Webinars, Hangouts, meetups
Daily routine:
Wake up
Check positioning of their site in Google
Annotate it with more schema.org tags
Wait for Google/Yahoo! to crawl them
Go back to next step
12. Open vote
• Are you an alpha, a beta, a gamma, a delta or an
epsilon?
• Or do you think that you can belong to several social
classes?
https://es.surveymonkey.com/s/3933H7C
• There should be a recent tweet from me (@ocorcho,
#ekaw2014), with a link to the survey
13. SHARED, FORMAL and EXPLICIT
• A happy world where all
sorts of ontologies,
vocabularies and
annotations are developed,
as efficiently as possible
• Everybody is happy in their
social class
• And where Gruberliness is
everywhere
Shared
Formal
Explicit
Alphas
Betas
Gammas
Deltas
Epsilons
14. And if somebody is not happy… a few grammes of recognition
• Every social class can take drugs after (even during)
work, to get even happier.
• One gramme of drug every time that…
• Alphas: a new term included in an upper-level ontology, or a
domain ontology is cleaned with their well-founded terms
• Betas: an inconsistency is found in some data thanks to the
logical axioms of their ontologies, or when their ontology is
included in BioPortal.
• Gammas: their ontology is listed in the LOV repository; and
ten grammes when used in some Linked Data dataset
• Deltas: the same for metadataregistry.org
• Epsilons: one gramme every 10.000 new Web pages
annotated according to schema.org
15. SHARED, FORMAL and EXPLICIT
But is our happy ontology engineering world big enough?
How many other people live outside of it?
There are savage reservations, where ugly non-ontologists live…
• They use relational databases
• Some of them are not even in normal form
• And “oh-my-God” CSVs
• And they communicate in natural language, HTML and
using UML class diagrams
However, sometimes they are visited by our inhabitants…
16.
17. The savage reservation in Madrid
• AENOR PNE 178301
• Norm on Open Data for
Smart Cities
• Organised by
• Spanish Ministry of
Industry
• AENOR
• AENOR CTN 178 group
• Subcommitee 3 on
Government and Mobility
• Workgroup on
Government
• Subgroup on Open Data
18. Some of the individuals in the savage reservation
• Coordinator
• Esther Minguela (Localidata)
• 35 members who belong to…
• Medium&Large Cities (10) – mostly City Information
Managers
• Private companies working for the public sector (6)
• Regions (3) – mostly Region Information Managers
• Ministries or alike (3)
• Geographic sector (3)
• I visited them for six months (January-June 2014),
trying to show the advantages of living in our Brave
Little World of Ontology Engineering
• Did I succeed? … No vote now (don’t spoil my presentation)
20. Main objectives of the work being done
• Make open data projects from cities more systematic
• Provide a reference guidelines for local administrations to
define, document and develop open data projects
• Evaluate the maturity of open data projects (through
indicators)
• Kickstarting them
• Continuous improvement
• Sustainability
• Quality and efficiency of the project
• Improve interoperability
• Decide on the 10 top-priority datasets to be opened
• Work on common data structures and vocabularies for these
datasets
21. 37 Metrics, grouped in domains (and dimensions)
Strategic Domain
1. Strategy
2. Leadership
3. Service-level agreement
4. Sustainability
Legal Domain
5. External and internal legal norms
6. Usage and licensing conditions
Organisational Domain
7. Responsible unit
8. Skilled team
9. Inventory of data
10. Priority
11. Measurement of the process
12. Measurements of usage and impact
Technical Domain
13. Catalogue
14. Available in the public sector catalogue
15. Documented datasets
16. Categories and search facilities
17. Availability
18. Persistent and friendly references
19. Accessibility
20. Access for free
21. Access systems in place
22. Primary data
23. Completeness
24. Documentation of data
25. Correctness
26. Geo-referencing
27. Linked Data
28. Update processes
29. Update frequency
30. New dataset inclusion
31. Data quantity
32. Data format
33. Vocabularies
Economic and social domain
34. Transparency, participation and
collaboration
35. Complaint/Conflict management
36. Fostering reuse
37. Developed reuse initiatives
22. • Each metric gets a number
• And each one has a weight,
agreed by group members
• A final indicator is then calculated
Total Value 0-200 201-400 401-600 601-800 801-1000
Open data indicator 1 2 3 4 5
Weight
Strategy
Strategy 25 %
Leadership 50 %
Service-level agreements 10 %
Sustainability 15 %
Level achieved Score
Level 0 (nothing) 0
Level 1 (you have
started doing it)
1
Level 2 (you are
good)
2
Level 3 (excellent) 3
An indicator on the maturity of open data projects
23. 10 Highest-Priority Datasets for 2015
• Listing based on the
current inventories from
all cities (and regions)
• Harmonisation
• Votes according to PSI-
reuse requests
Datasets
Cultural Agenda
Traffic
Population
Streets
Public Transport
Touristic Places and POIs
Budget
Shop Census
Air Quality
Contracts
Parkings
24. And now the meat…
• All that previous work may
have been done even by
our epsilons…
• Now it’s time to start
working on common data
structures and
vocabularies…
• Did I tell you that these
people were often visited
by some of the people
from our world?
• Before continuing, let’s
see some of the
conversations that we
managed to get acess
to…
27. Cool, we have a methodology…
Knowledge Resources
Non Ontological Resource
Reuse
Non Ontological Resource
Reengineering
2
2
2
Non Ontological Resources
Thesauri
DictionariesGlossaries Lexicons
Taxonomies
Classification
Schemas
O. Localization
9
Ontological Resource
Reengineering
4
4
4
O. Aligning
O. Merging
Alignments5
5
5
6
6
6
6
3
Ontological Resource
Reuse
3
Ontological Resources
O. Repositories and Registries
RDF(S)
OWL
Ontology Design
Pattern Reuse
7
O. Design Patterns
Ontology Restructuring
(Pruning, Extension,
Specialization, Modularization)
8
O. Specification O. Conceptualization O. ImplementationO. Formalization
1
RDF(S)
OWL
Scheduling
28. However…
• Our methodologies do not explain so much to domain
experts on what they have to do at each step
• So we just gave easy indications (as most of you do
normally)
• Start with competency questions, with a few answers
• This must come from data reusers’ requests
• And we call them “user stories”
• Extract terms, and classify them in nouns, adjectives, verbs
• Organise them a little bit
• Find common data structures out there (vocabularies,
ontologies) that use those terms or synonyms
• Decide which ones to use
• …
29. Savages working on their vocabularies…
• We used an agile-like method with a “competency
question backlog” (first some questions, and go down
the whole path, then some others, etc.)
• And used “common” tools
Google Docs Excel Card-sorting
• And now, let’s build the ontology
• Deadlock!!!
30. Deadlock 1. I have been told to reuse other ontologies
• We recommend reusing other ontological and non-
ontological resources (well, except for epsilons)
• That’s one of the bases of ontological engineering
• However, savages tend to do that at an early stage of
ontology development
• It causes confusion to them
• Should I use FOAF, or the Organization Ontology, or
vCard, or schema.org?
• And prevents people from being creative
• It causes endless discussions about terms (and lots of
problems with translations)
• Rec1: tell them to forget about reuse. Let them start
providing their own (wrong?) definitions, and agree
on those
31. Deadlock 2. I want my ontology to do inferences…
• A beta told me..
• OWL is funny to teach at University (especially for betas)
• It’s nice to see reasoning, consistency checking, OWA, etc.
• It is useful in many domains
• But developing such ontologies is not a task for our savages
• Rec2: Just work with text patterns, and guide them to write
good term definitions
• A district contains only neighbourhoods and census sections
• A shop can have at most three economic activities associated
to it
Note: Rabbit may be useful here (although I did not have time to
practice with it with this group)
32. Deadlock 3. I want my ontology to be ligthweight…
• A gamma told me…
• My ontology will be used for Linked Data publishing
(so that I am 5 stars!!)
• I have been said not to put domains or ranges
• I have been said to create only light taxonomies
• I have been said to use only RDF Schema
• Rec3: again, text patterns are a good option here
• Don’t make your experts worry about languages or formal
aspects
33. Deadlock 4. Which tool should I use?
• We thought that the war had ended?
• Alphas and betas told me to use Protégé
• Some of them said that I could use a Web-based version
• A gamma told me to use Neologism
• And an epsilon called me and said that it was enough if I used
tables with attributes, as in schema.org
• And then I saw an old tool, not available
anymore, that used schema.org-like
table-like descriptions and generated
ontologies in different languages
• WebODE
• Rec4: Use simple tools (e.g., Excel) that allow discussing
easily, without weird constraints
34. Deadlock 5. But these ontologies to reuse are in English
• These developers and data reusers prefer Spanish
terms
• We all know that identifiers are just symbols
• e.g., labels and comments in different languages should be
enough
• However…
• Should we mix term identifiers in different languages?
• Do we translate all terms to our language?
• Rec5: no idea yet about what to do…
35. The results so far…
Datasets Vocabulary
General vocabularies Postal Address: http://vocab.linkeddata.es/datosabiertos/def/urbanismo-
infraestructuras/direccionPostal
Administrative: http://vocab.linkeddata.es/datosabiertos/def/sector-publico/territorio
Streets http://vocab.linkeddata.es/datosabiertos/def/urbanismo-infraestructuras/callejero
SKOS: http://vocab.linkeddata.es/datosabiertos/kos/urbanismo-infraestructuras/tipo-via
Tourism http://vocab.linkeddata.es/datosabiertos/def/turismo/lugar
Cultural Agenda http://vocab.linkeddata.es/datosabiertos/def/cultura-ocio/agenda
Shop Census http://vocab.linkeddata.es/datosabiertos/def/comercio/tejidoComercial
SKOS (NACE): http://vocab.linkeddata.es/datosabiertos/kos/comercio/cnae
Population http://www.w3.org/TR/vocab-data-cube/
SKOS:
o Age: http://eurostat.linked-statistics.org/dic/age.rdf
o Gender: http://eurostat.linked-statistics.org/dic/sex.rdf
o Geo: http://eurostat.linked-statistics.org/dic/geo.rdf
Budget http://vocab.linkeddata.es/datosabiertos/def/hacienda/presupuesto
Contracts http://contsem.unizar.es/def/sector-publico/pproc
Air Quality http://www.w3.org/2005/Incubator/ssn/ssnx/ssn
Traffic http://vocab.linkeddata.es/datosabiertos/def/transporte/trafico
Public Transport http://vocab.linkeddata.es/datosabiertos/def/transporte/transportePublico
Parkings http://vocab.linkeddata.es/datosabiertos/def/urbanismo-infraestructuras/aparcamiento
36.
37. A walk through the Brave Little World of
Ontology Engineering
Why are we still discussing about what ontologies should be used for?
(see recent thread in Google+’s LOV community, started by Bernard Vatant,
on the “intended and real usage of vocabularies in LOV”)
https://plus.google.com/u/0/+BernardVatant/posts/SDYTN3FGkEr
How will our world be at year 25 After Gruber (A.G.)? And at year 50 A.G.?
Will there be soon a revolution led by epsilons to rule the world?
Are we the ones that live in a savage reservation in a larger world?
Or will we conquer the rest of the world?
38. Which social class do EKAW2014 participants belong to?
https://es.surveymonkey.com/results/SM-F75GGL2V/
39. Acknowledgements
• First of all, my acknowledgements go to
Aldous Huxley for writing the always-
inspiring “Brave New World” book.
• I would also like to give thanks to some of those who have
helped with comments on this presentation
• Asunción Gómez-Pérez and the whole Ontology Engineering
Group team
• José Manuel Gómez-Pérez
• Those who provided some material (acknowledgements in the
corresponding slides)
• And to all those with whom I have enjoyed building ontologies
for so many years (far too many to enumerate here)
• Specially those from the savage reservation in Madrid
40. Disclaimers
• The contents of this slideset
represent my own view on this topic
• Not necessarily the views of all
members of the
Ontology Engineering Group (UPM)
• They are based on some of my own
experiences in ontology engineering
• These are not necessarily generalisable
• Specially not valid, probably, in ontology-savvy domains
• And more important…
• I was trying to be provocative here, to generate discussion
41. Ontology Engineering
for and by the masses:
are we already there?
19th International Conference on Knowledge
Engineering and Knowledge Management
EKAW2014
27/11/2014
Oscar Corcho
ocorcho@fi.upm.es
@ocorcho
https://www.slideshare.com/ocorcho
Editor's Notes
Hello, I am an alpha, nice to meet you. Do you know what we do in our brave new ontology engineering world?
No, I am a savage. I only use relational databases, Excel, UML class diagrams
Oops, then you will not understand what “unity” means… See you…
Hello, I am a beta, nice to meet you. Have you ever heard the word ontology?
Yes, somebody told me that it is useful to publish my data, but I do not know how.
Well, yes, but that’s only for gammas. What it is really useful is to do some reasoning with it.
Ohh, cool, will I be able to detect duplicates in my data?
Yes, sure, we need to create a few axioms, activate Hermit, then consider the Open World Assumption, go back and reason again…
Ufff, I do not understand a word…
Obviously, you are a savage…
Hello, I am a gamma, nice to meet you. Do you use ontologies for reasoning?
Well, somebody came to tell me about it, but I could not understand a word of what they were telling me.
Ahh, yes, that’s because you really need vocabularies for publishing, nothing else.
And how do I select those vocabularies?
Just go to LOV, look for terms that are used, and use them. And the more you import from others the better
Hello, I am a delta, nice to meet you. Have you annotated your datasets and Web pages with Dublin Core?
Yes, I do all the time. It’s done by my content management system automatically.
And have you created codelists or SKOS thesauri already?
No, I was starting to do ontologies.
That’s going to hurt…
Hello, I am an epsilon, nice to meet you.
Hey, I was told that epsilons are not allowed to leave their world and visit us.
Of course, that’s the reason why I contact you through Google Hangout. Have you done your schema.org markup already?
Yes, I want to be ranked the first in Google searches. But I do not know the exact meaning of some properties
Ahh, that doesn’t really matter. We just use tables to describe them.