Slides used in my thesis defense/viva. Relevant links:
http://oa.upm.es/39348/
https://bitbucket.org/jmora/thesis
https://bitbucket.org/jmora/kyrie
https://bitbucket.org/jmora/kyrie-aux
The document discusses Open-DO, an open source initiative for developing safety-critical software. It provides an overview of Open-DO concepts like FLOSS, agile development practices, and high-integrity certification. Updates on Open-DO include new community projects, conferences, and tools to support qualifications. Formal methods like Couverture and Hi-Lite are presented as ways to verify properties and generate verification conditions for proof.
Neha Jain seeks a position where she can contribute to an organization's success through her skills. She has a Master's in VLSI Systems and Technology with a CGPA of 8.02. Her technical skills include Verilog HDL, SystemVerilog, C programming, and tools like Cadence and Xilinx. Her M.Tech project involved power analysis of a ring-based network-on-chip architecture. She has work experience as a design and verification engineer and has taught digital electronics courses.
LIBPF: A LIBRARY FOR PROCESS FLOWSHEETING IN C++libpf
1) LIBPF is a C++ library for process flowsheeting that allows modeling of continuous processes through a directed graph approach.
2) It aims to address challenges in modeling continuous processes for industry, including customization, integration, reliability, and maintainability.
3) LIBPF provides capabilities for modeling common unit operations and properties through a graph-based approach implemented in portable, open-source C++.
This document contains the resume of Ziad Ibrahim Abdelati. It summarizes his contact information, education, work experience, technical skills, projects and graduation project. He is currently a 4th year student studying Electronics and Electrical communications at Cairo University with a GPA of 91.83%. His work experience includes internships at Mentor Graphics and Mobinil in electronics design and telecommunications. His technical skills include VHDL, Verilog, MATLAB and microcontroller programming. His projects include digital circuit and processor designs. His graduation project focuses on Software Defined Radio implementation using Partial Dynamic Reconfiguration on FPGAs.
This document summarizes an academic paper that proposes optimizing elliptic curve cryptography (ECC) through application-specific instruction set processor (ASIP) design. It applies pipelining techniques to the data path and uses complex instructions to reduce latency and the number of instructions needed for point multiplication. The paper describes applying different levels of pipelining to explore performance and find an optimal pipeline depth. It also develops a new combined algorithm to perform point doubling and addition using the specialized instructions. An FPGA implementation over GF(2163) is presented and shown to outperform previous work.
This document contains Sai Dheeraj Polagani's resume. It includes his contact information, objective, education history, industrial experience at Intel as a physical design engineer and layout engineer, previous experience at Wipro Technologies as an ASIC physical design and verification engineer, technical skills, projects completed and academic projects during his Master's program. His experience includes full-chip static timing analysis, noise analysis, cross-talk analysis, place and route, timing closure, and physical verification. He has a Master's in Electrical Engineering from San Jose State University and a Bachelors in Information and Communication Technology from DA-IICT, India.
The document discusses Open-DO, an open source initiative for developing safety-critical software. It provides an overview of Open-DO concepts like FLOSS, agile development practices, and high-integrity certification. Updates on Open-DO include new community projects, conferences, and tools to support qualifications. Formal methods like Couverture and Hi-Lite are presented as ways to verify properties and generate verification conditions for proof.
Neha Jain seeks a position where she can contribute to an organization's success through her skills. She has a Master's in VLSI Systems and Technology with a CGPA of 8.02. Her technical skills include Verilog HDL, SystemVerilog, C programming, and tools like Cadence and Xilinx. Her M.Tech project involved power analysis of a ring-based network-on-chip architecture. She has work experience as a design and verification engineer and has taught digital electronics courses.
LIBPF: A LIBRARY FOR PROCESS FLOWSHEETING IN C++libpf
1) LIBPF is a C++ library for process flowsheeting that allows modeling of continuous processes through a directed graph approach.
2) It aims to address challenges in modeling continuous processes for industry, including customization, integration, reliability, and maintainability.
3) LIBPF provides capabilities for modeling common unit operations and properties through a graph-based approach implemented in portable, open-source C++.
This document contains the resume of Ziad Ibrahim Abdelati. It summarizes his contact information, education, work experience, technical skills, projects and graduation project. He is currently a 4th year student studying Electronics and Electrical communications at Cairo University with a GPA of 91.83%. His work experience includes internships at Mentor Graphics and Mobinil in electronics design and telecommunications. His technical skills include VHDL, Verilog, MATLAB and microcontroller programming. His projects include digital circuit and processor designs. His graduation project focuses on Software Defined Radio implementation using Partial Dynamic Reconfiguration on FPGAs.
This document summarizes an academic paper that proposes optimizing elliptic curve cryptography (ECC) through application-specific instruction set processor (ASIP) design. It applies pipelining techniques to the data path and uses complex instructions to reduce latency and the number of instructions needed for point multiplication. The paper describes applying different levels of pipelining to explore performance and find an optimal pipeline depth. It also develops a new combined algorithm to perform point doubling and addition using the specialized instructions. An FPGA implementation over GF(2163) is presented and shown to outperform previous work.
This document contains Sai Dheeraj Polagani's resume. It includes his contact information, objective, education history, industrial experience at Intel as a physical design engineer and layout engineer, previous experience at Wipro Technologies as an ASIC physical design and verification engineer, technical skills, projects completed and academic projects during his Master's program. His experience includes full-chip static timing analysis, noise analysis, cross-talk analysis, place and route, timing closure, and physical verification. He has a Master's in Electrical Engineering from San Jose State University and a Bachelors in Information and Communication Technology from DA-IICT, India.
Miniaturization, cost, functionality, complexity and power dissipation are important and necessary design traits which need attention in circuit designing. There is a trade off between miniaturization and power dissipation. Smart technology is always searching for new paradigms to continue improve power dissipation. Reversible logic is one of smart computing deployed to avoid power dissipation. Researchers have proposed many reversible logic-based arithmetic and logic units (ALU). However, the research in the area of fault tolerant ALU is still under progress. The aim of this paper is to bridge the knowledge gap for a new researcher in area of fault tolerance using parity preserving logic gates rather than searching huge data through various sources. This paper also presents a high functionality based novel fault tolerant arithmetic and logic unit architecture. A comparison on optimization aspects is presented in tabular form and results shows that proposed ALU architecture is optimum balance in terms of all aspects of reversible logic synthesis. The proposed ALU architecture is coded in Verilog HDL and simulated using Xilinx ISE design suit 14.2 tool. The quantum cost of all gates used in proposed architecture is verified using RCViewer + tool.
R originated in the 1970s at Bell Labs and has since evolved significantly. It is an open-source programming language used widely for statistical analysis and graphics. While powerful, R has some drawbacks like poor performance for large datasets and a steep learning curve. However, its key advantages including being free, having a large community of users, and extensive libraries have made it a popular tool, especially for academic research.
This document summarizes a lecture on rate-distortion optimization in video coding. It discusses various rate-distortion optimization problems including operational rate-distortion theory, joint source-channel coding optimization, storage constraint allocation, delay-constrained allocation, buffer-constrained allocation, and the multi-user problem. It also covers convex optimization techniques like the Lagrangian method that can help solve some of these optimization problems.
Containerisation and Dynamic Frameworks in ICCMA’19Carlo Taticchi
The International Competition on Computational Models of Argumentation (ICCMA) is a successful event dedicated to advancing the state of the art of solvers in Abstract Argumentation. We describe two proposals that will further improve the third and next edition of the competition, i.e. ICCMA 2019. The first novelty concerns the packaging of each solver-application participating in the com- petition in a virtual “light” container (using Docker): this allows for easy deployment and to (re)running all of the submissions on different architectures (Linux, Windows, macOS, and also in the cloud). The second proposal consists of a new track focused on solvers processing dynamic frameworks, i.e., solvers described in terms of changes w.r.t. previous ones: a solver can reuse the solution obtained previously to be faster on the same framework modulo a new argument/attack.
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsSubhabrata Mukherjee
Massive distillation of pre-trained language models like multilingual BERT with 35x compression and 51x speedup (98% smaller and faster) retaining 95% F1-score over 41 languages
Chemical Data Mining: Open Source & ReproducibleRajarshi Guha
This document discusses using the R programming environment for cheminformatics and chemical data mining. It describes how R is enhanced by open source software like the Chemistry Development Kit (CDK) which provides capabilities for working with chemical structure data within R. The CDK allows reading/writing different file formats, generating fingerprints, visualizing molecules, and accessing public databases directly from R to streamline workflows like quantitative structure-activity relationship modeling. This enables reproducible science by ensuring analyses can be updated based on the latest data.
This document contains Leonard Bong's resume. It summarizes his experience as a senior software engineer with over 25 years of experience programming in C, C++, Java, JavaScript, Python, Perl, Tcl/Tk, and SQL. He has extensive experience designing and implementing software tools for semiconductor design and manufacturing processes at companies like National Semiconductor, Texas Instruments, and Oracle. His resume lists his education as a BS and MS in Computer Science from Central Michigan University.
Research Inventy : International Journal of Engineering and Science is publis...researchinventy
This document summarizes a research paper that presents a flexible LDPC decoder design implemented on an FPGA. The decoder uses a partially parallel approach that allows it to support different code rates and block sizes. It consists of processing blocks, a message permutation block, and control logic. The processing blocks perform variable and check node operations in parallel. The message permutation block connects the blocks according to the parity check matrix. This provides flexibility to implement different codes. The control logic stores the code configuration and manages the decoding process. Simulation results showed the decoder runs at 89.29MHz on a Spartan 6 FPGA, using 765 slices and providing a better throughput to area ratio than previous designs.
Sr. Kakarla is a physical design engineer with over 6 years of experience in roles including IC physical design, CAD STA, layout design automation, and corporate design solutions. He has worked at Qualcomm and Intel on projects ranging from 32nm to 10nm technology nodes. Sr. Kakarla seeks a challenging role implementing high performance and low power SOC designs.
This document provides an overview of using R for cheminformatics. It discusses prerequisites for the workshop, including installing relevant packages. The outline covers an overview of the R language itself, parallel and distributed computing in R, and accessing databases from R. The document also provides some basic information on getting help, running code, and saving work in R.
Design and implementation of Closed Loop Control of Three Phase Interleaved P...IJMTST Journal
A single-phase, three-level, single-stage power-factor corrected AC/DC converter operated under closed
loop manner is presented. That operates with a single controller to regulate the output voltage and the input
inductor act as a boost inductor to have a single stage power factor correction with good output response. The
paper deals with a new single stage three level ac-dc converter which performs both power factor correction
and voltage regulation in a single stage. The proposed converter has two separate controllers, one for power
factor correction and the other for regulating the output voltage. A comprehensive review of the existing single
stage topologies has been carried out. Then the operating principle, control scheme and the design of the new
converter are presented. The proposed converter is having an input power factor close to unity and better
voltage regulation compared to the conventional ac-dc converter topologies. Proposed topology is evaluated
through Matlab/Simulink platform and simulation results are conferred.
This document discusses different API design options like REST, gRPC, and GraphQL. It provides an overview of Apache Cassandra and describes a methodology for designing APIs that involves modeling the data, workflows, and technologies. A decision tree is presented to analyze criteria for different APIs like performance, data integrity, and developer experience to determine which option is best. REST is best for CRUD APIs, gRPC for high performance services, and GraphQL for complex queries.
This document provides an introduction to the Legion programming paradigm for building task-oriented applications. It describes an implementation of an n-body simulation using Legion to demonstrate its programming model. The n-body problem calculates gravitational interactions between particles. Legion defines tasks that operate on logical regions of data in parallel. It handles mapping regions to physical resources. The document outlines registering tasks, launching them, defining domains, and Legion's object-oriented approach. It also describes the computing environment used, a heterogeneous cluster, for the n-body implementation.
Mohamed Mohsen Ibrahim is seeking new opportunities. He has a Bachelor's degree in Electrical Engineering from Ain Shams University and over 5 years of experience in IT support and as an Application Analyst. His skills include SQL, C/C++, networking, Unix administration, and embedded systems. He is self-motivated, a strong team player, and familiar with multinational work environments.
Este documento discute la importancia de la colaboración entre programadores y estrategas digitales. Explica que más de la mitad de las estrategias de distribución requieren tecnología web y que es necesario medir los resultados para determinar qué estrategias funcionan mejor con los usuarios. También enfatiza que los programadores deben entregar datos para que los estrategas puedan analizarlos, y que trabajando juntos pueden iterar sobre lo que funcione mejor para sus usuarios.
El documento describe la experiencia de una alumna con la herramienta Office 365 para tareas académicas. Office 365 permite trabajar en conjunto en tiempo real y acceder a correos electrónicos de forma sincronizada desde cualquier lugar. También facilita compartir archivos y comunicarse con compañeros. Las ventajas de las herramientas colaborativas incluyen la generación de equipos de trabajo y mejora continua de contenidos, mientras que las desventajas son la dependencia a la infraestructura y el riesgo de percepción de informalidad.
Wiring Hacker Synapses - Cola: Real-Time Shared EditingMustafa Isik
The document discusses collaborative coding techniques using Eclipse Communication Framework (ECF). It describes how ECF enables real-time shared editing between participants using operational transformations to resolve conflicts from out-of-order changes. The key challenges are ensuring consistency when changes occur over a network with latency and resolving conflicts when participants diverge with multiple local changes before remote operations arrive. Solutions involve ordering message passing with network protocols and transforming operations to preserve intentions when resolving conflicts.
Jean Christophe Meyer: Histoire Parallèle/Die Woche vor 50 Jahren – Lieu de m...EUscreen
Content in Motion | Curating Europe’s Audiovisual Heritage Conference, December 3-4 2015; www.euscreenxl2015.eu
This paper is aimed first at analyzing the initial public impact of Histoire Parallèle/Die Woche vor 50 Jahren, which aired weekly first on la Sept and then on French-German TV channel Arte from 1989 to 2001.
The 55-minute show exploited newsreel material, systematically presenting it in the original full length after exactly 50 years after its initial release in movie theatres. It covered a period stretching from the beginning of World War II until the end of the Marshall Plan. It simultaneously illustrated contradictive perspectives of several nations at war with each other. This part of the show lasted for 40 minutes. Then, for the final quarter of an hour, the show’s mainstay host, historian Marc Ferro discussed the material presented with a guest, usually a scholar. Despite the fact that the show could never be sold to foreign channels or rebroadcast, it still arouses great interest. Therefore our paper intends to explore how and to what extent content curation may contribute to it becoming a transnational or European Lieu de mémoire.
The document describes the Eurotex ID project which brought together museums, fashion schools, and textile companies across Europe. Young designers studied historical textile collections and created new fashion and textile designs inspired by the past. The project culminated in a conference highlighting collaborations between heritage institutions and designers, and an exhibition featuring the student designs alongside their historical inspirations. Contact and event details are provided for those interested in learning more.
Miniaturization, cost, functionality, complexity and power dissipation are important and necessary design traits which need attention in circuit designing. There is a trade off between miniaturization and power dissipation. Smart technology is always searching for new paradigms to continue improve power dissipation. Reversible logic is one of smart computing deployed to avoid power dissipation. Researchers have proposed many reversible logic-based arithmetic and logic units (ALU). However, the research in the area of fault tolerant ALU is still under progress. The aim of this paper is to bridge the knowledge gap for a new researcher in area of fault tolerance using parity preserving logic gates rather than searching huge data through various sources. This paper also presents a high functionality based novel fault tolerant arithmetic and logic unit architecture. A comparison on optimization aspects is presented in tabular form and results shows that proposed ALU architecture is optimum balance in terms of all aspects of reversible logic synthesis. The proposed ALU architecture is coded in Verilog HDL and simulated using Xilinx ISE design suit 14.2 tool. The quantum cost of all gates used in proposed architecture is verified using RCViewer + tool.
R originated in the 1970s at Bell Labs and has since evolved significantly. It is an open-source programming language used widely for statistical analysis and graphics. While powerful, R has some drawbacks like poor performance for large datasets and a steep learning curve. However, its key advantages including being free, having a large community of users, and extensive libraries have made it a popular tool, especially for academic research.
This document summarizes a lecture on rate-distortion optimization in video coding. It discusses various rate-distortion optimization problems including operational rate-distortion theory, joint source-channel coding optimization, storage constraint allocation, delay-constrained allocation, buffer-constrained allocation, and the multi-user problem. It also covers convex optimization techniques like the Lagrangian method that can help solve some of these optimization problems.
Containerisation and Dynamic Frameworks in ICCMA’19Carlo Taticchi
The International Competition on Computational Models of Argumentation (ICCMA) is a successful event dedicated to advancing the state of the art of solvers in Abstract Argumentation. We describe two proposals that will further improve the third and next edition of the competition, i.e. ICCMA 2019. The first novelty concerns the packaging of each solver-application participating in the com- petition in a virtual “light” container (using Docker): this allows for easy deployment and to (re)running all of the submissions on different architectures (Linux, Windows, macOS, and also in the cloud). The second proposal consists of a new track focused on solvers processing dynamic frameworks, i.e., solvers described in terms of changes w.r.t. previous ones: a solver can reuse the solution obtained previously to be faster on the same framework modulo a new argument/attack.
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsSubhabrata Mukherjee
Massive distillation of pre-trained language models like multilingual BERT with 35x compression and 51x speedup (98% smaller and faster) retaining 95% F1-score over 41 languages
Chemical Data Mining: Open Source & ReproducibleRajarshi Guha
This document discusses using the R programming environment for cheminformatics and chemical data mining. It describes how R is enhanced by open source software like the Chemistry Development Kit (CDK) which provides capabilities for working with chemical structure data within R. The CDK allows reading/writing different file formats, generating fingerprints, visualizing molecules, and accessing public databases directly from R to streamline workflows like quantitative structure-activity relationship modeling. This enables reproducible science by ensuring analyses can be updated based on the latest data.
This document contains Leonard Bong's resume. It summarizes his experience as a senior software engineer with over 25 years of experience programming in C, C++, Java, JavaScript, Python, Perl, Tcl/Tk, and SQL. He has extensive experience designing and implementing software tools for semiconductor design and manufacturing processes at companies like National Semiconductor, Texas Instruments, and Oracle. His resume lists his education as a BS and MS in Computer Science from Central Michigan University.
Research Inventy : International Journal of Engineering and Science is publis...researchinventy
This document summarizes a research paper that presents a flexible LDPC decoder design implemented on an FPGA. The decoder uses a partially parallel approach that allows it to support different code rates and block sizes. It consists of processing blocks, a message permutation block, and control logic. The processing blocks perform variable and check node operations in parallel. The message permutation block connects the blocks according to the parity check matrix. This provides flexibility to implement different codes. The control logic stores the code configuration and manages the decoding process. Simulation results showed the decoder runs at 89.29MHz on a Spartan 6 FPGA, using 765 slices and providing a better throughput to area ratio than previous designs.
Sr. Kakarla is a physical design engineer with over 6 years of experience in roles including IC physical design, CAD STA, layout design automation, and corporate design solutions. He has worked at Qualcomm and Intel on projects ranging from 32nm to 10nm technology nodes. Sr. Kakarla seeks a challenging role implementing high performance and low power SOC designs.
This document provides an overview of using R for cheminformatics. It discusses prerequisites for the workshop, including installing relevant packages. The outline covers an overview of the R language itself, parallel and distributed computing in R, and accessing databases from R. The document also provides some basic information on getting help, running code, and saving work in R.
Design and implementation of Closed Loop Control of Three Phase Interleaved P...IJMTST Journal
A single-phase, three-level, single-stage power-factor corrected AC/DC converter operated under closed
loop manner is presented. That operates with a single controller to regulate the output voltage and the input
inductor act as a boost inductor to have a single stage power factor correction with good output response. The
paper deals with a new single stage three level ac-dc converter which performs both power factor correction
and voltage regulation in a single stage. The proposed converter has two separate controllers, one for power
factor correction and the other for regulating the output voltage. A comprehensive review of the existing single
stage topologies has been carried out. Then the operating principle, control scheme and the design of the new
converter are presented. The proposed converter is having an input power factor close to unity and better
voltage regulation compared to the conventional ac-dc converter topologies. Proposed topology is evaluated
through Matlab/Simulink platform and simulation results are conferred.
This document discusses different API design options like REST, gRPC, and GraphQL. It provides an overview of Apache Cassandra and describes a methodology for designing APIs that involves modeling the data, workflows, and technologies. A decision tree is presented to analyze criteria for different APIs like performance, data integrity, and developer experience to determine which option is best. REST is best for CRUD APIs, gRPC for high performance services, and GraphQL for complex queries.
This document provides an introduction to the Legion programming paradigm for building task-oriented applications. It describes an implementation of an n-body simulation using Legion to demonstrate its programming model. The n-body problem calculates gravitational interactions between particles. Legion defines tasks that operate on logical regions of data in parallel. It handles mapping regions to physical resources. The document outlines registering tasks, launching them, defining domains, and Legion's object-oriented approach. It also describes the computing environment used, a heterogeneous cluster, for the n-body implementation.
Mohamed Mohsen Ibrahim is seeking new opportunities. He has a Bachelor's degree in Electrical Engineering from Ain Shams University and over 5 years of experience in IT support and as an Application Analyst. His skills include SQL, C/C++, networking, Unix administration, and embedded systems. He is self-motivated, a strong team player, and familiar with multinational work environments.
Este documento discute la importancia de la colaboración entre programadores y estrategas digitales. Explica que más de la mitad de las estrategias de distribución requieren tecnología web y que es necesario medir los resultados para determinar qué estrategias funcionan mejor con los usuarios. También enfatiza que los programadores deben entregar datos para que los estrategas puedan analizarlos, y que trabajando juntos pueden iterar sobre lo que funcione mejor para sus usuarios.
El documento describe la experiencia de una alumna con la herramienta Office 365 para tareas académicas. Office 365 permite trabajar en conjunto en tiempo real y acceder a correos electrónicos de forma sincronizada desde cualquier lugar. También facilita compartir archivos y comunicarse con compañeros. Las ventajas de las herramientas colaborativas incluyen la generación de equipos de trabajo y mejora continua de contenidos, mientras que las desventajas son la dependencia a la infraestructura y el riesgo de percepción de informalidad.
Wiring Hacker Synapses - Cola: Real-Time Shared EditingMustafa Isik
The document discusses collaborative coding techniques using Eclipse Communication Framework (ECF). It describes how ECF enables real-time shared editing between participants using operational transformations to resolve conflicts from out-of-order changes. The key challenges are ensuring consistency when changes occur over a network with latency and resolving conflicts when participants diverge with multiple local changes before remote operations arrive. Solutions involve ordering message passing with network protocols and transforming operations to preserve intentions when resolving conflicts.
Jean Christophe Meyer: Histoire Parallèle/Die Woche vor 50 Jahren – Lieu de m...EUscreen
Content in Motion | Curating Europe’s Audiovisual Heritage Conference, December 3-4 2015; www.euscreenxl2015.eu
This paper is aimed first at analyzing the initial public impact of Histoire Parallèle/Die Woche vor 50 Jahren, which aired weekly first on la Sept and then on French-German TV channel Arte from 1989 to 2001.
The 55-minute show exploited newsreel material, systematically presenting it in the original full length after exactly 50 years after its initial release in movie theatres. It covered a period stretching from the beginning of World War II until the end of the Marshall Plan. It simultaneously illustrated contradictive perspectives of several nations at war with each other. This part of the show lasted for 40 minutes. Then, for the final quarter of an hour, the show’s mainstay host, historian Marc Ferro discussed the material presented with a guest, usually a scholar. Despite the fact that the show could never be sold to foreign channels or rebroadcast, it still arouses great interest. Therefore our paper intends to explore how and to what extent content curation may contribute to it becoming a transnational or European Lieu de mémoire.
The document describes the Eurotex ID project which brought together museums, fashion schools, and textile companies across Europe. Young designers studied historical textile collections and created new fashion and textile designs inspired by the past. The project culminated in a conference highlighting collaborations between heritage institutions and designers, and an exhibition featuring the student designs alongside their historical inspirations. Contact and event details are provided for those interested in learning more.
Primary School in Bin Boćkioćki is located at 4 Dubieńska Street in the village of Boćki, Poland. It is named after Pope John Paul II and has 150 pupils and 20 teachers. The school day runs from 8am to 12:30pm, with three lessons before lunch and two or three after. Afternoon activities include sports teams, an English club, drama club, art and dance club, and scouting troop.
The timple is a 5-string traditional Canarian instrument typically about 38-40 centimeters long. It is most commonly tuned to D, A, E, C, and G. Simon Morales Tavío of Lanzarote is known as the father of modern timples from the mid-20th century onward. The timple is usually handcrafted out of wood and features a characteristic hump on its back, earning it the affectionate nickname of "resounding little camel" in some island areas. Some of the most renowned timplists include Totoyo Millares, Domingo Rodríguez Oramas, Benito Cabrera, Germán López,
BBC Developing Communities of Innovation - Thinking Digital Manchester 2015BBC
"Tales from the Frontline: Public Services and the creative Industries"
An overview of last 5 years establishing BBC R&D based at MediaCity:UK and how we have built communities of innovation.
The talk shares examples of BBC's digital collaborations with Academia, Industry, SMe's and the Audience including demos and showreels.
The document discusses active and passive voice and provides examples of sentences rewritten from active to passive voice. It includes a student's name, class, and homework assignment which asks them to rewrite 7 sentences changing them from active to passive voice. The original sentences describe actions taken by Remy, Colette, and Linguini.
EY India Attractiveness Survey 2015 – Reasons to Invest in India & Key Factor...EY
With leading 32% of the investors ranking India as the most attractive market this year, India has emerged as No. 1 FDI destination in the world during the first half of 2015. Download the India Attractive Survey to know more.
1-Oportunidades de Creación de Nuevos NegociosJontxu Pardo
Este documento presenta la biografía de Jontxu Pardo, un emprendedor fundador de varias empresas. También incluye información de contacto para Jontxu y consejos sobre cómo desarrollar un plan de negocios y reconocer oportunidades empresariales.
El documento proporciona información sobre los hábitos higiénicos para el personal manipulador de alimentos. Incluye formatos para realizar el seguimiento de la limpieza y desinfección de las áreas de trabajo de un restaurante, con detalles sobre las áreas, desinfectantes, concentraciones, dosis y frecuencias. También incluye un plan para el manejo de plagas y residuos, así como normas de higiene para los manipuladores de alimentos.
Obsolescencia del conocimiento en la sociedad del acceso.
Pequeña reflexion sobre la caducidad del conocimiento en los sistemas educativos y como poder afrontarlos. Hecho con prezi
These are the slides corresponding to the presentation done at ISWC 2013 in Sydney, for the paper with the same title. Remember to check the main URL: http://j.mp/benchmarkqueryrewriting
Tool-Driven Technology Transfer in Software EngineeringHeiko Koziolek
This talk presentst the tool-driven technology transfer process ABB Corporate Research applies in selected software engineering University collaborations. As an example, we have created an add-in to a popular UML tool and developed the tooling in close interaction with the target users. Centering the technology transfer around tool implementations brings many benefits such as the need to make conceptual contributions applicable and the ability to quickly benefit from the new concepts. A challenge to this form of technology transfer is the long-term commitment to the maintenance of the tooling, which we try to address by creating an open developer community. Tool-driven technology transfer projects have proven to be valuable a instrument of bringing advanced software engineering technologies into our organization.
Implementation of Efficiency CORDIC Algorithmfor Sine & Cosine GenerationIOSR Journals
Abstract: This paper presents an area-time efficient coordinate rotation digital computer (CORDIC) algorithm that completely eliminates the scale-factor. A generalized micro-rotation selection technique based on high speed most-significant-1-detection obviates the complex search algorithms for identifying the micro-rotations. This algorithm is redefined as the elementary angles for reducing the number of CORDIC iterations. Compared to the existing re-cursive architectures the proposed one has 17% lower slice-delay product on Xilinx Spartan XC2S200E device. The CORDIC processor pro-vides the flexibility to manipulate the number of iterations depending on the accuracy, area and latency requirements. Index Terms—coordinate rotation digital computer (CORDIC), cosine/sine, field-programmable gate array(FPGA),most-significant-1, recursive architecture.
This document proposes using relational concept analysis (RCA) to suggest refactorings for correcting design defects, specifically the "Blob" defect. RCA models relationships between classes and suggests cohesive groups of fields and methods that could be moved or extracted. The approach was validated on 4 open-source programs, suggesting relevant refactorings with 50-77% precision. Future work includes generalizing to other defects, assessing more programs, and discussing suggested refactorings with developers.
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Intel® Software
Integrated into Intel® Advisor, Cache-aware Roofline Modeling (CARM) provides insight into how an application behaves by helping to determine a) how optimally it works on a given hardware, b) the main factors that limit performance, c) if the workload is memory or compute-bound, and d) the right strategy to improve application performance.
This document discusses industrializing machine learning pipelines. It proposes a machine learning blueprint to improve collaboration between data scientists and engineers. The blueprint includes technical components like extracting, preprocessing, training, evaluating and predicting data. It defines roles for data scientists and engineers and recommends common tools and code to streamline workflows. The document also uses a case study of catalog categorization to illustrate how the blueprint was applied to classify videos using a machine learning pipeline with various technical components like TensorFlow, Apache Beam and BigQuery.
This document describes an approach for supporting software change tasks using automated query reformulations. It begins with an example of a software change request between Alex and Bob. It then discusses using techniques like TextRank and POSRank, adapted from PageRank, to identify important terms from change requests for querying a codebase. The approach was evaluated on a dataset of over 1,900 change tasks from eight open source projects. Experimental results found the proposed approach improved 57.84% of queries and outperformed existing state-of-the-art methods based on measures like query effectiveness and retrieval performance.
AnXplorer is a new optimization tool for analog and RF circuit design introduced by AgO. It automates the manual and iterative process of sizing circuit elements to meet objectives. AnXplorer uses multi-algorithmic optimization and equation-based modeling to find the global minimum for difficult problems. It generates a centered netlist that meets objectives while covering process, temperature, and voltage variation to reduce respins. Customers can expect a 5x improvement in engineering efficiency and reduced time to market.
Logic Programming as a Service (LPaaS): Intelligence for the IoTAndrea Omicini
Talk @ ICNSC 2017, Calabria, Italy, 16 May 2017
Abstract: The widespread diffusion of low-cost computing devices, such as Arduino boards and Raspberry Pi, along with improvements of Cloud computing platforms, are paving the way towards a whole new set of opportunities for Internet of Things (IoT) applications and services. Varying degrees of intelligence are often required for supporting adaptation and self-management—yet, they should be provided in a light-weight, easy to use and customise, highly-interoperable way. Accordingly, in this paper we explore the idea of Logic Programming as a Service (LPaaS) as a novel and promising re-interpretation of distributed logic programming in the IoT era. After introducing the reference context and motivating scenarios of LPaaS as a key enabling technology for intelligent IoT, we define the LPaaS general system architecture. Then, we present a prototype implementation built on top of the tuProlog system, which provides the required interoperability and customisation. We showcase the LPaaS potential through a case study designed as a simplification of the motivating scenarios.
A High-Level Programming Approach for using FPGAs in HPC using Functional Des...waqarnabi
(1) The authors present an approach for using FPGAs in high-performance computing (HPC) that involves using functional descriptions, vector type-transformations, and cost-modeling. (2) Their approach uses type transformations to generate design variants from a functional program and develops an intermediate language and cost model. (3) The cost model provides fast, lightweight estimates of performance and resource usage for different design variants to enable automated design space exploration for FPGA-based HPC applications.
The document discusses the syllabus and objectives for the subject CS6660 - Compiler Design. It outlines 5 units that will be covered: introduction to compilers, lexical analysis, syntax analysis, syntax directed translation and runtime environment, and code optimization and code generation. Each unit defines the concepts and techniques that will be learned. The objectives are for students to understand compiler design principles, parsing techniques, different translation levels, and how to optimize code generation.
This document proposes a design procedure for a re-configurable convolutional neural network (CNN) engine for field-programmable gate array (FPGA) applications. The procedure includes developing an accurate CNN model using TensorFlow and Python, and implementing a re-configurable CNN engine from scratch using register-transfer level design. The proposed engine was synthesized for 180nm CMOS technology and achieved 96% accuracy on MNIST and CIFAR-10 datasets. A graphical user interface was also designed for loading and testing datasets on the hardware engine.
This document summarizes Avesta Sasan's background and research focus. It begins with an introduction of Avesta Sasan and their position as an Associate Professor at UC Davis. The document then discusses some challenges with keeping hardware development pace with the rapid growth and increasing resource demands of artificial intelligence. This includes charts showing the growth in model size and training costs outpacing hardware improvements. It argues that contributions are needed from across machine learning, hardware architecture, circuit design, and manufacturing to improve AI speed and efficiency. The document proposes using machine learning techniques in electronic design automation flows to help automate and accelerate physical design and RTL synthesis steps in order to help close the gap between hardware and AI.
This presentation is about electronic style calculations for structural engineers. It deals with why we have a problem, how we have problem and how change.
Aggregate Computing Platforms: Bridging the GapsRoberto Casadei
This presentation, held in the context of the CS & Eng M.D. course "Pervasive Computing" (Unibo, Cesena), drafts some analysis for an Aggregate Computing platform and suggests areas of investigation.
Crossing the chasm between ontology engineering and application developmentPaola Espinoza-Arias
The document describes research on generating application programming interfaces (APIs) from ontologies to facilitate consumption of knowledge graphs by application developers. It presents a state of the art analysis of existing approaches for generating APIs from ontologies and outlines a proposed methodology. The methodology includes defining a method for generating APIs based on ontology artifacts like competency questions, and developing a tool called OATAPI to automate API generation. The goal is to help bridge the gap between ontology engineering and application development.
A Fast and Accurate Cost Model for FPGA Design Space Exploration in HPC Appli...waqarnabi
Slides for our paper at RAW, nominated for best paper award, related to our work on developing an optimizing compiler for running scientific code on FPGAs.
OPTIMIZED MULTIPLIER USING REVERSIBLE MULTICONTROL INPUT TOFFOLI GATESVLSICS Design
Reversible logic is an important area to carry the computation into the world of quantum computing. In this paper a 4-bit multiplier using a new reversible logic gate called BVPPG gate is presented. BVPPG gate is a 5 x 5 reversible gate which is designed to generate partial products required to perform multiplication and also duplication of operand bits is obtained. This reduces the total cost of the circuit. Toffoli gate is the universal and also most flexible reversible logic gate. So we have used the Toffoli gates to construct the designed multiplier.
This document presents an introduction to programmable logic controllers (PLCs) delivered by Global Technocrats and Consultants. It discusses PLC programming options including ladder diagrams and languages like RSLogix 5000. It also covers PLC basics such as inputs and outputs, architecture, wiring, functions, and network protocols. Specific protocols mentioned include ASibus, Profibus, DeviceNet and Modbus. The document concludes with review questions about PLC applications, inputs, outputs and relay behavior.
Implementation of Rotation and Vectoring-Mode Reconfigurable CORDICijtsrd
CORDIC or CO-ordinate Rotation Digital Computer is a fast, simple, efficient and powerful algorithm used for diverse Digital Signal Processing applications. Primarily developed for real-time airborne computations, it uses a unique computing technique which is especially suitable for solving the trigonometric relationships involved in plane co-ordinate rotation and conversion from rectangular to polar form. It comprises a special serial arithmetic unit having three shift registers, three adders/subtractors, Look-Up table and special interconnections. In this project A CORDIC-based processor for sine/cosine calculation was designed using VHDL programming in Xilinx ISE 13.2. The CORDIC module was tested for its functionality and correctness by test-bench analysis. Subsequently, FPGA implementation of the CORDIC core followed by Chip Scope Pro analysis of the output logic waveforms was performed. Kothapally Mounika | P. Pavan Kumar | K. Shobha Rani"Implementation of Rotation and Vectoring-Mode Reconfigurable CORDIC" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-4 , June 2018, URL: http://www.ijtsrd.com/papers/ijtsrd14396.pdf http://www.ijtsrd.com/engineering/electronics-and-communication-engineering/14396/implementation-of-rotation-and-vectoring-mode-reconfigurable-cordic/kothapally-mounika
Similar to Query Rewriting Optimisation Techniques for Ontology-Based Data Access (20)
Immersive Learning That Works: Research Grounding and Paths ForwardLeonel Morgado
We will metaverse into the essence of immersive learning, into its three dimensions and conceptual models. This approach encompasses elements from teaching methodologies to social involvement, through organizational concerns and technologies. Challenging the perception of learning as knowledge transfer, we introduce a 'Uses, Practices & Strategies' model operationalized by the 'Immersive Learning Brain' and ‘Immersion Cube’ frameworks. This approach offers a comprehensive guide through the intricacies of immersive educational experiences and spotlighting research frontiers, along the immersion dimensions of system, narrative, and agency. Our discourse extends to stakeholders beyond the academic sphere, addressing the interests of technologists, instructional designers, and policymakers. We span various contexts, from formal education to organizational transformation to the new horizon of an AI-pervasive society. This keynote aims to unite the iLRN community in a collaborative journey towards a future where immersive learning research and practice coalesce, paving the way for innovative educational research and practice landscapes.
The debris of the ‘last major merger’ is dynamically youngSérgio Sacani
The Milky Way’s (MW) inner stellar halo contains an [Fe/H]-rich component with highly eccentric orbits, often referred to as the
‘last major merger.’ Hypotheses for the origin of this component include Gaia-Sausage/Enceladus (GSE), where the progenitor
collided with the MW proto-disc 8–11 Gyr ago, and the Virgo Radial Merger (VRM), where the progenitor collided with the
MW disc within the last 3 Gyr. These two scenarios make different predictions about observable structure in local phase space,
because the morphology of debris depends on how long it has had to phase mix. The recently identified phase-space folds in Gaia
DR3 have positive caustic velocities, making them fundamentally different than the phase-mixed chevrons found in simulations
at late times. Roughly 20 per cent of the stars in the prograde local stellar halo are associated with the observed caustics. Based
on a simple phase-mixing model, the observed number of caustics are consistent with a merger that occurred 1–2 Gyr ago.
We also compare the observed phase-space distribution to FIRE-2 Latte simulations of GSE-like mergers, using a quantitative
measurement of phase mixing (2D causticality). The observed local phase-space distribution best matches the simulated data
1–2 Gyr after collision, and certainly not later than 3 Gyr. This is further evidence that the progenitor of the ‘last major merger’
did not collide with the MW proto-disc at early times, as is thought for the GSE, but instead collided with the MW disc within
the last few Gyr, consistent with the body of work surrounding the VRM.
When I was asked to give a companion lecture in support of ‘The Philosophy of Science’ (https://shorturl.at/4pUXz) I decided not to walk through the detail of the many methodologies in order of use. Instead, I chose to employ a long standing, and ongoing, scientific development as an exemplar. And so, I chose the ever evolving story of Thermodynamics as a scientific investigation at its best.
Conducted over a period of >200 years, Thermodynamics R&D, and application, benefitted from the highest levels of professionalism, collaboration, and technical thoroughness. New layers of application, methodology, and practice were made possible by the progressive advance of technology. In turn, this has seen measurement and modelling accuracy continually improved at a micro and macro level.
Perhaps most importantly, Thermodynamics rapidly became a primary tool in the advance of applied science/engineering/technology, spanning micro-tech, to aerospace and cosmology. I can think of no better a story to illustrate the breadth of scientific methodologies and applications at their best.
hematic appreciation test is a psychological assessment tool used to measure an individual's appreciation and understanding of specific themes or topics. This test helps to evaluate an individual's ability to connect different ideas and concepts within a given theme, as well as their overall comprehension and interpretation skills. The results of the test can provide valuable insights into an individual's cognitive abilities, creativity, and critical thinking skills
The technology uses reclaimed CO₂ as the dyeing medium in a closed loop process. When pressurized, CO₂ becomes supercritical (SC-CO₂). In this state CO₂ has a very high solvent power, allowing the dye to dissolve easily.
Or: Beyond linear.
Abstract: Equivariant neural networks are neural networks that incorporate symmetries. The nonlinear activation functions in these networks result in interesting nonlinear equivariant maps between simple representations, and motivate the key player of this talk: piecewise linear representation theory.
Disclaimer: No one is perfect, so please mind that there might be mistakes and typos.
dtubbenhauer@gmail.com
Corrected slides: dtubbenhauer.com/talks.html
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...Advanced-Concepts-Team
Presentation in the Science Coffee of the Advanced Concepts Team of the European Space Agency on the 07.06.2024.
Speaker: Diego Blas (IFAE/ICREA)
Title: Gravitational wave detection with orbital motion of Moon and artificial
Abstract:
In this talk I will describe some recent ideas to find gravitational waves from supermassive black holes or of primordial origin by studying their secular effect on the orbital motion of the Moon or satellites that are laser ranged.
The cost of acquiring information by natural selectionCarl Bergstrom
This is a short talk that I gave at the Banff International Research Station workshop on Modeling and Theory in Population Biology. The idea is to try to understand how the burden of natural selection relates to the amount of information that selection puts into the genome.
It's based on the first part of this research paper:
The cost of information acquisition by natural selection
Ryan Seamus McGee, Olivia Kosterlitz, Artem Kaznatcheev, Benjamin Kerr, Carl T. Bergstrom
bioRxiv 2022.07.02.498577; doi: https://doi.org/10.1101/2022.07.02.498577
ESPP presentation to EU Waste Water Network, 4th June 2024 “EU policies driving nutrient removal and recycling
and the revised UWWTD (Urban Waste Water Treatment Directive)”
The binding of cosmological structures by massless topological defectsSérgio Sacani
Assuming spherical symmetry and weak field, it is shown that if one solves the Poisson equation or the Einstein field
equations sourced by a topological defect, i.e. a singularity of a very specific form, the result is a localized gravitational
field capable of driving flat rotation (i.e. Keplerian circular orbits at a constant speed for all radii) of test masses on a thin
spherical shell without any underlying mass. Moreover, a large-scale structure which exploits this solution by assembling
concentrically a number of such topological defects can establish a flat stellar or galactic rotation curve, and can also deflect
light in the same manner as an equipotential (isothermal) sphere. Thus, the need for dark matter or modified gravity theory is
mitigated, at least in part.
The binding of cosmological structures by massless topological defects
Query Rewriting Optimisation Techniques for Ontology-Based Data Access
1. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING OPTIMISATION TECHNIQUES
FOR ONTOLOGY-BASED DATA ACCESS
Jos´e Mora L´opez
jmora@fi.upm.es
Supervisor: Oscar Corcho
ocorcho@fi.upm.es
Ontology Engineering Group
Facultad de Inform´atica, Universidad Polit´ecnica de Madrid
Madrid - December 17, 2015
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 1 / 52
2. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING IN OBDA
QUERY
REWRITING
ontology
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 2 / 52
3. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OUTLINE
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 3 / 52
4. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 4 / 52
5. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 5 / 52
6. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
USUAL LOGICS IN OBDA
axiom1
logic
DL-Litecore
DL-LiteF
DL-LiteR
Rapid’s
ELHIO
Datalog±2
Horn-SHIQ
B1 B2, B1 ¬B2
≥ 2R1 ⊥
R1 R2, R1 ¬R2
B1 ∃R1, ∃R1 B1
B1 ∃R1.B2
∃R1.B1 B2
B1 B2 B3
{a} B, B {a}, B(a)
n-ary predicates
trans(R1)
B1 ∀R1.B2, B1 ≤ 1R1.B2
1∀i, j, Bi represents a basic concept and Rj represents a role that may be basic or inverted.
2as implemented in Nyaya
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 6 / 52
7. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
SYSTEMS IN THE STATE OF THE ART
System Input Output Reference
Quonto DL-LiteR UCQ Calvanese et al. (2007)
REQUIEM ELHIO Datalog or UCQ P´erez-Urbina et al. (2009)
Presto DL-LiteR Datalog Rosati and Almatelli (2010)
Rapid DL-LiteR
3
Datalog or UCQ Chortaras et al. (2011)
Nyaya Datalog±
UCQ Gottlob et al. (2011)
Venetis’ DL-LiteR UCQ Venetis et al. (2011)
Prexto DL-LiteR and EBox Datalog or UCQ Rosati (2012)
Clipper Horn-SHIQ Datalog Eiter et al. (2012)
kyrie ELHIO Datalog or UCQ Mora and Corcho (2013)
kyrie2 ELHIO and EBox Datalog or UCQ Mora et al. (2014)
3Close to OWL2 QL, B1 ∃R.B2 axioms are supported
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 7 / 52
8. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
GENERAL QUERY REWRITING ALGORITHM
ontology
convert to
internal rep-
resentation
query
reachability
test
rewrite into
Datalog
Datalog
unfold into
UCQ
UCQ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 8 / 52
9. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REQUIEM QUERY REWRITING ALGORITHM
query
reachability
convert to
FOL
ontology
remove
functions
greedy
unfold
greedy
mode?
na¨ıve unfold
na¨ıve
mode?
Datalog
optimise UCQ
YesNo
Yes
No
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 9 / 52
10. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
LIMITATIONS IN THE STATE OF THE ART
1. Lack of effectiveness, e.g. rewriting to empty predicates
2. Redundancy, e.g. retrieving the same values multiple times in a query
3. Lack of efficiency, e.g. doing operations that lead to previous points
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 10 / 52
11. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 11 / 52
12. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBJECTIVES
Improve existing query rewriting techniques by:
1. Using the information provided by mappings
2. Using more efficient query rewriting algorithms
3. Using an EBox to avoid redundant answers
4. Using ELHIO, for the applicability of results both in theory and practice
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 12 / 52
13. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESES AND CONTRIBUTIONS
H1. Knowing the presence or absence of a predicate mappings allows
reducing the size of the rewritten queries and the time needed to produce
them
H2. Additional computations can be done during query rewriting so as to
obtain a gain in efficiency that compensates for the cost of these
computations
H3. EBoxes may allow reducing the size of the rewritten queries and the time
needed in the query process
C1. An algorithm4
that considers the presence of mappings to improve
results and process of query rewriting
C2. An algorithm including several optimisations in the process of query
rewriting
C3. An algorithm that uses the knowledge in an EBox to remove redundancy
in the rewritten queries and the query rewriting process
4All algorithms were empirically evaluated and formally validated
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 13 / 52
14. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESES AND CONTRIBUTIONS
H1. Knowing the presence or absence of a predicate mappings allows
reducing the size of the rewritten queries and the time needed to produce
them
H2. Additional computations can be done during query rewriting so as to
obtain a gain in efficiency that compensates for the cost of these
computations
H3. EBoxes may allow reducing the size of the rewritten queries and the time
needed in the query process
C1. An algorithm4
that considers the presence of mappings to improve
results and process of query rewriting
C2. An algorithm including several optimisations in the process of query
rewriting
C3. An algorithm that uses the knowledge in an EBox to remove redundancy
in the rewritten queries and the query rewriting process
4All algorithms were empirically evaluated and formally validated
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 13 / 52
15. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 14 / 52
16. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
QUERY
REWRITING
ontology
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 15 / 52
17. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
QUERY
REWRITING
ontology
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 15 / 52
18. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING ALGORITHM
query
reachability
convert to
FOL
ontology
remove
functions
greedy
unfold
greedy
mode?
na¨ıve unfold
na¨ıve
mode?
Datalog
optimise UCQ
YesNo
Yes
No
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 16 / 52
19. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING ALGORITHM
query
reachability
convert to
FOL
mappings
ontology
remove
functions
remove
non-mapped
predicates
greedy
unfold
greedy
mode?
na¨ıve unfold
na¨ıve
mode?
Datalog
optimise UCQ
YesNo
Yes
No
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 16 / 52
21. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
MAPPING-BASED REACHABILITY
Principle: predicates that provide no answers are not needed
Two working modes:
H (highest): first mapped predicate stops, following ones are contained
L (lowest): last mapped predicate stops, following ones are empty
Example
Query: Q(x) ← Student(x)
Mapped predicates: GradStudent, PhDStudent
Ontology:
GradStudent Student
PhDStudent GradStudent
MasterStudent GradStudent
H rewriting
Q(x) ← GradStudent(x)
L rewriting
Q(x) ← GradStudent(x)
Q(x) ← PhDStudent(x)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 18 / 52
22. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
MAPPING-BASED REACHABILITY
Principle: predicates that provide no answers are not needed
Two working modes:
H (highest): first mapped predicate stops, following ones are contained
L (lowest): last mapped predicate stops, following ones are empty
Example
Query: Q(x) ← Student(x)
Mapped predicates: GradStudent, PhDStudent
Ontology:
GradStudent Student
PhDStudent GradStudent
MasterStudent GradStudent
H rewriting
Q(x) ← GradStudent(x)
L rewriting
Q(x) ← GradStudent(x)
Q(x) ← PhDStudent(x)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 18 / 52
23. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
RESOLUTION WITH MAPPINGS
For each non-mapped predicate p
1. Check that all clauses with p in the head have all their body predicates
mapped
Otherwise: select a different predicate
Break cycles: choose one predicate per cycle to exclude from resolution
2. Resolve selecting p
3. Clauses containing p can be removed
All clauses containing predicates selected for resolution can be deleted
Example: Mapped {PhDStudent, researches}
TBox:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 19 / 52
24. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
RESOLUTION WITH MAPPINGS
For each non-mapped predicate p
1. Check that all clauses with p in the head have all their body predicates
mapped
Otherwise: select a different predicate
Break cycles: choose one predicate per cycle to exclude from resolution
2. Resolve selecting p
3. Clauses containing p can be removed
All clauses containing predicates selected for resolution can be deleted
Example: Mapped {PhDStudent, researches}
TBox:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 19 / 52
25. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION SETUP
Ontology: hydrOntology (Vilches-Bl´azquez et al., 2007)
155 concepts
expressiveness SHIN(D)
several mappings
The National Atlas data source, with 32 mappings, 1,100 hydrographical
instances
The Numerical Cartographic Database (BCN200) with 57 mappings that
provide 60,000 toponyms
EuroGlobalMap (EGM), 32 mappings that provide 3,500 Spanish
hydrographical toponyms
Hypothesis:
H1. Knowing the presence or absence of a predicate mappings allows reducing
the size of the rewritten queries and the time needed to produce them
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 20 / 52
26. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION RESULTS
Contextual
Information
Ontology:
HydrOntology
Original
Datalog
size:
686 clauses
Ignored
statements:
405
Time unit:
Milliseconds
Working modes:
Naive
Greedy
Full
Mappings None BCN200 Atlas EGM
Mappings # 0 57 32 32
Prune method A H L H L H L
Mode Phase Clauses generated after each phase
N
Prune 535 323 445 287 415 336 429
Saturation 412 25 50 17 24 19 31
Unfolding 2333 25 46 17 22 19 28
Prune 2333 25 46 17 22 19 28
G
Prune 535 323 445 287 415 336 429
Saturation 407 25 49 15 24 19 30
Unfolding 894 25 45 15 22 19 27
Prune 688 25 45 15 22 19 27
F
Prune 535 323 445 287 415 336 429
Saturation 407 25 49 15 24 19 30
Unfolding 2245 25 45 15 22 19 27
Prune 911 25 45 15 22 19 27
N Total time 2735 172 625 156 359 250 625
G Total time 2250 172 512 156 360 218 532
F Total time 3719 172 531 156 344 235 531
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 21 / 52
27. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION RESULTS
Contextual
Information
Ontology:
HydrOntology
Original
Datalog
size:
686 clauses
Ignored
statements:
405
Time unit:
Milliseconds
Working modes:
Naive
Greedy
Full
Mappings None BCN200 Atlas EGM
Mappings # 0 57 32 32
Prune method A H L H L H L
Mode Phase Clauses generated after each phase
N
Prune 535 323 445 287 415 336 429
Saturation 412 25 50 17 24 19 31
Unfolding 2333 25 46 17 22 19 28
Prune 2333 25 46 17 22 19 28
G
Prune 535 323 445 287 415 336 429
Saturation 407 25 49 15 24 19 30
Unfolding 894 25 45 15 22 19 27
Prune 688 25 45 15 22 19 27
F
Prune 535 323 445 287 415 336 429
Saturation 407 25 49 15 24 19 30
Unfolding 2245 25 45 15 22 19 27
Prune 911 25 45 15 22 19 27
N Total time 2735 172 625 156 359 250 625
G Total time 2250 172 512 156 360 218 532
F Total time 3719 172 531 156 344 235 531
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 21 / 52
28. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATED
Ontology: hydrOntology (Vilches-Bl´azquez et al., 2007)
155 concepts
expressiveness SHIN(D)
several mappings
The National Atlas data source, with 32 mappings, 1,100 hydrographical
instances
The Numerical Cartographic Database (BCN200) with 57 mappings that
provide 60,000 toponyms
EuroGlobalMap (EGM), 32 mappings that provide 3,500 Spanish
hydrographical toponyms
Hypothesis:
H1. Knowing the presence or absence of a predicate mappings allows reducing
the size of the rewritten queries and the time needed to produce them
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 22 / 52
29. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATED
Ontology: hydrOntology (Vilches-Bl´azquez et al., 2007)
155 concepts
expressiveness SHIN(D)
several mappings
The National Atlas data source, with 32 mappings, 1,100 hydrographical
instances
The Numerical Cartographic Database (BCN200) with 57 mappings that
provide 60,000 toponyms
EuroGlobalMap (EGM), 32 mappings that provide 3,500 Spanish
hydrographical toponyms
Hypothesis:
H1. Knowing the presence or absence of a predicate mappings allows reducing
the size of the rewritten queries and the time needed to produce them
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 22 / 52
30. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 23 / 52
31. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
QUERY
REWRITING
ontology
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 24 / 52
32. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
QUERY
REWRITING
ontology
rewritten
query
Mappings
query
query
translation
preprocess
clauses
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 24 / 52
33. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING PROCESS
query
reachability
test
remove
functions
greedy
unfold
greedy
mode?
na¨ıve unfold
na¨ıve
mode?
Datalog
convert to
FOL
ontology
optimise UCQ
YesNo
Yes
No
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 25 / 52
34. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING PROCESS
query
reachability
test
remove
functions
remove
auxiliar
predicates
reachability
test
unfold
preprocess
Datalog
UCQ
convert to
FOL
ontology
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 25 / 52
35. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
ONTOLOGY PREPROCESSING
Some inferences can be done without the query
This saves some time for later processing
Entirely in FOL
Example: Mapped {PhDStudent, researches}
TBox:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 26 / 52
36. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
ONTOLOGY PREPROCESSING
Some inferences can be done without the query
This saves some time for later processing
Entirely in FOL
Example: Mapped {PhDStudent, researches}
TBox:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 26 / 52
37. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CLAUSE SUBSUMPTION CHECK
Checked as soon as a new clause is generated
Deleted clauses do not participate in resolution
Incremental effect
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
TBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 27 / 52
38. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CLAUSE SUBSUMPTION CHECK
Checked as soon as a new clause is generated
Deleted clauses do not participate in resolution
Incremental effect
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
TBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 27 / 52
39. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
PRIORITISING SOME INFERENCES
Free slection resolution accepts some degree of freedom in order
Prioritising resolution with shorter clauses increases likelihood of
producing subsuming clauses
Increases effect of previous optimisation
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
TBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 28 / 52
40. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
PRIORITISING SOME INFERENCES
Free slection resolution accepts some degree of freedom in order
Prioritising resolution with shorter clauses increases likelihood of
producing subsuming clauses
Increases effect of previous optimisation
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
TBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 28 / 52
41. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CONSTRAINING SEARCHES
Different groups of clauses have different properties
query clauses
length
Keeping them separately means restricting the attention (computation) to
a group
Most operations select a random clause in a group, e.g. prioritising
inferences
In our examples
selection is not random, sets of clauses are preferred
selection is not specific, several clauses in the set
grouping clauses in categories is effective and efficient
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 29 / 52
42. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CONSTRAINING SEARCHES
Different groups of clauses have different properties
query clauses
length
Keeping them separately means restricting the attention (computation) to
a group
Most operations select a random clause in a group, e.g. prioritising
inferences
In our examples
selection is not random, sets of clauses are preferred
selection is not specific, several clauses in the set
grouping clauses in categories is effective and efficient
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 29 / 52
43. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION SETUP
Usual assets in query rewriting (ontologies and queries)
Adolena, path1, path5, StockExchange, University (LUBM), Vicodi.
Added ontologies (marked with “X”) using auxiliar predicates for qualified
existentials (Calvanese et al., 2007)
Added not so usual ontologies: Galen-Lite, Core
Added ontologies in ELHIO and beyond OWL2 QL (marked with “E”)
From 5 to 9 queries for each
Hypothesis:
H2. Additional computations can be done during query rewriting so as to obtain
a gain in efficiency that compensates for the cost of these computations
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 30 / 52
48. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATION
Usual assets in query rewriting (ontologies and queries)
Adolena, path1, path5, StockExchange, University (LUBM), Vicodi.
Added ontologies (marked with “X”) using auxiliar predicates for qualified
existentials (Calvanese et al., 2007)
Added not so usual ontologies: Galen-Lite, Core
Added ontologies in ELHIO and beyond OWL2 QL (marked with “E”)
From 5 to 9 queries for each
Hypothesis:
H2. Additional computations can be done during query rewriting so as to obtain
a gain in efficiency that compensates for the cost of these computations
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 33 / 52
49. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATION
Usual assets in query rewriting (ontologies and queries)
Adolena, path1, path5, StockExchange, University (LUBM), Vicodi.
Added ontologies (marked with “X”) using auxiliar predicates for qualified
existentials (Calvanese et al., 2007)
Added not so usual ontologies: Galen-Lite, Core
Added ontologies in ELHIO and beyond OWL2 QL (marked with “E”)
From 5 to 9 queries for each
Hypothesis:
H2. Additional computations can be done during query rewriting so as to obtain
a gain in efficiency that compensates for the cost of these computations
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 33 / 52
50. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 34 / 52
51. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
SOA: EBOX DEFINITION
ABoxes may present dependencies in their completeness
(Rodr´ıguez-Muro and Calvanese, 2011)
An EBox specifies the extensional constraints of the ABox (Rosati, 2012)
Example:
TBox: {PhDStudent Student}
ABox:
PhDStudents: Foo, Bar.
Students: Bar.
EBox: {Student PhDStudent}
The set of PhDStudents contains all Students, extensionally
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 35 / 52
52. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
SOA: EBOX DEFINITION
ABoxes may present dependencies in their completeness
(Rodr´ıguez-Muro and Calvanese, 2011)
An EBox specifies the extensional constraints of the ABox (Rosati, 2012)
Example:
TBox: {PhDStudent Student}
ABox:
PhDStudents: Foo, Bar.
Students: Bar.
EBox: {Student PhDStudent}
The set of PhDStudents contains all Students, extensionally
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 35 / 52
53. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
SOA: EBOX DEFINITION
ABoxes may present dependencies in their completeness
(Rodr´ıguez-Muro and Calvanese, 2011)
An EBox specifies the extensional constraints of the ABox (Rosati, 2012)
Example:
TBox: {PhDStudent Student}
ABox:
PhDStudents: Foo, Bar.
Students: Bar.
EBox: {Student PhDStudent}
The set of PhDStudents contains all Students, extensionally
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 35 / 52
54. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
ontology preprocess clauses
QUERY
REWRITING
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 36 / 52
55. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
OBDA CONTEXT
ontology preprocess
EBox
clauses
QUERY
REWRITING
EBox
generation
rewritten
query
Mappings
query
query
translation
translated
query
query
execution
data source
results
results
translation
translated
results
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 36 / 52
56. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING PROCESS
ontology
convert to
FOL
preprocess reachability
test
query
remove
functions
remove
auxiliar
predicates +
reachability
test
unfold Datalog
UCQ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 37 / 52
57. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING PROCESS
ontology
SCC removal
EBox
convert to
Datalog unfold
use EBox
use EBox
convert to
FOL
preprocess reachability
test
query
remove
functions
Datalog
UCQ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 37 / 52
58. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS
Strongly Connected Component (SCC): set of vertexes in a directed
graph that can be accessed from each other
Two conditions for removal (ensure no intensional values)
SCC to be removed cannot be accessible from another
Whole SCC needs to be implied by the EBox
Analogous in DL and Datalog
Vertexes are axioms/clauses
Edges are shared predicates
Direction: from head to body
For the example, clauses:
γ: GradStudent Student
ϕ: PhDStudent GradStudent
imply:
γ ϕ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 38 / 52
59. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS
Strongly Connected Component (SCC): set of vertexes in a directed
graph that can be accessed from each other
Two conditions for removal (ensure no intensional values)
SCC to be removed cannot be accessible from another
Whole SCC needs to be implied by the EBox
Analogous in DL and Datalog
Vertexes are axioms/clauses
Edges are shared predicates
Direction: from head to body
For the example, clauses:
γ: GradStudent Student
ϕ: PhDStudent GradStudent
imply:
γ ϕ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 38 / 52
60. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS
Strongly Connected Component (SCC): set of vertexes in a directed
graph that can be accessed from each other
Two conditions for removal (ensure no intensional values)
SCC to be removed cannot be accessible from another
Whole SCC needs to be implied by the EBox
Analogous in DL and Datalog
Vertexes are axioms/clauses
Edges are shared predicates
Direction: from head to body
For the example, clauses:
γ: GradStudent Student
ϕ: PhDStudent GradStudent
imply:
γ ϕ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 38 / 52
61. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS
Strongly Connected Component (SCC): set of vertexes in a directed
graph that can be accessed from each other
Two conditions for removal (ensure no intensional values)
SCC to be removed cannot be accessible from another
Whole SCC needs to be implied by the EBox
Analogous in DL and Datalog
Vertexes are axioms/clauses
Edges are shared predicates
Direction: from head to body
For the example, clauses:
γ: GradStudent Student
ϕ: PhDStudent GradStudent
imply:
γ ϕ
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 38 / 52
62. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS: EXAMPLE
A
BC
Q
DE
FG
H I
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 39 / 52
63. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS: EXAMPLE
A
BC
Q
DEE
FFG
HH
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 39 / 52
64. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS: EXAMPLE
A
BC
Q
DEE
FFGG
HH
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 39 / 52
65. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REMOVAL OF SCCS: EXAMPLE
A
BC
Q
D
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 39 / 52
66. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONALLY EMPTY PREDICATES REMOVAL
Analogous to the removal of non-mapped predicates
Extended to preprocessing
Possibility of cycles with function symbols
Keep predicates in the heads for reachability
Example: Mapped {PhDStudent, researches}
Ontology:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 40 / 52
67. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONALLY EMPTY PREDICATES REMOVAL
Analogous to the removal of non-mapped predicates
Extended to preprocessing
Possibility of cycles with function symbols
Keep predicates in the heads for reachability
Example: Mapped {PhDStudent, researches}
Ontology:
∃researches.PhDThesis PhDStudent
∃researches−
.PhDStudent PhDThesis
Ontology (Datalog):
PhDStudent(x) ← researches(x, y), PhDThesis(y)
PhDThesis(x) ← researches(y, x), PhDStudent(y)
Resolves: PhDStudent(x) ← researches(x, z), researches(z, y), PhDStudent(y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 40 / 52
68. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONALLY SUBSUMED CLAUSES REMOVAL
Analogous to the removal of subsumed clauses
More costly: requires resolution with EBox clauses
Detected when some clause is generated from a clause and the purely
extensional EBox
When detected: delete
Example:
Query: Q(x) ← Student(x)
TBox: PhDStudent Student
EBox: Student PhDStudent
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 41 / 52
69. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONALLY SUBSUMED CLAUSES REMOVAL
Analogous to the removal of subsumed clauses
More costly: requires resolution with EBox clauses
Detected when some clause is generated from a clause and the purely
extensional EBox
When detected: delete
Example:
Query: Q(x) ← Student(x)
TBox: PhDStudent Student
EBox: Student PhDStudent
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 41 / 52
70. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONAL CONDENSATION OF CLAUSES
clause subsumption
clause condensation
extensional clause subsumption
extensional clause condensation
Detected when some subsuming clause is generated from a clause and
the purely extensional EBox
When detected: replace
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
EBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 42 / 52
71. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EXTENSIONAL CONDENSATION OF CLAUSES
clause subsumption
clause condensation
extensional clause subsumption
extensional clause condensation
Detected when some subsuming clause is generated from a clause and
the purely extensional EBox
When detected: replace
Example:
Query: Q(x, y) ← Student(x), takesCourse(x, y), Course(y)
Consider big taxonomies for both Student and Course
EBox:
∃takesCourse Student
∃takesCourse− Course
Rewritten query: Q(x, y) ← takesCourse(x, y)
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 42 / 52
72. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION SETUP
Same queries. Extended previous assets with synthetically-generated EBoxes,
according to parameters:
SIZE ratio between EBox and TBox sizes (in axioms), e.g. size 2
means that EBox size is twice the TBox size
COVER percentage of axioms extracted from the TBox, remaining
axioms are generated randomly from the concepts
REVERSE among the axioms extracted, percentage of axioms that will
have their left hand side and right hand side swapped
For the evaluation:
Values for parameters have been 0, 0.2, 0.4, 0.6, 0.8 and 1
If a parameter is 0 then following parameters are 0 too
Hypothesis:
H3. EBoxes may allow reducing the size of the rewritten queries and the time
needed in the query process
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 43 / 52
73. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION: OPT. WITH AN EBOX (RESULTS)query
Datalog Datalog UCQ UCQ
time(ms) size time(ms) size
I II III IV I II III IV I II III IV I II III IV
1 0 0 157 235 15 13 14 9 0 0 516 672 15 13 14 9
2 16 16 157 234 10 10 10 10 16 16 500 656 10 10 10 10
3 0 0 125 235 35 30 28 15 31 15 485 813 72 57 54 15
4 0 16 219 188 41 38 22 16 63 94 735 719 185 170 3 42
5 16 16 172 250 8 5 7 1 32 31 609 719 30 9 15 1
6 0 15 234 188 18 14 14 11 0 15 578 641 18 14 14 11
7 0 0 125 172 27 23 23 20 94 125 1359 1359 180 140 140 110
Query independent information
EBox I II III IV
Preprocess time 109 2047 24266 2859
Preprocess size 222 195 171 111
size ratio 0.0 0.2 0.8 0.8
cover % 0.0 0.8 0.2 0.8
reverse % 0.0 0.0 0.0 0.0
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 44 / 52
74. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION: OPT. WITH AN EBOX (RESULTS)query
Datalog Datalog UCQ UCQ
time(ms) size time(ms) size
I II III IV I II III IV I II III IV I II III IV
1 0 0 157 235 15 13 14 9 0 0 516 672 15 13 14 9
2 16 16 157 234 10 10 10 10 16 16 500 656 10 10 10 10
3 0 0 125 235 35 30 28 15 31 15 485 813 72 57 54 15
4 0 16 219 188 41 38 22 16 63 94 735 719 185 170 3 42
5 16 16 172 250 8 5 7 1 32 31 609 719 30 9 15 1
6 0 15 234 188 18 14 14 11 0 15 578 641 18 14 14 11
7 0 0 125 172 27 23 23 20 94 125 1359 1359 180 140 140 110
Query independent information
EBox I II III IV
Preprocess time 109 2047 24266 2859
Preprocess size 222 195 171 111
size ratio 0.0 0.2 0.8 0.8
cover % 0.0 0.8 0.2 0.8
reverse % 0.0 0.0 0.0 0.0
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 44 / 52
75. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION, AGGREGATED RESULTS
0 0.2 0.4 0.6 0.8 1
102.3
102.4
102.5
size of the EBox
Averagetime(ms)toproducetheDatalogrewriting
cover 0 0.2 0.4 0.6 0.8 1
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 45 / 52
76. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION, AGGREGATED RESULTS
0 0.2 0.4 0.6 0.8 1
102.3
102.4
102.5
size of the EBox
Averagetime(ms)toproducetheDatalogrewriting
cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
103
104
size of the EBox
Averagetime(ms)toproducetheUCQrewriting
cover 0 0.2 0.4 0.6 0.8 1
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 45 / 52
77. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION, AGGREGATED RESULTS
0 0.2 0.4 0.6 0.8 1
102.3
102.4
102.5
size of the EBox
Averagetime(ms)toproducetheDatalogrewriting
cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
103
104
size of the EBox
Averagetime(ms)toproducetheUCQrewriting
cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
101
101.2
101.4
size of the EBox
AverageclausesintheDatalogrewriting cover 0 0.2 0.4 0.6 0.8 1
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 45 / 52
78. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
EVALUATION, AGGREGATED RESULTS
0 0.2 0.4 0.6 0.8 1
102.3
102.4
102.5
size of the EBox
Averagetime(ms)toproducetheDatalogrewriting
cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
103
104
size of the EBox
Averagetime(ms)toproducetheUCQrewriting
cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
101
101.2
101.4
size of the EBox
AverageclausesintheDatalogrewriting cover 0 0.2 0.4 0.6 0.8 1
0 0.2 0.4 0.6 0.8 1
101.5
102
102.5
size of the EBox
AverageclausesintheUCQrewriting
cover 0 0.2 0.4 0.6 0.8 1
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 45 / 52
79. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATION
Same queries. Extended previous assets with synthetically-generated EBoxes,
according to parameters:
SIZE ratio between EBox and TBox sizes (in axioms), e.g. size 2
means that EBox size is twice the TBox size
COVER percentage of axioms extracted from the TBox, remaining
axioms are generated randomly from the concepts
REVERSE among the axioms extracted, percentage of axioms that will
have their left hand side and right hand side swapped
For the evaluation:
Values for parameters have been 0, 0.2, 0.4, 0.6, 0.8 and 1
If a parameter is 0 then following parameters are 0 too
Hypothesis:
H3. EBoxes may allow reducing the size of the rewritten queries and the time
needed in the query process
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 46 / 52
80. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
HYPOTHESIS VALIDATION
Same queries. Extended previous assets with synthetically-generated EBoxes,
according to parameters:
SIZE ratio between EBox and TBox sizes (in axioms), e.g. size 2
means that EBox size is twice the TBox size
COVER percentage of axioms extracted from the TBox, remaining
axioms are generated randomly from the concepts
REVERSE among the axioms extracted, percentage of axioms that will
have their left hand side and right hand side swapped
For the evaluation:
Values for parameters have been 0, 0.2, 0.4, 0.6, 0.8 and 1
If a parameter is 0 then following parameters are 0 too
Hypothesis:
H3. EBoxes may allow reducing the size of the rewritten queries and the time
needed in the query process
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 46 / 52
81. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
1 INTRODUCTION
2 STATE OF THE ART
3 OBJECTIVES
4 OPTIMISATIONS IN THE PRESENCE OF MAPPINGS
5 ENGINEERING OPTIMISATIONS
6 EBOX-BASED OPTIMISATIONS
7 CONCLUSIONS AND FUTURE LINES
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 47 / 52
82. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CONCLUSIONS I
Small modifications to the query rewriting process from scientific and
engineering points of view, with:
Big impact on the results
Great applicability
Strong focus on a pragmatic perspective, maintaining the expressiveness
Theoretically sound and validated.
Formalisation: 10 propositions, 1 corollary
Empirical evaluations that show the obtained results, full data:
http://purl.org/net/jmora/mappings/fao/analysis
http://purl.org/net/jmora/kyrie/mappings/analysis
http://purl.org/net/jmora/kyrie/optimisations/evaluation
http://purl.org/net/jmora/kyrie/benchmark
http://purl.org/net/jmora/kyrie/queries/analysis
http://purl.org/net/jmora/kyrie/extensional/evaluation
Open source code:
https://bitbucket.org/jmora/kyrie
https://bitbucket.org/jmora/kyrie-aux
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 48 / 52
83. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
CONCLUSIONS II
Main publications:
Jose Mora and Oscar Corcho. Engineering optimisations in query rewriting
for OBDA. In Proceedings of the 9th International Conference on Semantic
Systems (I-SEMANTICS 13), pages 4148, Graz, Austria, September 2013.
Jose Mora and Oscar Corcho. Towards a systematic benchmarking of
ontology-based query rewriting systems. In The 12th International Semantic
Web Conference (ISWC2013), pages 369384, Sydney, Australia, October
2013.
Jose Mora, Riccardo Rosati, and Oscar Corcho. kyrie2: Query Rewriting
under Extensional Constraints in ELHIO. In The 13th International
Semantic Web Conference (ISWC2014), pages 568583, Riva del Garda, Italy,
October 2014.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 49 / 52
84. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
BETTER EVALUATION
Some assets used (often reused) for evaluation in papers, e.g.
(P´erez-Urbina et al., 2009)
Only systematic approach: (Imprialou et al., 2012)
Algorithm to generate queries automatically
Good for coverage testing (detected several problems)
Queries are generated automatically (chase)
All have a similar structure
No statement about how OBDA systems are actually used
Over this state of the art:
Usual assets used, added usual assets out of query rewriting
Generated some additional assets to evaluate specific features
Meta-evaluation: analysed the characteristics of queries and ontologies
Having a set of standard benchmarks would ease evaluation,
reproducibility, comparison, etc.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 50 / 52
85. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
BETTER EVALUATION
Some assets used (often reused) for evaluation in papers, e.g.
(P´erez-Urbina et al., 2009)
Only systematic approach: (Imprialou et al., 2012)
Algorithm to generate queries automatically
Good for coverage testing (detected several problems)
Queries are generated automatically (chase)
All have a similar structure
No statement about how OBDA systems are actually used
Over this state of the art:
Usual assets used, added usual assets out of query rewriting
Generated some additional assets to evaluate specific features
Meta-evaluation: analysed the characteristics of queries and ontologies
Having a set of standard benchmarks would ease evaluation,
reproducibility, comparison, etc.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 50 / 52
86. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
BETTER EVALUATION
Some assets used (often reused) for evaluation in papers, e.g.
(P´erez-Urbina et al., 2009)
Only systematic approach: (Imprialou et al., 2012)
Algorithm to generate queries automatically
Good for coverage testing (detected several problems)
Queries are generated automatically (chase)
All have a similar structure
No statement about how OBDA systems are actually used
Over this state of the art:
Usual assets used, added usual assets out of query rewriting
Generated some additional assets to evaluate specific features
Meta-evaluation: analysed the characteristics of queries and ontologies
Having a set of standard benchmarks would ease evaluation,
reproducibility, comparison, etc.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 50 / 52
87. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
FUTURE LINES
Improve
expressiveness (SPIN, OWL2 RL, OWL2 DL, ...)
efficiency (process but especially results)
awareness of context (provenance, cost model, data quality, ...)
standardization (SPARQL, ...)
applicability: types of data sources (streams, web services, ...)
knowledge available, e.g. predefined queries for visualisation
benchmarks
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 51 / 52
88. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING OPTIMISATION TECHNIQUES
FOR ONTOLOGY-BASED DATA ACCESS
Jos´e Mora L´opez
jmora@fi.upm.es
Supervisor: Oscar Corcho
ocorcho@fi.upm.es
Ontology Engineering Group
Facultad de Inform´atica, Universidad Polit´ecnica de Madrid
Madrid - December 17, 2015
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 52 / 52
89. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
FOL ELHIO
A(a) A(a), {a} A
P(a, b) P(a, b)
x≈a ← A(x) A {a}
A2(x) ← A1(x) A1 A2
A3(x) ← A1(x) ∧ A2(x) A1 A2 A3
P(x, f(x)) ← A(x) A ∃P
P(x, f(x)) ← A1(x),A2(f(x)) ← A1(x) A1 ∃P.A2
P(f(x), x) ← A A ∃P−
P(f(x), x) ← A1(x),A2(f(x)) ← A1(x) A1 ∃P−
.A2
A(x) ← P(x, y) ∃P A
A2(x) ← P(x, y), A1(y) ∃P.A1 A2
A(x) ← P(y, x) ∃P−
A
A2(x) ← P(y, x), A1(y) ∃P−
.A1 A2
S(x, y) ← P(x, y) P S, P−
S−
S(x, y) ← P(y, x) P S−
, P S−
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 53 / 52
90. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
RESEARCH QUESTIONS
1. Can the information about mappings be useful for query rewriting in an
OBDA context?
2. Can the information from mappings be used in an efficient way?
3. How can we tune current algorithms for query rewriting in OBDA?
4. How can we measure the appropriateness of the solutions to the query
rewriting problem?
5. Can EBoxes be used in query rewriting?
6. What is the impact of the different EBoxes?
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 54 / 52
91. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
ASSUMPTIONS AND LIMITATIONS
A1. We can obtain an exhaustive list of the predicates in the ontology that are
mapped to some data source for a set of mappings.
A2. Predicates that are not mapped cannot be retrieved from the data source.
A3. EBoxes can be derived from schemata and data available in data sources.
A4. Set semantics: redundant answers have no impact on the correctness.
A5. Some predicates in the ontology may not be mapped to the data source.
L1. We restrict the input queries to UCQs and output to Datalog. Unfolding
of the output to UCQs will be attempted.
L2. The mapping-based reduction in time and size for rewriting depends on
how many ontology predicates are mapped.
L3. The impact of additional computations in query rewriting may be
negative in some cases, e.g. those specifically designed for that.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 55 / 52
92. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
ASSUMPTIONS AND LIMITATIONS
A1. We can obtain an exhaustive list of the predicates in the ontology that are
mapped to some data source for a set of mappings.
A2. Predicates that are not mapped cannot be retrieved from the data source.
A3. EBoxes can be derived from schemata and data available in data sources.
A4. Set semantics: redundant answers have no impact on the correctness.
A5. Some predicates in the ontology may not be mapped to the data source.
L1. We restrict the input queries to UCQs and output to Datalog. Unfolding
of the output to UCQs will be attempted.
L2. The mapping-based reduction in time and size for rewriting depends on
how many ontology predicates are mapped.
L3. The impact of additional computations in query rewriting may be
negative in some cases, e.g. those specifically designed for that.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 55 / 52
93. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
FULL LIST OF PUBLICATIONS I
1. Domenico Lembo, Jose Mora, Riccardo Rosati, Domenico Fabio Savo, and
Evgenij Thorstensen. Mapping analysis in ontology-based data access:
Algorithms and complexity. In ISWC, Bethlehem, Pennsylvania, USA,
2015
2. Ernesto Jimenez-Ruiz, Evgeny Kharlamov, Dmitriy Zheleznyakov,
Christoph Pinkel, Martin Skjæveland, Evgenij Thorstensen, Jose Mora,
and Ian Horrocks. Bootox: Bootstrapping owl 2 ontologies and r2rml
mappings from relational databases. In ISWC, Bethlehem, Pennsylvania,
USA, 2015
3. Jose Mora, Riccardo Rosati, and Oscar Corcho. kyrie2: Query rewriting
under extensional constraints in ELHIO. In ISWC, Trentino, Italy, 2014
4. Marco Console, Jose Mora, Riccardo Rosati, Valerio Santarelli, and
Domenico Fabio Savo. Effective computation of maximal sound
approximations of description logic ontologies. In ISWC, Trentino, Italy,
2014
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 56 / 52
94. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
FULL LIST OF PUBLICATIONS II
5. Domenico Lembo, Jose Mora, Riccardo Rosati, Domenico Fabio Savo, and
Evgenij Thorstensen. Towards mapping analysis in ontology-based data
access. In Web Reason- ing and Rule Systems, Athens, Greece, 2014
6. Jose Mora and Oscar Corcho. Towards a systematic benchmarking of
ontology-based query rewriting systems. In ISWC, Sydney, Australia,
2013
7. Jose Mora and Oscar Corcho. Engineering optimisations in query
rewriting for OBDA. In I-SEMANTICS ’13, Graz, Austria, 2013
8. Ghislain Atemezing, Oscar Corcho, Daniel Garijo, Jose Mora, Mar´ıa
Poveda-Villal´on, Pablo Rozas, Daniel Vila-Suero, and Boris
Villaz´on-Terrazas. Transforming meteorological data into linked data.
Semantic Web, 2012
9. Jose Mora, Jos´e ´Angel Ramos, and Guadalupe Aguado de Cea.
Enhancing the expressiveness of linguistic structures. In E-LKR’12),
Castell´on de la Plana, Spain, 2012
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 57 / 52
95. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
FULL LIST OF PUBLICATIONS III
10. B. Villaz´on-Terrazas, D. Vila-Suero, D. Garijo, L. M. Vilches-Bl´azquez, M.
Poveda- Villal´on, J. Mora, O. Corcho, and A. G´omez-P´erez. Publishing
linked data-there is no one-size-fits-all formula. European Data Forum,
2012
11. Alexander de Le´on, Victor Saquicela, Luis M. Vilches, Boris
Villaz´on-Terrazas, Freddy Priyatna, Oscar Corcho, Carlos Buil, Jose Mora,
and Jean-Paul Calbimonte. Geographical linked data: a spanish use case.
In I-SEMANTICS ’10, Graz, Austria, 2010
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 58 / 52
96. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REFERENCES I
Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, and Riccardo
Rosati. Tractable Reasoning and Efficient Query Answering in Description Logics: The
DL-Lite Family. Journal of Automated Reasoning, 39(3):385–429, October 2007. doi:
10.1007/s10817-007-9078-x.
Alexandros Chortaras, Despoina Trivela, and Giorgos Stamou. Optimized Query Rewriting for
OWL 2 QL. In Automated Deduction – CADE-23, volume 6803, pages 192–206. Springer, 2011.
ISBN 978-3-642-22437-9.
Thomas Eiter, Magdalena Ortiz, Mantas Simkus, Trung-Kien Tran, and Guohui Xiao. Query
rewriting for Horn-SHIQ plus rules. In 26th AAAI Conference on Artificial Intelligence, 2012.
Georg Gottlob, Giorgio Orsi, and Andreas Pieris. Ontological Queries: Rewriting and
Optimization (Extended Version). arXiv:1112.0343, December 2011.
Martha Imprialou, Giorgos Stoilos, and Bernardo Cuenca Grau. Benchmarking ontology-based
query rewriting systems. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial
Intelligence, AAAI, 2012.
Jose Mora and Oscar Corcho. Towards a systematic benchmarking of ontology-based query
rewriting systems. In The 12th International Semantic Web Conference (ISWC2013), pages
369–384, Sydney, Australia, 2013.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 59 / 52
97. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
REFERENCES II
Jose Mora, Riccardo Rosati, and Oscar Corcho. kyrie2: Query Rewriting under Extensional
Constraints in ELHIO. In The 13th International Semantic Web Conference (ISWC2014), Trentino,
Italy, 2014.
H´ector P´erez-Urbina, Ian Horrocks, and Boris Motik. Efficient Query Answering for OWL 2. In
Lecture Notes in Computer Science, volume 5823, pages 489–504. Springer, 2009.
Mariano Rodr´ıguez-Muro and Diego Calvanese. Dependencies: Making ontology based data
access work in practice. Proc. of the 5th Alberto Mendelzon Int. Workshop on Foundations of Data
Management, 2011.
Riccardo Rosati. Prexto: Query Rewriting under Extensional Constraints in DL-Lite. In The
Semantic Web: Research and Applications, volume 7295 of Lecture Notes in Computer Science, pages
360–374. Springer Berlin / Heidelberg, 2012. ISBN 978-3-642-30283-1.
Riccardo Rosati and Alessandro Almatelli. Improving Query Answering over DL-Lite
Ontologies. In Proceedings of the Twelfth International Conference on the Principles of Knowledge
Representation and Reasoning. AAAI Press, 2010.
Tassos Venetis, Giorgos Stoilos, and Giorgos Stamou. Query rewriting under query extensions for
OWL 2 QL ontologies. In The 7th International Workshop on Scalable Semantic Web Knowledge Base
Systems (SSWS 2011), page 59, 2011.
Luis Manuel Vilches-Bl´azquez, Miguel ´Angel Bernab´e-Poveda, Mari Carmen Su´arez-Figueroa,
Asunci´on G´omez-P´erez, and Antonio F. Rodr´ıguez-Pascual. Towntology hydrOntology:
Relationship between Urban and Hydrographic Features in the Geographic Information
Domain. Ontologies for Urban Development, 61:73–84, 2007.
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 60 / 52
98. Introduction State of the Art Objectives Mappings Engineering EBox Conclusions References
QUERY REWRITING OPTIMISATION TECHNIQUES
FOR ONTOLOGY-BASED DATA ACCESS
Jos´e Mora L´opez
jmora@fi.upm.es
Supervisor: Oscar Corcho
ocorcho@fi.upm.es
Ontology Engineering Group
Facultad de Inform´atica, Universidad Polit´ecnica de Madrid
Madrid - December 17, 2015
jmora@fi.upm.es Optimising OBDA query rewriting Madrid - December 17, 2015 61 / 52