On the 21st of November 2013, Yannis Tzitzikas, FORTH, presented the Integrating heterogeneous and distributed information about marine species through a top level ontology paper at the 7th Metadata and Semantic Research Conference in Thessaloniki, Greece.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Florida State University (FSU) entered into a formal digital preservation strategy agreement with Florida Digital Archive (FDA) in 2009. However, prior to joining FDA, FSU requested permission from FDA to develop a plan to preserve a faculty member's research data. FDA agreed to allow the development of a FSU demo preservation of FSU Biological Scientist, Dr. A.K.S.K. Prasad, images of biological silica collection which was later presented in several national and international conference presentations.
This talk will include oral history and a presentation detailing the development of FSU utilizing locally developed preservation strategy of DAITSS, known as Dark Archive in the Sunshine State, starting with demo preservation of faculty research data which was later used to influence senior management to join FDA.
Transforming repositories: from repository managers to institutional data man...JISC KeepIt project
The last decade has seen support for digital preservation transformed. There are now a multitude of organisations, training courses, and software development tools to help guide managers of digital data towards preservation decisions and solutions. But how well do these approaches understand the needs and requirements of users? This presentation was given at ECA 2010, a conference for digital archiving professionals. But not everyone can be a digital archiving specialist. At a time of exploding volumes of digital content, especially on the Web, many non-specialists need help in preserving digital content. The presentation looks at the applicability and practicality of all this support for one class of user, digital repositories, and in particular institutional repositories (IRs) and their managers. We report on a course on digital preservation tools, designed by repository managers as part of the JISC KeepIt project. Positive feedback from the evaluations of this course have show that the emergence of the tools used in this course is a great story for digital preservation.
Knowledge Organisation Systems in ETDs: A Comparative StudyBhojaraju Gunjal
Gunjal, Bhojaraju & Urs, Shalini R (2010): Knowledge Organisation Systems in ETDs: A Comparative Study; In Proceedings of 13th International Symposium on Electronic Theses and Dissertations (ETD 2010); June 16 -18, 2010, Austin, TX, USA. https://conferences.tdl.org/utlibraries/etd2010/paper/view/28
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Florida State University (FSU) entered into a formal digital preservation strategy agreement with Florida Digital Archive (FDA) in 2009. However, prior to joining FDA, FSU requested permission from FDA to develop a plan to preserve a faculty member's research data. FDA agreed to allow the development of a FSU demo preservation of FSU Biological Scientist, Dr. A.K.S.K. Prasad, images of biological silica collection which was later presented in several national and international conference presentations.
This talk will include oral history and a presentation detailing the development of FSU utilizing locally developed preservation strategy of DAITSS, known as Dark Archive in the Sunshine State, starting with demo preservation of faculty research data which was later used to influence senior management to join FDA.
Transforming repositories: from repository managers to institutional data man...JISC KeepIt project
The last decade has seen support for digital preservation transformed. There are now a multitude of organisations, training courses, and software development tools to help guide managers of digital data towards preservation decisions and solutions. But how well do these approaches understand the needs and requirements of users? This presentation was given at ECA 2010, a conference for digital archiving professionals. But not everyone can be a digital archiving specialist. At a time of exploding volumes of digital content, especially on the Web, many non-specialists need help in preserving digital content. The presentation looks at the applicability and practicality of all this support for one class of user, digital repositories, and in particular institutional repositories (IRs) and their managers. We report on a course on digital preservation tools, designed by repository managers as part of the JISC KeepIt project. Positive feedback from the evaluations of this course have show that the emergence of the tools used in this course is a great story for digital preservation.
Knowledge Organisation Systems in ETDs: A Comparative StudyBhojaraju Gunjal
Gunjal, Bhojaraju & Urs, Shalini R (2010): Knowledge Organisation Systems in ETDs: A Comparative Study; In Proceedings of 13th International Symposium on Electronic Theses and Dissertations (ETD 2010); June 16 -18, 2010, Austin, TX, USA. https://conferences.tdl.org/utlibraries/etd2010/paper/view/28
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Slides from a presentation on the Knowledge Organization System (KOS) work program for GBIF. KOS developments for biodiversity information resources and input to the emerging Vocabulary Management Task Group (VoMaG).
Links
GBIF KOS prototype tools, http://kos.gbif.org/
Tool: Semantic Wiki prototype, http://terms.gbif.org/wiki/
Tool: ISOcat prototype demo, http://kos.gbif.org/isocat/
GBIF concept vocabulary term browser, http://kos.gbif.org/termbrowser/
GBIF Resources Repository, http://rs.gbif.org/terms/
GBIF Vocabulary Server, http://vocabularies.gbif.org/
GBIF Resources Browser, http://tools.gbif.org/resource-browser/
The iMarine initiative provides a data infrastructure aimed at facilitating open access, the sharing of data, collaborative analysis, processing and mining processing, as well as the dissemination of newly generated knowledge. The iMarine data infrastructure is developed to support decision making in high-level challenges that require policy decisions typical of the ecosystem approach. The iMarine offering can be articulated in six bundles. A “bundle” is a set of services and technologies grouped according to a family of related tasks for achieving a common objective. Bundles can be customized and/or enriched into flexible, purpose-built Virtual Research Environments (VRE). Virtual research environments offer flexible and secure web-based, community-centric platforms, so researchers can work together on common challenges. Each VRE in the infrastructure is tightly integrated with the underlying gCube enabling software, and can access and re-purpose data from other iMarine applications.
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: All About iMarine iMarine283644
iMarine is empowering users in the marine community and beyond by providing a highly efficient e-Infrastructure to accelerate data discovery, exchange, and analysis, tools and platforms that facilitates scientific discovery. Funded by the European Commission's 7th Framework Programme, a number of iMarine services are already available through the iMarine Gateway supplying cross disciplinary data supporting experts in the field.
A step into the future of iMarine: The iMarine Public-centred Partnership Bus...iMarine283644
Presentation by Marc Taconet - FAO-FI, Chief Fisheries Statistics and Information Branch (FIPS) & iMarine Board Chair, Patricio Bernal - IUCN High Seas Initiatives and Hervé Camount - Terradue, Program Manager on the sustainability plan of the iMarine initiative
This is the device Klassmate which I have developed for classroom teaching. Which costs 70000 INR and is portable interactive whiteboard. with Microsoft Kinect sensor.
Text (personal views position statement) to accompany presentation on what research infrastructures really need for data, XLDB-Europe, 8-10th June 2011, Edinburgh
As the volume and complexity of data from myriad Earth Observing platforms, both remote sensing and in-situ increases so does the demand for access to both data and information products from these data. The audience no longer is restricted to an investigator team with specialist science credentials. Non-specialist users from scientists from other disciplines, science-literate public, to teachers, to the general public and decision makers want access. What prevents them from this access to resources? It is the very complexity and specialist developed data formats, data set organizations and specialist terminology. What can be done in response? We must shift the burden from the user to the data provider. To achieve this our developed data infrastructures are likely to need greater degrees of internal code and data structure complexity to achieve (relatively) simpler end-user complexity. Evidence from numerous technical and consumer markets supports this scenario. We will cover the elements of modern data environments, what the new use cases are and how we can respond to them.
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Slides from a presentation on the Knowledge Organization System (KOS) work program for GBIF. KOS developments for biodiversity information resources and input to the emerging Vocabulary Management Task Group (VoMaG).
Links
GBIF KOS prototype tools, http://kos.gbif.org/
Tool: Semantic Wiki prototype, http://terms.gbif.org/wiki/
Tool: ISOcat prototype demo, http://kos.gbif.org/isocat/
GBIF concept vocabulary term browser, http://kos.gbif.org/termbrowser/
GBIF Resources Repository, http://rs.gbif.org/terms/
GBIF Vocabulary Server, http://vocabularies.gbif.org/
GBIF Resources Browser, http://tools.gbif.org/resource-browser/
The iMarine initiative provides a data infrastructure aimed at facilitating open access, the sharing of data, collaborative analysis, processing and mining processing, as well as the dissemination of newly generated knowledge. The iMarine data infrastructure is developed to support decision making in high-level challenges that require policy decisions typical of the ecosystem approach. The iMarine offering can be articulated in six bundles. A “bundle” is a set of services and technologies grouped according to a family of related tasks for achieving a common objective. Bundles can be customized and/or enriched into flexible, purpose-built Virtual Research Environments (VRE). Virtual research environments offer flexible and secure web-based, community-centric platforms, so researchers can work together on common challenges. Each VRE in the infrastructure is tightly integrated with the underlying gCube enabling software, and can access and re-purpose data from other iMarine applications.
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: All About iMarine iMarine283644
iMarine is empowering users in the marine community and beyond by providing a highly efficient e-Infrastructure to accelerate data discovery, exchange, and analysis, tools and platforms that facilitates scientific discovery. Funded by the European Commission's 7th Framework Programme, a number of iMarine services are already available through the iMarine Gateway supplying cross disciplinary data supporting experts in the field.
A step into the future of iMarine: The iMarine Public-centred Partnership Bus...iMarine283644
Presentation by Marc Taconet - FAO-FI, Chief Fisheries Statistics and Information Branch (FIPS) & iMarine Board Chair, Patricio Bernal - IUCN High Seas Initiatives and Hervé Camount - Terradue, Program Manager on the sustainability plan of the iMarine initiative
This is the device Klassmate which I have developed for classroom teaching. Which costs 70000 INR and is portable interactive whiteboard. with Microsoft Kinect sensor.
Text (personal views position statement) to accompany presentation on what research infrastructures really need for data, XLDB-Europe, 8-10th June 2011, Edinburgh
As the volume and complexity of data from myriad Earth Observing platforms, both remote sensing and in-situ increases so does the demand for access to both data and information products from these data. The audience no longer is restricted to an investigator team with specialist science credentials. Non-specialist users from scientists from other disciplines, science-literate public, to teachers, to the general public and decision makers want access. What prevents them from this access to resources? It is the very complexity and specialist developed data formats, data set organizations and specialist terminology. What can be done in response? We must shift the burden from the user to the data provider. To achieve this our developed data infrastructures are likely to need greater degrees of internal code and data structure complexity to achieve (relatively) simpler end-user complexity. Evidence from numerous technical and consumer markets supports this scenario. We will cover the elements of modern data environments, what the new use cases are and how we can respond to them.
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
AUTHENTICITY AND OAIS.THE CASPAR MODEL AND THE INTERPARES PRINCIPLES & OUTPUTSDigitalPreservationEurope
The presentation was given at the Delos Summer School (Tirrenia, June 2008). It illustrates the research activity related to the CASPAR project with specific reference to the authenticity and the need and the possibility to ensure continuity among international projects and their results. The first part is dedicated to define the concepts and the critical issues by taking into account the outputs of InterPARES and the basic framework of OAIS. Both the aspects have been included in the position paper on authenticity defined within the CASPAR project. The second part concerns the effort (under development as part of the Caspar framework for digital preservation) to build management tools fo ensure the verification and documentation of the digital objects authenticity.
Reference Model for an Open Archival Information Systems (OAIS): Overview and...faflrt
ALA/FAFLRT Workshop on Open Archival Information Service (OAIS). Presented by Alan Wood/A.E.Wood & Erickson/Lockheed Martin, Don Sawyer/NASA/GSFC, and Lou Reich/CSC. Sponsored by ALA Federal and Armed Forces Libraries Roundtable (FAFLRT). Presented on June 16, 2001 at the ALA Annual Conference.
Preprint-WCMRI,IFERP,Singapore,28 October 2022.pdfChristo Ananth
Call for Papers- Special Session: World Conference on Multidisciplinary Research and Innovation (WCMRI-22), (Session 1: Information and Communication Technology), Singapore
Christo Ananth
Professor, Samarkand State University, Uzbekistan
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...Raul Palma
Volcanic eruptions are among the most spectacular and dangerous phenomena on Earth, capable of generating disasters at various scales. The Geohazard Supersites and Natural Laboratories initiatives (GSNL) today is a network of 11 Supersites, including volcanoes and seismic areas. Complex algorithms are used to analyse these data and important information on the volcano activity. In addition to computing power and resources, researchers from the geo-gnsl community, as many other data-intensive science communities, are calling for innovative ways to manage their data, methods and other resources, which can enhance the visibility of scientific breakthroughs, encourage reuse, and foster a broader research accessibility. In this contribution we present the results of EVER-EST project (H2020-EINFRA-2015-1), in which we created in collaboration with different partners a virtual research environment (VRE) for Earth Science (https://vre.ever-est.eu/), embracing the research object concept and technologies at its core.
Similar to Integrating Heterogeneous and Distributed Information about Marine Species through a Top Level Ontology (20)
iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...iMarine283644
On the 22 July 2014, OpenChannels.org and the EBM Tools Network, two of the premier sources of information about coastal and marine planning and management tools in the United States and internationally, hosted the iMarine webinar: iMarine Data e-Infrastructure Initiative for Fisheries Management and Conservation of Marine Living Resources.
The webinar focused on the presentation of the iMarine initiative and its powerful data e-infrastructures and services, followed by a presentation of a set of use cases related to Geospatial Analysis, Ecology, Biodiversity and Life History Traits. The presentations were given by Pasquale Pagano, CNR-ISTI and iMarine Technical Director and Gianpaolo Coro, CNR-ISTI. Watch the video of the webinar here https://www.youtube.com/watch?v=lgf30BPyBbk
In computational statistics, algorithms often have specialized implementations that address very specific problems. Every so often, these algorithms are applicable also to other problems than the original ones. Today, interest is growing towards modular and pluggable solutions that enable the repetition and validation of the experiments made by other scientists and allow the exploitation of those algorithms in other contexts. Furthermore, such procedures are requested to be remotely hosted and to “hide” the complexity of the calculations, managed by remote computational infrastructures behind the scenes. For such reasons, the usual solution of supplying modular software libraries containing implementations of algorithms is leaving the place to Web Services accessible through standard protocols and hosting such implementations. The protocols describing the computational capabilities of these Services are more and more elaborate, so that modular workflows can rely on them.
Part 1 - What is a data e-infrastructure?
Part 2 - Serving policy frameworks facing BIG challenges
Part 3 - The power of an e-Infrastructure - Synergies and efficiencies through Global collaboration communities
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Integrating Heterogeneous and Distributed Information about Marine Species through a Top Level Ontology
1. Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
1
Integrating
Heterogeneous and Distributed Information
about Marine Species
through a Top Level Ontology
Y. Tzitzikas 1,2 , C. Alloca 1 , C. Bekiari 1 , Y. Marketakis 1 , P. Fafalios 1,2 ,
M. Doerr 1 , N. Minadakis 1 , T.Patkos 1 , L. Candela 3
1 Institute of Computer Science, FORTH-ICS
2 Computer Science Department, University of Crete, GREECE
3 Consiglio Nazionale delle Ricerche, CNR-ISTI, Pisa, Italy
7th Metadata and Semantics Research Conference (MTSR), Thessaloniki, Nov 19-22, 2013
1
2. Outline
• Context, Problem, Objectives
• Main Approaches for Integration
• The Followed Approach
– The Ontology MarineTLO
• Objectives, Benefits, Architecture
– The MarineTLO-based Warehouse
• Exploitation Scenarios
• Concluding Remarks
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
3
Context: iMarine
Id: It is an FP7 Research Infrastructure Project (2011-2014)
Final goal: launch an initiative aimed at establishing and operating an einfrastructure supporting the principles of the Ecosystem Approach to fisheries
management and conservation of marine living resources.
Partners:
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
4
2
3. Problem and objectives
The Problem
• There are several sources of the marine domain, but each of
them stores complementary information structured according
to its needs.
Our objective
• Harmonize and integrate (link, connect) information of the
marine domain
– Specific motivating scenario and use cases will be given at the end
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
5
Marine Information: in several sources
WoRMS: World Register of Marine Species
Registers more than 200K species
ECOSCOPE- A Knowledge Base About Marine
Ecosystems (IRD, France)
FLOD (Fisheries Linked Data) of
Food and Agriculture Organization (FAO) of the United
Nations
FishBase: Probably the largest and most extensively
accessed online database
of fish species.
DBpedia
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
6
3
4. Marine Information:
in several sources
Storing
Taxonomic information
complementary
information
Ecosystem information (e.g. which fish eats which fish)
Commercial codes
General information, occurrence data, including
information from other sources
General information, figures
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
Marine Information:
in several sources
7
Using and accessed through
different technologies
Web services (SOAP/WSDL)
RDF + OWL files
SPARQL Endpoint
Relational Database
SPARQL Endpoint
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
8
4
5. Main approaches for Integration
In general there are two main approaches for integration
Warehouse approach (materialized integration)
• Design Phase: The underlying sources (and their parts) have to be selected
• Creation Phase: Process for getting and creating the warehouse
• Maintenance Phase: Ability to create the warehouse from scratch, and/or ability
to update parts of it
• Mappings are exploited to extract information from data sources, to transform it
to the target model and then to store it at the central repository
Mediator approach (virtual integration)
• The mediator receives a query formulated in terms of the unified model/schema.
The mappings are used to enable query translation. The derived sub-queries are
sent to the wrappers of the individual sources, which transform them into
queries over the underlying sources. The results of these sub-queries are sent
back to the mediator where they are assembled to form the final answer
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
9
Main approaches for integration (cont.)
Warehouse
•
•
•
•
•
Mediator
•
Benefit: Flexibility in transformation
logic (including ability to curate and fix
problems)
Benefit: Decoupling of the release
management of the integrated resource
•
from the management cycles of the
underlying sources
Benefit: Decoupling of access load from
the underlying sources.
Benefit: Faster responses (in query
answering but also in other tasks, e.g. if
one wants to use it for applying an entity
matching technique).
Benefit: One advantage (but in some
cases disadvantage) of virtual
integration is the real-time
reflection of source updates in
integrated access
Comment: The higher complexity of
the system (and the quality of
service demands on the sources) is
only justified if immediate access to
updates is indeed required.
Shortcomings You have to pay the cost for
hosting the warehouse. You have to refresh
periodically the warehouse
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
10
5
6. Main approaches for integration (cont.)
In both cases we need a unified model/schema
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
11
The ontology MarineTLO
(Marine Top Level Ontology)
6
7. MarineTLO: Objectives
• MarineTLO aims at being a global core model that
– provides a common, agreed-upon and understanding of the concepts
and relationships holding in the marine domain to enable knowledge
sharing, information exchanging and integration between
heterogeneous sources
– covers with suitable abstractions the marine domain to enable the
most fundamental queries,
– can be extended to any level of detail on demand, and
– allows data originating from distinct sources to be adequately mapped
and integrated
• MarineTLO is not supposed to be the single ontology covering
the entirety of what exists
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
13
MarineTLO: Benefits from a Top-Level Ontology
• The adoption of a global core model has various
benefits:
– reduced effort for improving and evolving
• the focus is given on one model, rather than many (the results are
beneficial for the entire community
– reduced effort for constructing mappings
• this approach avoids the inevitable combinatorial explosion and
complexities that results from pair-wise mappings between
individual metadata formats and/or ontologies
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
14
7
8. MarineTLO: Key Design Principles
• Formulation
– It is an object-oriented semantic model, expressed to a form
comprehensible to both documentation experts and information
scientists while readily can be converted to machine-readable formats
such as RDF Schema, OWL, etc
• Metaclasses
– certain types of inference about classes is supported in an analogous
way as classes support certain types of inference about instances
• Monotonicity
– It aims to be monotonic in the sense of Domain Theory: the existing
constructs and the deductions made from them should remain valid and
well-formed, even as new constructs are added to the MarinTLO
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
15
MarineTLO: Query capabilities
It allows formulating complex queries, e.g.:
1.Given the scientific name of a species, find its predators with
the related taxon-rank classification and with the different
codes that the organizations use to refer to them.
2. Given the scientific name of a species, find the ecosystems,
waterareas and countries that this species is native to, and
the common names that are used for this species in each of
the countries
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
16
8
9. The notion of competence queries as driver
#Query For a scientific name of a species (e.g. Thunnus Albacares or Poromitra Crassiceps),
find/give me
Q1
the biological environments (e.g. ecosystems) in which the species has been introduced and more
general descriptive information of it (such as the country)
Q2
its common names and their complementary info (e.g. languages and countries where they are
used)
Q3
Q4
Q5
Q6
the water areas and their FAO codes in which the species is native
the countries in which the species lives
the water areas and the FAO portioning code associated with a country
the presentation w.r.t Country, Ecosystem, Water Area and Exclusive Economical Zone (of the
water area)
Q7
the projection w.r.t. Ecosystem and Competitor, providing for each competitor the identification
information (e.g. several codes provided by different organizations)
Q8
a map w.r.t. Country and Predator, providing for each predator both the identification information
and the biological classification
Q9
who discovered it, in which year, the biological classification, the identification information, the
common names - providing for each common name the language, the countries where it is used
in.
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
17
MarineTLO as Product
• The “full” version of MarineTLO (Version3.0.0)
– aims at covering any part of the marine domain
– contains 70 classes and 41 properties
• The “operational” version, for the needs of
iMarine(Version 3.0.0)
– used for building MarineTLO Warehouse (Version 3.0.0)
– contains 92 classes and 41 properties
– applied for integrating data mainly from FLOD, ECOSCOPE, part of
WoRMS and FISHBASE sources
• URL: www.ics.forth.gr/isl/MarineTLO
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
18
9
10. Class Level (excerpt)
S-Class Level (Version 3.0.0)
Temporal
Phenomenon
Country Code
Assignment
Country
Ecosystem
Code
Assignment
Ecosystem
Event
Human Activity
Exclusive
Economic Zone
TLO Entity
Scientific Name
Assignment
Attribute
Assignment
Common Name
Assignment
Physical Man Made Thing
Man Made Object
Water Area Code
Assignment
Man Made Thing
Actor
Persistent Item
Conceptual Object
Vessel
Codification System
Identifier
EEZCode
Physical Thing
Area
FAOGearTypeIdentifier
Sub Area
Water
Area
FAOVesselTypeIdentifier
Division
Sub Division
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
FORTH, i-Marine, Ostend, January 2013
19
19
Meta Class Level (excerpt)
Meta Class Level (Version 3.0.0)
Marine
Ecosystem Type
Temporal
Phenomen
on Type
Ecosystem
Type
Event
Type
Human Activity
Type
Attribute
Assignment
Type
TLO Entity
Type
Digital Object type
Actor Type
Persistent Item
Type
Identifier Type
Conceptual Object
Type
Gear Type
Physical
Thing Type
Equipment
Type
Biotic Element
Type
ECOSCOPE
Marine
Animal Type
FLOD Marine
Animal Type
WoRMS Marine
Animal Type
Vessel Type
Marine Animal
Type
Fish Base
Marine
Animal Type
DBpedia Marine
Animal Type
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
FORTH, i-Marine, Ostend, January 2013
20
20
10
11. Example 1: ThunnusAlbacares
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
21
Example 2: Scientific name assignment
Event
assignedDate
assignedIdentifier
relatedAuthorshipAssigment
relatedIdentifierAssigment
Attribute Assignment
PersistentItem
assignedName
MarineSpecies
relatedIdentifierAssigment
Thunnus_alba
cares
relatedAuthorshipAssigment
blank_node_Thu
nnus_albacares
assignedDate
name
reference
Actor
Scientific Name
Assignment
blank_node_Bo
nnaterre
assignedName
Yannis Tzitzikas et al., MTSR 2013,
“1788” “Thunnus Albacares”
Thessaloniki
name
“Bonnaterre”
22
11
12. Example 3: Species Establishment
isAssociatedWith
isAssocitedWith
Ecosystem
Country
Water Area
usualluIsBioticElementOf
usualluIsBioticElementOf
native
Introduced
Endemic
native
Introduced
Endemic
usualluIsBioticElementOf
native
Introduced
Endemic
Marine Species
Poromitra
crassiceps
isAssocitedWith
Antarctic
isAssocitedWith
Elephant I
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
Atlantic
Antarctic
23
Exploiting MarineTLO
12
13. Ways to use/exploit MarineTLO
1. For constructing semantic warehouses which:
– can answer queries which cannot be answered by the underlying
sources individually
– can aid the construction of mappings between instances
– can be exploited for various other task
• We shall see how they are exploited in the context of semantic postprocessing of search results
2. Various other uses
–
–
For publishing Linked Data
For mashing up facts
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
Publishing
Linked Data,
Mashups
Constructing
Warehouses offering
Complex query
answering
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
25
For semanticpost processing
of search results
26
13
14. The MarineTLO-based
Warehouse
MarineTLO
Warehouse
Warehouse construction and evolution process
Define requirements in terms
of competence queries
produces
Queries
Fetch the data from the selected sources
(SPARQL endpoints, services, etc)
Triples
Transform and Ingest to the Warehouse
Apply the rules to the warehouse
MaTWare
creates
Inspect the connectivity of the Warehouse
Formulate rules creating sameAs
relationships
uses
uses
produces Rules for
Instance
Matching
uses
sameAs triples
MaTWare
Ingest the sameAs relationships to the warehouse
Warehouse
Test and evaluate the Warehouse (using
competence queries)
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
MaTWare
28
14
15. The MarineTLO-based warehouse’s contents: used sources
RDF
Triple Store
MarineTLO
FLOD-to-TLO
mapping
ECOSCOPE-to-TLO
mapping
DBpedia-to-TLO
mapping
FishBase-to-TLO
mapping
ECOSCOPE
FLOD
WoRMS-to-TLO
mapping
WoRMS
(part of)
DBpedia
(part of)
FishBase
(part of)
Replicate
Replicate
Replicate
Replicate
Replicate
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
29
The MarineTLO-based warehouse’s contents: in numbers
• Now contains information about 37,000 distinct marine
species (including Fishbase). Number of triples: 2,970,058
Source
DBpedia
14,291
FLOD
Common Species (size of intersections)
Species
Number
10,849
WoRMS
1124
Ecoscope
277
FishBase
31,277
FLOD
DBpedia
FLOD
3,046
WoRMS
Ecoscope
Fishbase
731
56
9833
768
73
6141
53
1288
WoRMS
Ecoscope
Yannis Tzitzikas et al., MTSR 2013,
iMarine 2nd Review, September
Thessaloniki
2013,Brussels
53
30
15
16. The MarineTLO-based warehouse’s contents: concepts
Concepts
Ecoscope
FLOD
WoRMS DBpedia Fishbase
Species
Scientific Names
Authorships
Common Names
Predators
Ecosystems
Countries
Water Areas
Vessels
Gears
EEZ
Yannis Tzitzikas et al., MTSR 2013,
iMarine 2nd Review, September
Thessaloniki
2013,Brussels
31
Exploiting the
MarineTLO-based Warehouse
for
Semantic Post-Processing of Search Results
16
17. For Semantic Post-Processing: The process
web
browsing
contents
query
terms
(top-L) results
(+ metadata)
Entity
Mining
MarineTLO
Warehouse
entities / contents
Visualization/Interaction
(faceted search, entity
exploration, annotation,
top-k graphs, etc.)
semantic
data
Semantic
Analysis
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
• Grouping,
• Ranking
• Retrieving more
properties
33
XSearch-Portlet Screenshot
The
Warehou
se is used
The
Warehou
se is used
Search
Results
Result of
Entity
Mining
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
Result of
textual
clustering
34
17
18. Example of an
EntityCard
of Xsearch (if the entity’s
type is Species)
The
Warehou
se is used
From DBpedia
From FLOD
From Ecoscope
From WoRMS
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
XSearch as a bookmarklet
The
Warehou
se is used
Annotating entities over the original page
Entity
exploration
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
36
18
19. Concluding Remarks
Concluding Remarks
•
To tackle the need for having integrated sets of facts about marine species,
and thus to assist research about species and biodiversity, we have described
a top level ontology for that domain.
– It provides a unified and coherent core model for schema mapping which enables
formulating and answering queries which cannot be answered by any individual
source.
•
•
We detailed the process of constructing MarineTLO-based warehouses. The
current warehouse contains information about more than 37K marine
species
We have identified and described particular use cases and applications that
exploit this ontology and it warehouse.
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
38
19
20. Future Work and Research
• Next steps
– Finalize and make accessible the next release of the
warehouse (in 2013)
• Current and Future Research
– Focus on quality/connectivity issues
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
39
Links
• MarineTLO
• http://www.ics.forth.gr/isl/MarineTLO/
• TripleStores
– MarineTLO-Warehouse: http://virtuoso.i-marine.d4science.org:8890/sparql
– also browsable through http://virtuoso.i-marine.d4science.org:8890/fct
• Systems
– X-Search and gCube Search
• Portlet: https://i-marine.d4science.org/ (in various VREs, e.g. FCPPS ,
iSearch)
• Web Applications:
– http://62.217.127.118/x-search/ (over Bing and MarineTLO-Warehouse)
– http://62.217.127.118/x-search-fao/ (over ECOSCOPE and MarineTLO-Warehouse)
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
40
20
21. Thank you for your attention
Visit and send us feedback:
www.ics.forth.gr/isl/MarineTLO
Yannis Tzitzikas et al., MTSR 2013,
Thessaloniki
41
21