Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Technologies and practices for maintaining and publishing earth science vocabularies
1. Simon J D Cox, Jonathan Yu, Megan Williams, Fabrizio Giabardo, Dominic Lowe
16 April 2015
LAND AND WATER FLAGSHIP
Technologies and practices for maintaining
and publishing earth science vocabularies
2. Are these the same?
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe
“nitrogen”
“dissolved nitrogen”
“Total nitrogen, water, filtered, milligrams per liter”
“Concentration of nitrogen (total) per unit volume of the water body [dissolved
plus reactive particulate phase] by oxidation and colorimetric autoanalysis“
“Concentration of nitrogen (total) per unit mass of the water body [dissolved plus
reactive particulate <GF/F phase] by filtration and high temperature Pt catalytic
oxidation”
“Concentration (moles or mass) of total nitrogen (i.e. nitrogen in all chemical
forms) in suspended particulate material per unit volume of the water column.”
“Concentration of nitrogen (total) {'PON'} per unit volume of the water body
[particulate 2-10um phase] by filtration, acidification and elemental analysis”
“Dissolved total and organic nitrogen concentrations in the water column”
2 |
6. W3C Data Cube ontology
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe6 |
Each axis or variable
specified as a skos:Concept
Values of coded-properties
selected from a
skos:ConceptScheme
Homogeneous observations,
common structure definition
The RDF Data Cube Vocabulary, Cyganiak & Reynolds, W3C Recommendation 2014
18. RDFS
Semantic web dead long live semantic web | Simon Cox18 |
GeochronEra
TemporalReference
System
component
member
skos:ConceptScheme
skos:Concept
skos:hasTopConcept skos:narrowersubClassOf
subClassOf
subPropertyOf
subPropertyOf
domain
domain
domain
range
range
range
domain range
19. Inferencing
• Entailments and reasoning
• What does this combination of axioms
imply?
• Is there anything unexpected?
Phanerozoic
Cenozoic
Neogene
Stratigraphic
Chart
GeochronEra
TemporalReference
System
type
type
type
type
component member
member
hasTopConcept narrower
narrowernarrowerTransitive
Concept
ConceptScheme
broaderTransitive
Semantic web dead long live semantic web | Simon Cox19 |
20. Formalization and encoding process
Create order within
existing excel
spreadsheets
Every layout is
different
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe20 |
21. Formalization and encoding process
RDF 123
Every mapping
is different
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe21 |
22. Formalization and encoding process
Turtle,
in text editor …
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe22 |
25. • Physical documents, PDF
• Tables on web pages
• Bespoke XML documents
• RDF documents, OWL documents
• Web services
• RESTful web resources, Linked data
Vocabulary services | Cox & Yu
Delivery
26. Publish as linked data
URI = web-scale foreign-key
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe26 |
27. Linked vocabularies can be shared and re-used
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe27 |
32. Governance issues
What is the best way to re-use existing content already published as
linked data?
Do we fix it for them? Do we re-claim it?
Vocabulary deployment and governance | Cox32 |
33. Modeling flaws
GCMD science keywords
• Same textual definition, same label
• Different parent, different URI
– are they the same concept?
Vocabulary deployment and governance | Cox33 |
34. Re-base the URI?
<http://registry.it.csiro.au/def/kwa/gcmd/ABRASION>
a skos:Concept ;
rdfs:label "ABRASION" ;
dct:description "Mechanical scraping of a rock surface by friction between
rocks and moving particles."@en ;
owl:sameAs
<http://gcmdservices.gsfc.nasa.gov/kms/concept/8f57f4b0-5177-4362-81e8-ced75d37d1aa> ,
<http://gcmdservices.gsfc.nasa.gov/kms/concept/fd29bf77-df38-4b80-8148-8184fa41d843> ,
<http://gcmdservices.gsfc.nasa.gov/kms/concept/efacd4f6-59ea-4019-8265-8cc81ecc99c0> ,
<http://gcmdservices.gsfc.nasa.gov/kms/concept/f6e19e2e-555a-4d40-9833-c7513d92c813> ;
skos:prefLabel "ABRASION"@en .
Vocabulary deployment and governance | Cox34 |
41. Summary
• Term vocabularies can be formalized in RDF (SKOS, OWL) and
published as linked data
• Much content available, but needs converting (‘lifting’) to
semantic technologies
• Excel, RDF123, Text editor, SKOS, LDR and SISSVoc are our enablers
(but people are essential)
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe41 |
43. LAND AND WATER FLAGSHIP
Thank youEnvironmental Informatics Infrastructure
Simon J D Cox
Research Scientist
t +61 3 9252 6342
e simon.cox@csiro.au
w people.csiro.au/C/S/Simon-Cox
Jonathan Yu
Research Engineer
t +61 3 9252 6440
e jonathan.yu@csiro.au
w people.csiro.au/C/S/Jonathan-Yu
47. Simplified Knowledge Organization System
SKOS: a W3C Standard
Focus on the concept rather than the term
• Web/Linked data principle: Concept is identified by a URI
• Concept is annotated with text labels (i.e. the traditional ‘term’)
• Structured using hierarchical relations within a vocabulary
• broader, narrower
• Matching relations between vocabularies
• broadMatch, closeMatch, exactMatch
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe47 |
48. • Physical documents, PDF
• Tables on web pages
• Bespoke XML documents
• RDF documents, OWL documents
• Web services
• RESTful web resources, Linked data
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe
Delivery
50. Governance
Clear roles:
• Content is determined by the experts
• Formalization may uncover inconsistencies
• History and status must be visible
• No deletions! - retirement or supercession
Publishing earth science vocabularies | Cox, Yu, Williams, Giabardo, Lowe50 |