SlideShare a Scribd company logo
1 of 45
TWC
Exploring the Web of Data for
Earth and Environmental Sciences
Xiaogang (Marshall) Ma
Tetherless World Constellation
Rensselaer Polytechnic Institute
@MarshallXMamax7@rpi.edu
x.marshall.ma
rpi.edu/~max7
0000-0002-9110-7369MarshallXMa
TWCOutline
• Web of Data
– Semantic Web, RDF, Ontology, Linked Open Data
• Weaving the Web of Data
– OneGeology-Europe
– Global Change Information System
– Deep Carbon Observatory-Data Science
– Deep Time Data Infrastructure
• Exploring the Web of Data
– Semantic similarity
– Concept mapping
• Summary
– Semantic eScience
2
TWCSemantic Web
“The Semantic Web is an extension of the current web in
which information is given well-defined meaning, better
enabling computers and people to work in cooperation. ”
3
Berners-Lee et al., 2001. Sci. Amer.
TWCWeb of Documents vs. Web of Data
Back to the early 1990s
• HTML and URL
• Markup language and ways for
connecting resources
• Below the file level
• Stopped at the text level
4
The First Website
http://info.cern.ch/hypertext/WWW/TheProject.html
<a href=“…
TWC
Since the early 2000s…
• XML, RDF, OWL and URIs
• Markup language and ways for
connecting resources
• Below the file level
• Below the text level
• At the data level
Web of Documents vs. Web of Data
(cont.)
Miller, 2004
5
TWCResource Description
Framework (RDF)
• A standard of W3C
• RDF is made up of triples
– <subject, predicate, object>
• RDFS extends RDF with a standard “ontology vocabulary”
– Class, Property
– subClassOf
– Domain, Range
– Type
– …
6http://www.w3.org/RDF/ http://www.w3.org/TR/rdf-schema/
<Mozart, composed, The Magic Flute>
<Mozart, isA, Musician>
<The Magic Flute, isA, Opera>
<Musician, rdf:type, owl:Class>
<Musician, rdfs:subClassOf, Artist>
<composed, rdf:type, owl:ObjectProperty>
<composed, rdfs:domain, Musician>
<composed, rdfs:range, Opera>
TWCOntology
• The term ontology is originated from philosophy
– The study of the nature of existence
• For the Semantic Web purpose
– An ontology is the specification of a shared conceptualization of a
domain
7
Aristotle
(384 – 322 BCE)
I Ching (Book of Changes)
(c. 450 – 250 BCE)
TWCAn Ontology Spectrum
Enriching semantic expressions
Catalog Glossary Thesaurus
Conceptual
schema
Formal
assertionsTaxonomy
Alphanumerical list
Supergroup-subgroup Formal superclass-subclass
Hierarchy
Disjoint subclass;
transitive properties;
symmetric properties, etc.
8
(Ma et al., 2010; adapted from Welty, 2002; McGuinness, 2003;
Obrst, 2003; Uschold and Gruninger, 2004; Borgo et al., 2005)
TWCQuerying RDF Data
• Query Languages such as SPARQL
– Most forms of the query languages contain a set of triple patterns
– Triple patterns are like RDF triples except that each of the subject,
predicate and object may be a variable
9
SELECT ?x ?y
WHERE
{
?x composed ?y .
}
<Mozart, composed, The Magic Flute>
<Mozart, isA, Musician>
<The Magic Flute, isA, Opera>
• ?x, ?y are variables
• ?x composed ?y represents a
<subject, predicate, object> triple
Mozart The Magic Flute
Data
Query
Result
TWCQuerying RDF Data
• Query Languages such as SPARQL
– Most forms of the query languages contain a set of triple patterns
– Triple patterns are like RDF triples except that each of the subject,
predicate and object may be a variable
10
SELECT ?x ?y
WHERE
{
?x composed ?y .
}
<Mozart, composed, The Magic Flute>
<Mozart, isA, Musician>
<The Magic Flute, isA, Opera>
• ?x, ?y are variables
• ?x composed ?y represents a
<subject, predicate, object> triple
Mozart The Magic Flute
Data
Query
Result
Images courtesy of OneGeology
Correlation between
Mozart and Geology?
TWCA Vision of The Semantic Web
Machine-processable, global
Web standards:
• Assigning unambiguous
identifiers (URI)
• Expressing data, including
metadata (RDF)
• Capturing ontologies (OWL)
• Query, rules, transformations,
deployment, application spaces,
logic, proof, trust
Bratt, 2006, Emerging Web Technologies to Watch 11
TWC
(Berners-Lee, 2006) (Image sources: w3c.org and 5stardata.info)
Linked Open Data
12
TWCThe Linking Open Data Cloud
(Aug. 2014)
13From: lod-cloud.net
TWCOneGeology-Europe
• 20 European nations
providing national geologic
maps at scale ~1: 1M
• Harmonized geological
terms and map legends
• Multilingual labels in 18
languages
• Central portal for data
browsing/query among
distributed data sources
A contribution to
INSPIRE
http://www.onegeology-europe.org
14
Recent Works in Geoscience
TWC
15
Federated query
Result of geologic units with age ‘Cenozoic - from 66 million years to today’
http://onegeology-europe.brgm.fr/geoportal/viewer.jsp
TWC
16
Distributed datasets:
Mismatches of geological
units across political
boundaries
Italy/France near
Cuneo/Colmar
Cambrian Carboniferous
(Asch et al., 2012)
(Ma et al., 2014)
Felsic and hornblendic gneisses
Granitic rocks
Wyoming/Colorado
(Base map courtesy:
OneGeology-Europe and USGS)
There are still works
to be done….
TWC
17
Ma et al., 2011, nGeo
Data Interoperability:
“Data should be discoverable, accessible, decodable,
understandable and usable, and data sharing should be
legal and ethical for all participants.”
Original image from: http://ehna.org
TWC
18
http://data.globalchange.gov
TWC
“Figure 1.2: Sea Level Rise: Past, Present, and Future” in draft NCA3
An example question of provenance tracing:
What are the NASA contributions to Figure 1.2 in the draft NCA3?
19
TWCCurrent result:
GCIS ontology version 1.1
Classes and
properties
representing a
brief structure of
the draft NCA3
GCIS
ontology
version
1.1
TWC
Classes and properties about sensors, instruments, platforms,
and algorithms, etc. that datasets are generated from
GCIS
ontology
version 1.1
21
TWC
22
Image from nature.com
Ma et al., 2014, nClimate
Provenance Documentation:
“Linking a range of observations and model outputs, research
activities, people and organizations involved in the production of
scientific findings with the supporting data sets and methods
used to generate them”
TWCDeep Carbon Virtual Observatory
Fox, 2014
http://deepcarbon.net
DCVO: A cyber-
enabled platform
for linked science
23
Deep Carbon Observatory
(2009-2019)
• Deep Energy
• Deep Life
• Extreme Physics and
Chemistry
• Reservoirs and Fluxes
TWCDeep Carbon Virtual Observatory
• A vision of the DCVO:
– A conceptual model of the interplay between data, people,
publication, instruments, models, organizations, etc.
– Identify, annotate and link all key entities, agents and
activities
– A repository for datasets and associated metadata
– Unique and powerful data and metadata visualization for
dissemination of information
– Collaboration tools for scientific efforts
– An integrated portal for diverse content and applications
Fox et al., 2014
24
http://deepcarbon.net
TWCDeep Time Data Infrastructure
(2015-2025)
Studying the co-evolution of geosphere and biosphere
25
http://www.wmkeck.org/grant-programs/research/medical-
research-grant-abstracts/science-and-engineering-2014
Deep Time Data Workshop
at AGU 2014
Vast amounts of data related to planetary
evolution through deep time:
• Mineralogy and Petrology
• Paleobiology and Paleontology
• Paleotectonics and Paleomagnetism
• Geochemistry and Geochronology
• Genomics and Proteomics
…
Short-term goal (2015-2017):
Develop, curate, and integrate diverse data
resources to focus on our planet’s changing
near-surface oxidation state and the rise of
oxygen through deep time
TWCDeep Time Data Infrastructure
(2015-2025)
Studying the co-evolution of geo- and biospheres
26
http://www.wmkeck.org/grant-programs/research/medical-research-grant-
abstracts/science-and-engineering-2014
Deep Time Data Workshop
at AGU 2014
Vast amounts of data related to planetary
evolution through deep time:
• Mineralogy and Petrology
• Paleobiology and Paleontology
• Paleotectonics and Paleomagnetism
• Geochemistry and Geochronology
• Genomics and Proteomics
…
Short-term goal (2015-2017):
Develop, curate, and integrate diverse data
resources to focus on our planet’s changing
near-surface oxidation state and the rise of
oxygen through deep time
Hysted, G., Downs, R.T., and Hazen, R.M. (2015)
Mineral frequency distribution data conform to a
LNRE model: Prediction of Earth’s “missing”
minerals. Mathematical Geosciences, in press.
TWCExploring the Web of Data
• Geoscience vocabularies and ontologies are increasingly
created and used
– Concept recognition, comparison and interlinking will improve the
quality of data integration
TWCExploring the Web of Data
• Geoscience vocabularies and ontologies are increasingly
created and used
– Concept recognition, comparison and inter-linking will improve the
quality of data integration
• SEM+: a tool for concept mapping in geoscience
– SEM: Similarity-based Entity Matching
– Compute semantic similarity between concepts
– Suggest possible linking
Zheng et al., 2015, ESIn
TWCSEM+: Similarity-based Entity Matching
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Workflow of SEM+
TWCBlocking Algorithm
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Input Entities
Prefix-
blocking
Blocking
Entity Blocks
• Input: two or more large sets of concepts
isc:Archean
rdf:type gts:GeochronologicEra , skos:Concept ;
rdfs:comment "younger bound-2500.0"@en , "older
bound-4000.0"@en ;
rdfs:label "Archean Eon"@en ;
gts:rank
<http://resource.geosciml.org/ontology/timescale/rank/
Eon> ;
… …
An example concept
TWCPrefix: Rare Keywords
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Input Entities
Prefix-
blocking
Blocking
Entity Blocks
• Input: two or more large sets of concepts
• Blocking: group similar concepts
• Efficiency: reduce number of concept pairs
• Grouping concepts based on keywords in
their literal descriptions
• Intuition: Concepts that share more rare
keywords (prefix) are more likely to be similar
• Prefix-blocking: ‘prefixes’ are keywords that
belong to the least number of concepts
• The final similarity computation will only
apply to concepts in the same block
TWCConcept Blocks
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Input Entities
Prefix-
blocking
Blocking
Entity Blocks
• Input: two or more large sets of concepts
• Blocking: group similar concepts
4 concepts & their keywords
w = {A, B, C, E, K, L}
x = {C,D, E, L}
y = {B, K, E, L}
z = {A, B, L}
Result blocks
lb=2, then:
A: {w, z}
C: {w, x}
D: {x}
K: {w, y}
Prefix-blocking
lw: size of a block
lb: blocking parameter
If lw > lb, remove that
block
• Output: concept blocks
TWCIEWS Model
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
TWCTriple-wise Similarity
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Triple-wise
Similarity
Computation
• Similarity between two concepts c and c’
– Similarity between triples describing the two
concepts
– Triple-wise (pv) similarity: Simpv
• A challenge: property mapping
_:Boston rdfs:type _:t1
_:t1 rdfs:label ‘City’
is same as
_:Boston _:category ‘City’.
Example: property mapping
𝑆𝑖𝑚 𝑝𝑣 𝑝𝑣, 𝑝𝑣′ =
1. 𝑆𝑖𝑚𝑙 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑙𝑖𝑡𝑒𝑟𝑎𝑙
2. 𝑆𝑖𝑚 𝐹
𝑣, 𝑣′
𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′
𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑈𝑅𝐼
3. 𝐺𝑒𝑡 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑎𝑛𝑑 𝑡ℎ𝑒𝑛 𝑢𝑠𝑒 𝑆𝑖𝑚𝑙
𝑖𝑓 𝑣 𝑖𝑠 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑎𝑛𝑑 𝑣′ 𝑖𝑠 𝑈𝑅𝐼
TWCSimilarity between Two Triple Values
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Triple-wise
Similarity
Computation
𝑆𝑖𝑚 𝑝𝑣 𝑝𝑣, 𝑝𝑣′ =
1. 𝑆𝑖𝑚𝑙 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑙𝑖𝑡𝑒𝑟𝑎𝑙
2. 𝑆𝑖𝑚 𝐹
𝑣, 𝑣′
𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′
𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑈𝑅𝐼
3. 𝐺𝑒𝑡 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑎𝑛𝑑 𝑡ℎ𝑒𝑛 𝑢𝑠𝑒 𝑆𝑖𝑚𝑙
𝑖𝑓 𝑣 𝑖𝑠 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑎𝑛𝑑 𝑣′ 𝑖𝑠 𝑈𝑅𝐼
• Similarity between two triple values
• For case 1, compute similarity using Lin’s method (Lin, 1998. ICML)
• For case 2, use another equation 𝑆𝑖𝑚 𝐹 𝑐, 𝑐′ recursively
– Here we only traverse URIs to the depth of three
• For case 3, first extract literal value of v’ and then use Siml
TWCSimilarity between Two Concepts
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Triple-wise
Similarity
Computation
𝑆𝑖𝑚 𝑐, 𝑐′ =
𝑆𝑖𝑚 𝑝𝑣
𝑆𝑖𝑚 𝑝𝑣 + α 𝑃𝑉1 − 𝑆𝑖𝑚 𝑝𝑣 + β( 𝑃𝑉2 − 𝑆𝑖𝑚 𝑝𝑣)
• Similarity between two concepts c and c’
• Apply Jaccard similarity (Jaccard, 1912. New Phytologist)
• |𝑃𝑉1|: number of pvs in concept c
• |𝑃𝑉2|: number of pvs in concept c’
• α and β: coefficients of variation on the similarity measure on c and c’
unique description
TWCInformation Entropy
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
• Information entropy: Quantified measure
of uncertainty of information content
(Shannon, 1948)
• The amount of information of a property
can be quantified as information entropy
Properties are not equally
important for concept
description. A tripe describing
the Social Security Number is
more important than a triple of
name, to identify a person
Example
• X: a property
• P(xi): possibility of X obtaining a particular value xi
• Information Entropy of X:
𝐻 𝑋 = −
𝑖=1
𝑛
𝑃 𝑥𝑖 𝑙𝑜𝑔 𝑏 (𝑃 𝑥𝑖 )
Information
Entropy
Computation
TWCJoint Information Entropy
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
• Joint information entropy: a measure of the uncertainty associated with
a set of variables
• X1, …, Xn: a list of properties
• x1, … , xn: particular values of X1, …, Xn, respectively
• P(x1, … , xn): possibility of those values occurring together
• Joint Information Entropy of X1, …, Xn :
𝐻 X1, …, Xn = −
𝑥1
…
𝑥 𝑛
𝑃 𝑥1, … , 𝑥𝑛 𝑙𝑜𝑔 𝑏[𝑃 𝑥1, … , 𝑥𝑛 ]
Information
Entropy
Computation
TWCInformation Entropy
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
• P: the set of properties in 𝑆𝑖𝑚 𝑝𝑣
• H(P): information entropy of the common descriptions
𝑆𝑖𝑚 𝐹 𝑐, 𝑐′ = 𝐻(𝑃)
𝑆𝑖𝑚 𝑝𝑣
𝑆𝑖𝑚 𝑝𝑣 + α 𝑃𝑉1 − 𝑆𝑖𝑚 𝑝𝑣 + β( 𝑃𝑉2 − 𝑆𝑖𝑚 𝑝𝑣)
TWCSelecting Matches between Concepts
Input Entities
Prefix-
blocking
Blocking
Computing
Similarity
Entity Blocks
Final
Matches
Selecting
Matches
Information Entropy based
Weighted Similarity (IEWS) Model
Information
Entropy
Computation
Triple-wise
Similarity
Computation
Similarity Matrix
Final
Matches
Selecting
Matches
Similarity Matrix
TWCSummary
• eScience: the digital or electronic facilitation of science
• Semantic eScience
– A virtuous circle between science and semantic technologies
– Data driven + Knowledge driven?
• My understanding of Semantic eScience: AIR3
– Anyone can say anything on any topic
– Interoperability, interactivity, intercreativity
– The right information for the right person at the right time
41
Thanks for listening
@MarshallXMamax7@rpi.edu
TWC• Backup slides
TWC
43
Earth Resource Form
Environmental Impact Value
Exploration Activity Type
Exploration Result
UNFC Value
Earth Resource Expression
Earth Resource Shape
Enduse Potential
Mineral Occurrence Type
Mining Activity Type
Processing Activity Type
Mining Waste Type Value
Commodity Code
Mineral Deposit Group
Mineral Deposit Type
Product Value
Recently finished CGI vocabularies
• Construct a collection of vocabularies for
populating information interchange
documents and enabling interoperability
• Provide labels for concepts, scope to
various communities defined by
language, science domain, or application
domain
CGI Geoscience Terminology Workgroup
http://cgi-iugs.org/tech_collaboration/
geoscience_terminology_working_group.html
TWCPrior to 2005, we built systems;
Now we build frameworks
• Rough definitions
– Systems have very well-define entry and exit
points. A user tends to know when they are
using one. Options for extensions are limited
and usually require engineering
– Frameworks have many entry and use points.
Users often do not know when they are using
one. Extension points are part of the design
– Platforms are built on frameworks
(Fox, 2014)
44
Geoinformatics
TWCSemantic eScience
• Artificial Intelligence accelerates scientific discovery
– Data search, synthesis and hypothesis representation
– Data analysis: reasoning with models of the data
(Gil et al., 2014)
Image from science.com
A state-of-the-art example:
Hanalyzer (high-throughput analyzer)
• Uses natural language processing to
automatically extract a semantic network from
all PubMed papers relevant to a scientist
• Uses Semantic Web technology to integrate
assertions from other biomedical sources
• Reasons about the network to find new
correlations that suggest new genes to
investigate
45
(Leach et al., 2009)

More Related Content

What's hot

Tufts Spatial Data Rescue: Crawling at-risk Government Data
Tufts Spatial Data Rescue: Crawling at-risk Government DataTufts Spatial Data Rescue: Crawling at-risk Government Data
Tufts Spatial Data Rescue: Crawling at-risk Government DataKyle Monahan
 
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinAnja Jentzsch
 
Digital archaeology and museums
Digital archaeology and museumsDigital archaeology and museums
Digital archaeology and museumsdejp3
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeNational Institute of Informatics
 
GeoSemantic Technologies for Archaeological Resources
GeoSemantic Technologies for Archaeological ResourcesGeoSemantic Technologies for Archaeological Resources
GeoSemantic Technologies for Archaeological ResourcesPaul Cripps
 
CAA 2014: Geosemantic Tools for Archaeological Research
CAA 2014: Geosemantic Tools for Archaeological ResearchCAA 2014: Geosemantic Tools for Archaeological Research
CAA 2014: Geosemantic Tools for Archaeological ResearchPaul Cripps
 
Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...ariadnenetwork
 
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...Micah Altman
 

What's hot (12)

Tufts Spatial Data Rescue: Crawling at-risk Government Data
Tufts Spatial Data Rescue: Crawling at-risk Government DataTufts Spatial Data Rescue: Crawling at-risk Government Data
Tufts Spatial Data Rescue: Crawling at-risk Government Data
 
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
 
Resources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the WebResources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the Web
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
 
Digital archaeology and museums
Digital archaeology and museumsDigital archaeology and museums
Digital archaeology and museums
 
Sw4 sh slides
Sw4 sh slidesSw4 sh slides
Sw4 sh slides
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
 
GeoSemantic Technologies for Archaeological Resources
GeoSemantic Technologies for Archaeological ResourcesGeoSemantic Technologies for Archaeological Resources
GeoSemantic Technologies for Archaeological Resources
 
CAA 2014: Geosemantic Tools for Archaeological Research
CAA 2014: Geosemantic Tools for Archaeological ResearchCAA 2014: Geosemantic Tools for Archaeological Research
CAA 2014: Geosemantic Tools for Archaeological Research
 
Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...
 
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...
 
Semantic Web Technology
Semantic Web TechnologySemantic Web Technology
Semantic Web Technology
 

Similar to Exploring the Web of Data for Earth and Environmental Sciences

EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Xiaogang (Marshall) Ma
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Kerstin Lehnert
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...CIGScotland
 
Ancient History of the UK Web
Ancient History of the UK WebAncient History of the UK Web
Ancient History of the UK WebScott A. Hale
 
Dig the new breed: how open approaches can empower archaeologists
Dig the new breed: how open approaches can empower archaeologistsDig the new breed: how open approaches can empower archaeologists
Dig the new breed: how open approaches can empower archaeologistsDART Project
 
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...Raul Palma
 
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...Peter Löwe
 
Esri and the Scientific Community
Esri and the Scientific CommunityEsri and the Scientific Community
Esri and the Scientific CommunityDawn Wright
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsEarthCube
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...EarthCube
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semanticsplan4all
 
Semantic Web in Physical Science
Semantic Web in Physical ScienceSemantic Web in Physical Science
Semantic Web in Physical Sciencepetermurrayrust
 
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...Keith.May
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupalemmanuel_jamin
 
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Tatiana Tarasova
 
WOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web ObservatoriesWOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web Observatoriesgloriakt
 

Similar to Exploring the Web of Data for Earth and Environmental Sciences (20)

EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
 
Ancient History of the UK Web
Ancient History of the UK WebAncient History of the UK Web
Ancient History of the UK Web
 
Dig the new breed: how open approaches can empower archaeologists
Dig the new breed: how open approaches can empower archaeologistsDig the new breed: how open approaches can empower archaeologists
Dig the new breed: how open approaches can empower archaeologists
 
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...
Supporting the research lifecycle of geo-GSNL initiative through HPC and Rese...
 
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...
Tectonic Storytelling with Open Source and Digital Object Identifiers - a cas...
 
Esri and the Scientific Community
Esri and the Scientific CommunityEsri and the Scientific Community
Esri and the Scientific Community
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded Projects
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 
Semantic Web in Physical Science
Semantic Web in Physical ScienceSemantic Web in Physical Science
Semantic Web in Physical Science
 
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
 
E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupal
 
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
 
WOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web ObservatoriesWOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web Observatories
 

More from Xiaogang (Marshall) Ma

From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...Xiaogang (Marshall) Ma
 
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalAdoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalXiaogang (Marshall) Ma
 
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Xiaogang (Marshall) Ma
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureXiaogang (Marshall) Ma
 
Why data science matters and what we can do with it
Why data science matters and what we can do with itWhy data science matters and what we can do with it
Why data science matters and what we can do with itXiaogang (Marshall) Ma
 
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...Xiaogang (Marshall) Ma
 
Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Xiaogang (Marshall) Ma
 
A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...Xiaogang (Marshall) Ma
 
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Xiaogang (Marshall) Ma
 
A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...Xiaogang (Marshall) Ma
 
A short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesA short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesXiaogang (Marshall) Ma
 
Exploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextExploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextXiaogang (Marshall) Ma
 

More from Xiaogang (Marshall) Ma (13)

From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...From data portal to knowledge portal: Leveraging semantic technologies to sup...
From data portal to knowledge portal: Leveraging semantic technologies to sup...
 
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data PortalAdoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
 
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semant...
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
 
Why data science matters and what we can do with it
Why data science matters and what we can do with itWhy data science matters and what we can do with it
Why data science matters and what we can do with it
 
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
CLOSED - Call for Papers: Semantic eScience special issue in Earth Science In...
 
Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...Ontology Development for Provenance Tracing in National Climate Assessment o...
Ontology Development for Provenance Tracing in National Climate Assessment o...
 
A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...A short review of Connected China: A visualization of elite social networks i...
A short review of Connected China: A visualization of elite social networks i...
 
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)Ontology spectrum for geological data interoperability (PhD defense nov 2011)
Ontology spectrum for geological data interoperability (PhD defense nov 2011)
 
A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...A use case-driven iterative method for building a provenance-aware GCIS onto...
A use case-driven iterative method for building a provenance-aware GCIS onto...
 
A short introduction to GIS
A short introduction to GISA short introduction to GIS
A short introduction to GIS
 
A short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabulariesA short story of geologic time ontologies and vocabularies
A short story of geologic time ontologies and vocabularies
 
Exploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web contextExploratory visualization of earth science data in a Semantic Web context
Exploratory visualization of earth science data in a Semantic Web context
 

Recently uploaded

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 

Recently uploaded (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 

Exploring the Web of Data for Earth and Environmental Sciences

  • 1. TWC Exploring the Web of Data for Earth and Environmental Sciences Xiaogang (Marshall) Ma Tetherless World Constellation Rensselaer Polytechnic Institute @MarshallXMamax7@rpi.edu x.marshall.ma rpi.edu/~max7 0000-0002-9110-7369MarshallXMa
  • 2. TWCOutline • Web of Data – Semantic Web, RDF, Ontology, Linked Open Data • Weaving the Web of Data – OneGeology-Europe – Global Change Information System – Deep Carbon Observatory-Data Science – Deep Time Data Infrastructure • Exploring the Web of Data – Semantic similarity – Concept mapping • Summary – Semantic eScience 2
  • 3. TWCSemantic Web “The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation. ” 3 Berners-Lee et al., 2001. Sci. Amer.
  • 4. TWCWeb of Documents vs. Web of Data Back to the early 1990s • HTML and URL • Markup language and ways for connecting resources • Below the file level • Stopped at the text level 4 The First Website http://info.cern.ch/hypertext/WWW/TheProject.html <a href=“…
  • 5. TWC Since the early 2000s… • XML, RDF, OWL and URIs • Markup language and ways for connecting resources • Below the file level • Below the text level • At the data level Web of Documents vs. Web of Data (cont.) Miller, 2004 5
  • 6. TWCResource Description Framework (RDF) • A standard of W3C • RDF is made up of triples – <subject, predicate, object> • RDFS extends RDF with a standard “ontology vocabulary” – Class, Property – subClassOf – Domain, Range – Type – … 6http://www.w3.org/RDF/ http://www.w3.org/TR/rdf-schema/ <Mozart, composed, The Magic Flute> <Mozart, isA, Musician> <The Magic Flute, isA, Opera> <Musician, rdf:type, owl:Class> <Musician, rdfs:subClassOf, Artist> <composed, rdf:type, owl:ObjectProperty> <composed, rdfs:domain, Musician> <composed, rdfs:range, Opera>
  • 7. TWCOntology • The term ontology is originated from philosophy – The study of the nature of existence • For the Semantic Web purpose – An ontology is the specification of a shared conceptualization of a domain 7 Aristotle (384 – 322 BCE) I Ching (Book of Changes) (c. 450 – 250 BCE)
  • 8. TWCAn Ontology Spectrum Enriching semantic expressions Catalog Glossary Thesaurus Conceptual schema Formal assertionsTaxonomy Alphanumerical list Supergroup-subgroup Formal superclass-subclass Hierarchy Disjoint subclass; transitive properties; symmetric properties, etc. 8 (Ma et al., 2010; adapted from Welty, 2002; McGuinness, 2003; Obrst, 2003; Uschold and Gruninger, 2004; Borgo et al., 2005)
  • 9. TWCQuerying RDF Data • Query Languages such as SPARQL – Most forms of the query languages contain a set of triple patterns – Triple patterns are like RDF triples except that each of the subject, predicate and object may be a variable 9 SELECT ?x ?y WHERE { ?x composed ?y . } <Mozart, composed, The Magic Flute> <Mozart, isA, Musician> <The Magic Flute, isA, Opera> • ?x, ?y are variables • ?x composed ?y represents a <subject, predicate, object> triple Mozart The Magic Flute Data Query Result
  • 10. TWCQuerying RDF Data • Query Languages such as SPARQL – Most forms of the query languages contain a set of triple patterns – Triple patterns are like RDF triples except that each of the subject, predicate and object may be a variable 10 SELECT ?x ?y WHERE { ?x composed ?y . } <Mozart, composed, The Magic Flute> <Mozart, isA, Musician> <The Magic Flute, isA, Opera> • ?x, ?y are variables • ?x composed ?y represents a <subject, predicate, object> triple Mozart The Magic Flute Data Query Result Images courtesy of OneGeology Correlation between Mozart and Geology?
  • 11. TWCA Vision of The Semantic Web Machine-processable, global Web standards: • Assigning unambiguous identifiers (URI) • Expressing data, including metadata (RDF) • Capturing ontologies (OWL) • Query, rules, transformations, deployment, application spaces, logic, proof, trust Bratt, 2006, Emerging Web Technologies to Watch 11
  • 12. TWC (Berners-Lee, 2006) (Image sources: w3c.org and 5stardata.info) Linked Open Data 12
  • 13. TWCThe Linking Open Data Cloud (Aug. 2014) 13From: lod-cloud.net
  • 14. TWCOneGeology-Europe • 20 European nations providing national geologic maps at scale ~1: 1M • Harmonized geological terms and map legends • Multilingual labels in 18 languages • Central portal for data browsing/query among distributed data sources A contribution to INSPIRE http://www.onegeology-europe.org 14 Recent Works in Geoscience
  • 15. TWC 15 Federated query Result of geologic units with age ‘Cenozoic - from 66 million years to today’ http://onegeology-europe.brgm.fr/geoportal/viewer.jsp
  • 16. TWC 16 Distributed datasets: Mismatches of geological units across political boundaries Italy/France near Cuneo/Colmar Cambrian Carboniferous (Asch et al., 2012) (Ma et al., 2014) Felsic and hornblendic gneisses Granitic rocks Wyoming/Colorado (Base map courtesy: OneGeology-Europe and USGS) There are still works to be done….
  • 17. TWC 17 Ma et al., 2011, nGeo Data Interoperability: “Data should be discoverable, accessible, decodable, understandable and usable, and data sharing should be legal and ethical for all participants.” Original image from: http://ehna.org
  • 19. TWC “Figure 1.2: Sea Level Rise: Past, Present, and Future” in draft NCA3 An example question of provenance tracing: What are the NASA contributions to Figure 1.2 in the draft NCA3? 19
  • 20. TWCCurrent result: GCIS ontology version 1.1 Classes and properties representing a brief structure of the draft NCA3 GCIS ontology version 1.1
  • 21. TWC Classes and properties about sensors, instruments, platforms, and algorithms, etc. that datasets are generated from GCIS ontology version 1.1 21
  • 22. TWC 22 Image from nature.com Ma et al., 2014, nClimate Provenance Documentation: “Linking a range of observations and model outputs, research activities, people and organizations involved in the production of scientific findings with the supporting data sets and methods used to generate them”
  • 23. TWCDeep Carbon Virtual Observatory Fox, 2014 http://deepcarbon.net DCVO: A cyber- enabled platform for linked science 23 Deep Carbon Observatory (2009-2019) • Deep Energy • Deep Life • Extreme Physics and Chemistry • Reservoirs and Fluxes
  • 24. TWCDeep Carbon Virtual Observatory • A vision of the DCVO: – A conceptual model of the interplay between data, people, publication, instruments, models, organizations, etc. – Identify, annotate and link all key entities, agents and activities – A repository for datasets and associated metadata – Unique and powerful data and metadata visualization for dissemination of information – Collaboration tools for scientific efforts – An integrated portal for diverse content and applications Fox et al., 2014 24 http://deepcarbon.net
  • 25. TWCDeep Time Data Infrastructure (2015-2025) Studying the co-evolution of geosphere and biosphere 25 http://www.wmkeck.org/grant-programs/research/medical- research-grant-abstracts/science-and-engineering-2014 Deep Time Data Workshop at AGU 2014 Vast amounts of data related to planetary evolution through deep time: • Mineralogy and Petrology • Paleobiology and Paleontology • Paleotectonics and Paleomagnetism • Geochemistry and Geochronology • Genomics and Proteomics … Short-term goal (2015-2017): Develop, curate, and integrate diverse data resources to focus on our planet’s changing near-surface oxidation state and the rise of oxygen through deep time
  • 26. TWCDeep Time Data Infrastructure (2015-2025) Studying the co-evolution of geo- and biospheres 26 http://www.wmkeck.org/grant-programs/research/medical-research-grant- abstracts/science-and-engineering-2014 Deep Time Data Workshop at AGU 2014 Vast amounts of data related to planetary evolution through deep time: • Mineralogy and Petrology • Paleobiology and Paleontology • Paleotectonics and Paleomagnetism • Geochemistry and Geochronology • Genomics and Proteomics … Short-term goal (2015-2017): Develop, curate, and integrate diverse data resources to focus on our planet’s changing near-surface oxidation state and the rise of oxygen through deep time Hysted, G., Downs, R.T., and Hazen, R.M. (2015) Mineral frequency distribution data conform to a LNRE model: Prediction of Earth’s “missing” minerals. Mathematical Geosciences, in press.
  • 27. TWCExploring the Web of Data • Geoscience vocabularies and ontologies are increasingly created and used – Concept recognition, comparison and interlinking will improve the quality of data integration
  • 28. TWCExploring the Web of Data • Geoscience vocabularies and ontologies are increasingly created and used – Concept recognition, comparison and inter-linking will improve the quality of data integration • SEM+: a tool for concept mapping in geoscience – SEM: Similarity-based Entity Matching – Compute semantic similarity between concepts – Suggest possible linking Zheng et al., 2015, ESIn
  • 29. TWCSEM+: Similarity-based Entity Matching Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Workflow of SEM+
  • 30. TWCBlocking Algorithm Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Input Entities Prefix- blocking Blocking Entity Blocks • Input: two or more large sets of concepts isc:Archean rdf:type gts:GeochronologicEra , skos:Concept ; rdfs:comment "younger bound-2500.0"@en , "older bound-4000.0"@en ; rdfs:label "Archean Eon"@en ; gts:rank <http://resource.geosciml.org/ontology/timescale/rank/ Eon> ; … … An example concept
  • 31. TWCPrefix: Rare Keywords Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Input Entities Prefix- blocking Blocking Entity Blocks • Input: two or more large sets of concepts • Blocking: group similar concepts • Efficiency: reduce number of concept pairs • Grouping concepts based on keywords in their literal descriptions • Intuition: Concepts that share more rare keywords (prefix) are more likely to be similar • Prefix-blocking: ‘prefixes’ are keywords that belong to the least number of concepts • The final similarity computation will only apply to concepts in the same block
  • 32. TWCConcept Blocks Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Input Entities Prefix- blocking Blocking Entity Blocks • Input: two or more large sets of concepts • Blocking: group similar concepts 4 concepts & their keywords w = {A, B, C, E, K, L} x = {C,D, E, L} y = {B, K, E, L} z = {A, B, L} Result blocks lb=2, then: A: {w, z} C: {w, x} D: {x} K: {w, y} Prefix-blocking lw: size of a block lb: blocking parameter If lw > lb, remove that block • Output: concept blocks
  • 33. TWCIEWS Model Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix
  • 34. TWCTriple-wise Similarity Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Triple-wise Similarity Computation • Similarity between two concepts c and c’ – Similarity between triples describing the two concepts – Triple-wise (pv) similarity: Simpv • A challenge: property mapping _:Boston rdfs:type _:t1 _:t1 rdfs:label ‘City’ is same as _:Boston _:category ‘City’. Example: property mapping 𝑆𝑖𝑚 𝑝𝑣 𝑝𝑣, 𝑝𝑣′ = 1. 𝑆𝑖𝑚𝑙 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 2. 𝑆𝑖𝑚 𝐹 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑈𝑅𝐼 3. 𝐺𝑒𝑡 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑎𝑛𝑑 𝑡ℎ𝑒𝑛 𝑢𝑠𝑒 𝑆𝑖𝑚𝑙 𝑖𝑓 𝑣 𝑖𝑠 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑎𝑛𝑑 𝑣′ 𝑖𝑠 𝑈𝑅𝐼
  • 35. TWCSimilarity between Two Triple Values Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Triple-wise Similarity Computation 𝑆𝑖𝑚 𝑝𝑣 𝑝𝑣, 𝑝𝑣′ = 1. 𝑆𝑖𝑚𝑙 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 2. 𝑆𝑖𝑚 𝐹 𝑣, 𝑣′ 𝑖𝑓 𝑣 𝑎𝑛𝑑 𝑣′ 𝑎𝑟𝑒 𝑏𝑜𝑡ℎ 𝑈𝑅𝐼 3. 𝐺𝑒𝑡 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑎𝑛𝑑 𝑡ℎ𝑒𝑛 𝑢𝑠𝑒 𝑆𝑖𝑚𝑙 𝑖𝑓 𝑣 𝑖𝑠 𝑙𝑖𝑡𝑒𝑟𝑎𝑙 𝑎𝑛𝑑 𝑣′ 𝑖𝑠 𝑈𝑅𝐼 • Similarity between two triple values • For case 1, compute similarity using Lin’s method (Lin, 1998. ICML) • For case 2, use another equation 𝑆𝑖𝑚 𝐹 𝑐, 𝑐′ recursively – Here we only traverse URIs to the depth of three • For case 3, first extract literal value of v’ and then use Siml
  • 36. TWCSimilarity between Two Concepts Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Triple-wise Similarity Computation 𝑆𝑖𝑚 𝑐, 𝑐′ = 𝑆𝑖𝑚 𝑝𝑣 𝑆𝑖𝑚 𝑝𝑣 + α 𝑃𝑉1 − 𝑆𝑖𝑚 𝑝𝑣 + β( 𝑃𝑉2 − 𝑆𝑖𝑚 𝑝𝑣) • Similarity between two concepts c and c’ • Apply Jaccard similarity (Jaccard, 1912. New Phytologist) • |𝑃𝑉1|: number of pvs in concept c • |𝑃𝑉2|: number of pvs in concept c’ • α and β: coefficients of variation on the similarity measure on c and c’ unique description
  • 37. TWCInformation Entropy Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix • Information entropy: Quantified measure of uncertainty of information content (Shannon, 1948) • The amount of information of a property can be quantified as information entropy Properties are not equally important for concept description. A tripe describing the Social Security Number is more important than a triple of name, to identify a person Example • X: a property • P(xi): possibility of X obtaining a particular value xi • Information Entropy of X: 𝐻 𝑋 = − 𝑖=1 𝑛 𝑃 𝑥𝑖 𝑙𝑜𝑔 𝑏 (𝑃 𝑥𝑖 ) Information Entropy Computation
  • 38. TWCJoint Information Entropy Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix • Joint information entropy: a measure of the uncertainty associated with a set of variables • X1, …, Xn: a list of properties • x1, … , xn: particular values of X1, …, Xn, respectively • P(x1, … , xn): possibility of those values occurring together • Joint Information Entropy of X1, …, Xn : 𝐻 X1, …, Xn = − 𝑥1 … 𝑥 𝑛 𝑃 𝑥1, … , 𝑥𝑛 𝑙𝑜𝑔 𝑏[𝑃 𝑥1, … , 𝑥𝑛 ] Information Entropy Computation
  • 39. TWCInformation Entropy Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix • P: the set of properties in 𝑆𝑖𝑚 𝑝𝑣 • H(P): information entropy of the common descriptions 𝑆𝑖𝑚 𝐹 𝑐, 𝑐′ = 𝐻(𝑃) 𝑆𝑖𝑚 𝑝𝑣 𝑆𝑖𝑚 𝑝𝑣 + α 𝑃𝑉1 − 𝑆𝑖𝑚 𝑝𝑣 + β( 𝑃𝑉2 − 𝑆𝑖𝑚 𝑝𝑣)
  • 40. TWCSelecting Matches between Concepts Input Entities Prefix- blocking Blocking Computing Similarity Entity Blocks Final Matches Selecting Matches Information Entropy based Weighted Similarity (IEWS) Model Information Entropy Computation Triple-wise Similarity Computation Similarity Matrix Final Matches Selecting Matches Similarity Matrix
  • 41. TWCSummary • eScience: the digital or electronic facilitation of science • Semantic eScience – A virtuous circle between science and semantic technologies – Data driven + Knowledge driven? • My understanding of Semantic eScience: AIR3 – Anyone can say anything on any topic – Interoperability, interactivity, intercreativity – The right information for the right person at the right time 41 Thanks for listening @MarshallXMamax7@rpi.edu
  • 43. TWC 43 Earth Resource Form Environmental Impact Value Exploration Activity Type Exploration Result UNFC Value Earth Resource Expression Earth Resource Shape Enduse Potential Mineral Occurrence Type Mining Activity Type Processing Activity Type Mining Waste Type Value Commodity Code Mineral Deposit Group Mineral Deposit Type Product Value Recently finished CGI vocabularies • Construct a collection of vocabularies for populating information interchange documents and enabling interoperability • Provide labels for concepts, scope to various communities defined by language, science domain, or application domain CGI Geoscience Terminology Workgroup http://cgi-iugs.org/tech_collaboration/ geoscience_terminology_working_group.html
  • 44. TWCPrior to 2005, we built systems; Now we build frameworks • Rough definitions – Systems have very well-define entry and exit points. A user tends to know when they are using one. Options for extensions are limited and usually require engineering – Frameworks have many entry and use points. Users often do not know when they are using one. Extension points are part of the design – Platforms are built on frameworks (Fox, 2014) 44 Geoinformatics
  • 45. TWCSemantic eScience • Artificial Intelligence accelerates scientific discovery – Data search, synthesis and hypothesis representation – Data analysis: reasoning with models of the data (Gil et al., 2014) Image from science.com A state-of-the-art example: Hanalyzer (high-throughput analyzer) • Uses natural language processing to automatically extract a semantic network from all PubMed papers relevant to a scientist • Uses Semantic Web technology to integrate assertions from other biomedical sources • Reasons about the network to find new correlations that suggest new genes to investigate 45 (Leach et al., 2009)