SlideShare a Scribd company logo
1 of 128
Download to read offline
Wright State University
Wright State University
CORE Scholar
CORE Scholar
Kno.e.sis Publications
The Ohio Center of Excellence in Knowledge-
Enabled Computing (Kno.e.sis)
5-2007
Semantic Web: Technologies and Applications for the Real-World
Semantic Web: Technologies and Applications for the Real-World
Amit P. Sheth
Wright State University - Main Campus, amit@sc.edu
Follow this and additional works at: https://corescholar.libraries.wright.edu/knoesis
Part of the Bioinformatics Commons, Communication Technology and New Media Commons,
Databases and Information Systems Commons, OS and Networks Commons, and the Science and
Technology Studies Commons
Repository Citation
Repository Citation
Sheth, A. P. (2007). Semantic Web: Technologies and Applications for the Real-World. .
https://corescholar.libraries.wright.edu/knoesis/640
This Presentation is brought to you for free and open access by the The Ohio Center of Excellence in Knowledge-
Enabled Computing (Kno.e.sis) at CORE Scholar. It has been accepted for inclusion in Kno.e.sis Publications by an
authorized administrator of CORE Scholar. For more information, please contact library-corescholar@wright.edu.
Semantic Web: Technologies and
Applications for the Real-World
Amit Sheth
LexisNexis Ohio Eminent Scholar
Kno.e.sis Center
Wright State University
http://knoesis.wright.edu
Tutorial at WWW2007 by Amit Sheth and Susie Stephens, Banff, Alberta, Canada. May 2007
Semantic Web: Technologies and
Applications for the Real-World
Susie Stephens
Principal Research Scientist
Tutorial at WWW2007 by Amit Sheth and Susie Stephens, Banff, Alberta, Canada. May 2007
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Tutorial Outline
Introduction to the Semantic Web 1.30-2.00pm
Real-World Applications (1):
Enabling Technologies and Experiences
2.00-3.00pm
Break 3.00-3.30pm
Real-World Applications (2):
Details and War Stories, Case Studies
3.30-4.30pm
Semantic Web in Practice 4.30-5.00pm
3
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Introduction to the Semantic Web
4
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Agenda
• Characterizing the Semantic Web
• Semantic Web Standards
• Semantic Web Capabilities
5
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Characterizing the Semantic Web
• Semantic Web is an interoperability technology
• An architecture for interconnected communities and
vocabularies
• A set of interoperable standards for knowledge exchange
6
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Creating a Web of Data
Source: Ivan Herman
7
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Mashing Data
Source: W3C
8
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Drivers for the Semantic Web
• Business models develop rapidly these days, so infrastructure
that supports change is needed
• Organizations are increasingly forming and disbanding
collaborations
• Data is growing so quickly that it is no longer possible for
individuals to identify patterns in their heads
• Increasing recognition of the benefits of collective intelligence
9
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web Technologies
Source: W3C
10
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Resource Description Framework (RDF)
• RDF became a W3C standard recommendation in 2004
• RDF is a language for representing information about
resources in the Web
• Common framework for expressing information
• Information may be made available to applications other
than those for which it was originally created
• Any information in RDF can be connected to any other
information in RDF
• Resources are identified with Uniform Resource
Identifiers (URIs)
11
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Resource Description Framework (RDF)
Source: W3C
12
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
RDF Schema (RDFS)
• Vocabulary for describing properties and classes of
RDF resources
• Semantics for hierarchies of properties and classes
• A resource may belong to several classes
13
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Web Ontology Language (OWL)
• OWL became a W3C standard recommendation in 2004
• Explicitly represents meaning of terms in vocabularies and
the relationships between those terms
• Separate layers have been defined balancing expressibility
vs. implementability (OWL Lite, OWL DL, OWL Full)
• Supports inferencing
14
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Merging Ontologies
Source: Siderean Software
15
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
SPARQL
• SPARQL is expected to become a W3C standard
recommendation in Q3 2007
• It is a query language based on graph patterns
• Protocol layer for using SPARQL over HTTP
• There are SPARQL endpoints on the Web
• SPARQL can be used to construct graphs
16
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
SPARQL as a Unifying Source
Source: Ivan Herman
17
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Gleaning Resource Descriptions from
Dialects of Languages (GRDDL)
• GRDDL is a mechanism for Gleaning Resource Descriptions
from Dialects of Languages
• Introduces markup based on existing standards for declaring
that an XML document includes data compatible with RDF and
for linking to algorithms for extracting this data from the document
• GRDDL should reach W3C standard recommendation in Q2
2007
18
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Rules
• OWL DL and OWL Lite are based on Description Logic;
there are things that DL cannot express
• Attempts to combine ontologies and rules include RuleML,
SWRL, cwm, etc.
• There is also an increasing number of rule-based systems
that want to interchange rules
• The W3C Rule Interchange Format Working Group is
focused on this area
19
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Annotations for WSDL (SAWSDL)
• Develop a mechanism to enable semantic annotation of Web
service descriptions
• Using the WSDL 2.0 extension mechanism to build simple and
generic support for adding service descriptions for Web services
• This W3C standard is at Candidate Recommendation
• Tools and Use cases available (Google: SAWSDL)
20
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Enhanced Enterprise Search
Source: Oracle OTN
21
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Improved Reliability of Search Results
Source: Segala
22
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Web of Data
Source: Freebase
23
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Data Mashups
Source: BioDASH
24
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Data Mashups
Source: BioDASH
25
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Document Management
Source: MGH
26
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Data Warehouse
Source: University of Texas
27
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Heterogeneous Data Aggregation
• Ordnance Survey maintains the definitive mapping data of
Great Britain
• Use ontologies to bridge semantically diverse sources of data,
e.g., Hydrology, Administrative Geography, Buildings and
Places, etc.
• Efficient queries via the ontology or directly in RDF
• Advantages include efficient data integration, data
repurposing, better quality control and classification
28
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Integration with Semantic Mediation
• BT uses semantic descriptions of system interfaces and
messages to support integration of Operational Support
Systems (OSS)
• Internet Service Providers integrate their OSS with those of
BT (via a gateway)
• The approach helps overcome the increasing complexity of
supply chains, reduces costs and time-to-market, ontologies
allow for a reuse of services
29
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Natural Language Query of Applications
• Tata Consulting Services provides a natural language
interface to business applications
• Users pose questions and invoke actions using natural
language
• An OWL ontology aids in the retrieval of relevant data and
concepts
• Advantages include distinct semantics for various domain
concepts, external concepts linked in, simple interface
30
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Decision Support
Source: AGFA
31
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Summary
• The Semantic Web provides functionality of interest to business,
scientific and Web communities
• Capabilities are provided in data integration, search, semantic
mediation, decision support, etc.
• Semantic Web standards are maturing
•The Semantic Web is increasingly being adopted
32
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Real World Applications
with Enabling Capabilities/Technologies
33
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Enablers and Techniques
Semantic Web Application Lifecycle
34
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Syntax, Structure and Semantics
Semantics:
Meaning &
Use of Data
35
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Describing Semantic Web to
Nontechnical Users
Labeling data on Web so that both humans and
machines can more effectively use them
Associating meaning to data that machines can
understand so as to achieve lot more automation and
off-load more work to machines
Exploiting common vocabulary and richer modeling of
subject area for much better integration of data
36
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Ontology: Agreement with Common Vocabulary &
Domain Knowledge; Schema + Knowledge base
Semantic Annotation (meatadata Extraction): Manual,
Semi-automatic (automatic with human verification),
Automatic
Reasoning/computation: semantics enabled search,
integration, complex queries, analysis (paths,
subgraph), pattern finding, mining, hypothesis
validation, discovery, visualization
Enablers and Techniques
37
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
A Typical Enterprise SW Application
Lifecycle
• Build Ontology
– Build Schema (model level representation
– Populate with Knowledgebase (people, location,
organizations, events)
• Automatic Semantic Annotation (Extract Semantic
Metadata)
– Any type of document, multiple sources of documents
– Metadata can be stored with or sparely from documents
• Applications: search (ranked list of documents of interest
(semantic search), integrate/portal, summarize/explain, analyze,
make decisions
– Reasoning techniques: graph analysis, inferencing
Types of content/documents, Use of standards, Scalability,
Performance
38
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semagix Freedom Architecture: for building
ontology-driven information system
© Semagix, Inc.
Managing Semantic Content on the Web: http://portal.acm.org/citation.cfm?id=613729
39
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Building ontology
Three broad approaches:
Option 1: social process/manual: many years, committees
– Can be based on metadata standard
Option 2: automatic taxonomy generation (statistical clustering/NLP):
limitation/problems on quality, dependence on corpus, naming
Option 3: Descriptional component (schema) designed by domain experts;
Description base (assertional component, extension) by automated
processes
Option 2 is being investigated in several research projects;
Option 3 is currently supported by technologies such as Semagix Freedom
40
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Time, Space
Gene Ontology, Glycomics
Pharma Drug, Treatment-Diagnosis
Repertoire Management
Equity Markets
Anti-money Laundering, Financial Risk, Terrorism
Can be Public, Government, Limited Availability, Commercial
Ontology Examples
41
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Ontology Language/ Representation
Spectrum
Is Disjoint Subclass of
with transitivity
property
Modal Logic
Logical Theory
Thesaurus
Has Narrower Meaning Than
Taxonomy
Is Sub-Classification of
Conceptual Model
Is Subclass of
DB Schemas, XML Schema
First Order Logic
Relational Model
XML
ER
Extended ER
Description Logic
DAML+OIL, OWL, UML
RDFS, XTM
Syntactic Interoperability
Structural Interoperability
Semantic Interoperability
42
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Ontology
Semantic Query
Server
1. Ontology Model Creation (Description) 2. Knowledge Agent Creation
3. Automatic aggregation of Knowledge
4. Querying the Ontology
Ontology Creation and Maintenance
Steps (Approach 1)
© Semagix, Inc.
43
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Some observations on Ontology
Development and Maintenance
• Type of domain
Modeling of human centric world (sports,
entertainment, legal, financial services)
Versus
Natural world (life sciences, astronomy, …)
• Schema
• Sources of knowledge
44
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
GlycO
is a focused ontology for the description of glycomics
models the biosynthesis, metabolism, and biological
relevance of complex glycans
models complex carbohydrates as sets of simpler
structures that are connected with rich relationships
An ontology for structure and function of Glycopeptides
Published through the National Center for Biomedical
Ontology (NCBO) and Open Biomedical Ontologies (OBO)
See:GlycoDoc, GlycO
45
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
GlycO
Challenge – model hundreds of thousands of complex
carbohydrate entities
But, the differences between the entities are small (E.g.
just one component)
How to model all the concepts but preclude redundancy
→ ensure maintainability, scalability
47
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
GlycO population
Assumption: with a large body of background
knowledge, learning and extraction techniques can be
used to assert facts.
Asserted facts are compositions of individual building
blocks
Because the building blocks are richly described, the
extracted larger structures will be of high quality
48
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
GlycO Population
Multiple data sources used in populating the ontology
o KEGG - Kyoto Encyclopedia of Genes and Genomes
o SWEETDB
o CARBBANK Database
Each data source has a different schema for storing
data
There is significant overlap of instances in the data
sources
Hence, entity disambiguation and a common
representational format are needed
49
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Diverse Data From Multiple Sources
Assures Quality
Democratic principle
Some sources can be wrong, but not all will be
More likely to have homogeneity in correct data than in
erroneous data
50
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Has
CarbBank
ID?
IUPAC to
LINUCS
LINUCS to
GLYDE
Compare to
Knowledge
Base
Already in
KB?
YES
NO
Semagix Freedom knowledge
extractor
Instance
Data
YES:
next Instance
Insert into
KB
NO
Ontology population workflow
51
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Has
CarbBank
ID?
IUPAC to
LINUCS
LINUCS to
GLYDE
Compare to
Knowledge
Base
Already in
KB?
YES
NO
Semagix Freedom knowledge
extractor
Instance
Data
YES:
next Instance
Insert into
KB
NO
[][Asn]{[(4+1)][b-D-GlcpNAc]
{[(4+1)][b-D-GlcpNAc]
{[(4+1)][b-D-Manp]
{[(3+1)][a-D-Manp]
{[(2+1)][b-D-GlcpNAc]
{}[(4+1)][b-D-GlcpNAc]
{}}[(6+1)][a-D-Manp]
{[(2+1)][b-D-GlcpNAc]{}}}}}}
Ontology population workflow
52
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Has
CarbBank
ID?
IUPAC to
LINUCS
LINUCS to
GLYDE
Compare to
Knowledge
Base
Already in
KB?
YES
NO
Semagix Freedom knowledge
extractor
Instance
Data
YES:
next Instance
Insert into
KB
NO
<Glycan>
<aglycon name="Asn"/>
<residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc">
<residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc">
<residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="Man" >
<residue link="3" anomeric_carbon="1" anomer="a" chirality="D" monosaccharide="Man" >
<residue link="2" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc" >
</residue>
<residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc" >
</residue>
</residue>
<residue link="6" anomeric_carbon="1" anomer="a" chirality="D" monosaccharide="Man" >
<residue link="2" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc">
</residue>
</residue>
</residue>
</residue>
</residue>
</Glycan>
Ontology population workflow
53
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Has
CarbBank
ID?
IUPAC to
LINUCS
LINUCS to
GLYDE
Compare to
Knowledge
Base
Already in
KB?
YES
NO
Semagix Freedom knowledge
extractor
Instance
Data
YES:
next Instance
Insert into
KB
NO
Ontology population workflow
54
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Diverse Data From Multiple Sources
Assures Quality
Holds only, when the data in each source is independent
In the case of GlycO, the sources that were meant to assure
quality were not diverse.
One original source (Carbbank) was copied by several
Databases without curation
Errors in the original propagated
Errors in KEGG and Carbbank are the same
Cannot use these sources for comparison
Needs curation by the expert community
?
55
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
N. Takahashi and K. Kato, Trends in Glycosciences and Glycotechnology, 15: 235-251
β-D-GlcpNAc
β-D-GlcpNAc
β-D-Manp-(1-4)- -(1-4)-
α-D-Manp -(1-6)+
β-D-GlcpNAc-(1-2)-
α-D-Manp -(1-3)+
β-D-GlcpNAc-(1-4)-
β-D-GlcpNAc-(1-2)+
GlycoTree
56
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
N-Glycosylation metabolic pathway
GNT-I
attaches GlcNAc at position 2
UDP-N-acetyl-D-glucosamine + alpha-D-Mannosyl-1,3-(R1)-beta-D-mannosyl-R2
<=>
UDP + N-Acetyl-$beta-D-glucosaminyl-1,2-alpha-D-mannosyl-1,3-(R1)-beta-D-mannosyl-$R2
GNT-V
attaches GlcNAc at position 6
UDP-N-acetyl-D-glucosamine + G00020 <=> UDP + G00021
N-acetyl-glucosaminyl_transferase_V
N-glycan_beta_GlcNAc_9
N-glycan_alpha_man_4
58
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Pathway Steps - Glycan
Pathway visualization tool by M. Eavenson and M. Janik, LSDIS Lab, Univ. of Georgia
Abundance of this glycan
in three experiments
60
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 61
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
An ontology for capturing process and lifecycle information related to
proteomic experiments
Two aspects of glycoproteomics:
What is it? → identification
How much of it is there? → quantification
Heterogeneity in data generation process, instrumental parameters,
formats
Need data and process provenance → ontology-mediated provenance
Hence, ProPreO models both the glycoproteomics experimental process
and attendant data
Published through the National Center for Biomedical Ontology (NCBO)
and Open Biomedical Ontologies (OBO)
ProPreO ontology
More at: http://knoesis.wright.edu/research/bioinformatics/
62
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Annotation
Shallow Annotation: SemTag
Intermediate Annotation:
• SEE/Semagix: http://lsdis.cs.uga.edu/library/download/HSK02-SEE.pdf
• Ontotext: http://www.ontotext.com/kim/semanticannotation.html)
Advanced: Schema-driven Relationship Extraction
Engine (SCREEN): http://knoesis.wright.edu/research/discovery
63
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Extracting a Text Document:
Syntactic approach
INCIDENT MANAGEMENT SITUATION REPORT
Friday August 1, 1997 - 0530 MDT
NATIONAL PREPAREDNESS LEVEL II
CURRENT SITUATION: Alaska continues to experience large fire activity. Additional fires have been
staffed for structure protection.
SIMELS, Galena District, BLM. This fire is on the east side of the Innoko Flats, between Galena and McGr
The fore is active on the southern perimeter, which is burning into a continuous stand of black spruce. The
fire has increased in size, but was not mapped due to thick smoke. The slopover on the eastern perimeter is
35% contained, while protection of the historic cabit continues.
CHINIKLIK MOUNTAIN, Galena District, BLM. A Type II Incident Management Team (Wehking) is
assigned to the Chiniklik fire. The fire is contained. Major areas of heat have been mopped up. The fire is
contained. Major areas of heat have been mopped-up. All crews and overhead will mop-up where the fire
burned beyond the meadows. No flare-ups occurred today. Demobilization is planned for this weekend,
depending on the results of infrared scanning.
LAYOUT
Date => day month int ‘,’int
64
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Extraction
Agent
Web Page Enhanced Metadata Asset
Taalee Extraction and Knowledgebase
Enhancement
Taalee Semantic Engine, also IBM Semantics Tools
65
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Metadata Extraction and Semantic
Enhancement
Semantic Enhancement Engine
[Hammond, Sheth, Kochut 2002]
Also, WebFountatin,
KIM from OntoText
66
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Annotation - document
67
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Sample Created Metadata
<Entity id="122805"
class="DrugOntology#prescription_drug_brandname">
Bextra
<Relationship id=”442134”
class="DrugOntology#has_interaction">
<Entity id="14280" class="DrugOntology
#interaction_with_physical_condition>sulfa allergy
</Entity>
</Relationship>
</Entity>
Excerpt of Drug Ontology
Excerpt of Drug Ontology
Semantic Annotation – news feed
68
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic annotation of
scientific/experimental data
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Scientific Data
Computational Methods
Ontology instances
ProPreO population: transformation to rdf
70
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
830.9570 194.9604 2
580.2985 0.3592
688.3214 0.2526
779.4759 38.4939
784.3607 21.7736
1543.7476 1.3822
1544.7595 2.9977
1562.8113 37.4790
1660.7776 476.5043
parent ion m/z
fragment ion m/z
ms/ms peaklist data
fragment ion
abundance
parent ion
abundance
parent ion charge
ProPreO: Ontology-mediated provenance
Mass Spectrometry (MS) Data
http://knoesis.org/research/bioinformatics
71
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
<ms-ms_peak_list>
<parameter instrument=“micromass_QTOF_2_quadropole_time_of_flight_mass_spectrometer”
mode=“ms-ms”/>
<parent_ion m-z=“830.9570” abundance=“194.9604” z=“2”/>
<fragment_ion m-z=“580.2985” abundance=“0.3592”/>
<fragment_ion m-z=“688.3214” abundance=“0.2526”/>
<fragment_ion m-z=“779.4759” abundance=“38.4939”/>
<fragment_ion m-z=“784.3607” abundance=“21.7736”/>
<fragment_ion m-z=“1543.7476” abundance=“1.3822”/>
<fragment_ion m-z=“1544.7595” abundance=“2.9977”/>
<fragment_ion m-z=“1562.8113” abundance=“37.4790”/>
<fragment_ion m-z=“1660.7776” abundance=“476.5043”/>
</ms-ms_peak_list>
Ontological
Concepts
ProPreO: Ontology-mediated provenance
Semantically Annotated MS Data
http://knoesis.org/research/bioinformatics
72
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Real World Applications
Application Types – by broad capabilities
•Search (also browsing, personalization)
•Integration (also interoperability)
•Analysis (also visualization)
73
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Characterizing applications
Dimension: Maturity: Prototypes and demonstrations—show
case research, technique
Systems using real world data, real <- our focus
Operational Systems <- our focus
Dimension: Type of use: General User, Consumer, Targeted
Users/community on the Web,
Business/Science/Engineering/Government User
Dimension: scalability -- Enterprise to Web
Dimension: realism Toy/synthetic data to Real World Data
74
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Review of Applications
Ontology Development, Ontology Population Sources,
Ontology Quality Issues
Semantic Annotation/Metadata Extraction
Application Development Platforms/Support
Standards Usage
Scalability
Performance
75
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Taalee’s Semantic Search
Highly customizable, precise and freshest A/V search
Context and Domain Specific Attributes Uniform Metadata for Content from Multiple
Sources, Can be sorted by any field
Delightful, relevant information,
exceptional targeting opportunity
http://www.streamingmediaworld.com/gen/reviews/searchassociation/
76
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Creating
a
Web
of
related
information
What
can
a
context
do?
77
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Ontology driven Semantic Directory
Search for
company
‘Commerce
One’
Links to news on
companies that
compete against
Commerce One
Links to news on
companies Commerce
One competes against
(To view news on
Ariba, click on the link
for Ariba)
Crucial news on
Commerce One’s
competitors
(Ariba) can be
accessed easily
and automatically
78
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
What
else
can
a
context
do?
(a
commercial
perspective)
Semantic
Enrichment
Semantic Targeting
79
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic/Interactive Targeting
Buy Al Pacino Videos
Buy Russell Crowe Videos
Buy Christopher Plummer Videos
Buy Diane Venora Videos
Buy Philip Baker Hall Videos
Buy The Insider Video
Precisely targeted through the use of Structured Metadata and
integration from multiple sources
80
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Focused
relevant
content
organized
by topic
(semantic
categorization)
Automatic Content
Aggregation
from multiple
content providers
and feeds
Related
news not
specifically
asked for
(Semantic
Associations)
Competitive
research
inferred
automatically
Automatic
3rd party
content
integration
Semantic Application – Equity Dashboard
Managing Semantic Content on the Web, Internet Computing 2002
81
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Active Semantic Medical Records
(operational since January 2006)
Goals:
• Increase efficiency
• Reduce Errors, Improve Patient Satisfaction & Reporting
• Improve Profitability (better billing)
Technologies:
• Ontologies, semantic annotations &
rules
• Service Oriented Architecture
Thanks -- Dr. Agrawal, Dr. Wingeth, and others. Active Semantic
Electronic Medical Record ISWC2006 paper
http://iswc2006.semanticweb.org/items/in_use_8.php
Semantic Applications: Health Care
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Passenger Threat Analysis
Need to Know -> Demo
Financial Irregularity *
* a classified application
Primary Funding by ARDA, Secondary Funding by NSF
An Ontological Approach to the Document Access Problem of Insider Threat
Semantic Web Applications in
Government
83
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Aim: Legislation (PATRIOT ACT) requires banks to identify ‘who’ they are doing business with
Problem
Volume of internal and external data needed to be accessed
Complex name matching and disambiguation criteria
Requirement to ‘risk score’ certain attributes of this data
Approach
Creation of a ‘risk ontology’ populated from trusted sources (OFAC etc); Sophisticated entity
disambiguation
Semantic querying, Rules specification & processing
Solution
Rapid and accurate KYC checks
Risk scoring of relationships allowing for prioritisation of results; Full visibility of sources and
trustworthiness
Semantic Application in a Global Bank
84
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Watch list Organization
Company
Hamas
WorldCom
FBI Watchlist
Ahmed Yaseer
appears on Watchlist
member of organization
works for Company
Ahmed Yaseer:
• Appears on
Watchlist ‘FBI’
• Works for Company
‘WorldCom’
• Member of
organization ‘Hamas’
The Process
85
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Example of
Fraud Prevention
application used in
financial services
User will be able to navigate
the ontology using a number
of different interfaces
World Wide
Web content
Public
Records
BLOGS,
RSS
Un-structure text, Semi-structured Data
Watch Lists
Law
Enforcement Regulators
Semi-structured Government Data
Establishing
New Account
Global Investment Bank
86
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
N-Glycosylation Process (NGP)
Cell Culture
Glycoprotein Fraction
Glycopeptides Fraction
extract
Separation technique I
Glycopeptides Fraction
n*m
n
Signal integration
Data correlation
Peptide Fraction
Peptide Fraction
ms data ms/ms data
ms peaklist ms/ms peaklist
Peptide list
N-dimensional array
Glycopeptide identification
and quantification
proteolysis
Separation technique II
PNGase
Mass spectrometry
Data reduction
Data reduction
Peptide identification
binning
n
1
87
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Workflow based on Web Services =
Web Process
88
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web Process to incorporate provenance
Storage
Standard
Format
Data
Raw
Data
Filtered
Data
Search
Results
Final
Output
Agent Agent Agent
Agent
Biological
Sample
Analysis
by MS/MS
Raw Data
to
Standard
Format
Data
Pre-
process
DB
Search
(Mascot/
Sequest)
Results
Post-
process
(ProValt)
O I O I O I O I O
Biological Information
Semantic
Annotation
Applications
ISiS – Integrated Semantic Information
and Knowledge System
89
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
• Evaluate the specific effects of changing a biological parameter:
Retrieve abundance data for a given protein expressed by three
different cell types of a specific organism.
• Retrieve raw data supporting a structural assignment: Find all the
raw ms data files that contain the spectrum of a given peptide
sequence having a specific modification and charge state.
• Detect errors: Find and compare all peptide lists identified in
Mascot output files obtained using a similar organism, cell-type,
sample preparation protocol, and mass spectrometry
conditions.
ProPreO concepts highlighted in red
A Web Service
Must Be Invoked
Semantic Annotation Facilitates
Complex Queries
90
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
• Semantic Browser: contextual browsing of
PubMed
More Life Science Applications
More at Knoesis research in life sciences
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
9284
documents
4733
documents
Biologically
active substance
Lipid
Disease or
Syndrome
affects
causes
affects
causes
complicates
Fish Oils Raynaud’s Disease
???????
instance_of instance_of
5
documents
UMLS
MeSH
PubMed
Relationship Extraction
92
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
UMLS – A high level schema of the
biomedical domain
• 136 classes and 49 relationships
• Synonyms of all relationship – using variant lookup
(tools from NLM)
MeSH
• Terms already asserted as instance of one or more
classes in UMLS
PubMed
• Abstracts annotated with one or more MeSH terms
T147—effect
T147—induce
T147—etiology
T147—cause
T147—effecting
T147—induced
About the data used
93
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Abstract
Classification/Annotation
Example PubMed abstract (for the
domain expert)
94
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
SS-Tagger (University of Tokyo)
SS-Parser (University of Tokyo)
(TOP (S (NP (NP (DT An) (JJ excessive) (ADJP (JJ endogenous) (CC or) (JJ
exogenous) ) (NN stimulation) ) (PP (IN by) (NP (NN estrogen) ) ) ) (VP (VBZ
induces) (NP (NP (JJ adenomatous) (NN hyperplasia) ) (PP (IN of) (NP (DT
the) (NN endometrium) ) ) ) ) ) )
Method – Parse Sentences in PubMed
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Modifiers
Modified entities
Composite Entities
Method – Identify entities and
Relationships in Parse Tree
96
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
PubMed
Complex
Query
Supporting
Document
sets
retrieved
Migraine
Stress
Patient
affects
isa
Magnesium
Calcium Channel
Blockers
inhibit
Keyword query: Migraine[MH] + Magnesium[MH]
Hypothesis Driven retrieval of Scientific
Literature
97
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Eli Lilly Case Study
98
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Advances in Science
• Tremendous advances in biology over the last decade
• Sequencing of the human genome
• Technology for large scale expression studies and patient genotyping
• High resolution imaging (e.g. whole organism, cell based)
• Discovery and development of siRNA and other techniques
• Facilitates a more tailored approach to therapeutics
99
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Tailored Therapeutics
• The end of the blockbuster era
• The right dose to the right patient at the right time
• Lowering cost of pharmaceutical development
• Improved patient response to medication
100
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Critical Path Initiative
Source: Innovation or Stagnation, FDA Report, March 2004
101
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
The Web of Heterogeneous Data
Cell/Assay
Technologies
102
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Integrative Informatics Platform
Discovery
Clinical
Public
Proteomics Imaging
In-Vitro/Vivo
Genotyping
SNPs/Haplotypes
XREP
plus
IQ
Proteomics
Informatics
System/Semantic Integration Layer
Integrative Informatics
PGI
Lipid
Informatics
Tailored Therapeutic Workbench (TTW)
Integrative Data Mining/Query System
Lipidomics
Gene Expression
Info
Mining
Text
Translational
Informatics
103
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Discovery Target Assessment Tool
104
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Use Cases for RDF
• Patient Stratification
• If we could 'semantically' describe a patient with biomarker/genomic
properties, we could 'semantically' compare them. New definition of
similarity.
• Cellomics Data
• If we could 'semantically' describe cellular localization with other
properties, we could discover novel indicators (e.g. cell size relates
to expression) - emergent properties.
• Exon Descriptors
• If we could 'semantically' describe all the exons on a genome and
relate transcription and SNPs to other functional consequences, we
could 'reason' across the genome (i.e. query in interesting ways).
Adding power to interrogation.
105
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Summary
• The life sciences industry needs to increase productivity
• The Semantic Web is a promising technology for data
integration and decision support
• Lilly is using Semantic Web technologies for the Target
Assessment Tool within drug discovery
106
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Acknowledgements
John Reynders, Ph.D, Discovery Informatics
Alan Palkowitz, Ph.D., Discovery Chemistry
Jim Stephens, Ph.D., Discovery Toxicology
Jude Onyia, Ph.D., Discovery Biology
Harry Harlow, Ph.D., Integrative Biology
Dan Robertson, Ph.D., Computational Chemistry
Discovery Target Assessment (D-TAT) Team
107
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web in Practice
108
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Agenda
• Tools
• Data / vocabularies
• Collateral
• Community
• Pointers for getting going
110
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web Infrastructure
• Triple stores
• RDFizers
• Ontology Editors / Reasoning Systems
• Application Frameworks
111
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Triple Stores
• 3Store
• Aduna
• AllegroGraph
• Boca
• Joseki
• Kowari
• Mulgara
• Oracle RDF Data Model
• Profium Metadata Server
• RDF Gateway
• RDFStore
• Sesame
• Virtuoso
• YARS
There are many others available too…
112
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Boca, IBM
Source: IBM
113
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
RDF Data Model, Oracle
• Object-relational implementation
• Set of triples form an RDF/OWL graph (model)
• Optimized storage structure: repeated values stored only once
• Can handle multiple lexical forms of the same value
• Incremental load and bulk load
• SPARQL-like graph pattern embedded in SQL query
• Native inferencing for RDF, RDFS & user-defined rules
• Support for OWL and Semantic Operators in the next release
114
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Virtuoso, OpenLink
• Hybrid Data Server that combines SQL, RDF, XML, and Full
Text Data Management
• Includes a Virtual / Federated DBMS Layer that enables
transparent access to data from 3rd party SQL, RDF, XML, and
Web Services
• Produces RDF Instance Data in Physical and Virtual forms from
local or 3rd party data sources
• Provides full support for the SPARQL Query Language against
Physical and Virtual RDF Graphs
• Query Optimizer is specifically tuned for high-performance data
access across all realms
• Includes in-built middleware for producing RDF instance data
on-the-fly from non RDF Data Sources (e.g. (X)HTML,
Microformats, Web Services, Binary Files)
115
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Adapting SQL Databases
Source: Tim Berners-Lee
116
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Mapping Relational to RDF
• D2RQ
• SquirrelRDF
• DartGrid
• SPASQL
Source: DartGrid
117
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
RDFizers
• Relational -> RDF
• XML -> RDF
• Excel -> RDF
• JPEG -> RDF
• BibTEX -> RDF
A directory of RDFizers is provided at:
http://simile.mit.edu/wiki/RDFizers
• Java -> RDF
• Weather -> RDF
• Palm -> RDF
• Outlook -> RDF
• Flickr -> RDF
118
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Ontology Editors and Environments
Protégé, SWOOP, GrOWL, TopBraid, Ontotrack, SemanticWorks, ..
Source: Ian Horrocks
119
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Reasoning Systems
Pellet
KAON2
CEL
120
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web Tools
• RDF programming environment for 14+ languages
• C, C++, C# and .Net, Haskell, Java, Javascript, Lisp, Obj-C, PHP,
Perl, Prolog, Python, Ruby, Tcl/Tk
• Selection of on-line validators
• BBN OWL Validator, OWL Consistency Checker, WonderWeb OWL-
DL Validator, RDF Validator, RDF/XML & N3 Validator, ConsVISor
OWL Consistency Checker
• SPARQL Endpoints
• SPARQLer, SPARQLette, XML Army Knife, OpenLink Virtuoso
•Semantic Web Crawlers
• Swoogle, SWSE, Zitgist
121
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Semantic Web Tools
• RDF Browsers
• BrowseRDF, /facet, Longwell, mSpace, Siderean Software, Exhibit
• Semantic Web Browsers
• DISCO, ObjectViewer, OpenLink RDF Browser, Tabulator Browser, Haystack
• Labeling
• Adobe XMP
• Information Extraction
• Amilcare, Language and Computing
• Visualization
• IsaViz, Perfuse, Tom Sawyer, RDF-Gravity
• Relationship Analytics
• Cogito
• Content Management
• Profium Semantic Information Router
• Information Integration
• Ontoprise, Software AG, @Semantics, webMethods, Revelytix, Ontology Works
Over 500 tools are now available
122
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Lists of Tools
• http://sites.wiwiss.fu-berlin.de/suhl/bizer/toolkits/index.htm
• http://esw.w3.org/topic/SemanticWebTools
• http://www.mkbergman.com/?p=291
• http://planetrdf.com/guide/
• http://www.sekt-project.org/resources/sekt_components.html
123
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
How to get RDF Data?
• Write your own RDF in your preferred syntax
• Add RDF to XML directly (in its own namespace), e.g. in SVG
• Use intelligent scrapers or wrappers to extract RDF from a Web
page and then generate automatically (e.g. via an XSLT script)
• Formalize the scraper approach with GRDDL
• RDFa extend (X)HTML by defining general attributes to add
metadata to any element
• Create bridge to relational databases
• Use bridge from other data sources
124
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
RDF Data
• Annotea Bookmark File
• DBLP
• dbpedia
• dbtune
• Geonames
• MusicBrainz
• RDF Book Mashup
• Revyu
• US Census Data
• WordNet
• BIND
• BrainPharm
• Entrez Gene
• HIVSDB
• KEGG
• NeuroNames
• Reactome
• SenseLab
• SWAN publication & hypothesis
• UniProt
125
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Vocabularies
• eClassOwl: eBusiness ontology for products and services
• Gene Ontology: describes gene and gene products
• BioPAX: for biological pathway data
• SKOS core: describes knowledge systems, thesauri, glossaries
• Dublin Core: about information resources, digital libraries, with
extensions for rights, permissions, digital rights management
• FOAF: about people and their organizations
• DOAP: on the descriptions of software products
• Music Ontology: describes CDs, music tracks, etc.
• SIOC: for semantically-Interlinked Online Communities
Source: Ivan Herman
126
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Collateral
• Much good information at W3C
• http://www.w3.org/2001/sw/
• New FAQ on the Semantic Web
• http://www.w3.org/2001/sw/SW-FAQ
• Semantic Web Case Studies and Use Cases
• http://www.w3.org/2001/sw/sweo/public/UseCases
• List of Semantic Web books
• http://esw.w3.org/topic/SwBooks
• Dave Beckett’s Resources
• PlanetRDF a blog aggregator on Semantic Web topics
127
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Public Fora at W3C
• Semantic Web Interest Group
• A forum for developers with an archived mailing list, and a constant
IRC presence on freenode.net#swig
• Semantic Web for Health Care & Life Sciences: SW-HCLS
•Semantic Web Deployment Working Group
• Archives of working group are public
• Semantic Web Education and Outreach IG
• Community Projects
– Whitelisting Email Senders with FOAF
– Linking Open Data on the Semantic Web
– Knowee Contact Organizer
– POWDER Browser Extension
128
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Pointers for Getting Going
• Use robust URIs
• Reuse existing data and ontologies
• A little semantics goes a long way
• Model the real world rather than data artifacts
• Build upon your infrastructure incrementally
129
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Summary
• Many Semantic Web tools are available
• Data and vocabularies are increasingly being made
available in RDF/OWL
• Many books, tutorial and overviews are available to help
you get going
• Several public fora for community activities
130
Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007
Copy of the tutorial will be available at:
http://knoesis.wright.edu -> Library
131

More Related Content

Similar to emantic web technologies and applications for Ins

Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Gautier Poupeau
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Anna Fensel
 
How does semantic technology work?
How does semantic technology work? How does semantic technology work?
How does semantic technology work? Graeme Wood
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Stephane Fellah
 
Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Nandana Mihindukulasooriya
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
Open standards for linked organisations | meeting Estonia - Flemish Governmen...
Open standards for linked organisations | meeting Estonia - Flemish Governmen...Open standards for linked organisations | meeting Estonia - Flemish Governmen...
Open standards for linked organisations | meeting Estonia - Flemish Governmen...Raf Buyle
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebOscar Corcho
 
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...Saeed Moaddeli
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial OntologiesStephane Fellah
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...vty
 
WebGUI And The Semantic Web
WebGUI And The Semantic WebWebGUI And The Semantic Web
WebGUI And The Semantic WebWilliam McKee
 

Similar to emantic web technologies and applications for Ins (20)

Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)
 
How does semantic technology work?
How does semantic technology work? How does semantic technology work?
How does semantic technology work?
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management
 
Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Open standards for linked organisations | meeting Estonia - Flemish Governmen...
Open standards for linked organisations | meeting Estonia - Flemish Governmen...Open standards for linked organisations | meeting Estonia - Flemish Governmen...
Open standards for linked organisations | meeting Estonia - Flemish Governmen...
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Semantic web
Semantic webSemantic web
Semantic web
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...
Eiah Data Model: Semantic Interoperability between Distributed Digital Reposi...
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial Ontologies
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...
 
WebGUI And The Semantic Web
WebGUI And The Semantic WebWebGUI And The Semantic Web
WebGUI And The Semantic Web
 

Recently uploaded

定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girladitipandeya
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITMgdsc13
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 
Complet Documnetation for Smart Assistant Application for Disabled Person
Complet Documnetation   for Smart Assistant Application for Disabled PersonComplet Documnetation   for Smart Assistant Application for Disabled Person
Complet Documnetation for Smart Assistant Application for Disabled Personfurqan222004
 
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130  Available With RoomVIP Kolkata Call Girl Alambazar 👉 8250192130  Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Roomdivyansh0kumar0
 
Denver Web Design brochure for public viewing
Denver Web Design brochure for public viewingDenver Web Design brochure for public viewing
Denver Web Design brochure for public viewingbigorange77
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...aditipandeya
 
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...akbard9823
 
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With RoomVIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Roomgirls4nights
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 

Recently uploaded (20)

young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
 
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITM
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girls
 
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Complet Documnetation for Smart Assistant Application for Disabled Person
Complet Documnetation   for Smart Assistant Application for Disabled PersonComplet Documnetation   for Smart Assistant Application for Disabled Person
Complet Documnetation for Smart Assistant Application for Disabled Person
 
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130  Available With RoomVIP Kolkata Call Girl Alambazar 👉 8250192130  Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
 
Denver Web Design brochure for public viewing
Denver Web Design brochure for public viewingDenver Web Design brochure for public viewing
Denver Web Design brochure for public viewing
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
 
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...
Sushant Golf City / best call girls in Lucknow | Service-oriented sexy call g...
 
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With RoomVIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 

emantic web technologies and applications for Ins

  • 1. Wright State University Wright State University CORE Scholar CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 5-2007 Semantic Web: Technologies and Applications for the Real-World Semantic Web: Technologies and Applications for the Real-World Amit P. Sheth Wright State University - Main Campus, amit@sc.edu Follow this and additional works at: https://corescholar.libraries.wright.edu/knoesis Part of the Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, and the Science and Technology Studies Commons Repository Citation Repository Citation Sheth, A. P. (2007). Semantic Web: Technologies and Applications for the Real-World. . https://corescholar.libraries.wright.edu/knoesis/640 This Presentation is brought to you for free and open access by the The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) at CORE Scholar. It has been accepted for inclusion in Kno.e.sis Publications by an authorized administrator of CORE Scholar. For more information, please contact library-corescholar@wright.edu.
  • 2. Semantic Web: Technologies and Applications for the Real-World Amit Sheth LexisNexis Ohio Eminent Scholar Kno.e.sis Center Wright State University http://knoesis.wright.edu Tutorial at WWW2007 by Amit Sheth and Susie Stephens, Banff, Alberta, Canada. May 2007
  • 3. Semantic Web: Technologies and Applications for the Real-World Susie Stephens Principal Research Scientist Tutorial at WWW2007 by Amit Sheth and Susie Stephens, Banff, Alberta, Canada. May 2007
  • 4. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Tutorial Outline Introduction to the Semantic Web 1.30-2.00pm Real-World Applications (1): Enabling Technologies and Experiences 2.00-3.00pm Break 3.00-3.30pm Real-World Applications (2): Details and War Stories, Case Studies 3.30-4.30pm Semantic Web in Practice 4.30-5.00pm 3
  • 5. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Introduction to the Semantic Web 4
  • 6. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Agenda • Characterizing the Semantic Web • Semantic Web Standards • Semantic Web Capabilities 5
  • 7. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Characterizing the Semantic Web • Semantic Web is an interoperability technology • An architecture for interconnected communities and vocabularies • A set of interoperable standards for knowledge exchange 6
  • 8. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Creating a Web of Data Source: Ivan Herman 7
  • 9. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Mashing Data Source: W3C 8
  • 10. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Drivers for the Semantic Web • Business models develop rapidly these days, so infrastructure that supports change is needed • Organizations are increasingly forming and disbanding collaborations • Data is growing so quickly that it is no longer possible for individuals to identify patterns in their heads • Increasing recognition of the benefits of collective intelligence 9
  • 11. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web Technologies Source: W3C 10
  • 12. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Resource Description Framework (RDF) • RDF became a W3C standard recommendation in 2004 • RDF is a language for representing information about resources in the Web • Common framework for expressing information • Information may be made available to applications other than those for which it was originally created • Any information in RDF can be connected to any other information in RDF • Resources are identified with Uniform Resource Identifiers (URIs) 11
  • 13. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Resource Description Framework (RDF) Source: W3C 12
  • 14. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 RDF Schema (RDFS) • Vocabulary for describing properties and classes of RDF resources • Semantics for hierarchies of properties and classes • A resource may belong to several classes 13
  • 15. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Web Ontology Language (OWL) • OWL became a W3C standard recommendation in 2004 • Explicitly represents meaning of terms in vocabularies and the relationships between those terms • Separate layers have been defined balancing expressibility vs. implementability (OWL Lite, OWL DL, OWL Full) • Supports inferencing 14
  • 16. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Merging Ontologies Source: Siderean Software 15
  • 17. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 SPARQL • SPARQL is expected to become a W3C standard recommendation in Q3 2007 • It is a query language based on graph patterns • Protocol layer for using SPARQL over HTTP • There are SPARQL endpoints on the Web • SPARQL can be used to construct graphs 16
  • 18. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 SPARQL as a Unifying Source Source: Ivan Herman 17
  • 19. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Gleaning Resource Descriptions from Dialects of Languages (GRDDL) • GRDDL is a mechanism for Gleaning Resource Descriptions from Dialects of Languages • Introduces markup based on existing standards for declaring that an XML document includes data compatible with RDF and for linking to algorithms for extracting this data from the document • GRDDL should reach W3C standard recommendation in Q2 2007 18
  • 20. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Rules • OWL DL and OWL Lite are based on Description Logic; there are things that DL cannot express • Attempts to combine ontologies and rules include RuleML, SWRL, cwm, etc. • There is also an increasing number of rule-based systems that want to interchange rules • The W3C Rule Interchange Format Working Group is focused on this area 19
  • 21. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Annotations for WSDL (SAWSDL) • Develop a mechanism to enable semantic annotation of Web service descriptions • Using the WSDL 2.0 extension mechanism to build simple and generic support for adding service descriptions for Web services • This W3C standard is at Candidate Recommendation • Tools and Use cases available (Google: SAWSDL) 20
  • 22. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Enhanced Enterprise Search Source: Oracle OTN 21
  • 23. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Improved Reliability of Search Results Source: Segala 22
  • 24. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Web of Data Source: Freebase 23
  • 25. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Data Mashups Source: BioDASH 24
  • 26. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Data Mashups Source: BioDASH 25
  • 27. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Document Management Source: MGH 26
  • 28. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Data Warehouse Source: University of Texas 27
  • 29. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Heterogeneous Data Aggregation • Ordnance Survey maintains the definitive mapping data of Great Britain • Use ontologies to bridge semantically diverse sources of data, e.g., Hydrology, Administrative Geography, Buildings and Places, etc. • Efficient queries via the ontology or directly in RDF • Advantages include efficient data integration, data repurposing, better quality control and classification 28
  • 30. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Integration with Semantic Mediation • BT uses semantic descriptions of system interfaces and messages to support integration of Operational Support Systems (OSS) • Internet Service Providers integrate their OSS with those of BT (via a gateway) • The approach helps overcome the increasing complexity of supply chains, reduces costs and time-to-market, ontologies allow for a reuse of services 29
  • 31. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Natural Language Query of Applications • Tata Consulting Services provides a natural language interface to business applications • Users pose questions and invoke actions using natural language • An OWL ontology aids in the retrieval of relevant data and concepts • Advantages include distinct semantics for various domain concepts, external concepts linked in, simple interface 30
  • 32. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Decision Support Source: AGFA 31
  • 33. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Summary • The Semantic Web provides functionality of interest to business, scientific and Web communities • Capabilities are provided in data integration, search, semantic mediation, decision support, etc. • Semantic Web standards are maturing •The Semantic Web is increasingly being adopted 32
  • 34. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Real World Applications with Enabling Capabilities/Technologies 33
  • 35. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Enablers and Techniques Semantic Web Application Lifecycle 34
  • 36. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Syntax, Structure and Semantics Semantics: Meaning & Use of Data 35
  • 37. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Describing Semantic Web to Nontechnical Users Labeling data on Web so that both humans and machines can more effectively use them Associating meaning to data that machines can understand so as to achieve lot more automation and off-load more work to machines Exploiting common vocabulary and richer modeling of subject area for much better integration of data 36
  • 38. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Ontology: Agreement with Common Vocabulary & Domain Knowledge; Schema + Knowledge base Semantic Annotation (meatadata Extraction): Manual, Semi-automatic (automatic with human verification), Automatic Reasoning/computation: semantics enabled search, integration, complex queries, analysis (paths, subgraph), pattern finding, mining, hypothesis validation, discovery, visualization Enablers and Techniques 37
  • 39. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 A Typical Enterprise SW Application Lifecycle • Build Ontology – Build Schema (model level representation – Populate with Knowledgebase (people, location, organizations, events) • Automatic Semantic Annotation (Extract Semantic Metadata) – Any type of document, multiple sources of documents – Metadata can be stored with or sparely from documents • Applications: search (ranked list of documents of interest (semantic search), integrate/portal, summarize/explain, analyze, make decisions – Reasoning techniques: graph analysis, inferencing Types of content/documents, Use of standards, Scalability, Performance 38
  • 40. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semagix Freedom Architecture: for building ontology-driven information system © Semagix, Inc. Managing Semantic Content on the Web: http://portal.acm.org/citation.cfm?id=613729 39
  • 41. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Building ontology Three broad approaches: Option 1: social process/manual: many years, committees – Can be based on metadata standard Option 2: automatic taxonomy generation (statistical clustering/NLP): limitation/problems on quality, dependence on corpus, naming Option 3: Descriptional component (schema) designed by domain experts; Description base (assertional component, extension) by automated processes Option 2 is being investigated in several research projects; Option 3 is currently supported by technologies such as Semagix Freedom 40
  • 42. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Time, Space Gene Ontology, Glycomics Pharma Drug, Treatment-Diagnosis Repertoire Management Equity Markets Anti-money Laundering, Financial Risk, Terrorism Can be Public, Government, Limited Availability, Commercial Ontology Examples 41
  • 43. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Ontology Language/ Representation Spectrum Is Disjoint Subclass of with transitivity property Modal Logic Logical Theory Thesaurus Has Narrower Meaning Than Taxonomy Is Sub-Classification of Conceptual Model Is Subclass of DB Schemas, XML Schema First Order Logic Relational Model XML ER Extended ER Description Logic DAML+OIL, OWL, UML RDFS, XTM Syntactic Interoperability Structural Interoperability Semantic Interoperability 42
  • 44. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Ontology Semantic Query Server 1. Ontology Model Creation (Description) 2. Knowledge Agent Creation 3. Automatic aggregation of Knowledge 4. Querying the Ontology Ontology Creation and Maintenance Steps (Approach 1) © Semagix, Inc. 43
  • 45. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Some observations on Ontology Development and Maintenance • Type of domain Modeling of human centric world (sports, entertainment, legal, financial services) Versus Natural world (life sciences, astronomy, …) • Schema • Sources of knowledge 44
  • 46. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 GlycO is a focused ontology for the description of glycomics models the biosynthesis, metabolism, and biological relevance of complex glycans models complex carbohydrates as sets of simpler structures that are connected with rich relationships An ontology for structure and function of Glycopeptides Published through the National Center for Biomedical Ontology (NCBO) and Open Biomedical Ontologies (OBO) See:GlycoDoc, GlycO 45
  • 47. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 GlycO Challenge – model hundreds of thousands of complex carbohydrate entities But, the differences between the entities are small (E.g. just one component) How to model all the concepts but preclude redundancy → ensure maintainability, scalability 47
  • 48. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 GlycO population Assumption: with a large body of background knowledge, learning and extraction techniques can be used to assert facts. Asserted facts are compositions of individual building blocks Because the building blocks are richly described, the extracted larger structures will be of high quality 48
  • 49. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 GlycO Population Multiple data sources used in populating the ontology o KEGG - Kyoto Encyclopedia of Genes and Genomes o SWEETDB o CARBBANK Database Each data source has a different schema for storing data There is significant overlap of instances in the data sources Hence, entity disambiguation and a common representational format are needed 49
  • 50. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Diverse Data From Multiple Sources Assures Quality Democratic principle Some sources can be wrong, but not all will be More likely to have homogeneity in correct data than in erroneous data 50
  • 51. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Has CarbBank ID? IUPAC to LINUCS LINUCS to GLYDE Compare to Knowledge Base Already in KB? YES NO Semagix Freedom knowledge extractor Instance Data YES: next Instance Insert into KB NO Ontology population workflow 51
  • 52. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Has CarbBank ID? IUPAC to LINUCS LINUCS to GLYDE Compare to Knowledge Base Already in KB? YES NO Semagix Freedom knowledge extractor Instance Data YES: next Instance Insert into KB NO [][Asn]{[(4+1)][b-D-GlcpNAc] {[(4+1)][b-D-GlcpNAc] {[(4+1)][b-D-Manp] {[(3+1)][a-D-Manp] {[(2+1)][b-D-GlcpNAc] {}[(4+1)][b-D-GlcpNAc] {}}[(6+1)][a-D-Manp] {[(2+1)][b-D-GlcpNAc]{}}}}}} Ontology population workflow 52
  • 53. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Has CarbBank ID? IUPAC to LINUCS LINUCS to GLYDE Compare to Knowledge Base Already in KB? YES NO Semagix Freedom knowledge extractor Instance Data YES: next Instance Insert into KB NO <Glycan> <aglycon name="Asn"/> <residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc"> <residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc"> <residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="Man" > <residue link="3" anomeric_carbon="1" anomer="a" chirality="D" monosaccharide="Man" > <residue link="2" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc" > </residue> <residue link="4" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc" > </residue> </residue> <residue link="6" anomeric_carbon="1" anomer="a" chirality="D" monosaccharide="Man" > <residue link="2" anomeric_carbon="1" anomer="b" chirality="D" monosaccharide="GlcNAc"> </residue> </residue> </residue> </residue> </residue> </Glycan> Ontology population workflow 53
  • 54. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Has CarbBank ID? IUPAC to LINUCS LINUCS to GLYDE Compare to Knowledge Base Already in KB? YES NO Semagix Freedom knowledge extractor Instance Data YES: next Instance Insert into KB NO Ontology population workflow 54
  • 55. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Diverse Data From Multiple Sources Assures Quality Holds only, when the data in each source is independent In the case of GlycO, the sources that were meant to assure quality were not diverse. One original source (Carbbank) was copied by several Databases without curation Errors in the original propagated Errors in KEGG and Carbbank are the same Cannot use these sources for comparison Needs curation by the expert community ? 55
  • 56. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 N. Takahashi and K. Kato, Trends in Glycosciences and Glycotechnology, 15: 235-251 β-D-GlcpNAc β-D-GlcpNAc β-D-Manp-(1-4)- -(1-4)- α-D-Manp -(1-6)+ β-D-GlcpNAc-(1-2)- α-D-Manp -(1-3)+ β-D-GlcpNAc-(1-4)- β-D-GlcpNAc-(1-2)+ GlycoTree 56
  • 57. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 N-Glycosylation metabolic pathway GNT-I attaches GlcNAc at position 2 UDP-N-acetyl-D-glucosamine + alpha-D-Mannosyl-1,3-(R1)-beta-D-mannosyl-R2 <=> UDP + N-Acetyl-$beta-D-glucosaminyl-1,2-alpha-D-mannosyl-1,3-(R1)-beta-D-mannosyl-$R2 GNT-V attaches GlcNAc at position 6 UDP-N-acetyl-D-glucosamine + G00020 <=> UDP + G00021 N-acetyl-glucosaminyl_transferase_V N-glycan_beta_GlcNAc_9 N-glycan_alpha_man_4 58
  • 58. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Pathway Steps - Glycan Pathway visualization tool by M. Eavenson and M. Janik, LSDIS Lab, Univ. of Georgia Abundance of this glycan in three experiments 60
  • 59. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 61
  • 60. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 An ontology for capturing process and lifecycle information related to proteomic experiments Two aspects of glycoproteomics: What is it? → identification How much of it is there? → quantification Heterogeneity in data generation process, instrumental parameters, formats Need data and process provenance → ontology-mediated provenance Hence, ProPreO models both the glycoproteomics experimental process and attendant data Published through the National Center for Biomedical Ontology (NCBO) and Open Biomedical Ontologies (OBO) ProPreO ontology More at: http://knoesis.wright.edu/research/bioinformatics/ 62
  • 61. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Annotation Shallow Annotation: SemTag Intermediate Annotation: • SEE/Semagix: http://lsdis.cs.uga.edu/library/download/HSK02-SEE.pdf • Ontotext: http://www.ontotext.com/kim/semanticannotation.html) Advanced: Schema-driven Relationship Extraction Engine (SCREEN): http://knoesis.wright.edu/research/discovery 63
  • 62. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Extracting a Text Document: Syntactic approach INCIDENT MANAGEMENT SITUATION REPORT Friday August 1, 1997 - 0530 MDT NATIONAL PREPAREDNESS LEVEL II CURRENT SITUATION: Alaska continues to experience large fire activity. Additional fires have been staffed for structure protection. SIMELS, Galena District, BLM. This fire is on the east side of the Innoko Flats, between Galena and McGr The fore is active on the southern perimeter, which is burning into a continuous stand of black spruce. The fire has increased in size, but was not mapped due to thick smoke. The slopover on the eastern perimeter is 35% contained, while protection of the historic cabit continues. CHINIKLIK MOUNTAIN, Galena District, BLM. A Type II Incident Management Team (Wehking) is assigned to the Chiniklik fire. The fire is contained. Major areas of heat have been mopped up. The fire is contained. Major areas of heat have been mopped-up. All crews and overhead will mop-up where the fire burned beyond the meadows. No flare-ups occurred today. Demobilization is planned for this weekend, depending on the results of infrared scanning. LAYOUT Date => day month int ‘,’int 64
  • 63. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Extraction Agent Web Page Enhanced Metadata Asset Taalee Extraction and Knowledgebase Enhancement Taalee Semantic Engine, also IBM Semantics Tools 65
  • 64. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Metadata Extraction and Semantic Enhancement Semantic Enhancement Engine [Hammond, Sheth, Kochut 2002] Also, WebFountatin, KIM from OntoText 66
  • 65. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Annotation - document 67
  • 66. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Sample Created Metadata <Entity id="122805" class="DrugOntology#prescription_drug_brandname"> Bextra <Relationship id=”442134” class="DrugOntology#has_interaction"> <Entity id="14280" class="DrugOntology #interaction_with_physical_condition>sulfa allergy </Entity> </Relationship> </Entity> Excerpt of Drug Ontology Excerpt of Drug Ontology Semantic Annotation – news feed 68
  • 67. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic annotation of scientific/experimental data
  • 68. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Scientific Data Computational Methods Ontology instances ProPreO population: transformation to rdf 70
  • 69. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 830.9570 194.9604 2 580.2985 0.3592 688.3214 0.2526 779.4759 38.4939 784.3607 21.7736 1543.7476 1.3822 1544.7595 2.9977 1562.8113 37.4790 1660.7776 476.5043 parent ion m/z fragment ion m/z ms/ms peaklist data fragment ion abundance parent ion abundance parent ion charge ProPreO: Ontology-mediated provenance Mass Spectrometry (MS) Data http://knoesis.org/research/bioinformatics 71
  • 70. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 <ms-ms_peak_list> <parameter instrument=“micromass_QTOF_2_quadropole_time_of_flight_mass_spectrometer” mode=“ms-ms”/> <parent_ion m-z=“830.9570” abundance=“194.9604” z=“2”/> <fragment_ion m-z=“580.2985” abundance=“0.3592”/> <fragment_ion m-z=“688.3214” abundance=“0.2526”/> <fragment_ion m-z=“779.4759” abundance=“38.4939”/> <fragment_ion m-z=“784.3607” abundance=“21.7736”/> <fragment_ion m-z=“1543.7476” abundance=“1.3822”/> <fragment_ion m-z=“1544.7595” abundance=“2.9977”/> <fragment_ion m-z=“1562.8113” abundance=“37.4790”/> <fragment_ion m-z=“1660.7776” abundance=“476.5043”/> </ms-ms_peak_list> Ontological Concepts ProPreO: Ontology-mediated provenance Semantically Annotated MS Data http://knoesis.org/research/bioinformatics 72
  • 71. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Real World Applications Application Types – by broad capabilities •Search (also browsing, personalization) •Integration (also interoperability) •Analysis (also visualization) 73
  • 72. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Characterizing applications Dimension: Maturity: Prototypes and demonstrations—show case research, technique Systems using real world data, real <- our focus Operational Systems <- our focus Dimension: Type of use: General User, Consumer, Targeted Users/community on the Web, Business/Science/Engineering/Government User Dimension: scalability -- Enterprise to Web Dimension: realism Toy/synthetic data to Real World Data 74
  • 73. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Review of Applications Ontology Development, Ontology Population Sources, Ontology Quality Issues Semantic Annotation/Metadata Extraction Application Development Platforms/Support Standards Usage Scalability Performance 75
  • 74. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Taalee’s Semantic Search Highly customizable, precise and freshest A/V search Context and Domain Specific Attributes Uniform Metadata for Content from Multiple Sources, Can be sorted by any field Delightful, relevant information, exceptional targeting opportunity http://www.streamingmediaworld.com/gen/reviews/searchassociation/ 76
  • 75. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Creating a Web of related information What can a context do? 77
  • 76. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Ontology driven Semantic Directory Search for company ‘Commerce One’ Links to news on companies that compete against Commerce One Links to news on companies Commerce One competes against (To view news on Ariba, click on the link for Ariba) Crucial news on Commerce One’s competitors (Ariba) can be accessed easily and automatically 78
  • 77. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 What else can a context do? (a commercial perspective) Semantic Enrichment Semantic Targeting 79
  • 78. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic/Interactive Targeting Buy Al Pacino Videos Buy Russell Crowe Videos Buy Christopher Plummer Videos Buy Diane Venora Videos Buy Philip Baker Hall Videos Buy The Insider Video Precisely targeted through the use of Structured Metadata and integration from multiple sources 80
  • 79. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Focused relevant content organized by topic (semantic categorization) Automatic Content Aggregation from multiple content providers and feeds Related news not specifically asked for (Semantic Associations) Competitive research inferred automatically Automatic 3rd party content integration Semantic Application – Equity Dashboard Managing Semantic Content on the Web, Internet Computing 2002 81
  • 80. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Active Semantic Medical Records (operational since January 2006) Goals: • Increase efficiency • Reduce Errors, Improve Patient Satisfaction & Reporting • Improve Profitability (better billing) Technologies: • Ontologies, semantic annotations & rules • Service Oriented Architecture Thanks -- Dr. Agrawal, Dr. Wingeth, and others. Active Semantic Electronic Medical Record ISWC2006 paper http://iswc2006.semanticweb.org/items/in_use_8.php Semantic Applications: Health Care
  • 81. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Passenger Threat Analysis Need to Know -> Demo Financial Irregularity * * a classified application Primary Funding by ARDA, Secondary Funding by NSF An Ontological Approach to the Document Access Problem of Insider Threat Semantic Web Applications in Government 83
  • 82. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Aim: Legislation (PATRIOT ACT) requires banks to identify ‘who’ they are doing business with Problem Volume of internal and external data needed to be accessed Complex name matching and disambiguation criteria Requirement to ‘risk score’ certain attributes of this data Approach Creation of a ‘risk ontology’ populated from trusted sources (OFAC etc); Sophisticated entity disambiguation Semantic querying, Rules specification & processing Solution Rapid and accurate KYC checks Risk scoring of relationships allowing for prioritisation of results; Full visibility of sources and trustworthiness Semantic Application in a Global Bank 84
  • 83. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Watch list Organization Company Hamas WorldCom FBI Watchlist Ahmed Yaseer appears on Watchlist member of organization works for Company Ahmed Yaseer: • Appears on Watchlist ‘FBI’ • Works for Company ‘WorldCom’ • Member of organization ‘Hamas’ The Process 85
  • 84. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Example of Fraud Prevention application used in financial services User will be able to navigate the ontology using a number of different interfaces World Wide Web content Public Records BLOGS, RSS Un-structure text, Semi-structured Data Watch Lists Law Enforcement Regulators Semi-structured Government Data Establishing New Account Global Investment Bank 86
  • 85. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 N-Glycosylation Process (NGP) Cell Culture Glycoprotein Fraction Glycopeptides Fraction extract Separation technique I Glycopeptides Fraction n*m n Signal integration Data correlation Peptide Fraction Peptide Fraction ms data ms/ms data ms peaklist ms/ms peaklist Peptide list N-dimensional array Glycopeptide identification and quantification proteolysis Separation technique II PNGase Mass spectrometry Data reduction Data reduction Peptide identification binning n 1 87
  • 86. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Workflow based on Web Services = Web Process 88
  • 87. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web Process to incorporate provenance Storage Standard Format Data Raw Data Filtered Data Search Results Final Output Agent Agent Agent Agent Biological Sample Analysis by MS/MS Raw Data to Standard Format Data Pre- process DB Search (Mascot/ Sequest) Results Post- process (ProValt) O I O I O I O I O Biological Information Semantic Annotation Applications ISiS – Integrated Semantic Information and Knowledge System 89
  • 88. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 • Evaluate the specific effects of changing a biological parameter: Retrieve abundance data for a given protein expressed by three different cell types of a specific organism. • Retrieve raw data supporting a structural assignment: Find all the raw ms data files that contain the spectrum of a given peptide sequence having a specific modification and charge state. • Detect errors: Find and compare all peptide lists identified in Mascot output files obtained using a similar organism, cell-type, sample preparation protocol, and mass spectrometry conditions. ProPreO concepts highlighted in red A Web Service Must Be Invoked Semantic Annotation Facilitates Complex Queries 90
  • 89. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 • Semantic Browser: contextual browsing of PubMed More Life Science Applications More at Knoesis research in life sciences
  • 90. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 9284 documents 4733 documents Biologically active substance Lipid Disease or Syndrome affects causes affects causes complicates Fish Oils Raynaud’s Disease ??????? instance_of instance_of 5 documents UMLS MeSH PubMed Relationship Extraction 92
  • 91. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 UMLS – A high level schema of the biomedical domain • 136 classes and 49 relationships • Synonyms of all relationship – using variant lookup (tools from NLM) MeSH • Terms already asserted as instance of one or more classes in UMLS PubMed • Abstracts annotated with one or more MeSH terms T147—effect T147—induce T147—etiology T147—cause T147—effecting T147—induced About the data used 93
  • 92. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Abstract Classification/Annotation Example PubMed abstract (for the domain expert) 94
  • 93. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 SS-Tagger (University of Tokyo) SS-Parser (University of Tokyo) (TOP (S (NP (NP (DT An) (JJ excessive) (ADJP (JJ endogenous) (CC or) (JJ exogenous) ) (NN stimulation) ) (PP (IN by) (NP (NN estrogen) ) ) ) (VP (VBZ induces) (NP (NP (JJ adenomatous) (NN hyperplasia) ) (PP (IN of) (NP (DT the) (NN endometrium) ) ) ) ) ) ) Method – Parse Sentences in PubMed
  • 94. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Modifiers Modified entities Composite Entities Method – Identify entities and Relationships in Parse Tree 96
  • 95. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 PubMed Complex Query Supporting Document sets retrieved Migraine Stress Patient affects isa Magnesium Calcium Channel Blockers inhibit Keyword query: Migraine[MH] + Magnesium[MH] Hypothesis Driven retrieval of Scientific Literature 97
  • 96. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Eli Lilly Case Study 98
  • 97. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Advances in Science • Tremendous advances in biology over the last decade • Sequencing of the human genome • Technology for large scale expression studies and patient genotyping • High resolution imaging (e.g. whole organism, cell based) • Discovery and development of siRNA and other techniques • Facilitates a more tailored approach to therapeutics 99
  • 98. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Tailored Therapeutics • The end of the blockbuster era • The right dose to the right patient at the right time • Lowering cost of pharmaceutical development • Improved patient response to medication 100
  • 99. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Critical Path Initiative Source: Innovation or Stagnation, FDA Report, March 2004 101
  • 100. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 The Web of Heterogeneous Data Cell/Assay Technologies 102
  • 101. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Integrative Informatics Platform Discovery Clinical Public Proteomics Imaging In-Vitro/Vivo Genotyping SNPs/Haplotypes XREP plus IQ Proteomics Informatics System/Semantic Integration Layer Integrative Informatics PGI Lipid Informatics Tailored Therapeutic Workbench (TTW) Integrative Data Mining/Query System Lipidomics Gene Expression Info Mining Text Translational Informatics 103
  • 102. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Discovery Target Assessment Tool 104
  • 103. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Use Cases for RDF • Patient Stratification • If we could 'semantically' describe a patient with biomarker/genomic properties, we could 'semantically' compare them. New definition of similarity. • Cellomics Data • If we could 'semantically' describe cellular localization with other properties, we could discover novel indicators (e.g. cell size relates to expression) - emergent properties. • Exon Descriptors • If we could 'semantically' describe all the exons on a genome and relate transcription and SNPs to other functional consequences, we could 'reason' across the genome (i.e. query in interesting ways). Adding power to interrogation. 105
  • 104. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Summary • The life sciences industry needs to increase productivity • The Semantic Web is a promising technology for data integration and decision support • Lilly is using Semantic Web technologies for the Target Assessment Tool within drug discovery 106
  • 105. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Acknowledgements John Reynders, Ph.D, Discovery Informatics Alan Palkowitz, Ph.D., Discovery Chemistry Jim Stephens, Ph.D., Discovery Toxicology Jude Onyia, Ph.D., Discovery Biology Harry Harlow, Ph.D., Integrative Biology Dan Robertson, Ph.D., Computational Chemistry Discovery Target Assessment (D-TAT) Team 107
  • 106. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web in Practice 108
  • 107. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Agenda • Tools • Data / vocabularies • Collateral • Community • Pointers for getting going 110
  • 108. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web Infrastructure • Triple stores • RDFizers • Ontology Editors / Reasoning Systems • Application Frameworks 111
  • 109. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Triple Stores • 3Store • Aduna • AllegroGraph • Boca • Joseki • Kowari • Mulgara • Oracle RDF Data Model • Profium Metadata Server • RDF Gateway • RDFStore • Sesame • Virtuoso • YARS There are many others available too… 112
  • 110. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Boca, IBM Source: IBM 113
  • 111. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 RDF Data Model, Oracle • Object-relational implementation • Set of triples form an RDF/OWL graph (model) • Optimized storage structure: repeated values stored only once • Can handle multiple lexical forms of the same value • Incremental load and bulk load • SPARQL-like graph pattern embedded in SQL query • Native inferencing for RDF, RDFS & user-defined rules • Support for OWL and Semantic Operators in the next release 114
  • 112. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Virtuoso, OpenLink • Hybrid Data Server that combines SQL, RDF, XML, and Full Text Data Management • Includes a Virtual / Federated DBMS Layer that enables transparent access to data from 3rd party SQL, RDF, XML, and Web Services • Produces RDF Instance Data in Physical and Virtual forms from local or 3rd party data sources • Provides full support for the SPARQL Query Language against Physical and Virtual RDF Graphs • Query Optimizer is specifically tuned for high-performance data access across all realms • Includes in-built middleware for producing RDF instance data on-the-fly from non RDF Data Sources (e.g. (X)HTML, Microformats, Web Services, Binary Files) 115
  • 113. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Adapting SQL Databases Source: Tim Berners-Lee 116
  • 114. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Mapping Relational to RDF • D2RQ • SquirrelRDF • DartGrid • SPASQL Source: DartGrid 117
  • 115. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 RDFizers • Relational -> RDF • XML -> RDF • Excel -> RDF • JPEG -> RDF • BibTEX -> RDF A directory of RDFizers is provided at: http://simile.mit.edu/wiki/RDFizers • Java -> RDF • Weather -> RDF • Palm -> RDF • Outlook -> RDF • Flickr -> RDF 118
  • 116. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Ontology Editors and Environments Protégé, SWOOP, GrOWL, TopBraid, Ontotrack, SemanticWorks, .. Source: Ian Horrocks 119
  • 117. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Reasoning Systems Pellet KAON2 CEL 120
  • 118. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web Tools • RDF programming environment for 14+ languages • C, C++, C# and .Net, Haskell, Java, Javascript, Lisp, Obj-C, PHP, Perl, Prolog, Python, Ruby, Tcl/Tk • Selection of on-line validators • BBN OWL Validator, OWL Consistency Checker, WonderWeb OWL- DL Validator, RDF Validator, RDF/XML & N3 Validator, ConsVISor OWL Consistency Checker • SPARQL Endpoints • SPARQLer, SPARQLette, XML Army Knife, OpenLink Virtuoso •Semantic Web Crawlers • Swoogle, SWSE, Zitgist 121
  • 119. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Semantic Web Tools • RDF Browsers • BrowseRDF, /facet, Longwell, mSpace, Siderean Software, Exhibit • Semantic Web Browsers • DISCO, ObjectViewer, OpenLink RDF Browser, Tabulator Browser, Haystack • Labeling • Adobe XMP • Information Extraction • Amilcare, Language and Computing • Visualization • IsaViz, Perfuse, Tom Sawyer, RDF-Gravity • Relationship Analytics • Cogito • Content Management • Profium Semantic Information Router • Information Integration • Ontoprise, Software AG, @Semantics, webMethods, Revelytix, Ontology Works Over 500 tools are now available 122
  • 120. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Lists of Tools • http://sites.wiwiss.fu-berlin.de/suhl/bizer/toolkits/index.htm • http://esw.w3.org/topic/SemanticWebTools • http://www.mkbergman.com/?p=291 • http://planetrdf.com/guide/ • http://www.sekt-project.org/resources/sekt_components.html 123
  • 121. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 How to get RDF Data? • Write your own RDF in your preferred syntax • Add RDF to XML directly (in its own namespace), e.g. in SVG • Use intelligent scrapers or wrappers to extract RDF from a Web page and then generate automatically (e.g. via an XSLT script) • Formalize the scraper approach with GRDDL • RDFa extend (X)HTML by defining general attributes to add metadata to any element • Create bridge to relational databases • Use bridge from other data sources 124
  • 122. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 RDF Data • Annotea Bookmark File • DBLP • dbpedia • dbtune • Geonames • MusicBrainz • RDF Book Mashup • Revyu • US Census Data • WordNet • BIND • BrainPharm • Entrez Gene • HIVSDB • KEGG • NeuroNames • Reactome • SenseLab • SWAN publication & hypothesis • UniProt 125
  • 123. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Vocabularies • eClassOwl: eBusiness ontology for products and services • Gene Ontology: describes gene and gene products • BioPAX: for biological pathway data • SKOS core: describes knowledge systems, thesauri, glossaries • Dublin Core: about information resources, digital libraries, with extensions for rights, permissions, digital rights management • FOAF: about people and their organizations • DOAP: on the descriptions of software products • Music Ontology: describes CDs, music tracks, etc. • SIOC: for semantically-Interlinked Online Communities Source: Ivan Herman 126
  • 124. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Collateral • Much good information at W3C • http://www.w3.org/2001/sw/ • New FAQ on the Semantic Web • http://www.w3.org/2001/sw/SW-FAQ • Semantic Web Case Studies and Use Cases • http://www.w3.org/2001/sw/sweo/public/UseCases • List of Semantic Web books • http://esw.w3.org/topic/SwBooks • Dave Beckett’s Resources • PlanetRDF a blog aggregator on Semantic Web topics 127
  • 125. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Public Fora at W3C • Semantic Web Interest Group • A forum for developers with an archived mailing list, and a constant IRC presence on freenode.net#swig • Semantic Web for Health Care & Life Sciences: SW-HCLS •Semantic Web Deployment Working Group • Archives of working group are public • Semantic Web Education and Outreach IG • Community Projects – Whitelisting Email Senders with FOAF – Linking Open Data on the Semantic Web – Knowee Contact Organizer – POWDER Browser Extension 128
  • 126. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Pointers for Getting Going • Use robust URIs • Reuse existing data and ontologies • A little semantics goes a long way • Model the real world rather than data artifacts • Build upon your infrastructure incrementally 129
  • 127. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Summary • Many Semantic Web tools are available • Data and vocabularies are increasingly being made available in RDF/OWL • Many books, tutorial and overviews are available to help you get going • Several public fora for community activities 130
  • 128. Semantic Web: Technologies and Applications for the Real-World, Sheth & Stephens WWW2007 Copy of the tutorial will be available at: http://knoesis.wright.edu -> Library 131