SlideShare a Scribd company logo
1 of 102
The Semantic Web Landscape
                          A Practical Introduction


Contact:
Lee Feigenbaum
lee@cambridgesemantics.com
VP Technology
Co-chair, W3C SPARQL Working Group
                                               ©2011 Cambridge Semantics Inc. All rights reserved.
Example: Alzheimer’s Drug Discovery




    What genes are involved in signal transduction
       and are related to pyramidal neurons?




2                                      ©2011 Cambridge Semantics Inc. All rights reserved.
General search: 223,000 hits, 0 results




3                                 ©2011 Cambridge Semantics Inc. All rights reserved.
Domain-limited search: Still 2,580 potential results




4                                       ©2011 Cambridge Semantics Inc. All rights reserved.
Specific databases: Too many silos!




5                              ©2011 Cambridge Semantics Inc. All rights reserved.
Linked Scientific Data: 32 targeted results




6                                   ©2011 Cambridge Semantics Inc. All rights reserved.
What’s the trick?




    1. Agreement on common terms and
       relationships
    2. Incremental, flexible data structure
    3. Good-enough modeling
    4. Query interface tailored to the data model




7                                      ©2011 Cambridge Semantics Inc. All rights reserved.
WHAT IS THE SEMANTIC WEB?


8                        ©2011 Cambridge Semantics Inc. All rights reserved.
Names




9           ©2011 Cambridge Semantics Inc. All rights reserved.
Branding

     •   Semantic Web
     •   Web of Data
     •   Giant Global Graph
     •   Data Web
     •   Web 3.0
     •   Linked Data Web
     •   Semantic Data Web
     •   Enterprise Information Web



10                                     ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web – 1st View

     • “The Semantic Web”
       – Link explicit data on the World Wide Web in a machine-
         readable fashion
          •   …government data
          •   …commercial data
          •   …scientific data
          •   …social data
       – In order to enable applications such as…
          • …targeted, semantic search
          • …data browsing
          • …automated agents

          World Wide Web : Web pages :: The Semantic Web : Data

11                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web – 2nd View

     • “Semantic Web technologies”
       – A family of technology standards that ‘play nice together’,
         including:
           • Flexible data model
           • Expressive ontology language
           • Distributed query language
       – Drive enterprise applications, including:
           •   Data integration & virtualization
           •   Business intelligence
           •   Large knowledgebases
           •   …
        The technologies enable us to build applications and solutions that
               were not possible, practical, or feasible traditionally.

12                                                        ©2011 Cambridge Semantics Inc. All rights reserved.
A Common & Coherent Set of Technology Standards



      • A common set of technologies:
         – ...enables diverse uses
         – ...encourages interoperability
      • A coherent set of technologies:
         – …encourage incremental application
         – …provide a substantial base for innovation
      • A standard set of technologies:
         – ...reduces proprietary vendor lock-in
         – ...encourages many choices for tool sets

13                                                    ©2011 Cambridge Semantics Inc. All rights reserved.
The (In)Famous Layer Cake




14                         ©2011 Cambridge Semantics Inc. All rights reserved.
The (In)Famous Layer Cake




15                         ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web Technology Timeline


     1999     2001       2004        2007         2008                     2011




                                                                             RIF




16                                   ©2011 Cambridge Semantics Inc. All rights reserved.
2011: Where we are

     As technologies & tools have evolved, Semantic Web
     advocates have progressed through stages:

                  Report on…                           Execute on…

       Semantic Web vision                  Initial experiments

       Experiments                          Technology standards

       Technology standards                 Software packages

       Software packages                    Proofs of concept

       Proofs of concept                    Initial production implementations

                                            2nd, 3rd, … implementations—
       Initial production implementations
                                            network effect

17                                                                ©2011 Cambridge Semantics Inc. All rights reserved.
2011: Where we’re not
                                              Image from Trey Ideker via Enoch Huang




     Semantic Web technologies are not a ‘magic crank’ for discovering
         new drugs (or solving other problems, for that matter)!

18                                                      ©2011 Cambridge Semantics Inc. All rights reserved.
2011: Where we’re not (cont’d)

                “Ontology” vs.
                “ontology”?                              XML vs. RDF?

     Semantic Web vs.
     Linked Data?
                                                   Data integration vs.
                                                   reasoning vs. KBs
       RDFa vs. microformats                       vs. search vs. app.
       vs. microdata vs.                           development vs. …
       schema.org
           The Semantic Web still suffers from confusing and conflicting
                 messaging, each of which claims it’s “correct”.
19                                                        ©2011 Cambridge Semantics Inc. All rights reserved.
2011: Where we’re not (cont’d)




     We don’t yet have standard solutions for privacy, trust, probability,
              and other elements of the Semantic Web vision.


20                                                       ©2011 Cambridge Semantics Inc. All rights reserved.
What do Semantic Web solutions look like?




21                                 ©2011 Cambridge Semantics Inc. All rights reserved.
RDF is…




     Resource Description Framework




22                             ©2011 Cambridge Semantics Inc. All rights reserved.
RDF is…




     The data model of the Semantic Web.




23                               ©2011 Cambridge Semantics Inc. All rights reserved.
RDF is…




     A flexible data model that features unambiguous
     identifiers and named relations between pairs of
                        resources.




24                                      ©2011 Cambridge Semantics Inc. All rights reserved.
RDF is…

         A labeled, directed graph of relations between
                  resources and literal values.

     • RDF graphs are collections of triples
     • Triples are made up of a subject, a predicate, and an
       object

                             predicate
                   subject                object



     • Resources and relationships are named with URIs

25                                            ©2011 Cambridge Semantics Inc. All rights reserved.
Example RDF triples

     • “Lee Feigenbaum works for Cambridge Semantics”
                    Lee        works for     Cambridge
                Feigenbaum                   Semantics


     • “Lee Feigenbaum was born in 1978”
                    Lee         born in
                                             1978
                Feigenbaum


     • “Cambridge Semantics is headquartered in
       Massachusetts”
                 Cambridge   headquartered
                                             Massachusetts
                 Semantics




26                                                   ©2011 Cambridge Semantics Inc. All rights reserved.
Triples connect to form graphs

                   Lee             works for    Cambridge
               Feigenbaum                       Semantics



                                                            headquartered
     born in
                            lives in



     1978                                                 Massachusetts




                                                capital

                                       Boston




27                                                        ©2011 Cambridge Semantics Inc. All rights reserved.
Why RDF? What’s different here?

     • The graph data structure makes merging data with
       shared identifiers trivial
     • Triples act as a least common denominator for
       expressing data
     • URIs for naming remove ambiguity
       – …the same identifier means the same thing




28                                             ©2011 Cambridge Semantics Inc. All rights reserved.
Why RDF? Coping With Change

         Flexible
          Graph
                            URIs for           Agility
         Model              naming            On-the-fly



                                                 The World Changes

                                               Traditionally:
                                               Change is costly
                                               Semantics:
                                               Change is cheap




     RDB 1          RDB 2



29                                     ©2011 Cambridge Semantics Inc. All rights reserved.
Why RDF? Add Meaning to Data

     With traditional technology:
      Cust ID   Name             Referred By     Work Phone
      29212     Travis Ember     Janet Cassy
                                 Barbara Cassy   212-555-5001                     Inside the
      30012     Jessica Evalta   Brian Meedly    617-555-2325                     database
      59235     Hector Samton    Agatha Browne   732-555-8715




      29212     Travis Ember     Janet Cassy     212-555-5001
      30012     Jessica Evalta   Brian Meedly    617-555-2325
                                                                                Outside the
      59235     Hector Samton    Agatha Browne   732-555-8715
                                                                                 database

      No one knows what these numbers and names mean!
30                                                            ©2011 Cambridge Semantics Inc. All rights reserved.
Why RDF? Add Meaning to Data

     With Semantic Web technology:
                       name                        Text

           Person               referred by                                            Data
                                                         Text                       description
                       mobile
                       phone                      Text

                       name                     Travis Ember

          Person2912            referred by                                           Data,
                                                    Janet Cassy
                                                                                    wherever it
                       mobile                                                        appears
                       phone                  212-555-5001


           The meaning always travels with the data
31                                                                ©2011 Cambridge Semantics Inc. All rights reserved.
What does RDF look like?

     • RDF is the model, for which there are several
       concrete syntaxes:
        – RDF/XML – standard, complex XML syntax
        – Turtle – common, textual, triples-oriented syntax
           • …currently being standardized by the RDF working group
        – N3 – more expressive superset of Turtle
        – N-Triples – textual, line-oriented, useful for streaming



       When writing RDF by hand and in many guides, examples,
        and discussions these days, you’ll see Turtle most often.


32                                                      ©2011 Cambridge Semantics Inc. All rights reserved.
A Bit of Turtle

      • Write a triple by writing its parts separated by spaces
        (subject predicate object)


     @prefix ex: <http://example.org/myvocab/> .
     @prefix geo: <http://geonames.example/> .

     ex:LeeFeigenbaum        ex:employer       ex:CambridgeSemantics .
     ex:LeeFeigenbaum        ex:birthYear      1978 .
     ex:CambridgeSemantics   ex:headquarters   geo:BostonMA .
     geo:BostonMA            ex:population     574000 .




33                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
SPARQL is…




     SPARQL Protocol And RDF Query Language




34                                ©2011 Cambridge Semantics Inc. All rights reserved.
SPARQL is…




     The query language of the Semantic Web.




35                                 ©2011 Cambridge Semantics Inc. All rights reserved.
SPARQL is…




     A SQL-like language for querying sets of RDF
                       graphs.




36                                    ©2011 Cambridge Semantics Inc. All rights reserved.
SPARQL is…




       A simple protocol for issuing queries and
           receiving results over HTTP. So…

     Every SPARQL client works with every SPARQL
                        server!




37                                     ©2011 Cambridge Semantics Inc. All rights reserved.
Why SPARQL?

     SPARQL lets us:
     • Pull information from structured and semi-structured
       data.
     • Explore data by discovering unknown relationships.
     • Query and search an integrated view of disparate
       data sources.
     • Glue separate software applications together by
       transforming data from one vocabulary to another.
     • Update RDF data in bulk


38                                          ©2011 Cambridge Semantics Inc. All rights reserved.
Dealer 2
Dealer 1              Dealer 3
                                                                             Employee     ERP / Budget
                                                                             Directory      System
              Web                       EPA Fuel Efficiency
                                           Spreadsheet




                                      SPARQL Query Engine

    What automobiles get more than 25 miles per gallon and can be purchased at a
    dealer located within 10 miles of one of my employees?

                                                  SELECT ?automobile
                                                  WHERE {
                                                    ?automobile a ex:Car ; epa:mpg ?mpg ;
                                                       ex:dealer ?dealer .
                                                    ?employee a ex:Employee ; geo:loc ?loc .
                                                    ?dealer geo:loc ?dealerloc .
                                                    FILTER(?mpg > 25 &&
                                                            geo:dist(?loc, ?dealerloc) <= 10) .
                                                  }
                      Web dashboard                                 SPARQL query
The SPARQL 1.1 Landscape Includes

     • A query language
        – Now with aggregates, subqueries, property
          paths, negation, & more
     • An update language
     • An HTTP protocol for issuing SPARQL queries &
       updates
     • A REST protocol for reading/writing RDF data
     • A service description mechanism & vocabulary
     • Basic federated query extensions
     • Standard semantics for mixing query with reasoning

40                                               ©2011 Cambridge Semantics Inc. All rights reserved.
From the explicit to the inferred

     • 3 pieces of the Semantic Web technology stack are
       about describing a domain well enough to capture
       (some of) the meaning of resources and relationships
       in the domain
        – RDF Schema
        – OWL
        – RIF



          Apply knowledge to data to get more data.

41                                          ©2011 Cambridge Semantics Inc. All rights reserved.
RDFS is…




     RDF Schema




42                ©2011 Cambridge Semantics Inc. All rights reserved.
RDF Schema is…

     • Elements of:
       – Vocabulary (defining terms)
          • I define a relationship called “prescribed dose.”


       – Schema (defining types)
          • “prescribed dose” relates “treatments” to “dosages”
              – (my prescribed dose is 2mg; therefore 2mg is a dosage)


       – Taxonomy (defining hierarchies)
          • Any “doctor” is a “medical professional”
              – (therefore Dr. Brown is a medical professional)




43                                                         ©2011 Cambridge Semantics Inc. All rights reserved.
WOL OWL is…




     Web Ontology Language




44                       ©2011 Cambridge Semantics Inc. All rights reserved.
OWL is…

     • Elements of ontology
       – Same/different identity
          • “author” and “auteur” are the same relation
          • two resources with the same “ISBN” are the same “book”
       – More expressive type definitions
          • A “cycle” is a “vehicle” with at least one “wheel”
          • A “bicycle” is a “cycle” with exactly two “wheels”
       – More expressive relation definitions
          • “sibling” is a symmetric predicate
          • the value of the “favorite dwarf” relation must be one of “happy”,
            “sleepy”, “sneezy”, “grumpy”, “dopey”, “bashful”, “doc”




45                                                        ©2011 Cambridge Semantics Inc. All rights reserved.
OWL: Rich Class Definitions

     • A class is a (named) collection of things with similar
       attributes




Image courtesy of Fabien Gandon
46                                              ©2011 Cambridge Semantics Inc. All rights reserved.
OWL: Rich Class Definitions

     • A class is a (named) collection of things with similar
       attributes




Image courtesy of Fabien Gandon
47                                              ©2011 Cambridge Semantics Inc. All rights reserved.
OWL: Rich Class Definitions

     • A class is a (named) collection of things with similar
       attributes




Image courtesy of Fabien Gandon
48                                              ©2011 Cambridge Semantics Inc. All rights reserved.
OWL: Rich Class Definitions




49                          ©2011 Cambridge Semantics Inc. All rights reserved.
Why Ontologies? Put Data Within Reach of Domain Experts




        High-fidelity mappings
        make data reusable for
           many situations


50                                          ©2011 Cambridge Semantics Inc. All rights reserved.
RIF is…




     Rules Interchange Format




51                          ©2011 Cambridge Semantics Inc. All rights reserved.
RIF is…

     • Standard representation for exchanging sets of
       logical and business rules
     • Logical rules
        – A buyer buys an item from a seller if the seller sells the
          item to the buyer
        – A customer becomes a "Gold" customer as soon as his
          cumulative purchases during the current year top $5000
     • Production rules
        – Customers that become "Gold" customers must be notified
          immediately, and a golden customer card will be printed
          and sent to them within one week
        – For shopping carts worth more than $1000, "Gold"
          customers receive an additional discount of 10% of the
          total amount
52                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Fantasy Land Architecture



                                              Ontology /

                                 +             Schema




     Custom   Custom    Custom       Custom            Custom                     Custom
       UI       UI        UI           UI                UI                         UI




53                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Reality



                            Internet
                                                                  DB2
                                                                  XML
                                                                                        LDAP
                                                Oracle                                 Directory
                                                 RDB




     Custom   Custom   Custom          Custom               Custom                     Custom
       UI       UI       UI              UI                   UI                         UI




54                                                       ©2011 Cambridge Semantics Inc. All rights reserved.
R2RML is…




     Relational to RDF Mapping Language




55                               ©2011 Cambridge Semantics Inc. All rights reserved.
R2RML is…




     An RDF vocabulary for specifying mappings from
        relational data to RDF data (and SPARQL).




                   The following R2RML slides are courtesy of Alex Miller:
         http://www.slideshare.net/alexmiller/releasing-relational-data-to-the-semantic-web-7634727


56                                                                               ©2011 Cambridge Semantics Inc. All rights reserved.
GRDDL is…




     Gleaning Resource Descriptions from Dialects of
                       Language




60                                      ©2011 Cambridge Semantics Inc. All rights reserved.
GRDDL is…



     A method for authoritatively getting RDF data
          from XML and XHTML documents.




61                                     ©2011 Cambridge Semantics Inc. All rights reserved.
Linked Data is…

     • A simple set of 4 guidelines for publishing RDF data on
       the Web (over HTTP)
        – Developed by Tim Berners-Lee in 2006


     1. Use URIs as names for things
       •   Globally unique identity
     2. Use HTTP URIs
       •   Everyone has a Web browser/client
     3. When someone looks up a URI, provide useful
        information
       •   …in the form of RDF data
     4. Include links to other URIs
       •   Foster discovery of additional information

62                                                      ©2011 Cambridge Semantics Inc. All rights reserved.
The LOD Cloud, 2007




63                         ©2011 Cambridge Semantics Inc. All rights reserved.
The LOD Cloud, 2008




64                         ©2011 Cambridge Semantics Inc. All rights reserved.
The LOD Cloud, 2009




65                         ©2011 Cambridge Semantics Inc. All rights reserved.
LOD, 2011




66          ©2011 Cambridge Semantics Inc. All rights reserved.
RDFa is…




     RDF in Attributes




67                       ©2011 Cambridge Semantics Inc. All rights reserved.
RDFa is…




     A collection of HTML attributes that allow RDF to
            be embedded directly in Web pages.




68                                       ©2011 Cambridge Semantics Inc. All rights reserved.
RDFa Example
     <p vocab="http://schema.org/"
        prefix="foaf: http://xmlns.com/foaf/0.1/"
        about="#manu"
        typeof="Person">

             My name is <span property="name">Manu Sporny</span>
     and you can give me a ring via
     <span property="telephone">1-800-555-0155</span>.
     <img rel="image" src="http://manu.sporny.org/images/manu.png" />
     I have a
     <a rel="foaf:weblog" href="http://manu.sporny.org/">blog</a>.

     </p>
                           Example courtesy of Manu Sporny:
                         http://manu.sporny.org/2011/rdfa-lite/
69                                                                ©2011 Cambridge Semantics Inc. All rights reserved.
Why RDFa?

     • Don’t Repeat Yourself (DRY)
     • In-context metadata (copy & paste)
     • Authoritative (no screen scraping)




70                                          ©2011 Cambridge Semantics Inc. All rights reserved.
RDFa in action




71                    ©2011 Cambridge Semantics Inc. All rights reserved.
SEMANTIC WEB LANDSCAPE TODAY


72                       ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web Tools




     In 2011, there are a wide variety of open-source
      and commercial Semantic Web tools available.




73                                      ©2011 Cambridge Semantics Inc. All rights reserved.
Types of RDF Tools

     • Triple stores
        – Built on relational database—increasingly less common
        – Native RDF store
     • Development libraries
     • Full-featured application servers



       Most RDF tools contain some elements of each of
                           these.


74                                               ©2011 Cambridge Semantics Inc. All rights reserved.
Finding RDF Tools

     • Community-maintained lists
        – http://esw.w3.org/topic/SemanticWebTools
     • Emphasis on large triple stores
        – http://esw.w3.org/topic/LargeTripleStores
     • Michael Bergman’s Sweet Tools searchable list:
        – http://www.mkbergman.com/?page_id=325
     • Community forums:
        – http://answers.semanticweb.com
        – #swig on irc.freenode.net
        – semantic-web@w3.org


75                                               ©2011 Cambridge Semantics Inc. All rights reserved.
Types of SPARQL Tools

     • Query engines
        – Things that can run queries
        – Most RDF stores provide a SPARQL engine
     • Query rewriters
        – E.g. to query relational databases (more later)
     • Endpoints
        – Things that accept queries on the Web and return results
     • Client libraries
        – Things that make it easy to ask queries



76                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Finding SPARQL Tools

     • Community-maintained list of query engines
        – http://esw.w3.org/topic/SparqlImplementations
     • Publicly accessible SPARQL endpoints
        – http://esw.w3.org/topic/SparqlEndpoints
     • Michael Bergman’s Sweet Tools searchable list:
        – http://www.mkbergman.com/?page_id=325
     • Community forums:
        – http://answers.semanticweb.com
        – #swig on irc.freenode.net
        – semantic-web@w3.org


77                                              ©2011 Cambridge Semantics Inc. All rights reserved.
OWL Tools and Infrastructure

     • Editors/environments
        – Protégé, TopBraid, Oiled, Ontotrack, …


     • Reasoning systems
        – Pellet, FaCT++, Hermit, Racer, CEL, …


     • Reasoning integrated into RDF databases
        – OWLIM, Oracle RDF, Stardog, Virtuoso




78                                                 ©2011 Cambridge Semantics Inc. All rights reserved.
Visualizing and Publishing Vocabularies




79                                ©2011 Cambridge Semantics Inc. All rights reserved.
Reusable, public ontologies



                                                     FOAF




     The Event Ontology



                          Measurement Units Ontology



80                                      ©2011 Cambridge Semantics Inc. All rights reserved.
What about… everything else?




     Standards don’t yet exist, but many tools exist to
      derive RDF and/or run SPARQL queries against
                  other sources of data.




81                                        ©2011 Cambridge Semantics Inc. All rights reserved.
LDAP Directories




                 Squirrel RDF
     http://jena.sourceforge.net/SquirrelRDF/



82                                              ©2011 Cambridge Semantics Inc. All rights reserved.
Excel spreadsheets




                         Anzo for Excel
     http://www.cambridgesemantics.com/products/anzo_for_excel



83                                                     ©2011 Cambridge Semantics Inc. All rights reserved.
Web-based data sources




                    Virtuoso Sponger Cartridges
     http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSponger



84                                                          ©2011 Cambridge Semantics Inc. All rights reserved.
Unstructured Text




                Calais
       http://www.opencalais.com/



85                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Unstructured Text




       Zemanta Web Service
      http://developer.zemanta.com/




86                                    ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web In Use: Social Data

     • People, relationships
        – Friend Of A Friend (“FOAF”) – foaf:knows
        – Self-published or site-published (LiveJournal, hi5, …)
     • Blogs, discussion forums, mailing lists
        – Semantically Interlinked Online Communities (“SIOC”)
        – Plug-ins for popular blogging & CMS platforms
     • Calendars, vCards, reviews, …
        – One-offs




87                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Social Data Example

     • Facebook Open Graph Protocol




88                                        ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web In Use: Scientific Data




     May 12, 2009                              89
89                                        ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web In Use: Enterprises on the Web

     • Thesis: Describe your business more precisely and
       drive more (and better) traffic to your site
     • Example: NYTimes publishes their article
       classification scheme as linked data
     • Example: Best Buy, Overstock.com use RDFa to
       annotate product listings




90                                          ©2011 Cambridge Semantics Inc. All rights reserved.
Measurable Results




     • 30% increase in search-engine traffic
     • 15% increase in click-through-rate for search ads




91                                           ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web In Use: Inside the Enterprise

     • Many and Varied Applications Across Industries
        – Health care and pharma
           • integration, classification, ontologies
        – Oil & Gas
           • integration, classification
        – Finance
           • structured data, ontologies, XBRL
        – Publishing
           • metadata
        – Libraries & museums
           • metadata, classification
        – IT
           • rapid application development & evolution
92                                                       ©2011 Cambridge Semantics Inc. All rights reserved.
Targeting High-Potential Opportunities in Pharma

                                                                 ...
                      Territory    Profile     Preferred
       Regional                                 targets
       Analyst


                             Per-analyst
                           relevance filter




       Universe of
       considered
      opportunities
                                              High-potential
                                              opportunities
                                                                                 Mobile device


93                                                             ©2011 Cambridge Semantics Inc. All rights reserved.
Delivering Dynamic, Data-driven Websites




 “publishing stack is a great innovation for the BBC as dynamicthe first to
     The development of this new high-performance
                                                        we are
                                                                semantic

      use this technology on such a high-profile site. It also puts us at the
cutting edge of development for the next phase of the Internet, Web 3.0.
94                                                                      ©2011 Cambridge Semantics Inc. All rights reserved.
Semantic Web In Use: Government data


     – Since January 2010, 2,500 (large) datasets published as
       Linked Data




     – Since May 2009, 250,000 (smaller)
       datasets published (CSV, XML, …)
     – RPI project to convert datasets to
       Linked Data

95                                               ©2011 Cambridge Semantics Inc. All rights reserved.
TAKE-AWAY ADVICE


96                      ©2011 Cambridge Semantics Inc. All rights reserved.
Where do Semantic Web technologies shine?

     • These are horizontal, enabling technologies.
     • But they apply particularly well to problems with
       these characteristics:
        – Heterogeneous data from multiple, diverse sources
             • Increasing reliance on connections within this data
        –   Rapidly changing information needs
        –   Significant early-mover advantage
        –   Cross-organizational collaboration
        –   Large amounts of data that would benefit from
            classification



97                                                          ©2011 Cambridge Semantics Inc. All rights reserved.
Getting Started with Semantic Web technologies




                  Don’t boil the ocean.




98                                        ©2011 Cambridge Semantics Inc. All rights reserved.
Getting Started with Semantic Web technologies

     • Goal: quick tactical wins on the path to large
       strategic value
     • Be sure to consider the operational ramifications
        – Who does what differently?
     • Ideal Semantic Web projects/applications have an
       incremental path towards broad deployment that
       generates demonstrable value along the way




99                                           ©2011 Cambridge Semantics Inc. All rights reserved.
Choose practical, enterprise-ready tools

      • Look beyond the core Semantic Web capabilities and
        consider:
        –   integration with existing enterprise systems
        –   development & extension models
        –   deployment, logging, maintenance, backup
        –   tooling
        –   user experience

          If you choose to build new components and
       assemble existing components together, it’s quite
           likely you’ll end up reinventing the wheel.

100                                                  ©2011 Cambridge Semantics Inc. All rights reserved.
Plan for Acquiring Expertise

      • What level of expertise is necessary?
         –   Technologies only?
         –   Technologies + API?
         –   Technologies + tooling?
         –   Tooling only?
         –   …
      • How will we acquire the expertise?
         –   In-house (and if so, how?)
         –   Vendor services
         –   3rd-party services
         –   Open-source community

101                                             ©2011 Cambridge Semantics Inc. All rights reserved.
Thanks & Discussion

      • I’m always happy to field questions & engage in
        discussion:

                  lee@cambridgesemantics.com




102                                          ©2011 Cambridge Semantics Inc. All rights reserved.

More Related Content

What's hot

Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic WebSourav Sharma
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospectsGuus Schreiber
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 
Chapter 1 semantic web
Chapter 1 semantic webChapter 1 semantic web
Chapter 1 semantic webR A Akerkar
 
OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewKingsley Uyi Idehen
 
Making the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataMaking the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataKingsley Uyi Idehen
 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebMarina Santini
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference KeynoteKingsley Uyi Idehen
 
Website Performance at Client Level
Website Performance at Client LevelWebsite Performance at Client Level
Website Performance at Client LevelConstantin Stan
 
The Semantic Web
The Semantic WebThe Semantic Web
The Semantic Webostephens
 
The Semantic Web: What IAs Need to Know About Web 3.0
The Semantic Web: What IAs Need to Know About Web 3.0The Semantic Web: What IAs Need to Know About Web 3.0
The Semantic Web: What IAs Need to Know About Web 3.0Chiara Fox Ogan
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAnkur Biswas
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationSören Auer
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Cooking up the Semantic Web
Cooking up the Semantic WebCooking up the Semantic Web
Cooking up the Semantic WebOntotext
 

What's hot (20)

Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic Web
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospects
 
Taking Advantage of Semantic Web
Taking Advantage of Semantic WebTaking Advantage of Semantic Web
Taking Advantage of Semantic Web
 
Semantic web
Semantic web Semantic web
Semantic web
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Chapter 1 semantic web
Chapter 1 semantic webChapter 1 semantic web
Chapter 1 semantic web
 
OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers Overview
 
Making the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataMaking the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked Data
 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic Web
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Semantics and Web 3.0
Semantics and Web 3.0Semantics and Web 3.0
Semantics and Web 3.0
 
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 
Website Performance at Client Level
Website Performance at Client LevelWebsite Performance at Client Level
Website Performance at Client Level
 
The Semantic Web
The Semantic WebThe Semantic Web
The Semantic Web
 
The Semantic Web: What IAs Need to Know About Web 3.0
The Semantic Web: What IAs Need to Know About Web 3.0The Semantic Web: What IAs Need to Know About Web 3.0
The Semantic Web: What IAs Need to Know About Web 3.0
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web Technology
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
Eprints Application Profile
Eprints Application ProfileEprints Application Profile
Eprints Application Profile
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Cooking up the Semantic Web
Cooking up the Semantic WebCooking up the Semantic Web
Cooking up the Semantic Web
 

Similar to Intro to the Semantic Web Landscape - 2011

Lee Feigenbaum Presentation
Lee Feigenbaum PresentationLee Feigenbaum Presentation
Lee Feigenbaum PresentationMediabistro
 
Ontotext Overview Winter 2012
Ontotext Overview Winter 2012Ontotext Overview Winter 2012
Ontotext Overview Winter 2012Matthew Petrillo
 
First Industrial Results of Semantic Technologies - Claudio Bergamini
First Industrial Results of Semantic Technologies -  Claudio BergaminiFirst Industrial Results of Semantic Technologies -  Claudio Bergamini
First Industrial Results of Semantic Technologies - Claudio BergaminiClaudio Bergamini
 
Research ON Big Data
Research ON Big DataResearch ON Big Data
Research ON Big Datamysqlops
 
Research on big data
Research on big dataResearch on big data
Research on big dataRoby Chen
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
emantic web technologies and applications for Ins
emantic web technologies and applications for Insemantic web technologies and applications for Ins
emantic web technologies and applications for InsTemesgenHabtamu
 
Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Frank van Harmelen
 
Social web & linked data
Social web & linked dataSocial web & linked data
Social web & linked dataSerge Garlatti
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfrescorivetlogic
 
NOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jNOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jTobias Lindaaker
 
Solving Real Problems Using Linked Data
Solving Real Problems Using Linked DataSolving Real Problems Using Linked Data
Solving Real Problems Using Linked Datarumito
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technologyStanley Wang
 
Implementing Semantic Web applications: reference architecture and challenges
Implementing Semantic Web applications:  reference architecture and challengesImplementing Semantic Web applications:  reference architecture and challenges
Implementing Semantic Web applications: reference architecture and challengesBenjamin Heitmann
 
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012Bjarni Kristjánsson
 
Semantic Web 2.0
Semantic Web 2.0Semantic Web 2.0
Semantic Web 2.0hchen1
 

Similar to Intro to the Semantic Web Landscape - 2011 (20)

Lee Feigenbaum Presentation
Lee Feigenbaum PresentationLee Feigenbaum Presentation
Lee Feigenbaum Presentation
 
Ontotext Overview Winter 2012
Ontotext Overview Winter 2012Ontotext Overview Winter 2012
Ontotext Overview Winter 2012
 
ISWC 2012 - Linked Data Meetup
ISWC 2012 - Linked Data MeetupISWC 2012 - Linked Data Meetup
ISWC 2012 - Linked Data Meetup
 
First Industrial Results of Semantic Technologies - Claudio Bergamini
First Industrial Results of Semantic Technologies -  Claudio BergaminiFirst Industrial Results of Semantic Technologies -  Claudio Bergamini
First Industrial Results of Semantic Technologies - Claudio Bergamini
 
Research ON Big Data
Research ON Big DataResearch ON Big Data
Research ON Big Data
 
Research on big data
Research on big dataResearch on big data
Research on big data
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
we to deep learning
we to deep learning we to deep learning
we to deep learning
 
emantic web technologies and applications for Ins
emantic web technologies and applications for Insemantic web technologies and applications for Ins
emantic web technologies and applications for Ins
 
Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...
 
Social web & linked data
Social web & linked dataSocial web & linked data
Social web & linked data
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfresco
 
NOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jNOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4j
 
Solving Real Problems Using Linked Data
Solving Real Problems Using Linked DataSolving Real Problems Using Linked Data
Solving Real Problems Using Linked Data
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technology
 
Implementing Semantic Web applications: reference architecture and challenges
Implementing Semantic Web applications:  reference architecture and challengesImplementing Semantic Web applications:  reference architecture and challenges
Implementing Semantic Web applications: reference architecture and challenges
 
Semantic web
Semantic webSemantic web
Semantic web
 
Semtech2006
Semtech2006Semtech2006
Semtech2006
 
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012
Maximal: MPL Software Demo - INFORMS Phoenix Oct 2012
 
Semantic Web 2.0
Semantic Web 2.0Semantic Web 2.0
Semantic Web 2.0
 

More from LeeFeigenbaum

Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebLeeFeigenbaum
 
Taking the Tech out of SemTech
Taking the Tech out of SemTechTaking the Tech out of SemTech
Taking the Tech out of SemTechLeeFeigenbaum
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialLeeFeigenbaum
 
What;s Coming In SPARQL2?
What;s Coming In SPARQL2?What;s Coming In SPARQL2?
What;s Coming In SPARQL2?LeeFeigenbaum
 

More from LeeFeigenbaum (6)

Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic Web
 
Taking the Tech out of SemTech
Taking the Tech out of SemTechTaking the Tech out of SemTech
Taking the Tech out of SemTech
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
What;s Coming In SPARQL2?
What;s Coming In SPARQL2?What;s Coming In SPARQL2?
What;s Coming In SPARQL2?
 
SPARQL 1.1 Status
SPARQL 1.1 StatusSPARQL 1.1 Status
SPARQL 1.1 Status
 
SPARQL Cheat Sheet
SPARQL Cheat SheetSPARQL Cheat Sheet
SPARQL Cheat Sheet
 

Recently uploaded

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Recently uploaded (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Intro to the Semantic Web Landscape - 2011

  • 1. The Semantic Web Landscape A Practical Introduction Contact: Lee Feigenbaum lee@cambridgesemantics.com VP Technology Co-chair, W3C SPARQL Working Group ©2011 Cambridge Semantics Inc. All rights reserved.
  • 2. Example: Alzheimer’s Drug Discovery What genes are involved in signal transduction and are related to pyramidal neurons? 2 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 3. General search: 223,000 hits, 0 results 3 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 4. Domain-limited search: Still 2,580 potential results 4 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 5. Specific databases: Too many silos! 5 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 6. Linked Scientific Data: 32 targeted results 6 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 7. What’s the trick? 1. Agreement on common terms and relationships 2. Incremental, flexible data structure 3. Good-enough modeling 4. Query interface tailored to the data model 7 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 8. WHAT IS THE SEMANTIC WEB? 8 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 9. Names 9 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 10. Branding • Semantic Web • Web of Data • Giant Global Graph • Data Web • Web 3.0 • Linked Data Web • Semantic Data Web • Enterprise Information Web 10 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 11. Semantic Web – 1st View • “The Semantic Web” – Link explicit data on the World Wide Web in a machine- readable fashion • …government data • …commercial data • …scientific data • …social data – In order to enable applications such as… • …targeted, semantic search • …data browsing • …automated agents World Wide Web : Web pages :: The Semantic Web : Data 11 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 12. Semantic Web – 2nd View • “Semantic Web technologies” – A family of technology standards that ‘play nice together’, including: • Flexible data model • Expressive ontology language • Distributed query language – Drive enterprise applications, including: • Data integration & virtualization • Business intelligence • Large knowledgebases • … The technologies enable us to build applications and solutions that were not possible, practical, or feasible traditionally. 12 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 13. A Common & Coherent Set of Technology Standards • A common set of technologies: – ...enables diverse uses – ...encourages interoperability • A coherent set of technologies: – …encourage incremental application – …provide a substantial base for innovation • A standard set of technologies: – ...reduces proprietary vendor lock-in – ...encourages many choices for tool sets 13 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 14. The (In)Famous Layer Cake 14 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 15. The (In)Famous Layer Cake 15 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 16. Semantic Web Technology Timeline 1999 2001 2004 2007 2008 2011 RIF 16 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 17. 2011: Where we are As technologies & tools have evolved, Semantic Web advocates have progressed through stages: Report on… Execute on… Semantic Web vision Initial experiments Experiments Technology standards Technology standards Software packages Software packages Proofs of concept Proofs of concept Initial production implementations 2nd, 3rd, … implementations— Initial production implementations network effect 17 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 18. 2011: Where we’re not Image from Trey Ideker via Enoch Huang Semantic Web technologies are not a ‘magic crank’ for discovering new drugs (or solving other problems, for that matter)! 18 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 19. 2011: Where we’re not (cont’d) “Ontology” vs. “ontology”? XML vs. RDF? Semantic Web vs. Linked Data? Data integration vs. reasoning vs. KBs RDFa vs. microformats vs. search vs. app. vs. microdata vs. development vs. … schema.org The Semantic Web still suffers from confusing and conflicting messaging, each of which claims it’s “correct”. 19 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 20. 2011: Where we’re not (cont’d) We don’t yet have standard solutions for privacy, trust, probability, and other elements of the Semantic Web vision. 20 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 21. What do Semantic Web solutions look like? 21 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 22. RDF is… Resource Description Framework 22 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 23. RDF is… The data model of the Semantic Web. 23 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 24. RDF is… A flexible data model that features unambiguous identifiers and named relations between pairs of resources. 24 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 25. RDF is… A labeled, directed graph of relations between resources and literal values. • RDF graphs are collections of triples • Triples are made up of a subject, a predicate, and an object predicate subject object • Resources and relationships are named with URIs 25 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 26. Example RDF triples • “Lee Feigenbaum works for Cambridge Semantics” Lee works for Cambridge Feigenbaum Semantics • “Lee Feigenbaum was born in 1978” Lee born in 1978 Feigenbaum • “Cambridge Semantics is headquartered in Massachusetts” Cambridge headquartered Massachusetts Semantics 26 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 27. Triples connect to form graphs Lee works for Cambridge Feigenbaum Semantics headquartered born in lives in 1978 Massachusetts capital Boston 27 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 28. Why RDF? What’s different here? • The graph data structure makes merging data with shared identifiers trivial • Triples act as a least common denominator for expressing data • URIs for naming remove ambiguity – …the same identifier means the same thing 28 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 29. Why RDF? Coping With Change Flexible Graph URIs for Agility Model naming On-the-fly The World Changes Traditionally: Change is costly Semantics: Change is cheap RDB 1 RDB 2 29 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 30. Why RDF? Add Meaning to Data With traditional technology: Cust ID Name Referred By Work Phone 29212 Travis Ember Janet Cassy Barbara Cassy 212-555-5001 Inside the 30012 Jessica Evalta Brian Meedly 617-555-2325 database 59235 Hector Samton Agatha Browne 732-555-8715 29212 Travis Ember Janet Cassy 212-555-5001 30012 Jessica Evalta Brian Meedly 617-555-2325 Outside the 59235 Hector Samton Agatha Browne 732-555-8715 database No one knows what these numbers and names mean! 30 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 31. Why RDF? Add Meaning to Data With Semantic Web technology: name Text Person referred by Data Text description mobile phone Text name Travis Ember Person2912 referred by Data, Janet Cassy wherever it mobile appears phone 212-555-5001 The meaning always travels with the data 31 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 32. What does RDF look like? • RDF is the model, for which there are several concrete syntaxes: – RDF/XML – standard, complex XML syntax – Turtle – common, textual, triples-oriented syntax • …currently being standardized by the RDF working group – N3 – more expressive superset of Turtle – N-Triples – textual, line-oriented, useful for streaming When writing RDF by hand and in many guides, examples, and discussions these days, you’ll see Turtle most often. 32 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 33. A Bit of Turtle • Write a triple by writing its parts separated by spaces (subject predicate object) @prefix ex: <http://example.org/myvocab/> . @prefix geo: <http://geonames.example/> . ex:LeeFeigenbaum ex:employer ex:CambridgeSemantics . ex:LeeFeigenbaum ex:birthYear 1978 . ex:CambridgeSemantics ex:headquarters geo:BostonMA . geo:BostonMA ex:population 574000 . 33 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 34. SPARQL is… SPARQL Protocol And RDF Query Language 34 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 35. SPARQL is… The query language of the Semantic Web. 35 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 36. SPARQL is… A SQL-like language for querying sets of RDF graphs. 36 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 37. SPARQL is… A simple protocol for issuing queries and receiving results over HTTP. So… Every SPARQL client works with every SPARQL server! 37 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 38. Why SPARQL? SPARQL lets us: • Pull information from structured and semi-structured data. • Explore data by discovering unknown relationships. • Query and search an integrated view of disparate data sources. • Glue separate software applications together by transforming data from one vocabulary to another. • Update RDF data in bulk 38 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 39. Dealer 2 Dealer 1 Dealer 3 Employee ERP / Budget Directory System Web EPA Fuel Efficiency Spreadsheet SPARQL Query Engine What automobiles get more than 25 miles per gallon and can be purchased at a dealer located within 10 miles of one of my employees? SELECT ?automobile WHERE { ?automobile a ex:Car ; epa:mpg ?mpg ; ex:dealer ?dealer . ?employee a ex:Employee ; geo:loc ?loc . ?dealer geo:loc ?dealerloc . FILTER(?mpg > 25 && geo:dist(?loc, ?dealerloc) <= 10) . } Web dashboard SPARQL query
  • 40. The SPARQL 1.1 Landscape Includes • A query language – Now with aggregates, subqueries, property paths, negation, & more • An update language • An HTTP protocol for issuing SPARQL queries & updates • A REST protocol for reading/writing RDF data • A service description mechanism & vocabulary • Basic federated query extensions • Standard semantics for mixing query with reasoning 40 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 41. From the explicit to the inferred • 3 pieces of the Semantic Web technology stack are about describing a domain well enough to capture (some of) the meaning of resources and relationships in the domain – RDF Schema – OWL – RIF Apply knowledge to data to get more data. 41 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 42. RDFS is… RDF Schema 42 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 43. RDF Schema is… • Elements of: – Vocabulary (defining terms) • I define a relationship called “prescribed dose.” – Schema (defining types) • “prescribed dose” relates “treatments” to “dosages” – (my prescribed dose is 2mg; therefore 2mg is a dosage) – Taxonomy (defining hierarchies) • Any “doctor” is a “medical professional” – (therefore Dr. Brown is a medical professional) 43 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 44. WOL OWL is… Web Ontology Language 44 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 45. OWL is… • Elements of ontology – Same/different identity • “author” and “auteur” are the same relation • two resources with the same “ISBN” are the same “book” – More expressive type definitions • A “cycle” is a “vehicle” with at least one “wheel” • A “bicycle” is a “cycle” with exactly two “wheels” – More expressive relation definitions • “sibling” is a symmetric predicate • the value of the “favorite dwarf” relation must be one of “happy”, “sleepy”, “sneezy”, “grumpy”, “dopey”, “bashful”, “doc” 45 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 46. OWL: Rich Class Definitions • A class is a (named) collection of things with similar attributes Image courtesy of Fabien Gandon 46 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 47. OWL: Rich Class Definitions • A class is a (named) collection of things with similar attributes Image courtesy of Fabien Gandon 47 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 48. OWL: Rich Class Definitions • A class is a (named) collection of things with similar attributes Image courtesy of Fabien Gandon 48 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 49. OWL: Rich Class Definitions 49 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 50. Why Ontologies? Put Data Within Reach of Domain Experts High-fidelity mappings make data reusable for many situations 50 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 51. RIF is… Rules Interchange Format 51 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 52. RIF is… • Standard representation for exchanging sets of logical and business rules • Logical rules – A buyer buys an item from a seller if the seller sells the item to the buyer – A customer becomes a "Gold" customer as soon as his cumulative purchases during the current year top $5000 • Production rules – Customers that become "Gold" customers must be notified immediately, and a golden customer card will be printed and sent to them within one week – For shopping carts worth more than $1000, "Gold" customers receive an additional discount of 10% of the total amount 52 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 53. Fantasy Land Architecture Ontology / + Schema Custom Custom Custom Custom Custom Custom UI UI UI UI UI UI 53 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 54. Reality Internet DB2 XML LDAP Oracle Directory RDB Custom Custom Custom Custom Custom Custom UI UI UI UI UI UI 54 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 55. R2RML is… Relational to RDF Mapping Language 55 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 56. R2RML is… An RDF vocabulary for specifying mappings from relational data to RDF data (and SPARQL). The following R2RML slides are courtesy of Alex Miller: http://www.slideshare.net/alexmiller/releasing-relational-data-to-the-semantic-web-7634727 56 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 57.
  • 58.
  • 59.
  • 60. GRDDL is… Gleaning Resource Descriptions from Dialects of Language 60 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 61. GRDDL is… A method for authoritatively getting RDF data from XML and XHTML documents. 61 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 62. Linked Data is… • A simple set of 4 guidelines for publishing RDF data on the Web (over HTTP) – Developed by Tim Berners-Lee in 2006 1. Use URIs as names for things • Globally unique identity 2. Use HTTP URIs • Everyone has a Web browser/client 3. When someone looks up a URI, provide useful information • …in the form of RDF data 4. Include links to other URIs • Foster discovery of additional information 62 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 63. The LOD Cloud, 2007 63 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 64. The LOD Cloud, 2008 64 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 65. The LOD Cloud, 2009 65 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 66. LOD, 2011 66 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 67. RDFa is… RDF in Attributes 67 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 68. RDFa is… A collection of HTML attributes that allow RDF to be embedded directly in Web pages. 68 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 69. RDFa Example <p vocab="http://schema.org/" prefix="foaf: http://xmlns.com/foaf/0.1/" about="#manu" typeof="Person"> My name is <span property="name">Manu Sporny</span> and you can give me a ring via <span property="telephone">1-800-555-0155</span>. <img rel="image" src="http://manu.sporny.org/images/manu.png" /> I have a <a rel="foaf:weblog" href="http://manu.sporny.org/">blog</a>. </p> Example courtesy of Manu Sporny: http://manu.sporny.org/2011/rdfa-lite/ 69 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 70. Why RDFa? • Don’t Repeat Yourself (DRY) • In-context metadata (copy & paste) • Authoritative (no screen scraping) 70 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 71. RDFa in action 71 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 72. SEMANTIC WEB LANDSCAPE TODAY 72 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 73. Semantic Web Tools In 2011, there are a wide variety of open-source and commercial Semantic Web tools available. 73 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 74. Types of RDF Tools • Triple stores – Built on relational database—increasingly less common – Native RDF store • Development libraries • Full-featured application servers Most RDF tools contain some elements of each of these. 74 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 75. Finding RDF Tools • Community-maintained lists – http://esw.w3.org/topic/SemanticWebTools • Emphasis on large triple stores – http://esw.w3.org/topic/LargeTripleStores • Michael Bergman’s Sweet Tools searchable list: – http://www.mkbergman.com/?page_id=325 • Community forums: – http://answers.semanticweb.com – #swig on irc.freenode.net – semantic-web@w3.org 75 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 76. Types of SPARQL Tools • Query engines – Things that can run queries – Most RDF stores provide a SPARQL engine • Query rewriters – E.g. to query relational databases (more later) • Endpoints – Things that accept queries on the Web and return results • Client libraries – Things that make it easy to ask queries 76 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 77. Finding SPARQL Tools • Community-maintained list of query engines – http://esw.w3.org/topic/SparqlImplementations • Publicly accessible SPARQL endpoints – http://esw.w3.org/topic/SparqlEndpoints • Michael Bergman’s Sweet Tools searchable list: – http://www.mkbergman.com/?page_id=325 • Community forums: – http://answers.semanticweb.com – #swig on irc.freenode.net – semantic-web@w3.org 77 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 78. OWL Tools and Infrastructure • Editors/environments – Protégé, TopBraid, Oiled, Ontotrack, … • Reasoning systems – Pellet, FaCT++, Hermit, Racer, CEL, … • Reasoning integrated into RDF databases – OWLIM, Oracle RDF, Stardog, Virtuoso 78 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 79. Visualizing and Publishing Vocabularies 79 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 80. Reusable, public ontologies FOAF The Event Ontology Measurement Units Ontology 80 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 81. What about… everything else? Standards don’t yet exist, but many tools exist to derive RDF and/or run SPARQL queries against other sources of data. 81 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 82. LDAP Directories Squirrel RDF http://jena.sourceforge.net/SquirrelRDF/ 82 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 83. Excel spreadsheets Anzo for Excel http://www.cambridgesemantics.com/products/anzo_for_excel 83 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 84. Web-based data sources Virtuoso Sponger Cartridges http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSponger 84 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 85. Unstructured Text Calais http://www.opencalais.com/ 85 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 86. Unstructured Text Zemanta Web Service http://developer.zemanta.com/ 86 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 87. Semantic Web In Use: Social Data • People, relationships – Friend Of A Friend (“FOAF”) – foaf:knows – Self-published or site-published (LiveJournal, hi5, …) • Blogs, discussion forums, mailing lists – Semantically Interlinked Online Communities (“SIOC”) – Plug-ins for popular blogging & CMS platforms • Calendars, vCards, reviews, … – One-offs 87 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 88. Social Data Example • Facebook Open Graph Protocol 88 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 89. Semantic Web In Use: Scientific Data May 12, 2009 89 89 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 90. Semantic Web In Use: Enterprises on the Web • Thesis: Describe your business more precisely and drive more (and better) traffic to your site • Example: NYTimes publishes their article classification scheme as linked data • Example: Best Buy, Overstock.com use RDFa to annotate product listings 90 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 91. Measurable Results • 30% increase in search-engine traffic • 15% increase in click-through-rate for search ads 91 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 92. Semantic Web In Use: Inside the Enterprise • Many and Varied Applications Across Industries – Health care and pharma • integration, classification, ontologies – Oil & Gas • integration, classification – Finance • structured data, ontologies, XBRL – Publishing • metadata – Libraries & museums • metadata, classification – IT • rapid application development & evolution 92 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 93. Targeting High-Potential Opportunities in Pharma ... Territory Profile Preferred Regional targets Analyst Per-analyst relevance filter Universe of considered opportunities High-potential opportunities Mobile device 93 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 94. Delivering Dynamic, Data-driven Websites “publishing stack is a great innovation for the BBC as dynamicthe first to The development of this new high-performance we are semantic use this technology on such a high-profile site. It also puts us at the cutting edge of development for the next phase of the Internet, Web 3.0. 94 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 95. Semantic Web In Use: Government data – Since January 2010, 2,500 (large) datasets published as Linked Data – Since May 2009, 250,000 (smaller) datasets published (CSV, XML, …) – RPI project to convert datasets to Linked Data 95 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 96. TAKE-AWAY ADVICE 96 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 97. Where do Semantic Web technologies shine? • These are horizontal, enabling technologies. • But they apply particularly well to problems with these characteristics: – Heterogeneous data from multiple, diverse sources • Increasing reliance on connections within this data – Rapidly changing information needs – Significant early-mover advantage – Cross-organizational collaboration – Large amounts of data that would benefit from classification 97 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 98. Getting Started with Semantic Web technologies Don’t boil the ocean. 98 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 99. Getting Started with Semantic Web technologies • Goal: quick tactical wins on the path to large strategic value • Be sure to consider the operational ramifications – Who does what differently? • Ideal Semantic Web projects/applications have an incremental path towards broad deployment that generates demonstrable value along the way 99 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 100. Choose practical, enterprise-ready tools • Look beyond the core Semantic Web capabilities and consider: – integration with existing enterprise systems – development & extension models – deployment, logging, maintenance, backup – tooling – user experience If you choose to build new components and assemble existing components together, it’s quite likely you’ll end up reinventing the wheel. 100 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 101. Plan for Acquiring Expertise • What level of expertise is necessary? – Technologies only? – Technologies + API? – Technologies + tooling? – Tooling only? – … • How will we acquire the expertise? – In-house (and if so, how?) – Vendor services – 3rd-party services – Open-source community 101 ©2011 Cambridge Semantics Inc. All rights reserved.
  • 102. Thanks & Discussion • I’m always happy to field questions & engage in discussion: lee@cambridgesemantics.com 102 ©2011 Cambridge Semantics Inc. All rights reserved.

Editor's Notes

  1. This initial example shows how a Semantic Web approach differs from both traditional search as well as siloed databases.
  2. Search gives lots of potential hits, but no targeted results. It requires a large investment to manually sort through the hits to see if there are any concrete results within them.
  3. Search within a specific domain is a bit better, but shares the same basic limitations.
  4. This is probably the most common way to find answers – look in specific databases. As soon as a problem spans multiple databases, however, you run into the silo problem: you need to ask part of the question against database A, part against database B, and manually figure out how to combine the results from both to get an answer to the overall question. This is magnified if there are 3, 5, or 20 databases involved in getting an answer.
  5. With a Semantic Web approach, the siloes are broken down, the data is linked together, and a single query can result in a targeted list of specific results.
  6. One of the goals of this tutorial is to de-mystify the all of the names of technologies, tools, projects, etc. that swirl around the Semantic Web story.And since I saw that as I researched this presentation, everyone seems to like this particular Gary Larson cartoon, it behooved me to include it.
  7. Different nuances, but the same actual thing. Still, you can often tell a lot about someone’s view of Semantic Web based on the terms they choose to you to describe it. Linked Data Web has been – relatively speaking – successful in gaining traction.
  8. This is the ultimate vision as per the original Scientific American article. Referred to last week as the “top-down approach”.
  9. Many of the people that have been building the technologies, standards, and tools are doing so with these ends in mind. They have (disruptive, game-changing) problems today and these technologies provide a way to solve them today.
  10. http://www.flickr.com/photos/reverendsam/2367569306/The good – emphasize the importance of the foundational layers (URIs and RDF) ; emphasizes the long-term roadmap/vision of what’s needed for the Semantic WebThe bad – implies that perhaps things can’t be taken serious until all the pieces are in place ; implies an order to the research ; various versions of the cake tell different stories (importance of XML, absence of query, lack of UI/application layer, …)Valentin Zacharias wrote about the “infamy” part of the layer cake here: http://www.valentinzacharias.de/blog/2007/04/ban-semantic-web-layer-cake.html
  11. http://www.flickr.com/photos/reverendsam/2367569306/The good – emphasize the importance of the foundational layers (URIs and RDF) ; emphasizes the long-term roadmap/vision of what’s needed for the Semantic WebThe bad – implies that perhaps things can’t be taken serious until all the pieces are in place ; implies an order to the research ; various versions of the cake tell different stories (importance of XML, absence of query, lack of UI/application layer, …)Valentin Zacharias wrote about the “infamy” part of the layer cake here: http://www.valentinzacharias.de/blog/2007/04/ban-semantic-web-layer-cake.html
  12. What’s been happening this whole time? (Between the introduction of the vision and today.) A lot of technology, standards, tool, and product development. Also, a lot of advocacy.
  13. The Ontology/ontology dichotomy is captured well by Jim Hendler at http://www.cs.rpi.edu/%7Ehendler/presentations/SemTech2008-2Towers.pdf
  14. Of course, these are all unsolved problems in the relational world as well, but they may be magnified with the highly distributed nature of the Semantic Web.
  15. Definition.
  16. Prescriptive.
  17. Descriptive.
  18. Formal.
  19. The first is as opposed to relational tables or XML schemas where the schema needs to be explicitly adjusted to accommodate whatever data is being merged.The second is due to the expressivity of the model – can handle lists, trees, n-ary relations, etc.The third is as opposed to table &amp; column identifiers or XML attribute names.
  20. (This slide best told with animation in the original PowerPoint.)The Semantic Web paradigms allows new and updated data to be brought “into the fold” incrementally, without starting over. This makes it particularly amenable to changing requirements.
  21. Main message: With relational technology, you lose the meaning of the data as soon as it exits the database – and so you end up hard-coding the knowledge of what the data means throughout every part of a software application. This is very error-prone and means it’s extremely tough to change anything.
  22. Main message: With semantics, the data always travels with its meaning, and the data looks the same inside and outside the database.
  23. Definition.
  24. Prescriptive.
  25. Descriptive.
  26. Descriptive (part 2). This is leagues ahead of the situation with SQL! Major deployment help on the Web.
  27. Definition.
  28. Definition.
  29. We call this “semantic data virtualization”.Databases that traditionally manage enterprise data are IT artifacts.They’re crafted by IT, for IT: asking scientists or other business domain experts to understand a relational model with scores of tables, IDs, key/value tables, unused columns, etc. is completely unrealistic.The semantic model is a conceptual model. It eschews IDs, keys, etc. in favor of concepts and relationships expressed/expressible in human language. This is reflected in software that is built with Semantic Web data. This means that when a researcher is linking their results spreadsheet, they’re dealing only in concepts that they’re familiar with (trades, accounts, settlements, securities, etc.). And that in turn means that this approach works regardless of whatever spreadsheet layout a particular collaborator is using: researchers can continue using their current spreadsheets, with no change.
  30. Definition.
  31. Courtesy W3C SWEO group, http://linkeddata.org/docs/eswc2007-poster-linking-open-data.pdf
  32. Also see examples later in this deck – Semantic Web In Use – would be nice to have an example of RDFa markup
  33. Possible answers: Few people are driven by data ownership, data portabilityPeople are drawn to specific sitesPeople _want_ to segment their online profiles (c.f. Facebook vs. LinkedIn)Drupal—which runs 1% of the world’s Web sites—is on the leading edge of adoption of the Semantic Web for content-driven sites. Drupal 7 exposes the semantics of Drupal sites’ natural structures to Google/Yahoo! with RDFa. Also modules for SIOC and Facebook OGP.
  34. The key point here is that though FB published this protocol, it relies on open Semantic Web standards (RDFa) that anyone else can consume. The same semantics allow people to link the “Like” button to the type of artifact being liked (movie, here) and also can allow search engines to give more structure, query engines to find more data, etc.
  35. Image courtesy of http://bio2rdf.org/ .Scientific data makes up a significant portion of the current Linked Data Web. This is information on proteins and genes, pathways, and sequences, chemistry and genetics, … This diagram shows some of the information available and how its linked together. Nodes are sized according to their quantity of data, and links are sized according to the quantity of links.
  36. Google (Rich Snippets) and Yahoo! (originally Search Monkey) consume semantic markup to enhance search listings.
  37. http://searchnewscentral.com/20110207129/Technical/rdfa-the-inside-story-from-best-buy.html
  38. Many enterprise uses of Semantic Web / Linked Data are highlighted at: http://www.w3.org/2001/sw/sweo/public/UseCases/
  39. Question: Where in this scenario do you think Semantic Web concepts and technologies are being employed? What would the alternative be?Answers: integrating data to get as large a universe as possible; rules and reasoning to intelligently filter the data
  40. Combine manual tagging with ontology-driven reasoning and ontology-driven dynamic aggregation (700 index pages, more than the rest of the sports site combined) to produce a dynamic, cross-indexed, cross-linked, useful site for the World Cup.What is the semantic value here? * Produce an information rich site at many levels of aggregation (player, team, geography, group, …) without employing a large fleet of editors to curate the site’s _content_. Instead, maintain an ontology and provide a content tagging process. * Use the ontology to help automate the tagging process (forward-chaining inference based on taxonomies)For more details:http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html http://www.bbc.co.uk/blogs/bbcinternet/2010/07/the_world_cup_and_a_call_to_ac.html
  41. Other governments with similar efforts. Australia, Sweden,New Zealand, … , various local governments