SlideShare a Scribd company logo
1 of 18
Semantic Annotation and Search
   for Resources in the Next
        Generation Web
Ajith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya Wijeratne
                          Kno.e.sis Center
                       Wright State University
                            Dayton OH




          W3C Workshop on Data and Services Integration
              October 20-21 2011, Bedford, MA, USA
Agenda
  •The service Integration problem
       •What are the patterns we see?
  •What is the best course of action?
       •Making the best use of HTML 5 and search engines (Google
       / Bing)
  •Experience from Kino
       •Annotate-Index-enhance-search Lifecycle for biology
       oriented documents
  •Kino Web
       •Annotations with schema.org and SA-REST service model
       •SA-REST, Microdata or any other mechanism

Oct 21 2011                                                     2
Before we start - Our Assertions

   •There is no global model or representation
       o      Accept it!
       o      Instead we can represent one in a universally
              acceptable way
   •Human in the loop is important!
           o Don't forget the guy in the trench
   •Grass roots / bottom up
           o Top down approaches are expensive to adopt


Oct 21 2011                                                   3
What is the Problem ?


   Services are (still) described in multiple ways
   • SOAP vs REST debate is not as bad but
    still exists
   • SOAP services have found its home in the
    enterprise




Oct 21 2011                                          4
What is the Problem ? (Cont)

REST has become the (de facto) standard in
the consumer space
   •
   No agreed upon formal description (WSDL
   2.0 / WADL ?)
   •
   No specific registry mechanism - Developers
   just Google to find the services.
   •
   Several high profile composition tools failed !
   (Google Mashup Editor, Microsoft popfly)


Oct 21 2011                                      5
What have we learnt ?

   •Services (and Web APIs / services wrapped
      by programming language) are primarily
      composed by humans
         o Read the documentation, copy sample code and use
           Google generously
   •Special purpose indexes and registries do
      not work.
         o General purpose search engines (Google / Bing /
           Yahoo) has become really good

Oct 21 2011                                                  6
Anticipated Future Trends

Service consumption and composition is going
to remain a semi-automated process
     • Humans will always be part of the process


General purpose Search engines are going to
be the key source of data for service
composers



Oct 21 2011                                        7
Our Primary Premise

    Modification of service descriptions via
    annotations is the best way to supplement
    the upcoming service consumption
    patterns

    We are not alone in this thinking!
         •The trend towards microdata and 'rich snippets'



Oct 21 2011                                                 8
A Generic Architecture for the
Annotation / Index / Search Cycle


                   Annotation                                Search

     Web
     Documents
                                Annotation Submission    Document Extraction
                                or acquisition Process




               Third party
              Data Sources                                    Index
                             Annotation Enhancement




Oct 21 2011                                                                    9
Experience from Kino (KinoE)

A tool for biologists
   •
   Modify Web pages using SA-REST
   annotations
         o Concepts come from National Center for Biomedical
           Ontologies (NCBO)
   •Use a specialized indexing engine that can
       parse the annotations and provide faceted
       searching

Oct 21 2011                                                10
KinoE Architecture


     Kino browser based annotation                            Kino Search Interfaces

        Web Pages                                 Kino Web             Other Front -
                               Kino Browser
                                                  Front-end               ends
                                  Plugin




                                                   Kino Search API
        NCBO Ontology Access                                                  SOLRJ
               API                                  Kino Index API
        NCBO Ontology
                                                                     SOLR Web Interface
          Repository

      NCBO REST                               Kino Back-end           Lucene Index
      Service
Oct 21 2011                                                                               11
KinoW (Web Edition)


A more general annotator
   •
  SA-REST Service and Schema.org concepts
   •
  Mechanism can be Microdata or SA-REST
        • Only Microdata at the moments
   •Publishing targeted towards the original
       content providers
        • Use WebDAV / Drupal Plugin / Wiki plugin etc


Oct 21 2011                                              13
KinoW Architecture

                                                       Search

     Browser based annotation
                                                                Custom Front
                                                                    -ends
        Web Pages
                     Kino Browser
                        Plugin



                          WebDAV /CMS
                             plugins
                                            crawling


                             Hosted
                              Site
Schema.org / LOD /
 Other third party
 concept providers            Enhancement

Oct 21 2011                                                             14
What is possible with this approach?

   •General search engine based service
      discovery
         o Annotation driven service discoveries
             Issue queries in Google to find the services you
              are interstested (provided Google supports
              filtering by annotations)
   •Formal structures (WSDL / WADL) can be
      gleaned from the human readable pages
         o Both humans and machines can make use
         o More opportunities for composition tools


Oct 21 2011                                                      15
Demonstration
Questions
Extra : Role of LOD?

Act as a huge third party data repository?




Oct 21 2011                                  18

More Related Content

Similar to Semantic Annotation and Search for Resources in the Next Generation Web

SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
Agnes Molnar
 
JISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impactJISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impact
Andy Powell
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
Marco Brambilla
 
SharePoint - Right Intro To Development
SharePoint - Right Intro To DevelopmentSharePoint - Right Intro To Development
SharePoint - Right Intro To Development
Mark Rackley
 

Similar to Semantic Annotation and Search for Resources in the Next Generation Web (20)

Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
 
WebBee rapid web app development teck stack
WebBee rapid web app development teck stackWebBee rapid web app development teck stack
WebBee rapid web app development teck stack
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
Apache Geode - The First Six Months
Apache Geode -  The First Six MonthsApache Geode -  The First Six Months
Apache Geode - The First Six Months
 
Samba management Console
Samba management ConsoleSamba management Console
Samba management Console
 
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
 
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
 
Angular jS Introduction by Google
Angular jS Introduction by GoogleAngular jS Introduction by Google
Angular jS Introduction by Google
 
JISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impactJISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impact
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
 
Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010
 
Elasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English versionElasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English version
 
Decoupled Architecture and WordPress
Decoupled Architecture and WordPressDecoupled Architecture and WordPress
Decoupled Architecture and WordPress
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
BarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian MulvanyBarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian Mulvany
 
SharePoint - Right Intro To Development
SharePoint - Right Intro To DevelopmentSharePoint - Right Intro To Development
SharePoint - Right Intro To Development
 
Samepoint API
Samepoint APISamepoint API
Samepoint API
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Semantic Annotation and Search for Resources in the Next Generation Web

  • 1. Semantic Annotation and Search for Resources in the Next Generation Web Ajith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya Wijeratne Kno.e.sis Center Wright State University Dayton OH W3C Workshop on Data and Services Integration October 20-21 2011, Bedford, MA, USA
  • 2. Agenda •The service Integration problem •What are the patterns we see? •What is the best course of action? •Making the best use of HTML 5 and search engines (Google / Bing) •Experience from Kino •Annotate-Index-enhance-search Lifecycle for biology oriented documents •Kino Web •Annotations with schema.org and SA-REST service model •SA-REST, Microdata or any other mechanism Oct 21 2011 2
  • 3. Before we start - Our Assertions •There is no global model or representation o Accept it! o Instead we can represent one in a universally acceptable way •Human in the loop is important! o Don't forget the guy in the trench •Grass roots / bottom up o Top down approaches are expensive to adopt Oct 21 2011 3
  • 4. What is the Problem ? Services are (still) described in multiple ways • SOAP vs REST debate is not as bad but still exists • SOAP services have found its home in the enterprise Oct 21 2011 4
  • 5. What is the Problem ? (Cont) REST has become the (de facto) standard in the consumer space • No agreed upon formal description (WSDL 2.0 / WADL ?) • No specific registry mechanism - Developers just Google to find the services. • Several high profile composition tools failed ! (Google Mashup Editor, Microsoft popfly) Oct 21 2011 5
  • 6. What have we learnt ? •Services (and Web APIs / services wrapped by programming language) are primarily composed by humans o Read the documentation, copy sample code and use Google generously •Special purpose indexes and registries do not work. o General purpose search engines (Google / Bing / Yahoo) has become really good Oct 21 2011 6
  • 7. Anticipated Future Trends Service consumption and composition is going to remain a semi-automated process • Humans will always be part of the process General purpose Search engines are going to be the key source of data for service composers Oct 21 2011 7
  • 8. Our Primary Premise Modification of service descriptions via annotations is the best way to supplement the upcoming service consumption patterns We are not alone in this thinking! •The trend towards microdata and 'rich snippets' Oct 21 2011 8
  • 9. A Generic Architecture for the Annotation / Index / Search Cycle Annotation Search Web Documents Annotation Submission Document Extraction or acquisition Process Third party Data Sources Index Annotation Enhancement Oct 21 2011 9
  • 10. Experience from Kino (KinoE) A tool for biologists • Modify Web pages using SA-REST annotations o Concepts come from National Center for Biomedical Ontologies (NCBO) •Use a specialized indexing engine that can parse the annotations and provide faceted searching Oct 21 2011 10
  • 11. KinoE Architecture Kino browser based annotation Kino Search Interfaces Web Pages Kino Web Other Front - Kino Browser Front-end ends Plugin Kino Search API NCBO Ontology Access SOLRJ API Kino Index API NCBO Ontology SOLR Web Interface Repository NCBO REST Kino Back-end Lucene Index Service Oct 21 2011 11
  • 12.
  • 13. KinoW (Web Edition) A more general annotator • SA-REST Service and Schema.org concepts • Mechanism can be Microdata or SA-REST • Only Microdata at the moments •Publishing targeted towards the original content providers • Use WebDAV / Drupal Plugin / Wiki plugin etc Oct 21 2011 13
  • 14. KinoW Architecture Search Browser based annotation Custom Front -ends Web Pages Kino Browser Plugin WebDAV /CMS plugins crawling Hosted Site Schema.org / LOD / Other third party concept providers Enhancement Oct 21 2011 14
  • 15. What is possible with this approach? •General search engine based service discovery o Annotation driven service discoveries  Issue queries in Google to find the services you are interstested (provided Google supports filtering by annotations) •Formal structures (WSDL / WADL) can be gleaned from the human readable pages o Both humans and machines can make use o More opportunities for composition tools Oct 21 2011 15
  • 18. Extra : Role of LOD? Act as a huge third party data repository? Oct 21 2011 18