SlideShare a Scribd company logo
Semantic Annotation and Search
   for Resources in the Next
        Generation Web
Ajith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya Wijeratne
                          Kno.e.sis Center
                       Wright State University
                            Dayton OH




          W3C Workshop on Data and Services Integration
              October 20-21 2011, Bedford, MA, USA
Agenda
  •The service Integration problem
       •What are the patterns we see?
  •What is the best course of action?
       •Making the best use of HTML 5 and search engines (Google
       / Bing)
  •Experience from Kino
       •Annotate-Index-enhance-search Lifecycle for biology
       oriented documents
  •Kino Web
       •Annotations with schema.org and SA-REST service model
       •SA-REST, Microdata or any other mechanism

Oct 21 2011                                                     2
Before we start - Our Assertions

   •There is no global model or representation
       o      Accept it!
       o      Instead we can represent one in a universally
              acceptable way
   •Human in the loop is important!
           o Don't forget the guy in the trench
   •Grass roots / bottom up
           o Top down approaches are expensive to adopt


Oct 21 2011                                                   3
What is the Problem ?


   Services are (still) described in multiple ways
   • SOAP vs REST debate is not as bad but
    still exists
   • SOAP services have found its home in the
    enterprise




Oct 21 2011                                          4
What is the Problem ? (Cont)

REST has become the (de facto) standard in
the consumer space
   •
   No agreed upon formal description (WSDL
   2.0 / WADL ?)
   •
   No specific registry mechanism - Developers
   just Google to find the services.
   •
   Several high profile composition tools failed !
   (Google Mashup Editor, Microsoft popfly)


Oct 21 2011                                      5
What have we learnt ?

   •Services (and Web APIs / services wrapped
      by programming language) are primarily
      composed by humans
         o Read the documentation, copy sample code and use
           Google generously
   •Special purpose indexes and registries do
      not work.
         o General purpose search engines (Google / Bing /
           Yahoo) has become really good

Oct 21 2011                                                  6
Anticipated Future Trends

Service consumption and composition is going
to remain a semi-automated process
     • Humans will always be part of the process


General purpose Search engines are going to
be the key source of data for service
composers



Oct 21 2011                                        7
Our Primary Premise

    Modification of service descriptions via
    annotations is the best way to supplement
    the upcoming service consumption
    patterns

    We are not alone in this thinking!
         •The trend towards microdata and 'rich snippets'



Oct 21 2011                                                 8
A Generic Architecture for the
Annotation / Index / Search Cycle


                   Annotation                                Search

     Web
     Documents
                                Annotation Submission    Document Extraction
                                or acquisition Process




               Third party
              Data Sources                                    Index
                             Annotation Enhancement




Oct 21 2011                                                                    9
Experience from Kino (KinoE)

A tool for biologists
   •
   Modify Web pages using SA-REST
   annotations
         o Concepts come from National Center for Biomedical
           Ontologies (NCBO)
   •Use a specialized indexing engine that can
       parse the annotations and provide faceted
       searching

Oct 21 2011                                                10
KinoE Architecture


     Kino browser based annotation                            Kino Search Interfaces

        Web Pages                                 Kino Web             Other Front -
                               Kino Browser
                                                  Front-end               ends
                                  Plugin




                                                   Kino Search API
        NCBO Ontology Access                                                  SOLRJ
               API                                  Kino Index API
        NCBO Ontology
                                                                     SOLR Web Interface
          Repository

      NCBO REST                               Kino Back-end           Lucene Index
      Service
Oct 21 2011                                                                               11
KinoW (Web Edition)


A more general annotator
   •
  SA-REST Service and Schema.org concepts
   •
  Mechanism can be Microdata or SA-REST
        • Only Microdata at the moments
   •Publishing targeted towards the original
       content providers
        • Use WebDAV / Drupal Plugin / Wiki plugin etc


Oct 21 2011                                              13
KinoW Architecture

                                                       Search

     Browser based annotation
                                                                Custom Front
                                                                    -ends
        Web Pages
                     Kino Browser
                        Plugin



                          WebDAV /CMS
                             plugins
                                            crawling


                             Hosted
                              Site
Schema.org / LOD /
 Other third party
 concept providers            Enhancement

Oct 21 2011                                                             14
What is possible with this approach?

   •General search engine based service
      discovery
         o Annotation driven service discoveries
             Issue queries in Google to find the services you
              are interstested (provided Google supports
              filtering by annotations)
   •Formal structures (WSDL / WADL) can be
      gleaned from the human readable pages
         o Both humans and machines can make use
         o More opportunities for composition tools


Oct 21 2011                                                      15
Demonstration
Questions
Extra : Role of LOD?

Act as a huge third party data repository?




Oct 21 2011                                  18

More Related Content

Similar to Semantic Annotation and Search for Resources in the Next Generation Web

Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011
Stephen Thair
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
Paul Walk
 
WebBee rapid web app development teck stack
WebBee rapid web app development teck stackWebBee rapid web app development teck stack
WebBee rapid web app development teck stack
ALDAN3
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
SPC Adriatics
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
Agnes Molnar
 
Apache Geode - The First Six Months
Apache Geode -  The First Six MonthsApache Geode -  The First Six Months
Apache Geode - The First Six Months
Anthony Baker
 
Samba management Console
Samba management ConsoleSamba management Console
Samba management Console
Manfred Furuholmen
 
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
zinyus
 
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
Saltlux zinyus
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Chris McNulty
 
Angular jS Introduction by Google
Angular jS Introduction by GoogleAngular jS Introduction by Google
Angular jS Introduction by Google
ASG
 
JISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impactJISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impact
Andy Powell
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
Marco Brambilla
 
Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010
Chris McNulty
 
Elasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English versionElasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English version
David Pilato
 
Decoupled Architecture and WordPress
Decoupled Architecture and WordPressDecoupled Architecture and WordPress
Decoupled Architecture and WordPress
Pantheon
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
Chris McNulty
 
BarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian MulvanyBarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian Mulvany
Ian Mulvany
 
SharePoint - Right Intro To Development
SharePoint - Right Intro To DevelopmentSharePoint - Right Intro To Development
SharePoint - Right Intro To Development
Mark Rackley
 
Samepoint API
Samepoint APISamepoint API
Samepoint API
Darren Culbreath
 

Similar to Semantic Annotation and Search for Resources in the Next Generation Web (20)

Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011Measuring web performance. Velocity EU 2011
Measuring web performance. Velocity EU 2011
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
 
WebBee rapid web app development teck stack
WebBee rapid web app development teck stackWebBee rapid web app development teck stack
WebBee rapid web app development teck stack
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
Apache Geode - The First Six Months
Apache Geode -  The First Six MonthsApache Geode -  The First Six Months
Apache Geode - The First Six Months
 
Samba management Console
Samba management ConsoleSamba management Console
Samba management Console
 
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가 토크릴레이 1탄 html5 전망 (전종홍 박사)
 
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
전문가토크릴레이 1탄 html5 전망 (전종홍 박사)
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
 
Angular jS Introduction by Google
Angular jS Introduction by GoogleAngular jS Introduction by Google
Angular jS Introduction by Google
 
JISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impactJISC IE Architecture - external trends and their potential impact
JISC IE Architecture - external trends and their potential impact
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
 
Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010Tagging Up - MMS and Taxonomy In SharePoint 2010
Tagging Up - MMS and Taxonomy In SharePoint 2010
 
Elasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English versionElasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English version
 
Decoupled Architecture and WordPress
Decoupled Architecture and WordPressDecoupled Architecture and WordPress
Decoupled Architecture and WordPress
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
BarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian MulvanyBarCamb Connotea by Ian Mulvany
BarCamb Connotea by Ian Mulvany
 
SharePoint - Right Intro To Development
SharePoint - Right Intro To DevelopmentSharePoint - Right Intro To Development
SharePoint - Right Intro To Development
 
Samepoint API
Samepoint APISamepoint API
Samepoint API
 

Recently uploaded

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 

Recently uploaded (20)

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 

Semantic Annotation and Search for Resources in the Next Generation Web

  • 1. Semantic Annotation and Search for Resources in the Next Generation Web Ajith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya Wijeratne Kno.e.sis Center Wright State University Dayton OH W3C Workshop on Data and Services Integration October 20-21 2011, Bedford, MA, USA
  • 2. Agenda •The service Integration problem •What are the patterns we see? •What is the best course of action? •Making the best use of HTML 5 and search engines (Google / Bing) •Experience from Kino •Annotate-Index-enhance-search Lifecycle for biology oriented documents •Kino Web •Annotations with schema.org and SA-REST service model •SA-REST, Microdata or any other mechanism Oct 21 2011 2
  • 3. Before we start - Our Assertions •There is no global model or representation o Accept it! o Instead we can represent one in a universally acceptable way •Human in the loop is important! o Don't forget the guy in the trench •Grass roots / bottom up o Top down approaches are expensive to adopt Oct 21 2011 3
  • 4. What is the Problem ? Services are (still) described in multiple ways • SOAP vs REST debate is not as bad but still exists • SOAP services have found its home in the enterprise Oct 21 2011 4
  • 5. What is the Problem ? (Cont) REST has become the (de facto) standard in the consumer space • No agreed upon formal description (WSDL 2.0 / WADL ?) • No specific registry mechanism - Developers just Google to find the services. • Several high profile composition tools failed ! (Google Mashup Editor, Microsoft popfly) Oct 21 2011 5
  • 6. What have we learnt ? •Services (and Web APIs / services wrapped by programming language) are primarily composed by humans o Read the documentation, copy sample code and use Google generously •Special purpose indexes and registries do not work. o General purpose search engines (Google / Bing / Yahoo) has become really good Oct 21 2011 6
  • 7. Anticipated Future Trends Service consumption and composition is going to remain a semi-automated process • Humans will always be part of the process General purpose Search engines are going to be the key source of data for service composers Oct 21 2011 7
  • 8. Our Primary Premise Modification of service descriptions via annotations is the best way to supplement the upcoming service consumption patterns We are not alone in this thinking! •The trend towards microdata and 'rich snippets' Oct 21 2011 8
  • 9. A Generic Architecture for the Annotation / Index / Search Cycle Annotation Search Web Documents Annotation Submission Document Extraction or acquisition Process Third party Data Sources Index Annotation Enhancement Oct 21 2011 9
  • 10. Experience from Kino (KinoE) A tool for biologists • Modify Web pages using SA-REST annotations o Concepts come from National Center for Biomedical Ontologies (NCBO) •Use a specialized indexing engine that can parse the annotations and provide faceted searching Oct 21 2011 10
  • 11. KinoE Architecture Kino browser based annotation Kino Search Interfaces Web Pages Kino Web Other Front - Kino Browser Front-end ends Plugin Kino Search API NCBO Ontology Access SOLRJ API Kino Index API NCBO Ontology SOLR Web Interface Repository NCBO REST Kino Back-end Lucene Index Service Oct 21 2011 11
  • 12.
  • 13. KinoW (Web Edition) A more general annotator • SA-REST Service and Schema.org concepts • Mechanism can be Microdata or SA-REST • Only Microdata at the moments •Publishing targeted towards the original content providers • Use WebDAV / Drupal Plugin / Wiki plugin etc Oct 21 2011 13
  • 14. KinoW Architecture Search Browser based annotation Custom Front -ends Web Pages Kino Browser Plugin WebDAV /CMS plugins crawling Hosted Site Schema.org / LOD / Other third party concept providers Enhancement Oct 21 2011 14
  • 15. What is possible with this approach? •General search engine based service discovery o Annotation driven service discoveries  Issue queries in Google to find the services you are interstested (provided Google supports filtering by annotations) •Formal structures (WSDL / WADL) can be gleaned from the human readable pages o Both humans and machines can make use o More opportunities for composition tools Oct 21 2011 15
  • 18. Extra : Role of LOD? Act as a huge third party data repository? Oct 21 2011 18