SlideShare a Scribd company logo
Search Web Services
Ralph LeVan
Senior Research Scientist
NISO Discovery Tools Forum 2008
OASIS Search Web Services
Technical Committee
http://www.oasis-open.org/committees/search-ws
To define Search and Retrieval Web Services, combining
various current and ongoing web service activities.
OASIS Search Web Services
TC
Ray Denenberg—Library of Congress—Co-Chair
Matthew Dovey—JISC Executive—Co-Chair
Larry Dixson—Library of Congress—Voting Member
Janifer Gatenby—OCLC—Voting Member
Ralph LeVan—OCLC—Voting Member
Ashley Sanders—Univ. of Manchester—Voting Member
Robert Sanderson—Univ. of Liverpool—Voting Member
Sri Gopalan—Booz Allen Hamilton—Member
MacKenzie Smith—M.I.T. —Member
Who is OASIS
OASIS is a non-profit, international consortium that creates
interoperable industry specifications based on public
standards such as XML and SGML.
The ebXML suite of standards is probably their most famous
product
http://www.oasis-open.org
Why are we there?
We were hoping to reach a broader audience than we
normally see in NISO
We were hoping that there would be synergies with the
other XML-based standards groups. After all, most of them
have searching requirements.
Where We’ve Come From
Pros, Cons and What We’ve Learned
• Z39.50
• SRW/U
• OpenSearch
Z39.50
Pros
• High Functionality
• High Interoperability
Cons
• Complicated
• Binary encoding over raw tcp/ip
Lesson Learned
• There’s a need for a high functionality interface
• If people are desperate enough, they’ll do anything
SRU
Pros
• XML-based web service
• High Interoperability
Cons
• Still complicated (but much less than Z39.50!)
• Unheard of outside the library community
Lesson Learned
• There’s still a need for a high functionality interface
• If people aren’t desperate, they’ll live with what they’ve
got
OpenSearch
Pros
• Simple
• Moderate Interoperability
Cons
• Low Functionality
Lesson Learned
• There’s a need for a simple low functionality interface
• Developers prefer to do as little as possible
What We’re Doing
• CQL 1.2
• SRU 2.0
• Abstract Protocol Definition
• Binding to HTTP Get
• Binding to SRU 1.2
• Binding to OpenSearch
• SWS Description Language
CQL 1.2
This is the path to actually standardize CQL
Enhances a couple of features (sort and proximity and the
CQL Context Set)
SRU 2.0?
I wish I had something to say here, but it’s mostly on the
todo list and the SWS Description Language has more
traction in the committee.
Abstract Protocol Description (APD)
This document is an abstract protocol definition for the Search Web
Services (SWS) searchRetrieve operation. It presents the model
for the SearchRetrieve operation and is also intended to serve as
a guideline for the development of application protocol bindings
(hereafter bindings, see definitional note).
A binding describes the capabilities and general characteristic of a
server or search engine, and how it is to be accessed. A binding
may describe a class of servers via a human-readable document
or a binding may be a machine-readable file describing a single
server, provided by that server, according to the description
language described at xxx, which is a fundamental component of
the SWS standard
APD Data Model
A server exposes a datastore for access by a remote client for
purposes of search and retrieval. The datastore is a collection of
units of data. Such a unit is referred to as an item in this model.
For purposes of this model there is a single datastore at any
given server.
Associated with a datastore are one or more formats that may be
used for the transfer of items from the server to the client. Such
a format is referred to as an item type in this model. An item
type represents a common understanding shared by the client
and server of the information contained in the items of the
datastore, to allow the transfer of that information. The item
type identifies an abstract representation of the information. It
does not represent nor does it constrain the internal
representation or storage of that information at the server
APD Processing Model
A client sends a searchRetrieve request to a server, which responds
with a searchRetrieve response. The request includes a search
query to be matched against the items at the server’s datastore.
The server processes the query, creating a result set (see Result
Set Model) of items that match the query.
The request also indicates the desired number of items to be
included in the response and includes information about how the
individual items in the response, as well as the response at large,
are to be formatted.
The response includes items from the result set, diagnostic
information, and a result set identifier that the client may use in
a subsequent request to retrieve additional items.
APD Result Set Model
This is a logical model; support of result sets is not assumed
nor required by this standard
From the client's point of view, the result set is a set of
items each referenced by an ordinal number, beginning
with 1. The client may request a given item from a result
set according to a specific format. For example the client
may request item 1 in Dublin Core, and subsequently
request item 1 in MODS. The format in which items are
supplied is not a property of the result set, nor is it a
property of the requested items as a member of the result
set; the result set is simply the ordered list of items.
APD Request Parameters
Abstract Parameter
Name
Description
responseType e.g. 'text/html', ‘application/atom+xml’ ,
application/x+sru
query The search query of the request.
startPosition The position within the result set of the first item to be
returned.
maximumItems The number of items requested to be returned.
itemType e.g. string, jpeg, dc, iso2709. From list provided by
server.
sortOrder The requested order of the result set.
APD Response Parameters
Abstract Element Name Description
numberOfItems The number of items matched by the query.
resultSetId The identifier for the result set created by the
query.
items a sequence of items.
nextPosition The next position within the result set following
the final returned item.
Diagnostics Error message and/or diagnostics.
echoedSearchRetrieveRequest The server may echo the request back to the
client.
HTTP Get Binding
Syntax
The client sends a request via the HTTP GET method
Specifically it is an HTTP URL of the form:
<base URL>?<searchpart>
Encoding
Convert the value to UTF-8. Percent-encode characters
as necessary within the value. Construct a URI from the
parameter names and encoded values.
SRU 1.2 Binding
The APD + the HTTP Get Binding + new request parameters
(operation, version, recordPacking resultSetTTL stylesheet
extraRequestData) – unused base parameters (responseType,
sortOrder) + new response elements (version,
resultSetIdleTime, extraResponseData) and a shiny XML
encoding.
SWS Description Language
What do we think we’ve learned?
Developers are tired of being
told how to do their business!
Unless they have a business reason to worry about
interoperability, they won’t. Third party interoperability
needs to be something they can add on when they do
discover they need it. Better yet, let someone else add it
on.
Prescriptive vs Descriptive Standards
A prescriptive standard (Z39.50, SRU and the response part
of OpenSearch) causes interoperability by telling you how
to construct your interface, allowing for simple clients that
know how to talk to you. The hard work of interface is
done by the server.
A descriptive standard (WSDL and the request part of
OpenSearch) causes interoperability by allowing you to
describe your interface in such a way that clients can be
created dynamically to talk to you. The hard work of
interface is done by the client.
Who Wants This?
• Anyone who wants access to content that doesn’t adhere
to any search standards: Web 2.0 and NISO Metasearch!
• Anyone with content to provide who doesn’t know what
clients might want to search that content
Essentially OpenSearch…
<op>
<request type="template"
href="http://copac.ac.uk/wzgw?rsn={resultSetName}&amp;
format=XML+-+MODS&amp;id={sessionID}&amp;
fs=Download+records"/>
<response type="XML" schema=“AtomResponse“/>
</op>
… On Steroids!
<request href="http://copac.ac.uk/">
<form action="/wzgw" method="get“
name="Copac Quick Search">
<param name="au" semantics="au"/>
<param name="ti" semantics="ti"/>
<param name="any" semantics="kw"/>
<param name="form" value="qs"/>
<param name="fs" value="Search"/>
</form>
</request>
… On Steriods (cont.)
<response>
<set name="numberOfItems">
<regexp regexp="&lt;span
id=&quot;num_hits&quot;&gt;([0-9]+)&lt;"/>
</set>
</response>
P.S., Bibliographic Context Set Anyone?
SRU depends on context sets. The SRU Editorial Board
recognizes the need for context set for bibliographic
searching (equivalent to Bib-1 in the Z39.50 universe).
But, they don’t feel that they are the appropriate body.
Anyone in the NISO community interested?
Questions?
http://staff.oclc.org/~levan/docs/SearchWebService.ppt
levan@oclc.org

More Related Content

What's hot

NLIDB(Natural Language Interface to DataBases)
NLIDB(Natural Language Interface to DataBases)NLIDB(Natural Language Interface to DataBases)
NLIDB(Natural Language Interface to DataBases)
Swetha Pallati
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engine
s0P5a41b
 
The introduction of RESTful
The introduction of RESTful The introduction of RESTful
The introduction of RESTful
Jon Chen
 
Hypothesis testing and confidence intervals
Hypothesis testing and confidence intervalsHypothesis testing and confidence intervals
Hypothesis testing and confidence intervals
collinsaura
 
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Lucidworks
 
Search explained T3DD15
Search explained T3DD15Search explained T3DD15
Search explained T3DD15
Hans Höchtl
 
Web_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_HabibWeb_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_Habib
El Habib NFAOUI
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
Mounia Lalmas-Roelleke
 
Database management system session 6
Database management system session 6Database management system session 6
Database management system session 6
Infinity Tech Solutions
 
A Conversation About REST
A Conversation About RESTA Conversation About REST
A Conversation About REST
Mike Wilcox
 

What's hot (10)

NLIDB(Natural Language Interface to DataBases)
NLIDB(Natural Language Interface to DataBases)NLIDB(Natural Language Interface to DataBases)
NLIDB(Natural Language Interface to DataBases)
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engine
 
The introduction of RESTful
The introduction of RESTful The introduction of RESTful
The introduction of RESTful
 
Hypothesis testing and confidence intervals
Hypothesis testing and confidence intervalsHypothesis testing and confidence intervals
Hypothesis testing and confidence intervals
 
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
 
Search explained T3DD15
Search explained T3DD15Search explained T3DD15
Search explained T3DD15
 
Web_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_HabibWeb_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_Habib
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
Database management system session 6
Database management system session 6Database management system session 6
Database management system session 6
 
A Conversation About REST
A Conversation About RESTA Conversation About REST
A Conversation About REST
 

Similar to LeVan, "Search Web Services"

REST & RESTful Web Services
REST & RESTful Web ServicesREST & RESTful Web Services
REST & RESTful Web Services
Halil Burak Cetinkaya
 
Semantic Web Servers
Semantic Web ServersSemantic Web Servers
Semantic Web Serverswebhostingguy
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
Kumar Goud
 
Unit 2
Unit 2Unit 2
Unit 2
Ravi Kumar
 
Innovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC IntegrationsInnovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC Integrations
Steve Speicher
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
Adrian Stevenson
 
Creating Web Services with Zend Framework - Matthew Turland
Creating Web Services with Zend Framework - Matthew TurlandCreating Web Services with Zend Framework - Matthew Turland
Creating Web Services with Zend Framework - Matthew TurlandMatthew Turland
 
The ‘discovery to delivery’ DLF reference model
The ‘discovery to delivery’ DLF reference modelThe ‘discovery to delivery’ DLF reference model
The ‘discovery to delivery’ DLF reference model
Andy Powell
 
Intelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIntelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIAEME Publication
 
Rest web service
Rest web serviceRest web service
Rest web service
Hamid Ghorbani
 
Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015
Jackson F. de A. Mafra
 
Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29Julie Allinson
 
R01765113122
R01765113122R01765113122
R01765113122
IOSR Journals
 
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
iosrjce
 
Overview of java web services
Overview of java web servicesOverview of java web services
Rest api best practices – comprehensive handbook
Rest api best practices – comprehensive handbookRest api best practices – comprehensive handbook
Rest api best practices – comprehensive handbook
Katy Slemon
 
Software performance testing_overview
Software performance testing_overviewSoftware performance testing_overview
Software performance testing_overviewRohan Bhattarai
 
Switch to Backend 2023
Switch to Backend 2023Switch to Backend 2023

Similar to LeVan, "Search Web Services" (20)

REST & RESTful Web Services
REST & RESTful Web ServicesREST & RESTful Web Services
REST & RESTful Web Services
 
PDFArticle
PDFArticlePDFArticle
PDFArticle
 
Semantic Web Servers
Semantic Web ServersSemantic Web Servers
Semantic Web Servers
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
 
Unit 2
Unit 2Unit 2
Unit 2
 
Innovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC IntegrationsInnovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC Integrations
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
 
Creating Web Services with Zend Framework - Matthew Turland
Creating Web Services with Zend Framework - Matthew TurlandCreating Web Services with Zend Framework - Matthew Turland
Creating Web Services with Zend Framework - Matthew Turland
 
Restful web services
Restful web servicesRestful web services
Restful web services
 
The ‘discovery to delivery’ DLF reference model
The ‘discovery to delivery’ DLF reference modelThe ‘discovery to delivery’ DLF reference model
The ‘discovery to delivery’ DLF reference model
 
Intelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIntelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversion
 
Rest web service
Rest web serviceRest web service
Rest web service
 
Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015
 
Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29
 
R01765113122
R01765113122R01765113122
R01765113122
 
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
 
Overview of java web services
Overview of java web servicesOverview of java web services
Overview of java web services
 
Rest api best practices – comprehensive handbook
Rest api best practices – comprehensive handbookRest api best practices – comprehensive handbook
Rest api best practices – comprehensive handbook
 
Software performance testing_overview
Software performance testing_overviewSoftware performance testing_overview
Software performance testing_overview
 
Switch to Backend 2023
Switch to Backend 2023Switch to Backend 2023
Switch to Backend 2023
 

More from National Information Standards Organization (NISO)

Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
National Information Standards Organization (NISO)
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
National Information Standards Organization (NISO)
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
National Information Standards Organization (NISO)
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
National Information Standards Organization (NISO)
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
National Information Standards Organization (NISO)
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
National Information Standards Organization (NISO)
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
National Information Standards Organization (NISO)
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
National Information Standards Organization (NISO)
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
National Information Standards Organization (NISO)
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
National Information Standards Organization (NISO)
 

More from National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 

Recently uploaded

Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

LeVan, "Search Web Services"

  • 1. Search Web Services Ralph LeVan Senior Research Scientist NISO Discovery Tools Forum 2008
  • 2. OASIS Search Web Services Technical Committee http://www.oasis-open.org/committees/search-ws To define Search and Retrieval Web Services, combining various current and ongoing web service activities.
  • 3. OASIS Search Web Services TC Ray Denenberg—Library of Congress—Co-Chair Matthew Dovey—JISC Executive—Co-Chair Larry Dixson—Library of Congress—Voting Member Janifer Gatenby—OCLC—Voting Member Ralph LeVan—OCLC—Voting Member Ashley Sanders—Univ. of Manchester—Voting Member Robert Sanderson—Univ. of Liverpool—Voting Member Sri Gopalan—Booz Allen Hamilton—Member MacKenzie Smith—M.I.T. —Member
  • 4. Who is OASIS OASIS is a non-profit, international consortium that creates interoperable industry specifications based on public standards such as XML and SGML. The ebXML suite of standards is probably their most famous product http://www.oasis-open.org
  • 5. Why are we there? We were hoping to reach a broader audience than we normally see in NISO We were hoping that there would be synergies with the other XML-based standards groups. After all, most of them have searching requirements.
  • 6. Where We’ve Come From Pros, Cons and What We’ve Learned • Z39.50 • SRW/U • OpenSearch
  • 7. Z39.50 Pros • High Functionality • High Interoperability Cons • Complicated • Binary encoding over raw tcp/ip Lesson Learned • There’s a need for a high functionality interface • If people are desperate enough, they’ll do anything
  • 8. SRU Pros • XML-based web service • High Interoperability Cons • Still complicated (but much less than Z39.50!) • Unheard of outside the library community Lesson Learned • There’s still a need for a high functionality interface • If people aren’t desperate, they’ll live with what they’ve got
  • 9. OpenSearch Pros • Simple • Moderate Interoperability Cons • Low Functionality Lesson Learned • There’s a need for a simple low functionality interface • Developers prefer to do as little as possible
  • 10. What We’re Doing • CQL 1.2 • SRU 2.0 • Abstract Protocol Definition • Binding to HTTP Get • Binding to SRU 1.2 • Binding to OpenSearch • SWS Description Language
  • 11. CQL 1.2 This is the path to actually standardize CQL Enhances a couple of features (sort and proximity and the CQL Context Set)
  • 12. SRU 2.0? I wish I had something to say here, but it’s mostly on the todo list and the SWS Description Language has more traction in the committee.
  • 13. Abstract Protocol Description (APD) This document is an abstract protocol definition for the Search Web Services (SWS) searchRetrieve operation. It presents the model for the SearchRetrieve operation and is also intended to serve as a guideline for the development of application protocol bindings (hereafter bindings, see definitional note). A binding describes the capabilities and general characteristic of a server or search engine, and how it is to be accessed. A binding may describe a class of servers via a human-readable document or a binding may be a machine-readable file describing a single server, provided by that server, according to the description language described at xxx, which is a fundamental component of the SWS standard
  • 14. APD Data Model A server exposes a datastore for access by a remote client for purposes of search and retrieval. The datastore is a collection of units of data. Such a unit is referred to as an item in this model. For purposes of this model there is a single datastore at any given server. Associated with a datastore are one or more formats that may be used for the transfer of items from the server to the client. Such a format is referred to as an item type in this model. An item type represents a common understanding shared by the client and server of the information contained in the items of the datastore, to allow the transfer of that information. The item type identifies an abstract representation of the information. It does not represent nor does it constrain the internal representation or storage of that information at the server
  • 15. APD Processing Model A client sends a searchRetrieve request to a server, which responds with a searchRetrieve response. The request includes a search query to be matched against the items at the server’s datastore. The server processes the query, creating a result set (see Result Set Model) of items that match the query. The request also indicates the desired number of items to be included in the response and includes information about how the individual items in the response, as well as the response at large, are to be formatted. The response includes items from the result set, diagnostic information, and a result set identifier that the client may use in a subsequent request to retrieve additional items.
  • 16. APD Result Set Model This is a logical model; support of result sets is not assumed nor required by this standard From the client's point of view, the result set is a set of items each referenced by an ordinal number, beginning with 1. The client may request a given item from a result set according to a specific format. For example the client may request item 1 in Dublin Core, and subsequently request item 1 in MODS. The format in which items are supplied is not a property of the result set, nor is it a property of the requested items as a member of the result set; the result set is simply the ordered list of items.
  • 17. APD Request Parameters Abstract Parameter Name Description responseType e.g. 'text/html', ‘application/atom+xml’ , application/x+sru query The search query of the request. startPosition The position within the result set of the first item to be returned. maximumItems The number of items requested to be returned. itemType e.g. string, jpeg, dc, iso2709. From list provided by server. sortOrder The requested order of the result set.
  • 18. APD Response Parameters Abstract Element Name Description numberOfItems The number of items matched by the query. resultSetId The identifier for the result set created by the query. items a sequence of items. nextPosition The next position within the result set following the final returned item. Diagnostics Error message and/or diagnostics. echoedSearchRetrieveRequest The server may echo the request back to the client.
  • 19. HTTP Get Binding Syntax The client sends a request via the HTTP GET method Specifically it is an HTTP URL of the form: <base URL>?<searchpart> Encoding Convert the value to UTF-8. Percent-encode characters as necessary within the value. Construct a URI from the parameter names and encoded values.
  • 20. SRU 1.2 Binding The APD + the HTTP Get Binding + new request parameters (operation, version, recordPacking resultSetTTL stylesheet extraRequestData) – unused base parameters (responseType, sortOrder) + new response elements (version, resultSetIdleTime, extraResponseData) and a shiny XML encoding.
  • 21. SWS Description Language What do we think we’ve learned? Developers are tired of being told how to do their business! Unless they have a business reason to worry about interoperability, they won’t. Third party interoperability needs to be something they can add on when they do discover they need it. Better yet, let someone else add it on.
  • 22. Prescriptive vs Descriptive Standards A prescriptive standard (Z39.50, SRU and the response part of OpenSearch) causes interoperability by telling you how to construct your interface, allowing for simple clients that know how to talk to you. The hard work of interface is done by the server. A descriptive standard (WSDL and the request part of OpenSearch) causes interoperability by allowing you to describe your interface in such a way that clients can be created dynamically to talk to you. The hard work of interface is done by the client.
  • 23. Who Wants This? • Anyone who wants access to content that doesn’t adhere to any search standards: Web 2.0 and NISO Metasearch! • Anyone with content to provide who doesn’t know what clients might want to search that content
  • 25. … On Steroids! <request href="http://copac.ac.uk/"> <form action="/wzgw" method="get“ name="Copac Quick Search"> <param name="au" semantics="au"/> <param name="ti" semantics="ti"/> <param name="any" semantics="kw"/> <param name="form" value="qs"/> <param name="fs" value="Search"/> </form> </request>
  • 26. … On Steriods (cont.) <response> <set name="numberOfItems"> <regexp regexp="&lt;span id=&quot;num_hits&quot;&gt;([0-9]+)&lt;"/> </set> </response>
  • 27. P.S., Bibliographic Context Set Anyone? SRU depends on context sets. The SRU Editorial Board recognizes the need for context set for bibliographic searching (equivalent to Bib-1 in the Z39.50 universe). But, they don’t feel that they are the appropriate body. Anyone in the NISO community interested?