The stairs evaluation

•Download as PPTX, PDF•

1 like•432 views

maruthimlis

Technology

M.MARUTHI
DLIS 2ND YEAR
PONDICHERRY UNIVERSITY
PONDICHERRY

 In 1985,blair and mornon published a report
on a large
scale experiment aimed at evaluating the
retrieval effectiveness of a full-text search
and retrieval system.
 This is known as STAIRS (storage and
information retrieval system) study.

 The database examined in the STAIRS study
consisted
of nearly 40,000 documents, representing
rouhgly 3,50,000 pages of hard copy text used
in the defence of a large corporate lawsuit.
 The full texts of all the pages where available
online texts and could be retrieved where
specified words appeared either simply or in
Boolean combinations.
 the major objective of the STAIRS evaluation
was to asses how well the system could retrieve
all the documents (and only those) relevant to a
given request, and measures of recall and
precisitions were used this purpose.

 The lawyers generated a total of 51 general
requests.
 The lawyers evaluated those documents and
grouped them into ‘vital’, ‘satisfactory’,
marginally relevant’ , or ‘irrelevant’ in relation
to the request.
 A sampling techniques was adapted .
 Random samples were taken and these were
evaluated by the lawyers .
 The total number of relevant of documents
that existed in these subsets was estimated.

 Out of the 51 requests, values for recall and
Presidion were calculated for 40 and the
remaining 11 were used to check the
sampling techniques and control for possible
bias in the evaluation of retrieved and sample
tests.
 The value of precision ranged in percentage
terms from 100%.

 In attempting to find out why STAIRS could
retrieve only one out of five relevant items in
response to a request.
 They point out that a retrieved set of several
thousand documents is impractical to browse
on the part of the user, and quit naturally the
user in such cases wants to reformulate the
query by adding more and more search terms
to bring the out put size to a manageable
limit.

 Early evaluation experiments have produced a
number of facts and figures that can be
utilized in many ways –in designing a new
system.
 The STAIRS study reported retrieval results
that are contradictory to the earlier studies.
 In 1986 salton published a paper
commenting on the major points of objection
raised by salton.

 There is evidence for output overload in large
systems.
 The effectiveness of automatic indexing.
 Four major methodological problems of the
previous studies.
 Small database unreliable techniques for judging
the relevance.
 The first concern is with the size and
composition of the collections used for testing in
research .

 The second concern is with the nature of the
queries used in research collections. for
example: looking at the INSPEC test collection
of 12,684documents, if a typical query maps
to 33 relevant documents, this would
extrapolate to an STN user retrieving over
24,000 documents.
 In chemical abstracts database containing 9.5
million documents.

 The third points raises the issue of whether the
issues the performance of a ranked retrieval
system is a large enough improvement over the
Boolean search model search model to represent
a cost-effective alternative.
 The use of research collections with larger
vocabularies and more records.
 the investigation of retrieval schemes that
incorporate proximity information
 The use of test collection that contain more
specific queries.

What's hot

Systematic reviewsFowler Susan

Open Research DataOkanagan College Library

Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...University of Michigan Taubman Health Sciences Library

Analyzing dataAdan Rodriguez

Cies 2010 literature searching pubmedPatrice Chalon

Presentation2John Pell

A simplified example of searching systematicallyLinda_Kelly

Prisma s manuscript preprintdaisyfloresc

Review of literature HEMANT SHARMA

2015 GU-ICBI Poster (third printing)Michael Atkins

Core hom-a-powerful-and-exhaustive-database-of-clinical-trials-in-homeopathy ...home

Beyond traditional metrics at the University of São Paulo: scientific product...sfausto

Historians Vs Social Scientistsjgerber

Testing Reviewer Suggestions Derived from Bibliometric Specialty Approximatio...Nadine Rons

Scientometrics MD AZIZUR RAHMAN

4D Specialty Approximation: Ability to Distinguish between Related SpecialtiesNadine Rons

Nucl. Acids Res.-2014-Howe-nar-gku1244Yasel Cruz

MEDLARS - Medical Literature Analysis And Retrieval SystemPALLAB DAS

PubMedAlicia Tiny

What's hot (19)

Systematic reviews

Open Research Data

Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...

Analyzing data

Cies 2010 literature searching pubmed

Presentation2

A simplified example of searching systematically

Prisma s manuscript preprint

Review of literature

2015 GU-ICBI Poster (third printing)

Core hom-a-powerful-and-exhaustive-database-of-clinical-trials-in-homeopathy ...

Beyond traditional metrics at the University of São Paulo: scientific product...

Historians Vs Social Scientists

Testing Reviewer Suggestions Derived from Bibliometric Specialty Approximatio...

Scientometrics

4D Specialty Approximation: Ability to Distinguish between Related Specialties

Nucl. Acids Res.-2014-Howe-nar-gku1244

MEDLARS - Medical Literature Analysis And Retrieval System

PubMed

Similar to The stairs evaluation

Case study finalhinanwr

The effect of socio-demographiccharacteristics on theinf.docxmehek4

Evaluation of medlarssilambu111

Unit 1 business researchpraveen3030

Embi cri review-2012-finalPeter Embi

The Systematic Review of Literature in LIS: an approachGrial - University of Salamanca

How to conduct_a_systematic_or_evidence_reviewEaglefly Fly

A Qualititative Approach To HCI ResearchNathan Mathis

Finding articles and books using database for your discipline pubricaPubrica

A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS AIRCC Publishing Corporation

A Semantic Retrieval System for Extracting Relationships from Biological CorpusAIRCC Publishing Corporation

A Semantic Retrieval System for Extracting Relationships from Biological Corpusijcsit

Systematic literature review technique.pptxTANMAY DAS GUPTA

Building theory from case studyKrite Infotech

Assignment 2 Case StudyRead the following articleAgostinhodesteinbrook

La & edm in practicebharati k

Experimental research data quality inijait

Standard Datasets in Information Retrieval Jean Brenda

Clicking Past GoogleDouglas Joubert

Assignment 6.1Scott Bohlin

Similar to The stairs evaluation (20)

Case study final

The effect of socio-demographiccharacteristics on theinf.docx

Evaluation of medlars

Unit 1 business research

Embi cri review-2012-final

The Systematic Review of Literature in LIS: an approach

How to conduct_a_systematic_or_evidence_review

A Qualititative Approach To HCI Research

Finding articles and books using database for your discipline pubrica

A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS

A Semantic Retrieval System for Extracting Relationships from Biological Corpus

Systematic literature review technique.pptx

Building theory from case study

Assignment 2 Case StudyRead the following articleAgostinho

La & edm in practice

Experimental research data quality in

Standard Datasets in Information Retrieval

Clicking Past Google

Assignment 6.1

Recently uploaded

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Artificial intelligence in cctv survelliance.pptxhariprasad279825

"ML in Production",Oleksandr BaganFwdays

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

costume and set research powerpoint presentationphoebematthew05

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Understanding the Laravel MVC ArchitecturePixlogix Infotech

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

CloudStudio User manual (basic edition):comworks

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Recently uploaded (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Designing IA for AI - Information Architecture Conference 2024

Artificial intelligence in cctv survelliance.pptx

"ML in Production",Oleksandr Bagan

My INSURER PTE LTD - Insurtech Innovation Award 2024

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service

APIForce Zurich 5 April Automation LPDG

Advanced Test Driven-Development @ php[tek] 2024

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Gen AI in Business - Global Trends Report 2024.pdf

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Are Multi-Cloud and Serverless Good or Bad?

costume and set research powerpoint presentation

Connect Wave/ connectwave Pitch Deck Presentation

Understanding the Laravel MVC Architecture

"Debugging python applications inside k8s environment", Andrii Soldatenko

CloudStudio User manual (basic edition):

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

The stairs evaluation

1. M.MARUTHI DLIS 2ND YEAR PONDICHERRY UNIVERSITY PONDICHERRY

2.  In 1985,blair and mornon published a report on a large scale experiment aimed at evaluating the retrieval effectiveness of a full-text search and retrieval system.  This is known as STAIRS (storage and information retrieval system) study.

3.  The database examined in the STAIRS study consisted of nearly 40,000 documents, representing rouhgly 3,50,000 pages of hard copy text used in the defence of a large corporate lawsuit.  The full texts of all the pages where available online texts and could be retrieved where specified words appeared either simply or in Boolean combinations.  the major objective of the STAIRS evaluation was to asses how well the system could retrieve all the documents (and only those) relevant to a given request, and measures of recall and precisitions were used this purpose.

4.  The lawyers generated a total of 51 general requests.  The lawyers evaluated those documents and grouped them into ‘vital’, ‘satisfactory’, marginally relevant’ , or ‘irrelevant’ in relation to the request.  A sampling techniques was adapted .  Random samples were taken and these were evaluated by the lawyers .  The total number of relevant of documents that existed in these subsets was estimated.

5.  Out of the 51 requests, values for recall and Presidion were calculated for 40 and the remaining 11 were used to check the sampling techniques and control for possible bias in the evaluation of retrieved and sample tests.  The value of precision ranged in percentage terms from 100%.

6.  In attempting to find out why STAIRS could retrieve only one out of five relevant items in response to a request.  They point out that a retrieved set of several thousand documents is impractical to browse on the part of the user, and quit naturally the user in such cases wants to reformulate the query by adding more and more search terms to bring the out put size to a manageable limit.

7.  Early evaluation experiments have produced a number of facts and figures that can be utilized in many ways –in designing a new system.  The STAIRS study reported retrieval results that are contradictory to the earlier studies.  In 1986 salton published a paper commenting on the major points of objection raised by salton.

8.  There is evidence for output overload in large systems.  The effectiveness of automatic indexing.  Four major methodological problems of the previous studies.  Small database unreliable techniques for judging the relevance.  The first concern is with the size and composition of the collections used for testing in research .

9.  The second concern is with the nature of the queries used in research collections. for example: looking at the INSPEC test collection of 12,684documents, if a typical query maps to 33 relevant documents, this would extrapolate to an STN user retrieving over 24,000 documents.  In chemical abstracts database containing 9.5 million documents.

10.  The third points raises the issue of whether the issues the performance of a ranked retrieval system is a large enough improvement over the Boolean search model search model to represent a cost-effective alternative.  The use of research collections with larger vocabularies and more records.  the investigation of retrieval schemes that incorporate proximity information  The use of test collection that contain more specific queries.

11. THANK YOU

The stairs evaluation

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to The stairs evaluation

Similar to The stairs evaluation (20)

Recently uploaded

Recently uploaded (20)

The stairs evaluation