SlideShare a Scribd company logo
1
Search term recommendation and non-
textual ranking evaluated
Philipp Mayr, Peter Mutschke, Philipp Schaer, York Sure
GESIS Leibniz Institute for the Social Sciences
9th European NKOS Workshop at the 14th ECDL Conference
Glasgow, Scotland
9th September 2010
2
Topic of the talk
1. Value-added services to improve the search in
Digital Libraries (DL)
• Short introduction of the services
2. Results from a retrieval evaluation exercise
• Evaluation of a KOS-based Search Term
Recommender (STR)
• Evaluation of re-ranking approaches:
Bradfordizing and author centrality
3
Situation
Major difficulties in most text-oriented DL
1. Vagueness between search and indexing
terms (language problem)
2. Too much documents and unstructured result
sets (information overload)
3. Pure text-statistical rankings (e.g. tf-idf) have
serious weaknesses (short document problem)
Consequence: Users are unsatisfied with a retrieval
system and search is far beyond its possibilities
4
IRM project
Motivation:
• Implementation and evaluation of value-added services
for retrieval in DL
• Using science models (specialties in languages,
networks of authors, centrality of journals) (Bates, 1990)
• Improvement of search results/experience:
• Search term recommendation (bridge between user
and controlled specialized vocabulary/ies)
• Non-textual ranking techniques (providing computational
approaches to rank documents, provide context)
Funded by DFG (grant number INST 658/6-1)
5
Search Term Recommender (STR)
Recommendation of highly associated controlled terms
Expansion of the top n ranked controlled terms
Statistical relations:
• Co-word analysis
• Complete database and
KOS as training set
• Automatic mapping of terms
Controlled context:
descriptors as tagcloud
Make use of knowledge/specialties in a KOS
and the statistical relations computed by
the co
Make use of knowledge/specialties in a KOS
and the statistical relations computed by
the co-word analysis http://www.gesis.org/beta/prototypen/irm/
6
Search Term
Recommendation (STR)
Example
• Starting with a unspecific term “Unemployment”
• Pure text-based standard ranking produces many and relatively poor results
• STR recommends controlled terms from a pre-selected KOS (thesaurus)
Interaction: User refines search with a
recommended controlled term e.g. „Job Search“
7
Service: Reranking (Bradfordizing)
Sorting central papers in core journals/publishers on top
Journal context
Core
Zone 2
Periphery
• A build-in functionality of the SOLR system aggregates articles
• The frequency counts of the journals are used to boost documents
Make use of publication regularities on
a
Make use of publication regularities on
a topic (coreness of journals)
Interaction: User selects a certain journal
8
Re-Ranking
Author networks
(AUTH)
Service: Reranking (Author centrality)
• A background procedure computes a co-authorship network for the whole result
• The centrality values of the computed authors are used to boost documents
Co-authorship network
Make use of knowledge about the
interaction and cooperation of authors
Make use of knowledge about the
interaction and cooperation of authors
Interaction: User selects a certain author
9
Main questions
• What is examined?
– the quality of different services to improve search
– or the quality of the associated search of the services
• Can we improve search in a DL with different
KOS-enhanced databases?
– In one discipline
– Between at least two disciplines
• The services are helpful to whom?
– Experts/scientists
– Students
10
Evaluation
10 different topics from the CLEF conferences (title and
descriptions), 73 LIS students, German SOLIS database
as test collection
1. Selection of a topic
2. Assessment of documents: top
10 ranked documents for each
service
• STR
• BRAD
• AUTH
• SOLR (tf-idf as baseline)
3. Evaluation of the relevance
assessments (MAP, P@10)
Pooled document set in the assessment tool
11
Operationalization
All terms and topics are translated from German
Original query for the service SOLR:
povert* AND german*
STR: Expanded STR-enhanced query:
(povert* AND german*) OR "poverty" OR "Federal
Republic of Germany" OR "social assistance" OR
"immiseration“ (highest confidence value)
BRAD: top 10 documents after BRAD-reranking (most
frequent core journal)
AUTH: top 10 documents after AUTH-reranking (highest
betweeness value)
12
Topics
- 73 assessors did 3,196 single relevance judgments
- topic 83 was picked 15 times – topic 96 twice
- fair and moderate inter-rater agreement
Schaer et al.
13
Number of documents judged relevant by the assessors per topic
• STR by adding the 4 descriptors with the highest confidence
• STR was best service (topics 88, 110, 166 and 173 with an
improvement of 20% at least compared to the baseline)
Topics
Relevantdocuments
10 topics
1 database
top 10 docs
4 services
73 students
Mayr et al., submitted
Results
14
Results
Schaer et al.
Precision for each topic and service
-Weaknesses of each service on a concrete topic can be observed
-Different thresholds applied make a more diverse result
15
14
36
STR
AUTHBRAD
5 5
3
SOLR
0
• 36 intersections of suggested top n=10 documents over all topics
and services
• Total of 400 documents: (4 services * 10 per service * 10 topics)
• Result sets are close to disjoint, services provide quite different
views onto the document space
Mayr et al., submitted
Results
Read: SOLR and STR
have 14 document
intersections from 100
possible (for all 10
topics)
16
Implications
1. Precision values of the retrieval services are
the same or better than standard text-
statistical ranking (tf-idf)
2. Each service produces its own document
space (measured in intersections)
3. Each service has weaknesses which can have
implication on the relevance ratio
• STR: misleading search terms (statistical artefacts) but high
descriptive power of controlled terms
• BRAD: high frequent but some irrelevant journals (outdated)
• AUTH: central but misleading authors
17
Next steps
Searching and evaluating in a multi-thesauri-
scenario (connecting STR services)
Interactive retrieval with services support
Evaluation of scientists on their specific research
topics
Evaluation of combination of services
Reimplementation of the commercial categorizer
18
Short conclusions
1. Valuable alternative views on document
spaces -> especially in interactive retrieval (?)
2. Improved relevance ratio and low overlap of
the services results
3. Contexts matter!
• Providing context to stimulate and support
users
• Support interactive systems for exploration
19
References
1. Philipp Schaer, Philipp Mayr, Peter Mutschke (accepted for ASIST
2010): Demonstrating a Service-Enhanced Retrieval System
2. Philipp Schaer, Philipp Mayr, Peter Mutschke (accepted for LWA
2010): Implications of Inter-Rater Agreement on a Student Information
Retrieval Evaluation
3. Mayr, Philipp; Mutschke, Peter; Schaer, Philipp; Sure, York (Abstract
accepted for Scientometrics Special Issue on Science Models):
Science Models as Value-Added Services for Scholarly Information
Systems.
20
Thank you!
Dr. Philipp Mayr
GESIS
Lennéstr. 30
53113 Bonn
mailto:philipp.mayr@gesis.org
IRM project
http://www.gesis.org/index.php?id=2479
IRM prototype
http://www.gesis.org/beta/prototypen/irm/

More Related Content

What's hot

محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزيرمحاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
مركز البحوث الأقسام العلمية
 
Brooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINALBrooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINAL
National Information Standards Organization (NISO)
 
Dataset reuse: An analysis of references in community discussions, publicatio...
Dataset reuse: An analysis of references in community discussions, publicatio...Dataset reuse: An analysis of references in community discussions, publicatio...
Dataset reuse: An analysis of references in community discussions, publicatio...
Kemele M. Endris
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
National Information Standards Organization (NISO)
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Varsha Khodiyar
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
Varsha Khodiyar
 
Citation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online LiteratureCitation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online Literature
Balachandar Radhakrishnan
 
Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...
Balachandar Radhakrishnan
 
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck KoscherBarcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
Crossref
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
GESIS
 
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
Matthäus Zloch
 
Martin Rasmussen: Ensuring availability and quality of research data through ...
Martin Rasmussen: Ensuring availability and quality of research data through ...Martin Rasmussen: Ensuring availability and quality of research data through ...
Martin Rasmussen: Ensuring availability and quality of research data through ...
"Open Access - Open Data" conference, 13th/14th December, 2010
 
Clicking Past Google
Clicking Past GoogleClicking Past Google
Clicking Past Google
Douglas Joubert
 
Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.
Emma Warren-Jones
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
Nees Jan van Eck
 
Text mining
Text miningText mining
Text mining
Koshy Geoji
 
RLG Shared Print Update For ALA MW 2009
RLG Shared Print Update For ALA MW 2009RLG Shared Print Update For ALA MW 2009
RLG Shared Print Update For ALA MW 2009
OCLC Research
 
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot ProgramData Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
NASIG
 
Giving researchers credit for data
Giving researchers credit for dataGiving researchers credit for data
Giving researchers credit for data
Jisc
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
maxfalc
 

What's hot (20)

محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزيرمحاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
محاضرة برنامج Endnote لتبويب المراجع العلمية د.غادة باوزير
 
Brooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINALBrooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINAL
 
Dataset reuse: An analysis of references in community discussions, publicatio...
Dataset reuse: An analysis of references in community discussions, publicatio...Dataset reuse: An analysis of references in community discussions, publicatio...
Dataset reuse: An analysis of references in community discussions, publicatio...
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Citation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online LiteratureCitation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online Literature
 
Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...
 
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck KoscherBarcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
 
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
 
Martin Rasmussen: Ensuring availability and quality of research data through ...
Martin Rasmussen: Ensuring availability and quality of research data through ...Martin Rasmussen: Ensuring availability and quality of research data through ...
Martin Rasmussen: Ensuring availability and quality of research data through ...
 
Clicking Past Google
Clicking Past GoogleClicking Past Google
Clicking Past Google
 
Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
Text mining
Text miningText mining
Text mining
 
RLG Shared Print Update For ALA MW 2009
RLG Shared Print Update For ALA MW 2009RLG Shared Print Update For ALA MW 2009
RLG Shared Print Update For ALA MW 2009
 
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot ProgramData Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
 
Giving researchers credit for data
Giving researchers credit for dataGiving researchers credit for data
Giving researchers credit for data
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 

Viewers also liked

Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
Ayla Stein
 
Multimodal Social Book Search
Multimodal Social Book SearchMultimodal Social Book Search
Multimodal Social Book Search
Ismail BADACHE
 
Data for nuclear non-proliferation
Data for nuclear non-proliferation Data for nuclear non-proliferation
Data for nuclear non-proliferation
fisherali
 
Retrieval of textual and non textual information in
Retrieval of textual and non textual information inRetrieval of textual and non textual information in
Retrieval of textual and non textual information in
eSAT Publishing House
 
Para-textual Information: An interrogation of the non-textual properties of t...
Para-textual Information: An interrogation of the non-textual properties of t...Para-textual Information: An interrogation of the non-textual properties of t...
Para-textual Information: An interrogation of the non-textual properties of t...
Jessica Rogers
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
Dr. Haxel Consult
 

Viewers also liked (6)

Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
Considering Dynamic Non-Textual Content when Migrating Digital Asset Managem...
 
Multimodal Social Book Search
Multimodal Social Book SearchMultimodal Social Book Search
Multimodal Social Book Search
 
Data for nuclear non-proliferation
Data for nuclear non-proliferation Data for nuclear non-proliferation
Data for nuclear non-proliferation
 
Retrieval of textual and non textual information in
Retrieval of textual and non textual information inRetrieval of textual and non textual information in
Retrieval of textual and non textual information in
 
Para-textual Information: An interrogation of the non-textual properties of t...
Para-textual Information: An interrogation of the non-textual properties of t...Para-textual Information: An interrogation of the non-textual properties of t...
Para-textual Information: An interrogation of the non-textual properties of t...
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
 

Similar to Search term recommendation and non-textual ranking evaluated

Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval
GESIS
 
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and WritingRec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Aravind Sesagiri Raamkumar
 
Philosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen VoorheesPhilosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen Voorhees
k21jag
 
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
Aravind Sesagiri Raamkumar
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
Antonio Gulli
 
Non-textual ranking in Digital Libraries
Non-textual ranking in Digital LibrariesNon-textual ranking in Digital Libraries
Non-textual ranking in Digital Libraries
GESIS
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
Aravind Sesagiri Raamkumar
 
Sub1579
Sub1579Sub1579
intro.ppt
intro.pptintro.ppt
intro.ppt
UbaidURRahman78
 
Introduction.pptx
Introduction.pptxIntroduction.pptx
Introduction.pptx
Mahsadelavari
 
Advantages of Query Biased Summaries in Information Retrieval
Advantages of Query Biased Summaries in Information RetrievalAdvantages of Query Biased Summaries in Information Retrieval
Advantages of Query Biased Summaries in Information Retrieval
Onur Yılmaz
 
A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...
Aravind Sesagiri Raamkumar
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Artificial Intelligence Institute at UofSC
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Toine Bogers
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
GESIS
 
Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIA
Insight_Altmetrics
 
research unveiling connections and recommendations.pptx
research unveiling connections and recommendations.pptxresearch unveiling connections and recommendations.pptx
research unveiling connections and recommendations.pptx
Rishikeshreddy25
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search engines
unyil96
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review method
Norsaremah Salleh
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
NASIG
 

Similar to Search term recommendation and non-textual ranking evaluated (20)

Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval
 
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and WritingRec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
 
Philosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen VoorheesPhilosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen Voorhees
 
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
Comparison of Techniques for Measuring Research Coverage of Scientific Papers...
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
 
Non-textual ranking in Digital Libraries
Non-textual ranking in Digital LibrariesNon-textual ranking in Digital Libraries
Non-textual ranking in Digital Libraries
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
 
Sub1579
Sub1579Sub1579
Sub1579
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 
Introduction.pptx
Introduction.pptxIntroduction.pptx
Introduction.pptx
 
Advantages of Query Biased Summaries in Information Retrieval
Advantages of Query Biased Summaries in Information RetrievalAdvantages of Query Biased Summaries in Information Retrieval
Advantages of Query Biased Summaries in Information Retrieval
 
A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
 
Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIA
 
research unveiling connections and recommendations.pptx
research unveiling connections and recommendations.pptxresearch unveiling connections and recommendations.pptx
research unveiling connections and recommendations.pptx
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search engines
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review method
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
 

More from GESIS

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introduction
GESIS
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journals
GESIS
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over time
GESIS
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
GESIS
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
GESIS
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”
GESIS
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
GESIS
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social Sciences
GESIS
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
GESIS
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living Lab
GESIS
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)
GESIS
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
GESIS
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
GESIS
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
GESIS
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
GESIS
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
GESIS
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information Retrieval
GESIS
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...
GESIS
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016
GESIS
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization Systems
GESIS
 

More from GESIS (20)

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introduction
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journals
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over time
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social Sciences
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living Lab
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information Retrieval
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization Systems
 

Recently uploaded

Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 

Recently uploaded (20)

Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 

Search term recommendation and non-textual ranking evaluated

  • 1. 1 Search term recommendation and non- textual ranking evaluated Philipp Mayr, Peter Mutschke, Philipp Schaer, York Sure GESIS Leibniz Institute for the Social Sciences 9th European NKOS Workshop at the 14th ECDL Conference Glasgow, Scotland 9th September 2010
  • 2. 2 Topic of the talk 1. Value-added services to improve the search in Digital Libraries (DL) • Short introduction of the services 2. Results from a retrieval evaluation exercise • Evaluation of a KOS-based Search Term Recommender (STR) • Evaluation of re-ranking approaches: Bradfordizing and author centrality
  • 3. 3 Situation Major difficulties in most text-oriented DL 1. Vagueness between search and indexing terms (language problem) 2. Too much documents and unstructured result sets (information overload) 3. Pure text-statistical rankings (e.g. tf-idf) have serious weaknesses (short document problem) Consequence: Users are unsatisfied with a retrieval system and search is far beyond its possibilities
  • 4. 4 IRM project Motivation: • Implementation and evaluation of value-added services for retrieval in DL • Using science models (specialties in languages, networks of authors, centrality of journals) (Bates, 1990) • Improvement of search results/experience: • Search term recommendation (bridge between user and controlled specialized vocabulary/ies) • Non-textual ranking techniques (providing computational approaches to rank documents, provide context) Funded by DFG (grant number INST 658/6-1)
  • 5. 5 Search Term Recommender (STR) Recommendation of highly associated controlled terms Expansion of the top n ranked controlled terms Statistical relations: • Co-word analysis • Complete database and KOS as training set • Automatic mapping of terms Controlled context: descriptors as tagcloud Make use of knowledge/specialties in a KOS and the statistical relations computed by the co Make use of knowledge/specialties in a KOS and the statistical relations computed by the co-word analysis http://www.gesis.org/beta/prototypen/irm/
  • 6. 6 Search Term Recommendation (STR) Example • Starting with a unspecific term “Unemployment” • Pure text-based standard ranking produces many and relatively poor results • STR recommends controlled terms from a pre-selected KOS (thesaurus) Interaction: User refines search with a recommended controlled term e.g. „Job Search“
  • 7. 7 Service: Reranking (Bradfordizing) Sorting central papers in core journals/publishers on top Journal context Core Zone 2 Periphery • A build-in functionality of the SOLR system aggregates articles • The frequency counts of the journals are used to boost documents Make use of publication regularities on a Make use of publication regularities on a topic (coreness of journals) Interaction: User selects a certain journal
  • 8. 8 Re-Ranking Author networks (AUTH) Service: Reranking (Author centrality) • A background procedure computes a co-authorship network for the whole result • The centrality values of the computed authors are used to boost documents Co-authorship network Make use of knowledge about the interaction and cooperation of authors Make use of knowledge about the interaction and cooperation of authors Interaction: User selects a certain author
  • 9. 9 Main questions • What is examined? – the quality of different services to improve search – or the quality of the associated search of the services • Can we improve search in a DL with different KOS-enhanced databases? – In one discipline – Between at least two disciplines • The services are helpful to whom? – Experts/scientists – Students
  • 10. 10 Evaluation 10 different topics from the CLEF conferences (title and descriptions), 73 LIS students, German SOLIS database as test collection 1. Selection of a topic 2. Assessment of documents: top 10 ranked documents for each service • STR • BRAD • AUTH • SOLR (tf-idf as baseline) 3. Evaluation of the relevance assessments (MAP, P@10) Pooled document set in the assessment tool
  • 11. 11 Operationalization All terms and topics are translated from German Original query for the service SOLR: povert* AND german* STR: Expanded STR-enhanced query: (povert* AND german*) OR "poverty" OR "Federal Republic of Germany" OR "social assistance" OR "immiseration“ (highest confidence value) BRAD: top 10 documents after BRAD-reranking (most frequent core journal) AUTH: top 10 documents after AUTH-reranking (highest betweeness value)
  • 12. 12 Topics - 73 assessors did 3,196 single relevance judgments - topic 83 was picked 15 times – topic 96 twice - fair and moderate inter-rater agreement Schaer et al.
  • 13. 13 Number of documents judged relevant by the assessors per topic • STR by adding the 4 descriptors with the highest confidence • STR was best service (topics 88, 110, 166 and 173 with an improvement of 20% at least compared to the baseline) Topics Relevantdocuments 10 topics 1 database top 10 docs 4 services 73 students Mayr et al., submitted Results
  • 14. 14 Results Schaer et al. Precision for each topic and service -Weaknesses of each service on a concrete topic can be observed -Different thresholds applied make a more diverse result
  • 15. 15 14 36 STR AUTHBRAD 5 5 3 SOLR 0 • 36 intersections of suggested top n=10 documents over all topics and services • Total of 400 documents: (4 services * 10 per service * 10 topics) • Result sets are close to disjoint, services provide quite different views onto the document space Mayr et al., submitted Results Read: SOLR and STR have 14 document intersections from 100 possible (for all 10 topics)
  • 16. 16 Implications 1. Precision values of the retrieval services are the same or better than standard text- statistical ranking (tf-idf) 2. Each service produces its own document space (measured in intersections) 3. Each service has weaknesses which can have implication on the relevance ratio • STR: misleading search terms (statistical artefacts) but high descriptive power of controlled terms • BRAD: high frequent but some irrelevant journals (outdated) • AUTH: central but misleading authors
  • 17. 17 Next steps Searching and evaluating in a multi-thesauri- scenario (connecting STR services) Interactive retrieval with services support Evaluation of scientists on their specific research topics Evaluation of combination of services Reimplementation of the commercial categorizer
  • 18. 18 Short conclusions 1. Valuable alternative views on document spaces -> especially in interactive retrieval (?) 2. Improved relevance ratio and low overlap of the services results 3. Contexts matter! • Providing context to stimulate and support users • Support interactive systems for exploration
  • 19. 19 References 1. Philipp Schaer, Philipp Mayr, Peter Mutschke (accepted for ASIST 2010): Demonstrating a Service-Enhanced Retrieval System 2. Philipp Schaer, Philipp Mayr, Peter Mutschke (accepted for LWA 2010): Implications of Inter-Rater Agreement on a Student Information Retrieval Evaluation 3. Mayr, Philipp; Mutschke, Peter; Schaer, Philipp; Sure, York (Abstract accepted for Scientometrics Special Issue on Science Models): Science Models as Value-Added Services for Scholarly Information Systems.
  • 20. 20 Thank you! Dr. Philipp Mayr GESIS Lennéstr. 30 53113 Bonn mailto:philipp.mayr@gesis.org IRM project http://www.gesis.org/index.php?id=2479 IRM prototype http://www.gesis.org/beta/prototypen/irm/