SlideShare a Scribd company logo
Università degli studi di Bari “Aldo Moro” Dipartimento di Informatica   A Domain Based Approach to Information Retrieval in Digital Libraries F. Rotella , S. Ferilli, F. Leuzzi [email_address] , {fabio.leuzzi, rotella.fulvio}@gmail.com 8th Italian Research Conference on Digital Libraries Bari, Italy, February 9-10, 2012 L.A.C.A.M.  http://lacam.di.uniba.it:8000
Overview ,[object Object]
Keyword Extraction
Word Sense Disambiguation
Synset Clustering
A Multistrategy Similarity Measure
Document Partitioning
User Query Processing
A Preliminary Evaluation
Conclusions & Future Works A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
Some repositories leave the  responsibility of quality  to the authors. + Anybody can produce and distribute documents. = Possible low average quality of the repository contents. Users are often overwhelmed by documents that only  apparently  are suitable for  satisfying  their  information needs . Introduction A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
Introduction ,[object Object]
Numerical/statistical manipulation of (key)words has been widely explored in the literature ,[object Object],[object Object],[object Object],[object Object],[object Object]
WordNet Domains A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
Objectives Improving fruition of a DL  ,[object Object]
Try to overcome the ambiguity of natural language
Inspired by the typical behavior of humans: ,[object Object]
select the  most appropriate one  according to the context of the discourse A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
Keyword Extraction ,[object Object]

More Related Content

Similar to A Domain Based Approach to Information Retrieval in Digital Libraries - Rotella, Ferilli, Leuzzi

Chapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfChapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfJemalNesre1
 
Chapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalChapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalcaptainmactavish1996
 
Listening comprehension in efl teaching
Listening comprehension in efl teachingListening comprehension in efl teaching
Listening comprehension in efl teachingmora-deyanira
 
Listening Comprehension in EFL Teaching
Listening Comprehension in EFL TeachingListening Comprehension in EFL Teaching
Listening Comprehension in EFL Teachingmora-deyanira
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applicationsVasileios Lampos
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsTye Rausch
 
Automatic Profiling Of Learner Texts
Automatic Profiling Of Learner TextsAutomatic Profiling Of Learner Texts
Automatic Profiling Of Learner TextsJeff Nelson
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysisRubyaShaheen
 
Chapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfChapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfHabtamu100
 
Your Annotated Bibliography must have 8 sources. Please go back to t.docx
Your Annotated Bibliography must have 8 sources. Please go back to t.docxYour Annotated Bibliography must have 8 sources. Please go back to t.docx
Your Annotated Bibliography must have 8 sources. Please go back to t.docxbudbarber38650
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingSaurabh Kaushik
 
Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...
 Establishment of a List of Non-Compositional Multi-Word Combinations for Eng... Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...
Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...Research Journal of Education
 
How do we generate spoken words This issue is a fasci-natin.docx
How do we generate spoken words This issue is a fasci-natin.docxHow do we generate spoken words This issue is a fasci-natin.docx
How do we generate spoken words This issue is a fasci-natin.docxwellesleyterresa
 
Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUWalid Saba
 
What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discoursePascual Pérez-Paredes
 

Similar to A Domain Based Approach to Information Retrieval in Digital Libraries - Rotella, Ferilli, Leuzzi (20)

Chapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfChapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdf
 
Chapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalChapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrieval
 
Listening comprehension in efl teaching
Listening comprehension in efl teachingListening comprehension in efl teaching
Listening comprehension in efl teaching
 
Listening Comprehension in EFL Teaching
Listening Comprehension in EFL TeachingListening Comprehension in EFL Teaching
Listening Comprehension in EFL Teaching
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applications
 
Edinburgh
EdinburghEdinburgh
Edinburgh
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
Barbiers iclave-fr
Barbiers iclave-frBarbiers iclave-fr
Barbiers iclave-fr
 
Automatic Profiling Of Learner Texts
Automatic Profiling Of Learner TextsAutomatic Profiling Of Learner Texts
Automatic Profiling Of Learner Texts
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
 
Chapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfChapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdf
 
Reading report
Reading reportReading report
Reading report
 
Your Annotated Bibliography must have 8 sources. Please go back to t.docx
Your Annotated Bibliography must have 8 sources. Please go back to t.docxYour Annotated Bibliography must have 8 sources. Please go back to t.docx
Your Annotated Bibliography must have 8 sources. Please go back to t.docx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448
 
Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...
 Establishment of a List of Non-Compositional Multi-Word Combinations for Eng... Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...
Establishment of a List of Non-Compositional Multi-Word Combinations for Eng...
 
Distributional semantics
Distributional semanticsDistributional semantics
Distributional semantics
 
How do we generate spoken words This issue is a fasci-natin.docx
How do we generate spoken words This issue is a fasci-natin.docxHow do we generate spoken words This issue is a fasci-natin.docx
How do we generate spoken words This issue is a fasci-natin.docx
 
Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLU
 
What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discourse
 

More from University of Bari (Italy)

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...University of Bari (Italy)
 
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text University of Bari (Italy)
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationUniversity of Bari (Italy)
 
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
Recognising the Social Attitude in Natural Interaction with Pedagogical AgentsRecognising the Social Attitude in Natural Interaction with Pedagogical Agents
Recognising the Social Attitude in Natural Interaction with Pedagogical AgentsUniversity of Bari (Italy)
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...University of Bari (Italy)
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextUniversity of Bari (Italy)
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextUniversity of Bari (Italy)
 

More from University of Bari (Italy) (8)

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
 
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
 
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
Recognising the Social Attitude in Natural Interaction with Pedagogical AgentsRecognising the Social Attitude in Natural Interaction with Pedagogical Agents
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
 
Technical report jada
Technical report jadaTechnical report jada
Technical report jada
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 

Recently uploaded

Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfThiyagu K
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...EugeneSaldivar
 
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...SachinKumar945617
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxssuserbdd3e8
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxbennyroshan06
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxJisc
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxJisc
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsparmarsneha2
 
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptxSolid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptxDenish Jangid
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersPedroFerreira53928
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345beazzy04
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasiemaillard
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXMIRIAMSALINAS13
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chipsGeoBlogs
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...Nguyen Thanh Tu Collection
 

Recently uploaded (20)

Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
 
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptxSolid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 

A Domain Based Approach to Information Retrieval in Digital Libraries - Rotella, Ferilli, Leuzzi

  • 1. Università degli studi di Bari “Aldo Moro” Dipartimento di Informatica A Domain Based Approach to Information Retrieval in Digital Libraries F. Rotella , S. Ferilli, F. Leuzzi [email_address] , {fabio.leuzzi, rotella.fulvio}@gmail.com 8th Italian Research Conference on Digital Libraries Bari, Italy, February 9-10, 2012 L.A.C.A.M. http://lacam.di.uniba.it:8000
  • 2.
  • 10. Conclusions & Future Works A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 11. Some repositories leave the responsibility of quality to the authors. + Anybody can produce and distribute documents. = Possible low average quality of the repository contents. Users are often overwhelmed by documents that only apparently are suitable for satisfying their information needs . Introduction A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 12.
  • 13.
  • 14. WordNet Domains A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 15.
  • 16. Try to overcome the ambiguity of natural language
  • 17.
  • 18. select the most appropriate one according to the context of the discourse A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 19.
  • 20. To limit the possibility of including non-discriminative and very general words in the representation of a document A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 21. Word Sense Disambiguation Domain Driven One Domain per Discourse assumption: many uses of a word in a coherent portion of text tend to share the same domain. A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi Prevalent domain individuation Extraction of all synsets for each term Extraction of all domains for each synset Choice of prevalent domain synset
  • 22.
  • 23.
  • 24.
  • 26. breadth (inverse neighbors) WordNet relationship are considered
  • 27. A Multistrategy Similarity Measure Cosidered Relationship member meronimy : the latter synset is a member meronym of the former; substance meronimy : the latter synset is a substance meronym of the former; part meronimy : the latter synset is a part meronym of the former; similarity : the latter synset is similar in meaning to the former; antonym : specifies antonymous word; attribute : defines the attribute relation between noun and adjective synset pairs in which the adjective is a value of the noun; additional information : additional information about the first word can be obtained by seeing the second word; part of speech based : specifies two different relations based on the parts of speech involved; participle : the adjective first word is a participle of the verb second word; hyperonymy : the latter synset is a hypernym of the former. A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 28.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39. obtaining the most similar cluster to the involved words A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 40. Users Query Elaboration Query Results The best combination is used to obtain the list of clusters ranked by descending relevance, that can be used as an answer to the user search . The results are then displayed to the user, in particular are displayed the first n sets of document such that n is the minimum value that shows at least 10 results. A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi
  • 41.
  • 42.
  • 43. synset: 105943300; lemmas: doctrine, philosophical system, philosophy and school of thought; gloss: a belief (or system of beliefs) accepted as authoritative by some group or school;
  • 44. synset: 105941423; lemmas: belief ; gloss: any cognitive content held as true.
  • 45.
  • 46.
  • 47. synset: 107033753; lemmas: mass ; gloss: a musical setting for a Mass;
  • 48. synset: 107026352; lemmas: opera ; gloss: a drama set to music, consists of singing with orchestral accompaniment and an orchestral overture and interludes;
  • 49. synset: 107071942; lemmas: genre, music genre , musical genre and musical style; gloss: an expressive style of music;
  • 50. synset: 107064715; lemmas: rock , rock ’n’ roll, rock and roll, rock music, rock’n’roll and rock-and-roll; gloss: a genre of popular music originating in the 1950s, a blend of black rhythm-and-blues with white country-and-western. A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi A Preliminary Evaluation The Quality of Clusters
  • 51. A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi # Query Outcomes Precision Recall 1 Ornaments and melodies [1 to 9] music [10 to 11] religion 0.82 (1.0) 0.43 (9/21) 2 Reincarnation and eternal life [1 to 9] religion [10] science 0.9 (1.0) 0.39 (9/23) 3 Traditions and folks [1 to 4] music [5 to 6] religion [7 to 10] music 0.8 (1.0) 0.38 (8/21) 4 Limits of theory of relativity [1 to 2] science [3] politics [4 to 5] religion [6 to 15] science 0.8 0.44 (12/27) 5 Capitalism vs communism [1 to 3] politics [4] science [5 to 6] religion [7 to 11] politics [12] science [13] music 0.61 (0.77) 0.53 (8/15) 6 Markets and new economy [1] politics [2] music [3] science [4 to 8] politics [9 to 10] religion 0.6 (0.7) 0.4 (6/15) 7 Relationship between democracy and parliament [1 to 3] politics [4] science [5 to 6] politics [7 to 10] religion 0.5 (0.6) 0.33 (5/15) A Preliminary Evaluation Synthesis of Outcomes
  • 52.
  • 53.
  • 54.
  • 55. To explore other approaches to choose better intensional descriptions of each document A Domain Based Approach to Information Retrieval in Digital Libraries - F. Rotella, S. Ferilli, F. Leuzzi