Your SlideShare is downloading. ×
Sdi
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Sdi

326
views

Published on

Published in: Education

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
326
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • 12/01/11
  • Transcript

    • 1. A Semantic Model of Selective Dissemination of Information for Digital Libraries Authors: J. M. Morales-del-Castillo¹, R. Pedraza- Jiménez², A. A. Ruiz³, E. Peis ⁴ , and E. Herrera-Viedma ⁵©www.sti-innsbruck.at INNSBRUCK www.sti-innsbruck.at Copyright 2008 STI
    • 2. Basic Ideea – Develop a multi-agent Selective Dissmination of Information (SDI) platform capable of generating alerts and recommandations of documents for users, according to their personal profiles – Appling Semantic Web technologies for achiving more efficient information managment and improving agent-agent and user-agent communicationwww.sti-innsbruck.at 2
    • 3. SDI Components • Thesaurus – Enables organizing the most relevant concepts in a specific domain, by defining semantic relations between them. • User profiles – Structured representations that contain personal data, interest and preferences of users. • RSS feeds – Used as “current awareness bulletins” to generate personalized bibliographic alerts • Recommendation log file – Each document in the repository has an associated log file that includes the listing of evaluations assigned to that resource by different userswww.sti-innsbruck.at 3
    • 4. Thesaurus The creation of a thesaurus includes four phases: • Pre-processing of documents – Prepare the document parametrization by removing the elements regarded as superfluous in 3 stages: • Eliminate all the tags (HTML, XML, etc) • Standardization of the words in the document including removing texts articles, determiners, auxiliary verbs, conjunctions, prepositions, … • Stemming all the terms left using the WordNet algorithm(Morphy) • Parameterizing the selected terms – Final terms are quantified by assigning weights obtained by the application of the scheme term frequency – inverse document frequency (tf-idf)www.sti-innsbruck.at
    • 5. Thesaurus • Conceptualizing their lexical stems – The associated meaning of each term (lemma) are extract by searching them on WordNet, which returns a group of synsets associated to each word (including hypernyms and hyperonyms) • Generating a lattice or graph that shows the relation between the identified concepts – Using formal concept analysis techniques for finding relations from the generated groups, where each node in the graph represents a descriptor(namely a group of synonyms terms) – Clustering of documents depending on the terms(and synonyms) including links to those with which has any relation(hyponymy or hyperonymy) Once the thesaurus is obtained by identifying its terms and the underlying relation between them, it is represented using SKOS vocabulary.www.sti-innsbruck.at
    • 6. User profiles • Defined with Friend of a Friend(FOAF) vocabulary (generated at registration time) – Containing personal data, interests and preferences of users • 2 Parts: – Public profile: data related to the users identity and affiliation – Private profile: user interests and preferences about the topic of the alerts he or she wishes to receive • Users must specify keywords and concepts that best define their information needs • This keywords are then compared with the concepts in the thesaurus; if there is an exact math, the introduced term will be return, otherwise the lexically most similar term. • The return term will be suggested to the user and added to its preferences, if this term satisfy he user expectations.www.sti-innsbruck.at
    • 7. Profile and RSS feeds generation processwww.sti-innsbruck.at
    • 8. Alert generation processwww.sti-innsbruck.at
    • 9. Questionswww.sti-innsbruck.at
    • 10. References1. J. M. Morales-del-Castillo: Assistant Professor of Information Science, Libraryand Information Science Department, University of Granada, Spain2. R. Pedraza-Jiménez: Assistant Professor of Information Science, Journalismand Audiovisual Communication Department, Pompeu Fabra University,Barcelona, Spain3. A. A. Ruíz: Full Professor of Information Science, Library and InformationScience Department, University of Granada.4. E. Peis: is Full Professor of Information Science, Library and InformationScience Department, University of Granada.5. E. Herrera-Viedma: Senior Lecturer in Computer Science, Computer Scienceand Artificial Intelligence Department, University of Granada.www.sti-innsbruck.at

    ×