SlideShare a Scribd company logo
1 of 35
Download to read offline
Folksonomies
Inhaltserschließung und Retrieval
           im Web 2.0
               und
         in Bibliotheken

             Dr. phil. Isabella Peters
     Heinrich-Heine-Universität Düsseldorf
     Abteilung für Informationswissenschaft

                                              1
        Uni Graz – 17. Dezember 2009
Folksonomies: Indexing without Rules

  “Anything goes”
  “Against method”, 1975 (Paul K. Feyerabend, Austro-American
  philosopher)


Tagging
• no rules
• no methods – or even against methods
• indexing a single document
   –   synonyms – why not? (New York – NY – Big Apple – … )
   –   homonyms – never heard! (not: Java [Programming Language] – Java
       [Island], but Java)
   –   translations – why not? (Singapore – Singapur – …)
   –   typing errors – nobody is perfect (Syngapur)
   –   hierarchical relations (hyponymy) – why not? (Düsseldorf –
       North Rhine-Westfalia – Germany)
   –   hierarchical relations (meronymy) – why not? (tree – branch – leaf)
                                                                             2
Indexing – in general




                        3
Tri-partite System of Folksonomies
Folksonomies consist always of 3 parts
1) document (resource)
2) prosumer (user)
3) tag




                                         4
Users – Tags - Documents


 shared users              thematically linked




 thematically linked       shared documents
                                             5
Shared Documents & Thematically
Linked Users


 more like this ...         thematically linked
   similar documents

 detection of documents


 more like me ...
   similar users

 detection of communities


                            shared documents
                                           6
More like me! Or: More like This User!
• starting point: single user (ego)
• processing
   – (1) tag-specific similarity
       • all tags of ego: a(t)
       • all tags of another user B: b(t)
       • common tags of ego and another user B: g(t)
   – (2) document-specific similarity
       • all tagged documents of ego: a(d)
       • all tagged documents of another user B: b(d)
       • common tagged documents of ego and another user B: g(d)
   – calculation of similarity
       • tag-specific: Jaccard-Sneath: Sim(tag; Ego,B) = g(t) / [a(t) + b(t) – g(t)]
       • document-specific: Jaccard-Sneath: Sim(doc; Ego,B) = g(d) / [a(d) + b(d) – g(d)]
       • ranking of Bi by similarity to ego (say, top 10 tag-specific and top 10 document-
         specific users)
       • merging of both lists (exclusion of duplicates)
       • cluster analysis (k-nearest neighbours, single linkage, complete linkage, group
         average linkage)
   – result presentation: social network of ego in the centre

                                                                                       7
More like me! Or: More like This User!


                                                        Sim(tag) = 0.45
                                                        Sim(doc) = 0.36

 Sim(tag) = 0.21
 Sim(doc) = 0.25
                    Sim(tag) = 0.33                                               Sim(tag) = 0.15
                    Sim(doc) = 0.29                                               Sim(doc) = 0.17
                                                                Sim(tag) = 0.08
                                                                Sim(doc) = 0.11


                                      Sim(tag) = 0.17
                                      Sim(doc) = 0.23




single linkage clustering        Sim(tag) = 0.65
(fictitious example)             Sim(doc) = 0.55                                              8
Narrow Folksonomies


• only one
tagger (the
content creator)
• no multiple
tagging




• example:
YouTube

                      Tags



                             9
Extended Narrow Folksonomies
• more than one tagger
• no multiple tagging
• example: Flickr                    Tags




 Source: Vander Wal (2005)     Add Tags Option
                                            10
Broad Folksonomies

•   more than one tagger
•   multiple tagging
•   example: Delicious




                                Tags
    Source: Vander Wal (2005)

                                       11
Folksonomies make use of
Collective Intelligence
Collective Intelligence
• “Wisdom of the Crowds” (Surowiecki)
• “Hive Minds” (Kroski) – “Vox populi” (Galton) – “Crowdsourcing”
• no discussions, diversity of opinions, decentralisation
• users tag a document independently from each other
• statistical aggregation of data

Collaborative Intelligence
• discussions and consensus
• prototype service: Wikipedia (but: 90 + 9 + 1 – rule)

“Madness of the Crowds”
• e.g., soccer fans – hooligans
• no diversity of opinion – no independence – no decentralisation –
  no (statistical) aggregation
                                                               12
Power Tags

•   Power Law Distribution   •    Inverse-logistic Distribution




Power Tags                       Power Tags



                                                                  13
Power Law Tag Distribution

                               Tags zu w w
                                        w .visitlondon.com
  Users
    70

    60                            Power Tags           f (x)= C / xa
    50

    40                                                 80/20-Rule

    30

    20
                                                       Long Tail

    10

     0




                                                                 t
                                                                n

                                                               en
                                     m
                                    nd




                                                             re




                                                               s
       on




                                                             ay
                                                              io




                                                             ra
                                            de
                  el


                           K




                                                            re
                                                            m
                                   is
                av




                                                           tu


                                                           at




                                                          nd
                                  la




                                                          id
                          U
     nd




                                                        nd
                                          ui




                                                         in
                                 ur




                                                       rm
                                ng




                                                        ul
              Tr




                                                        ol
                                         G




                                                      Lo
   Lo




                                                      ta
                               To




                                                      C




                                                     Lo
                                                      H
                                                     fo
                               E




                                                    er
                                                   In

                                                  nt

                                                                       Tags
                                                 E



                                                                         14
Source: http:// del.icio.us
As




                                                             0
                                                                 5
                                                                              10
                                                                                   15
                                                                                        20
                                                                                                     25
                                                                                                           30
                                                                                                                         35
                                           so
                                              cia
                                                  tio
                                                     ns
                                               Lib


                                                                                                                              Users
                                                   ra
                                          In          ry
                                   In       fo
                                     fo        r
                                       rm mat
                                         at          ion
                                           io
                                             ns
                                                 cie
                                                     nc
                                                        e




                                                                 Long Trunk
                                            Te        IA
                                              ch




Source: http:// del.icio.us
                                                 no
                                           Pr       lo
                                             of         gy
                                                es
                                                  sio
                                                      n
                                              Re al
                                                 se
                                                    ar
                                                       ch
                                               Us
                                                  ab
                                                     ilit
                                                         y
                                               Sc
                                                  ien
                                                       ce
                                                                                                            Power Tags




                                               Lib
                                                  ra
                               In                    rie
                                 fo                      s
                                   rm
                                     at
                                       ion We
                                          ar        b
                                             ch
                                               ite
                                                  ctu
                                                     re
                                        Or
                                           ga           IT
                                               niz
                                                                                                                                 Tags zu www.asis.org




                                                   at
                                                                                                                                                        Inverse-logistic Tag Distribution




                                          Ar          io
                                                         ns
                                              ch
                                                 ite
                                                     ct
                                          Or           u
                                              ga re
                                                  nz
                                                     at
                                                        ion
                                                                                         Long Tail




                                            Co
                                                mp
                                                                                                          f (x)=




                              In                     ut
                                fo                      e
                                  rm Con rs
                                     at           fe
                                                                                                          e




                                       io
                                         n_ renc
                                  In         ar            e
                                    fo         ch
                                      rm           ite
                                         at            ct
                                                          ur
                                            ion
                                                _s          e
                                                    cie
                                                        nc
                                                            e
                                                So
                                                    cie
                                                                                                           -C‘(x-1)b




                                                         ty
                      15
                              Tags
Use of Power Tags


• Power Tags as factor in relevance ranking
  documents tagged with Power Tags appear higher in
  ranking

• Power Tags as candidate tags for Tag Gardening
  which (semantic) relation do they have with co
                                               -
  occuring tags?




                                                   16
Benefits of Indexing with Folksonomies
• authentic user language – solution of the “vocabulary problem”
• actuality
• multiple interpretations – many perspectives – bridging the semantic gap
•   raise access to information resources
•   follow “desire lines” of users
•   cheap indexing method – shared indexing
•   the more taggers, the more the system becomes better – network effects
•   capable of indexing mass information on the Web
•   resources for development of knowledge organization systems
•   mass quality “control”
•   searching - browsing – serendipity
•   neologisms
•   identify communities and “small worlds”
•   collaborative recommender system
• make people sensitive to information indexing

                                                                      17
Disadvantages of Indexing with
Folksonomies
• absence of controlled vocabulary
• different basic levels (in the sense of Eleanor Rosch)
• different interests – loss of context information
•   language merging
•   hidden paradigmatic relations
•   merging of formal (bibliographical) and aboutness tags
•   no specific fields
•   tags make evaluations (“stupid”)
•   spam-tags
•   syncategoremata (user-specific tags, “me”)
•   performative tags (“to do”, “to read”)
•   other misleading keywords


    solution: Tag Gardening with methods of Information Linguistics, user
    collaboration in giving meaning to tags and combination with existing
    knowledge organization systems                                       18
Goal of Tag Gardening: Emergent
Semantics




Quelle: Peters, I., & Weller, K. (2008). Tag Gardening for Folksonomy Enrichment and     19
Maintenance. Webology, 5(3), Article 58, from http://www.webology.ir/2008/v5n3/a58.html.
Maintenance of KOS and Folksonomy

                                new terms – new relations




    Folksonomy                                                                            KOS




                                            Tag Gardening
Quelle: Christiaens, S. (2006). Metadata Mechanism: From Ontology to Folksonomy…and Back. Lecture   20
Notes in Computer Science, 4277, 199–207.
Feedback Loop in Practice:
Tagging of OPACs


2 possibilities:

• 1) tagging of resources within the library’s website

• 2) tagging of resources outside the library’s firewall




                                                         21
Tagging of OPACS: Within Library’s
Website: PennTags




                                     22
http://tags.library.upenn.edu/
Tagging of OPACS: Within Library’s
Website: Ann Arbor District Library




                                      23
http://www.aadl.org/catalog
Tagging of OPACS: Within Library’s
Website: University Library Hildesheim




                                              24
http://www.uni-hildesheim.de/mybib/all_tags
Tagging of OPACS: Within Library’s
Website

• advantages:

   – user behaviour can be directly observed and
     exploited for own applications

   – used knowledge organization system (KOS) can
     profit from user behaviour and user language

   – users will be “attracted” to the library

   – library will appear “trendy”
                                                    25
Tagging of OPACS: Within Library’s
Website

• disadvantages:

   – development and implementation (costs and
     manpower) of the tagging service have to be taken
     over from the library

   – if only users may tag: librarians may loose their
     work motivation or may have a feeling of
     uselessness

   – “lock in” effect of users
         -   -                   no “fresh” ideas
                                                         26
Tagging of Resources Outside the
Library‘s Firewall: LibraryThing




 http://www.librarything.com/search   27
Tagging of Resources Outside the
Library‘s Firewall: BibSonomy




 http://www.bibsonomy.org/         28
Tagging of Resources Outside the
Library‘s Firewall
• advantages:

   – development and implementation (costs and
     manpower) of the tagging service haven‘t to be
     taken over from the library

   – the library may profit from the “know- how” of the
     provider of the tagging system

   – users may profit from tagging activities of
     hundreds of other users    no lock in
                                      -

   – library appears “trendy”                         29
Tagging of Resources Outside the
Library‘s Firewall

• disadvantages

   – user behaviour cannot be observed or exploited

   – your users support other tagging service

   – used KOS cannot profit from user behaviour




                                                      30
Exkurs: Sentiment Tags
 • negative tags: “awful” – “foolish”, …
 • positive tags: “amazing” – “useful”, …
 • applicable for sentiment analysis of documents




Quelle: Yanbe, Y., Jatowt, A., Nakamura, S., & Tanaka, K. (2007). Can Social Bookmarking Enhance Search in the
                                                                                                          31
Web? In Proceedings of the 7th ACM/IEEE Joint Conference on Digital Libraries, Vancouver, Canada (pp. 107–116).
Summary

• knowing how folksonomies work is important for their
  adequate application in both

  – knowledge representation and

  – information retrieval

• knowing why folksonomies work is a secret ☺




                                                    32
Knowledge Representation and
Information Retrieval
• two sides of the same coin
• Immanuel Kant: Thoughts without content are
  empty, intuitions without concepts are blind...




                            Feedback
                              Loop




    Knowledge Representation            Information Retrieval
  without Information Retrieval is       without Knowledge
              empty.                   Representation is blind. 33
Folksonomies and
Knowledge Organization Systems
• two sides of the same coin
• no rivals- work best in combination!




                               Feedback
                                 Loop




 flexible, up-to-date, user-centric       precise, rigid, complete
                                                                     34
Viele Grüße aus Düsseldorf.




  Erschienen 2009 im
Verlag Saur, de Gruyter




    Kontakt: isabella.peters@uni duesseldorf.de
                               -
                                                  35

More Related Content

Similar to Folksonomies Indexing Und Retrieval In Bibliotheken

Gremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageGremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageMarko Rodriguez
 
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Andrea Wiggins
 
Graph Processing Applications @ HUG
Graph Processing Applications @ HUGGraph Processing Applications @ HUG
Graph Processing Applications @ HUGPraveen Sripati
 
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...Aaron Li
 
Graph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataGraph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataMarko Rodriguez
 
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...Marko Rodriguez
 

Similar to Folksonomies Indexing Und Retrieval In Bibliotheken (7)

Gremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageGremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming Language
 
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
 
Graph Processing Applications @ HUG
Graph Processing Applications @ HUGGraph Processing Applications @ HUG
Graph Processing Applications @ HUG
 
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...
KDD 2014 Presentation (Best Research Paper Award): Alias Topic Modelling (Red...
 
Graph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataGraph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of Data
 
measuring carbon footprint_richards
measuring carbon footprint_richardsmeasuring carbon footprint_richards
measuring carbon footprint_richards
 
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, and Reco...
 

More from Isabella Peters

Open Science & Altmetrics
Open Science & AltmetricsOpen Science & Altmetrics
Open Science & AltmetricsIsabella Peters
 
Impulsstatement barcamp wissenschaft 20_forschung neu denken
Impulsstatement barcamp wissenschaft 20_forschung neu denkenImpulsstatement barcamp wissenschaft 20_forschung neu denken
Impulsstatement barcamp wissenschaft 20_forschung neu denkenIsabella Peters
 
Crowdsourcing in Article Evaluation
Crowdsourcing in Article EvaluationCrowdsourcing in Article Evaluation
Crowdsourcing in Article EvaluationIsabella Peters
 
Folksonomies und Unternehmenskommunikation peters final
Folksonomies und Unternehmenskommunikation peters finalFolksonomies und Unternehmenskommunikation peters final
Folksonomies und Unternehmenskommunikation peters finalIsabella Peters
 
Folksonomies: In General and in Libraries
Folksonomies: In General and in LibrariesFolksonomies: In General and in Libraries
Folksonomies: In General and in LibrariesIsabella Peters
 
Web 2 0 Im Unternehmen Und Wissensmanagement
Web 2 0 Im Unternehmen Und WissensmanagementWeb 2 0 Im Unternehmen Und Wissensmanagement
Web 2 0 Im Unternehmen Und WissensmanagementIsabella Peters
 

More from Isabella Peters (6)

Open Science & Altmetrics
Open Science & AltmetricsOpen Science & Altmetrics
Open Science & Altmetrics
 
Impulsstatement barcamp wissenschaft 20_forschung neu denken
Impulsstatement barcamp wissenschaft 20_forschung neu denkenImpulsstatement barcamp wissenschaft 20_forschung neu denken
Impulsstatement barcamp wissenschaft 20_forschung neu denken
 
Crowdsourcing in Article Evaluation
Crowdsourcing in Article EvaluationCrowdsourcing in Article Evaluation
Crowdsourcing in Article Evaluation
 
Folksonomies und Unternehmenskommunikation peters final
Folksonomies und Unternehmenskommunikation peters finalFolksonomies und Unternehmenskommunikation peters final
Folksonomies und Unternehmenskommunikation peters final
 
Folksonomies: In General and in Libraries
Folksonomies: In General and in LibrariesFolksonomies: In General and in Libraries
Folksonomies: In General and in Libraries
 
Web 2 0 Im Unternehmen Und Wissensmanagement
Web 2 0 Im Unternehmen Und WissensmanagementWeb 2 0 Im Unternehmen Und Wissensmanagement
Web 2 0 Im Unternehmen Und Wissensmanagement
 

Recently uploaded

ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationNeilDeclaro1
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsSandeep D Chaudhary
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answersdalebeck957
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 

Recently uploaded (20)

ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health Education
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 

Folksonomies Indexing Und Retrieval In Bibliotheken

  • 1. Folksonomies Inhaltserschließung und Retrieval im Web 2.0 und in Bibliotheken Dr. phil. Isabella Peters Heinrich-Heine-Universität Düsseldorf Abteilung für Informationswissenschaft 1 Uni Graz – 17. Dezember 2009
  • 2. Folksonomies: Indexing without Rules “Anything goes” “Against method”, 1975 (Paul K. Feyerabend, Austro-American philosopher) Tagging • no rules • no methods – or even against methods • indexing a single document – synonyms – why not? (New York – NY – Big Apple – … ) – homonyms – never heard! (not: Java [Programming Language] – Java [Island], but Java) – translations – why not? (Singapore – Singapur – …) – typing errors – nobody is perfect (Syngapur) – hierarchical relations (hyponymy) – why not? (Düsseldorf – North Rhine-Westfalia – Germany) – hierarchical relations (meronymy) – why not? (tree – branch – leaf) 2
  • 3. Indexing – in general 3
  • 4. Tri-partite System of Folksonomies Folksonomies consist always of 3 parts 1) document (resource) 2) prosumer (user) 3) tag 4
  • 5. Users – Tags - Documents shared users thematically linked thematically linked shared documents 5
  • 6. Shared Documents & Thematically Linked Users more like this ... thematically linked similar documents detection of documents more like me ... similar users detection of communities shared documents 6
  • 7. More like me! Or: More like This User! • starting point: single user (ego) • processing – (1) tag-specific similarity • all tags of ego: a(t) • all tags of another user B: b(t) • common tags of ego and another user B: g(t) – (2) document-specific similarity • all tagged documents of ego: a(d) • all tagged documents of another user B: b(d) • common tagged documents of ego and another user B: g(d) – calculation of similarity • tag-specific: Jaccard-Sneath: Sim(tag; Ego,B) = g(t) / [a(t) + b(t) – g(t)] • document-specific: Jaccard-Sneath: Sim(doc; Ego,B) = g(d) / [a(d) + b(d) – g(d)] • ranking of Bi by similarity to ego (say, top 10 tag-specific and top 10 document- specific users) • merging of both lists (exclusion of duplicates) • cluster analysis (k-nearest neighbours, single linkage, complete linkage, group average linkage) – result presentation: social network of ego in the centre 7
  • 8. More like me! Or: More like This User! Sim(tag) = 0.45 Sim(doc) = 0.36 Sim(tag) = 0.21 Sim(doc) = 0.25 Sim(tag) = 0.33 Sim(tag) = 0.15 Sim(doc) = 0.29 Sim(doc) = 0.17 Sim(tag) = 0.08 Sim(doc) = 0.11 Sim(tag) = 0.17 Sim(doc) = 0.23 single linkage clustering Sim(tag) = 0.65 (fictitious example) Sim(doc) = 0.55 8
  • 9. Narrow Folksonomies • only one tagger (the content creator) • no multiple tagging • example: YouTube Tags 9
  • 10. Extended Narrow Folksonomies • more than one tagger • no multiple tagging • example: Flickr Tags Source: Vander Wal (2005) Add Tags Option 10
  • 11. Broad Folksonomies • more than one tagger • multiple tagging • example: Delicious Tags Source: Vander Wal (2005) 11
  • 12. Folksonomies make use of Collective Intelligence Collective Intelligence • “Wisdom of the Crowds” (Surowiecki) • “Hive Minds” (Kroski) – “Vox populi” (Galton) – “Crowdsourcing” • no discussions, diversity of opinions, decentralisation • users tag a document independently from each other • statistical aggregation of data Collaborative Intelligence • discussions and consensus • prototype service: Wikipedia (but: 90 + 9 + 1 – rule) “Madness of the Crowds” • e.g., soccer fans – hooligans • no diversity of opinion – no independence – no decentralisation – no (statistical) aggregation 12
  • 13. Power Tags • Power Law Distribution • Inverse-logistic Distribution Power Tags Power Tags 13
  • 14. Power Law Tag Distribution Tags zu w w w .visitlondon.com Users 70 60 Power Tags f (x)= C / xa 50 40 80/20-Rule 30 20 Long Tail 10 0 t n en m nd re s on ay io ra de el K re m is av tu at nd la id U nd nd ui in ur rm ng ul Tr ol G Lo Lo ta To C Lo H fo E er In nt Tags E 14 Source: http:// del.icio.us
  • 15. As 0 5 10 15 20 25 30 35 so cia tio ns Lib Users ra In ry In fo fo r rm mat at ion io ns cie nc e Long Trunk Te IA ch Source: http:// del.icio.us no Pr lo of gy es sio n Re al se ar ch Us ab ilit y Sc ien ce Power Tags Lib ra In rie fo s rm at ion We ar b ch ite ctu re Or ga IT niz Tags zu www.asis.org at Inverse-logistic Tag Distribution Ar io ns ch ite ct Or u ga re nz at ion Long Tail Co mp f (x)= In ut fo e rm Con rs at fe e io n_ renc In ar e fo ch rm ite at ct ur ion _s e cie nc e So cie -C‘(x-1)b ty 15 Tags
  • 16. Use of Power Tags • Power Tags as factor in relevance ranking documents tagged with Power Tags appear higher in ranking • Power Tags as candidate tags for Tag Gardening which (semantic) relation do they have with co - occuring tags? 16
  • 17. Benefits of Indexing with Folksonomies • authentic user language – solution of the “vocabulary problem” • actuality • multiple interpretations – many perspectives – bridging the semantic gap • raise access to information resources • follow “desire lines” of users • cheap indexing method – shared indexing • the more taggers, the more the system becomes better – network effects • capable of indexing mass information on the Web • resources for development of knowledge organization systems • mass quality “control” • searching - browsing – serendipity • neologisms • identify communities and “small worlds” • collaborative recommender system • make people sensitive to information indexing 17
  • 18. Disadvantages of Indexing with Folksonomies • absence of controlled vocabulary • different basic levels (in the sense of Eleanor Rosch) • different interests – loss of context information • language merging • hidden paradigmatic relations • merging of formal (bibliographical) and aboutness tags • no specific fields • tags make evaluations (“stupid”) • spam-tags • syncategoremata (user-specific tags, “me”) • performative tags (“to do”, “to read”) • other misleading keywords solution: Tag Gardening with methods of Information Linguistics, user collaboration in giving meaning to tags and combination with existing knowledge organization systems 18
  • 19. Goal of Tag Gardening: Emergent Semantics Quelle: Peters, I., & Weller, K. (2008). Tag Gardening for Folksonomy Enrichment and 19 Maintenance. Webology, 5(3), Article 58, from http://www.webology.ir/2008/v5n3/a58.html.
  • 20. Maintenance of KOS and Folksonomy new terms – new relations Folksonomy KOS Tag Gardening Quelle: Christiaens, S. (2006). Metadata Mechanism: From Ontology to Folksonomy…and Back. Lecture 20 Notes in Computer Science, 4277, 199–207.
  • 21. Feedback Loop in Practice: Tagging of OPACs 2 possibilities: • 1) tagging of resources within the library’s website • 2) tagging of resources outside the library’s firewall 21
  • 22. Tagging of OPACS: Within Library’s Website: PennTags 22 http://tags.library.upenn.edu/
  • 23. Tagging of OPACS: Within Library’s Website: Ann Arbor District Library 23 http://www.aadl.org/catalog
  • 24. Tagging of OPACS: Within Library’s Website: University Library Hildesheim 24 http://www.uni-hildesheim.de/mybib/all_tags
  • 25. Tagging of OPACS: Within Library’s Website • advantages: – user behaviour can be directly observed and exploited for own applications – used knowledge organization system (KOS) can profit from user behaviour and user language – users will be “attracted” to the library – library will appear “trendy” 25
  • 26. Tagging of OPACS: Within Library’s Website • disadvantages: – development and implementation (costs and manpower) of the tagging service have to be taken over from the library – if only users may tag: librarians may loose their work motivation or may have a feeling of uselessness – “lock in” effect of users - - no “fresh” ideas 26
  • 27. Tagging of Resources Outside the Library‘s Firewall: LibraryThing http://www.librarything.com/search 27
  • 28. Tagging of Resources Outside the Library‘s Firewall: BibSonomy http://www.bibsonomy.org/ 28
  • 29. Tagging of Resources Outside the Library‘s Firewall • advantages: – development and implementation (costs and manpower) of the tagging service haven‘t to be taken over from the library – the library may profit from the “know- how” of the provider of the tagging system – users may profit from tagging activities of hundreds of other users no lock in - – library appears “trendy” 29
  • 30. Tagging of Resources Outside the Library‘s Firewall • disadvantages – user behaviour cannot be observed or exploited – your users support other tagging service – used KOS cannot profit from user behaviour 30
  • 31. Exkurs: Sentiment Tags • negative tags: “awful” – “foolish”, … • positive tags: “amazing” – “useful”, … • applicable for sentiment analysis of documents Quelle: Yanbe, Y., Jatowt, A., Nakamura, S., & Tanaka, K. (2007). Can Social Bookmarking Enhance Search in the 31 Web? In Proceedings of the 7th ACM/IEEE Joint Conference on Digital Libraries, Vancouver, Canada (pp. 107–116).
  • 32. Summary • knowing how folksonomies work is important for their adequate application in both – knowledge representation and – information retrieval • knowing why folksonomies work is a secret ☺ 32
  • 33. Knowledge Representation and Information Retrieval • two sides of the same coin • Immanuel Kant: Thoughts without content are empty, intuitions without concepts are blind... Feedback Loop Knowledge Representation Information Retrieval without Information Retrieval is without Knowledge empty. Representation is blind. 33
  • 34. Folksonomies and Knowledge Organization Systems • two sides of the same coin • no rivals- work best in combination! Feedback Loop flexible, up-to-date, user-centric precise, rigid, complete 34
  • 35. Viele Grüße aus Düsseldorf. Erschienen 2009 im Verlag Saur, de Gruyter Kontakt: isabella.peters@uni duesseldorf.de - 35