Your SlideShare is downloading. ×
0
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Semantics in Social Tagging Systems
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Semantics in Social Tagging Systems

1,719

Published on

Presentation by Andreas Hotho about Bibsonomy at the DC-2008 Wikimedia Workshop on User Generated Metadata

Presentation by Andreas Hotho about Bibsonomy at the DC-2008 Wikimedia Workshop on User Generated Metadata

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,719
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
57
Comments
0
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Semantics in Social Tagging Systems Andreas Hotho Dominik Benz, Robert Jäschke, Beate Krause, Christoph Schmitz, Gerd Stumme Hertie-Lehrstuhl für Wissensverarbeitung Universität Kassel & Forschungszentrum L3S C. Cattuto, A. Baldassarri, V. Loreto, V. D. P. Servedio Physics Department, University of Roma “La Sapienza”, Italy
  • 2. Map of Web 2.0 artwork by R. Munroe http://xkcd.com/ Andreas Hotho 27.09.08 2
  • 3. Everybody is tagging…  simple and intuitive way to organize resources, immediately useful  uncontrolled vocabulary  however: evidence for converging vocabulary / emergent semantics due to  shared implicit knowledge  mutual influence of users  underlying social networks http://xkcd.com/ resource tag user Andreas Hotho 27.09.08 3
  • 4. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 4
  • 5. BibSonomy ― a cooperative publication management system Large User Basis: We use the system  100.051 registered users  for our daily scientific work,  288.849 bookmarks  in European and other projects  258.633 publications  and for evaluating our algorithms.  + 986.458 publications from DBLP.  Integrated a.o. in Citavi and JabRef. http://www.bibsonomy.org Andreas Hotho 27.09.08 5
  • 6. Topic-specific collection of references (here: Social Network Analysis) Andreas Hotho 27.09.08 6
  • 7. Export in over 30 formats, including BibTeX and Endnote Andreas Hotho 27.09.08 7
  • 8. Generates publication lists for individuals, research groups, and projects Andreas Hotho 27.09.08 8
  • 9. Entry point for conference proceedings Andreas Hotho 27.09.08 9
  • 10. Basket functionality for libraries Andreas Hotho 27.09.08 10
  • 11. Back reference to the library Andreas Hotho 27.09.08 11
  • 12. Posting a new publication is easy:  Highlight reference  Click on “Post Publication” button Andreas Hotho 27.09.08 12
  • 13. Posting a new bookmark/publication:  Information Extraction (Mallet) fills form for you.  Just add your favorite tags. Andreas Hotho 27.09.08 13
  • 14. Posting a new bookmark/publication:  That’s it!  Other options:  Scrapers (> 60), eg for Citeseer, ACM Upload BibTeX  Enter information manually JabRef interface Andreas Hotho 27.09.08 14
  • 15. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 15
  • 16. Social Tagging Systems / Delicious.com Andreas Hotho 27.09.08 16
  • 17. Social Tagging Systems  Simpy:  free, “nicer” design  special function: groups, a bookmark history function  Mister Wong:  Most popular system in Germany  special function: every post has links to „recommended“ web sites.  FURL and blinklist has a special rating function.  Feed Me Links has a function to add bookmarks by mail.  RawSugar provides an automatically generated hierarchy.  backflip and AllMyFavorites.net uses folders.  Chipmark, Spurl and Netvouz has tags and folders.  http://www.simpy.com/, http://www.mister-wong.de/, http://www.furl.net/, http:// www.blinklist.com/, http://feedmelinks.com/portal, http://www.rawsugar.com/, http:// www.backflip.com/, http://www.allmyfavorites.net/, https://www.chipmark.com/Main, http://www.spurl.net/, http://www.netvouz.com/ Andreas Hotho 27.09.08 17
  • 18. Social Cataloging Systems Andreas Hotho 27.09.08 18
  • 19. Social Cataloging Systems Andreas Hotho 27.09.08 19
  • 20. Social Cataloging Systems Andreas Hotho 27.09.08 20
  • 21. Social Cataloging Systems Andreas Hotho 27.09.08 21
  • 22. Social Cataloging Systems Andreas Hotho 27.09.08 22
  • 23. Social Cataloging Systems Andreas Hotho 27.09.08 23
  • 24. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 24
  • 25. Andreas Hotho 27.09.08 25
  • 26. Most related tags by cooccurrence / cosine simlarity art web2.0 design photography illustration blog graphics ajax web tools blog webdesign freq news blog technology politics media daily howto tutorial reference tips linux programming video music funny tv software media ajax javascript web2.0 web programming webdesign tutorial howto programming reference design css javascript ajax programming css web webdesign art graphic creative print portfolios nice cosine web2.0 web2 web-2.0 webapp “web web_2.0 news blogs people weblog culture future howto how-to guide tutorials help how_to video entertainment awesome fun cool random ajax dhtml dom js ecmascript webdev tutorial tutorials tips coding code examples javascript webdevelopment webdev example examples webprogramming Andreas Hotho 27.09.08 26
  • 27. Semantic Grounding in WordNet  WordNet is a large lexical database for English.  Words with same meaning are grouped in synsets, which are ordered by an is-a hierarchy.  Introduction of single artificial root node enables application of graph-based similarity metrics between pairs of nouns / pairs of verbs.  Inclusion of top n del.icio.us tags in WordNet:  100: 82%  1,000: 79%  5,000: 69%  10,000: 61% Andreas Hotho 27.09.08 27
  • 28. Example of Semantic Grounding Wordnet Synset Hierarchy:  Original tag:  „java“ computers  Most similar tag: programming  Freq, folkrank: map design_patterns languages „programming“  Cosine: „python“ java python Grounded similarity Andreas Hotho 27.09.08 28
  • 29. shortest paths in WordNet random siblings length of shortest path to most related tag Andreas Hotho 27.09.08 29
  • 30. Results for delicious together with similarity pruning Andreas Hotho 27.09.08 30
  • 31. Results for delicious together with similarity pruning Andreas Hotho 27.09.08 31
  • 32. Association Rules ≅ transactions ≅ items  K1 = (U £ R, T, I1)  If users tag some resource with tag ti, they frequently also use tj for it.  Usage:  tag recommendations  learning implications (tag hierarchy) Andreas Hotho 27.09.08 32
  • 33. Association Rules  K2 = (T £ U, R, I2)  If users tag a resource ri with a particular tag, they frequently also use this tag for rj .  Usage:  finding communities  resource recommendations Andreas Hotho 27.09.08 33
  • 34. Association Rules  K2 = (T £ U, R, I2)  If users tag a resource ri with a particular tag, they frequently also use this tag for rj .  Usage:  finding communities Andreas Hotho resource recommendations  27.09.08 34
  • 35. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 35
  • 36. Summary and Outlook  Our FolkRank algorithm supports search in folksonomies.  Relatedness measures on tags in folksonomies are a good basis to extract semantic relations  Trend detection in Social Bookmarking Systems  Tag Recommender allows to recommend user specific tags for new post  Detecting Spam is a major challenge  LogSonomies - analysing the structure of search engine query log files  Learning some kind of synsets, relations and hierarchy of tags Andreas Hotho 27.09.08 36
  • 37. Similar tags live on www.bibsonomy.org Thanks for your attention! contact: hotho@cs.uni-kassel.de Andreas Hotho 27.09.08 37

×