Slideshow ire

Team:
Anand Agrawal [201205671]
Ashutosh Singla [201101106]
Diwas Joshi [201305573]
D Anil Kumar [201250812]

Abstract
 We present a tag recommendation model for collaborative
bookmarking systems.
 Suggesting most relevant tags for a given URL and its
description.
 We are using Lucene index and clustering based approach
to determine the same.

Problem Statement
 Design a tag recommendation system which will form a tag
cloud from a given corpus.
 The tag recommendation problem can be described as follows:
For a given post P whose user is U and resource is R, a set of tags
are suggested as tags for the post.
 The commonly used approach to choose the tags is rule-based
and classification-based methods, but both of them have
defects: rule-based approach relies on expert experience and
manual efforts to set up the rules and tuning the parameters;
classification-based is restrict to the fix of tag space and is
inefficient when it is treated as a multi-label problem.

Related Work
 Some of the previous work in tag recommendation area has been done in
content-based and collaborative approach.
 In the content-based approach, a system exploits some textual source with
Information Retrieval-related techniques in order to extract relevant N-
grams from the text.

Approach
 We started with some pre-processing of given training to make it
more suitable for crawling purposes.
 Then we crawl the URLs from given training dataset to extract
the web content like text, pdf, html document etc. and
normalize it to remove unwanted tags.
 Then we use Lucene to Index the crawled data.
 We are using similarity score based approach and clustering
based approach. We created clusters of similar links such that
each link in each cluster is similar to each other link the cluster
based on a pre-determined approach.
 Finally for each group we find the most popular tags.

Approach… (continued)
For the Extraction of candidate tags we are using
following sources::
 URL given by the user
 From the user's previously tagged resources
 From the given description
 Word related tags which are extracted from description
 For Ranking we are using user history. The groups most
similar to the link and description by user are identified and
the tags to these groups are become tag for the link.

Theory
As a part of our clustering model we are calculating clusters on
following different events:
 Grouping the links based on their similarity to other links
 Weighing the groups on their popularity in user's link and
description
 Giving more weight to title tag in over all data
 How much tag is related to words in given description

References
 Tag recommendation by machine learning with textual and social
features Xian Chen · Hyoseop Shin Received: 5 February 2011 /
Revised: 4 January 2012 / Accepted: 5 March 2012 / Published online: 1
April 2012 © Springer Science+Business Media, LLC 2012
 A Tag Recommendation System based on contents Ning Zhang, Yuan
Zhang, and Jie Tang Knowledge Engineering Group Department of
Computer Science and Technology, Tsinghua University, Beijing,
China zntsinghua1117@gmail.com, fancyzy0526@gmail.com and
jietang@tsinghua.edu.cn
 AutoTag: A Collaborative Approach to Automated Tag Assignment for
Weblog Posts Gilad Mishne ISLA, University of Amsterdam Kruislaan
403, 1098SJ Amsterdam, The Netherlands
 STaR: a Social Tag Recommender System Cataldo Musto, Fedelucio
Narducci, Marco de Gemmis, Pasquale Lops, and Giovanni Semeraro
Department of Computer Science, University of Bari “Aldo Moro”, Italy
{musto,narducci,degemmis,lops,semeraro}@di.uniba.it

Slideshow ire

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (12)

Similar to Slideshow ire

Similar to Slideshow ire (20)

Recently uploaded

Recently uploaded (20)

Slideshow ire