SlideShare a Scribd company logo
1 of 39
Tag Based Social
Recommender System(RS)
Project Mentor
Ms Pragya Dwivedi
By
Aditi Gupta
Anirudh kanjani
Abhinav Vasu Rawat
Kapil kumar
Ashutosh Singh
Agenda







Recommender systems- overview
Usefulness of Recommender Systems(RS)
Types of RS
Relation with information architecture
Limitations and possible improvements
Relation with Social Networking
What are they and Why are
they
Recommender systems provide a way for information
filtering that attempts to present information that are
likely of interest to the user. Its advantages are:
 Enhances user experience
◦ Assists users in finding information
◦ Reduces search and navigation time
 Increases productivity
 Increases credibility
 Mutually beneficial proposition
Types of Recommender
Systems(RS)
Content based RS
•
•
•
•
•

Highlights
Recommend items similar to those users
preferred in the past
User profiling is the key
Items/content usually denoted by keywords
Matching “user preferences” with “item
characteristics” works for textual information
Vector Space Model widely used
Content based RS
•
•
•
•
•
•

Limitations
Not all content is well represented by
keywords, e.g. images
Items represented by same set of features are
indistinguishable
Overspecialization: unrated items not shown
Users with thousands of purchases is a
problem
New user: No history available
Shouldn’t show items that are too different, or
too similar
Collaborative RS
•
•
•
•
•

Highlights
Use other users’ recommendations (ratings) to
judge item’s utility
Key is to find users/user groups whose interests
match with the current user
Vector Space model widely used (directions of
vectors are user specified ratings)
More users, more ratings: better results
Can account for items dissimilar to the ones seen
in the past too...ovielens.org
Collaborative RS
•

•
•
•

Limitations
Different users might use different scales.
Possible solution: weighted ratings, i.e.
deviations from average rating .
Finding similar users/user groups isn’t very easy.
New user: No preferences available.
New item: No ratings available.
Hybrid RS




Uses both content based and collaborative filtering.
Introduced to avoid the limitations found in both
content and collaborative methods.
Example: Netflix- makes recommendations by
comparing the watching and searching habits of
similar users (i.e. collaborative filtering) as well as
by offering movies that share characteristics with
films that a user has rated highly (content-based
filtering).
Other Variations of RS
Cluster Models
• Create clusters or groups.
• Put a customer into a category.
• Classification simplifies the task of user
matching.
• More scalability and performance.
• Lesser accuracy than normal collaborative
filtering method.
Possible Improvement in RS
Better understanding of users and items
–
Social network (social RS)
1. User level
• Highlighting interests, hobbies, and keywords
people have in common
2. Item level
• link the keywords to ecommerce (by RS
algorithms)
What is tag?
A tag is a piece of information that describes the
data or content that it is assigned to. Tags are nonhierarchical keywords used for Internet bookmarks,
digital images, videos, files and so on. A tag doesn't
carry any information or semantics.

Tagging serves many functions, including:

Classification

Marking ownership

Describing content type

Online identity
About tagging
Labeling and Tagging are done to aid in
classification, marking, ownership, noting boundaries
and indicating online identity. They may take the form
of words, images or marks.
Online & internet databases deploy them as a way
for publishers to help users to find content.
Where they are used?
Social bookmarking :- provides users to add tags to
their bookmarks.
 Flickr :- allows users to add their own text tags to
each of their pictures, constructing flexible & easy
metadata that makes pictures highly searchable.
 YouTube :- also implements tagging. They
categorise content using simple keywords. The
users add tags which are visible and themselves
link to other items that share that keyword tag.

Examples






Within a Blog : - Many blog systems allow authors
to add free-form tags to a post. For example, a post
may display that it has been tagged with baseball
and tickets.
For an event :- An official tag is a keyword adopted
by events to use in their web applications, such as
blog entries, photos of the event and persentation
slides.
In research :- Associate an item with a small no of
themes, then a group of tags for these themes can
be attached. In this way free form classification
allows author to manage large amounts of
information.
Tag types


Triple Tags : - Triple tag or Machine tag
uses a special tag to define extra semantics
information about the tag, making it more
meaningful for interpretation.
Triple tags comprise of - a namespace ,
a predicate & a value .
Tag types




Hash Tag : - Word or phrase prefixed with #. Form
of metadata tag. Short messages on social
networking such as twitter , facebook may be
tagged by putting #.
before important words.
Hash tag provides a means of grouping such
messages since one can search for hash tags and
get the set of messages that contain it.
Knowledge tag : - it is a type of meta information
that describes or defines some aspect of
information resource. They are
the type of
metadata that captures knowledge in the form of
descriptions, classification, comments, notes,
hyperlinks etc.
Information Retrieval Systems
Information retrieval is the activity of obtaining
information resources relevant to an information
need from collection of information resources.
Searches can be based on metadata or on full text.
The Information Retrieval
Cycle
Source
Selection

Resource
Query
Formulation

Query
Search

Ranked List
Selection

Documents

query reformulation,
relevance feedback

result

11/27/2013

Introduction to Information Retrieval

19
Search Process
Source
Selection

Resource
Query
Formulation

Query

Search

Indexing

Index

Ranked List

Selection

Documents

Results
Document Collection
Slide is from Jimmy Lin’s tutorial
11/27/2013

Introduction to Information Retrieval

20
Implementation-How
Recommender System Works
In case we use content based filtering
Cosine similarity formula is utilized as follows

Where wc and ws are TF-IDF weight vectors
Implementation-How
Recommender System Works
In case we use collaborative filtering Pearson similarity
formula is used as follows







sim(x,y)-similarity between user x and y
rx,s – rating for item “s” given by user “x”
ry,s – rating for item “s” given by user “y”
ry- mean of all ratings by user “y”
rx- mean of all ratings by user “x”
Implementation-How
Recommender System Works

Similarity Model
Vector-space model
This is a model that allows us to extract documents
based on the tags given by a user through a query.
Vector space model uses TF-IDF weights to
categorise the documents into relevant and nonrelevant ones. The end result is the document(s)
having best similarity with the tags given in the query.

11/27/2013

Introduction to Information Retrieval

24
The Vector-Space Model
Assume t distinct terms remain after preprocessing;
call them index terms or the vocabulary.
 These “orthogonal” terms form a vector space.
Dimension = t = |vocabulary|
 Each term, i, in a document or query, j, is given a
real-valued weight, wij.
 Both documents and queries are expressed as
t-dimensional vectors:
dj = (w1j, w2j, …, wtj)


25
Document Collection
A

collection of n documents can be represented in the
vector space model by a term-document matrix.
 An entry in the matrix corresponds to the “weight” of a
term in the document; zero means the term has no
significance in the document or it simply doesn’t exist in
the document.

T1 T2 ….
w11 w21 …

D1
wt1
D2 w12
wt2
:
:
:
:
Dn w1n
wtn

Tt

w22 …

:
:
w2n …

:
:
26
Issues for Vector Space Model
How to determine important words in a document?
◦ Word sense?
◦ Word n-grams (and phrases, idioms,…)  terms
 How to determine the degree of importance of a
term within a document and within the entire
collection?
 How to determine the degree of similarity between
a document and the query?
 In the case of the web, what is a collection and
what are the effects of links, formatting information,
etc.?


27
Term Weights: Term Frequency


More frequent terms in a document are more
important, i.e. more indicative of the topic.
fij = frequency of term i in document j



May want to normalize term frequency (TF) by
dividing by the frequency of the most common term
in the document:
TFij = fij / maxi{fij}

28
Term Weights: Inverse Document
Frequency





Terms that appear in many different documents are
less indicative of overall topic.
df i = document frequency of term i
= number of documents containing term i
IDFi = inverse document frequency of term i,
= log2 (N/ df i)
(N: total number of documents)
An indication of a term’s discrimination power.
Log used to dampen the effect relative to tf.

29
TF-IDF Weighting






A typical combined term importance indicator is TFIDF weighting:
wij = TFij -IDFi = TFij log2 (N/ dfi)
A term occurring frequently in the document but
rarely in the rest of the collection is given high weight.
Many other ways of determining term weights have
been proposed.
Experimentally, TF-IDF has been found to work well.

30
Computing TF-IDF - An
Example
Given a document containing terms with given
frequencies:
A(3), B(2), C(1)
Assume collection contains 10,000 documents and
document frequencies of these terms are:
A(50), B(1300), C(250)
Then:
A: TF = 3/3; IDF = log2(10000/50) = 7.6; TF-IDF =
7.6
B: TF= 2/3; IDF = log2 (10000/1300) = 2.9; TF-IDF =
2.0
C: TF= 1/3; IDF= log2 (10000/250) = 5.3; TF-IDF = 31
Performance and Correction
Measures
Precision- is the fraction of documents retrieved
that are relevant to the user’s information need.
 Recall- Recall is the fraction of the documents that
are relevant to the query that are successfully
retrieved
 F-Measure
 Mean Absolute Error(MAE)

Precision vs. Recall
All docs

Retrieved

Recall

| RelRetriev ed |
| Rel in Collection |

Relevant

Precision
11/27/2013

| RelRetriev ed |
| Retrieved |

Introduction to Information Retrieval

33
F-Measure
The weighted harmonic mean of precision and
recall , the traditional f- measure or balanced Fsource is
F-measure =

2 *precision*recall
(precision+recall)
Mean Absolute Error(MAE)
Mean absolute error for a set of queries is calculated
as average of the absolute difference between the
predicted rating and the actual rating for each query.

Where n is the total number of queries,
is the
prediction and is the true value and the absolute
error is
Datasets


We have studied the datasets of some popular
sites and have implemented basic functions like
Pearson similarity, Cosine similarity, Resnick
prediction formula and Tf-Idf model on them. The
datasets we studied are as follows:



MovieLens Dataset
Flickr Dataset


MovieLens Dataset
MovieLens is a recommender system and virtual
community website that recommends films based
on user-provided ratings.
 The dataset on which we have worked contains a
total of 1,00,000 ratings from 943 users on 1682
movie items.
 It was collected from September 19th, 1997 to April
22nd, 1998.
 The dataset includes file that has every entry in 4tuples <user_id><item_id><rating><timestamp>.

Flickr Dataset
Flickr is an image hosting and video hosting
website where people host images that they
embed in blogs and social media.
 The dataset we have used is MRFLICKR-25000
and it is a collection of 25000 images downloaded
from the social photography site Flickr through its
public API.
 The average number of tags per image is 8.94. In
the collection there are 1386 tags which occur in at
least 20 images.
 The dataset includes a meta-data folder named
“meta” that contains all the tags associated with a
particular image in a respective file.

Visit my blog for more
www.csekapil.wordpress.com
Motilal Nehru National institute of Tech.
Allahabad.(india)

More Related Content

What's hot

Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filteringD Yogendra Rao
 
Email Classification
Email ClassificationEmail Classification
Email ClassificationXi Chen
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Prediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksPrediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksMohamed El-Geish
 
Recommendation systems
Recommendation systems  Recommendation systems
Recommendation systems Badr Hirchoua
 
Email Classification - Why Should it Matter to You?
Email Classification - Why Should it Matter to You?Email Classification - Why Should it Matter to You?
Email Classification - Why Should it Matter to You?Sherpa Software
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systemsguest77b0cd12
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag CloudsAakash Gupta
 
Recommending tags for new resources
Recommending tags for new resourcesRecommending tags for new resources
Recommending tags for new resourcesIJDKP
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 
Recommender systems
Recommender systemsRecommender systems
Recommender systemsTamer Rezk
 
IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface晓愚 孟
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation enginesGeorgian Micsa
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperChangsung Moon
 
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...ijcsa
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFYusuke Yamamoto
 

What's hot (19)

Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filtering
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
Email Classification
Email ClassificationEmail Classification
Email Classification
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Prediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksPrediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social Networks
 
Recommendation systems
Recommendation systems  Recommendation systems
Recommendation systems
 
Email Classification - Why Should it Matter to You?
Email Classification - Why Should it Matter to You?Email Classification - Why Should it Matter to You?
Email Classification - Why Should it Matter to You?
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systems
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag Clouds
 
Recommending tags for new resources
Recommending tags for new resourcesRecommending tags for new resources
Recommending tags for new resources
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED-V2I2P09
IJSRED-V2I2P09
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paper
 
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CF
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 

Viewers also liked

Probablistic sampling group 3 assighnment
Probablistic sampling group 3 assighnmentProbablistic sampling group 3 assighnment
Probablistic sampling group 3 assighnmentShimelis Birhanu
 
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short TextRESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short TextElizabeth Murnane
 
Skew-symmetric matrix completion for rank aggregation
Skew-symmetric matrix completion for rank aggregationSkew-symmetric matrix completion for rank aggregation
Skew-symmetric matrix completion for rank aggregationDavid Gleich
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)Mohamed Rafique
 
15 μαρτιου 2010 ημεριδα γγκ ομιλια π. καλαποθαρακου
15 μαρτιου 2010 ημεριδα γγκ  ομιλια π. καλαποθαρακου15 μαρτιου 2010 ημεριδα γγκ  ομιλια π. καλαποθαρακου
15 μαρτιου 2010 ημεριδα γγκ ομιλια π. καλαποθαρακουΕ.Κ.ΠΟΙ.ΖΩ.
 
Query formulation process
Query formulation processQuery formulation process
Query formulation processmalathimurugan
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an IntroductionAli Abbasi
 
Cluster & multi satge random sampling
Cluster & multi satge random samplingCluster & multi satge random sampling
Cluster & multi satge random samplingrifansahDua1
 
Cluster and multistage sampling
Cluster and multistage samplingCluster and multistage sampling
Cluster and multistage samplingsuncil0071
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsYONG ZHENG
 
Branch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection AlgorithmsBranch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection AlgorithmsChamin Nalinda Loku Gam Hewage
 
Space Exploration Merit Badge Slide Show
Space Exploration Merit Badge Slide ShowSpace Exploration Merit Badge Slide Show
Space Exploration Merit Badge Slide ShowTodd Cella
 

Viewers also liked (16)

Html
HtmlHtml
Html
 
Week 4 Lecture Part 1
Week 4 Lecture Part 1Week 4 Lecture Part 1
Week 4 Lecture Part 1
 
Probablistic sampling group 3 assighnment
Probablistic sampling group 3 assighnmentProbablistic sampling group 3 assighnment
Probablistic sampling group 3 assighnment
 
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short TextRESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text
RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text
 
Skew-symmetric matrix completion for rank aggregation
Skew-symmetric matrix completion for rank aggregationSkew-symmetric matrix completion for rank aggregation
Skew-symmetric matrix completion for rank aggregation
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)
 
15 μαρτιου 2010 ημεριδα γγκ ομιλια π. καλαποθαρακου
15 μαρτιου 2010 ημεριδα γγκ  ομιλια π. καλαποθαρακου15 μαρτιου 2010 ημεριδα γγκ  ομιλια π. καλαποθαρακου
15 μαρτιου 2010 ημεριδα γγκ ομιλια π. καλαποθαρακου
 
Query formulation process
Query formulation processQuery formulation process
Query formulation process
 
IR
IRIR
IR
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an Introduction
 
Cluster & multi satge random sampling
Cluster & multi satge random samplingCluster & multi satge random sampling
Cluster & multi satge random sampling
 
Cluster and multistage sampling
Cluster and multistage samplingCluster and multistage sampling
Cluster and multistage sampling
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender Systems
 
Branch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection AlgorithmsBranch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection Algorithms
 
Space Exploration Merit Badge Slide Show
Space Exploration Merit Badge Slide ShowSpace Exploration Merit Badge Slide Show
Space Exploration Merit Badge Slide Show
 
Slideshare ppt
Slideshare pptSlideshare ppt
Slideshare ppt
 

Similar to Social recommender system

CS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationCS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationPalani Kumar
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaEditor IJCATR
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaEditor IJCATR
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsIRJET Journal
 
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...rahulmonikasharma
 
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...rahulmonikasharma
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...IRJET Journal
 
Text mining and analytics v6 - p1
Text mining and analytics   v6 - p1Text mining and analytics   v6 - p1
Text mining and analytics v6 - p1Dave King
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...IRJET Journal
 
VIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdfVIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdfArthyR3
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Techniquepaperpublications3
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based SegmentationIRJET Journal
 
Improving Annotations in Digital Documents using Document Features and Fuzzy ...
Improving Annotations in Digital Documents using Document Features and Fuzzy ...Improving Annotations in Digital Documents using Document Features and Fuzzy ...
Improving Annotations in Digital Documents using Document Features and Fuzzy ...IRJET Journal
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic SearchPaul Wlodarczyk
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsGabriel Moreira
 
Information Architecture
Information ArchitectureInformation Architecture
Information ArchitectureOlivier Tripet
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Gabriel Moreira
 
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...Paul Wlodarczyk
 

Similar to Social recommender system (20)

CS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationCS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_Recommendation
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
 
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
Context based Document Indexing and Retrieval using Big Data Analytics - A Re...
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
 
Text mining and analytics v6 - p1
Text mining and analytics   v6 - p1Text mining and analytics   v6 - p1
Text mining and analytics v6 - p1
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
 
VIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdfVIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdf
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
 
Improving Annotations in Digital Documents using Document Features and Fuzzy ...
Improving Annotations in Digital Documents using Document Features and Fuzzy ...Improving Annotations in Digital Documents using Document Features and Fuzzy ...
Improving Annotations in Digital Documents using Document Features and Fuzzy ...
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic Search
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender Systems
 
Information Architecture
Information ArchitectureInformation Architecture
Information Architecture
 
Hc3612711275
Hc3612711275Hc3612711275
Hc3612711275
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
 
PhD defense
PhD defense PhD defense
PhD defense
 
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
 

Recently uploaded

How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 

Recently uploaded (20)

How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 

Social recommender system

  • 1. Tag Based Social Recommender System(RS) Project Mentor Ms Pragya Dwivedi By Aditi Gupta Anirudh kanjani Abhinav Vasu Rawat Kapil kumar Ashutosh Singh
  • 2. Agenda       Recommender systems- overview Usefulness of Recommender Systems(RS) Types of RS Relation with information architecture Limitations and possible improvements Relation with Social Networking
  • 3. What are they and Why are they Recommender systems provide a way for information filtering that attempts to present information that are likely of interest to the user. Its advantages are:  Enhances user experience ◦ Assists users in finding information ◦ Reduces search and navigation time  Increases productivity  Increases credibility  Mutually beneficial proposition
  • 5. Content based RS • • • • • Highlights Recommend items similar to those users preferred in the past User profiling is the key Items/content usually denoted by keywords Matching “user preferences” with “item characteristics” works for textual information Vector Space Model widely used
  • 6. Content based RS • • • • • • Limitations Not all content is well represented by keywords, e.g. images Items represented by same set of features are indistinguishable Overspecialization: unrated items not shown Users with thousands of purchases is a problem New user: No history available Shouldn’t show items that are too different, or too similar
  • 7. Collaborative RS • • • • • Highlights Use other users’ recommendations (ratings) to judge item’s utility Key is to find users/user groups whose interests match with the current user Vector Space model widely used (directions of vectors are user specified ratings) More users, more ratings: better results Can account for items dissimilar to the ones seen in the past too...ovielens.org
  • 8. Collaborative RS • • • • Limitations Different users might use different scales. Possible solution: weighted ratings, i.e. deviations from average rating . Finding similar users/user groups isn’t very easy. New user: No preferences available. New item: No ratings available.
  • 9. Hybrid RS    Uses both content based and collaborative filtering. Introduced to avoid the limitations found in both content and collaborative methods. Example: Netflix- makes recommendations by comparing the watching and searching habits of similar users (i.e. collaborative filtering) as well as by offering movies that share characteristics with films that a user has rated highly (content-based filtering).
  • 10. Other Variations of RS Cluster Models • Create clusters or groups. • Put a customer into a category. • Classification simplifies the task of user matching. • More scalability and performance. • Lesser accuracy than normal collaborative filtering method.
  • 11. Possible Improvement in RS Better understanding of users and items – Social network (social RS) 1. User level • Highlighting interests, hobbies, and keywords people have in common 2. Item level • link the keywords to ecommerce (by RS algorithms)
  • 12. What is tag? A tag is a piece of information that describes the data or content that it is assigned to. Tags are nonhierarchical keywords used for Internet bookmarks, digital images, videos, files and so on. A tag doesn't carry any information or semantics. Tagging serves many functions, including:  Classification  Marking ownership  Describing content type  Online identity
  • 13. About tagging Labeling and Tagging are done to aid in classification, marking, ownership, noting boundaries and indicating online identity. They may take the form of words, images or marks. Online & internet databases deploy them as a way for publishers to help users to find content.
  • 14. Where they are used? Social bookmarking :- provides users to add tags to their bookmarks.  Flickr :- allows users to add their own text tags to each of their pictures, constructing flexible & easy metadata that makes pictures highly searchable.  YouTube :- also implements tagging. They categorise content using simple keywords. The users add tags which are visible and themselves link to other items that share that keyword tag. 
  • 15. Examples    Within a Blog : - Many blog systems allow authors to add free-form tags to a post. For example, a post may display that it has been tagged with baseball and tickets. For an event :- An official tag is a keyword adopted by events to use in their web applications, such as blog entries, photos of the event and persentation slides. In research :- Associate an item with a small no of themes, then a group of tags for these themes can be attached. In this way free form classification allows author to manage large amounts of information.
  • 16. Tag types  Triple Tags : - Triple tag or Machine tag uses a special tag to define extra semantics information about the tag, making it more meaningful for interpretation. Triple tags comprise of - a namespace , a predicate & a value .
  • 17. Tag types   Hash Tag : - Word or phrase prefixed with #. Form of metadata tag. Short messages on social networking such as twitter , facebook may be tagged by putting #. before important words. Hash tag provides a means of grouping such messages since one can search for hash tags and get the set of messages that contain it. Knowledge tag : - it is a type of meta information that describes or defines some aspect of information resource. They are the type of metadata that captures knowledge in the form of descriptions, classification, comments, notes, hyperlinks etc.
  • 18. Information Retrieval Systems Information retrieval is the activity of obtaining information resources relevant to an information need from collection of information resources. Searches can be based on metadata or on full text.
  • 19. The Information Retrieval Cycle Source Selection Resource Query Formulation Query Search Ranked List Selection Documents query reformulation, relevance feedback result 11/27/2013 Introduction to Information Retrieval 19
  • 20. Search Process Source Selection Resource Query Formulation Query Search Indexing Index Ranked List Selection Documents Results Document Collection Slide is from Jimmy Lin’s tutorial 11/27/2013 Introduction to Information Retrieval 20
  • 21. Implementation-How Recommender System Works In case we use content based filtering Cosine similarity formula is utilized as follows Where wc and ws are TF-IDF weight vectors
  • 22. Implementation-How Recommender System Works In case we use collaborative filtering Pearson similarity formula is used as follows      sim(x,y)-similarity between user x and y rx,s – rating for item “s” given by user “x” ry,s – rating for item “s” given by user “y” ry- mean of all ratings by user “y” rx- mean of all ratings by user “x”
  • 24. Similarity Model Vector-space model This is a model that allows us to extract documents based on the tags given by a user through a query. Vector space model uses TF-IDF weights to categorise the documents into relevant and nonrelevant ones. The end result is the document(s) having best similarity with the tags given in the query. 11/27/2013 Introduction to Information Retrieval 24
  • 25. The Vector-Space Model Assume t distinct terms remain after preprocessing; call them index terms or the vocabulary.  These “orthogonal” terms form a vector space. Dimension = t = |vocabulary|  Each term, i, in a document or query, j, is given a real-valued weight, wij.  Both documents and queries are expressed as t-dimensional vectors: dj = (w1j, w2j, …, wtj)  25
  • 26. Document Collection A collection of n documents can be represented in the vector space model by a term-document matrix.  An entry in the matrix corresponds to the “weight” of a term in the document; zero means the term has no significance in the document or it simply doesn’t exist in the document. T1 T2 …. w11 w21 … D1 wt1 D2 w12 wt2 : : : : Dn w1n wtn Tt w22 … : : w2n … : : 26
  • 27. Issues for Vector Space Model How to determine important words in a document? ◦ Word sense? ◦ Word n-grams (and phrases, idioms,…)  terms  How to determine the degree of importance of a term within a document and within the entire collection?  How to determine the degree of similarity between a document and the query?  In the case of the web, what is a collection and what are the effects of links, formatting information, etc.?  27
  • 28. Term Weights: Term Frequency  More frequent terms in a document are more important, i.e. more indicative of the topic. fij = frequency of term i in document j  May want to normalize term frequency (TF) by dividing by the frequency of the most common term in the document: TFij = fij / maxi{fij} 28
  • 29. Term Weights: Inverse Document Frequency    Terms that appear in many different documents are less indicative of overall topic. df i = document frequency of term i = number of documents containing term i IDFi = inverse document frequency of term i, = log2 (N/ df i) (N: total number of documents) An indication of a term’s discrimination power. Log used to dampen the effect relative to tf. 29
  • 30. TF-IDF Weighting     A typical combined term importance indicator is TFIDF weighting: wij = TFij -IDFi = TFij log2 (N/ dfi) A term occurring frequently in the document but rarely in the rest of the collection is given high weight. Many other ways of determining term weights have been proposed. Experimentally, TF-IDF has been found to work well. 30
  • 31. Computing TF-IDF - An Example Given a document containing terms with given frequencies: A(3), B(2), C(1) Assume collection contains 10,000 documents and document frequencies of these terms are: A(50), B(1300), C(250) Then: A: TF = 3/3; IDF = log2(10000/50) = 7.6; TF-IDF = 7.6 B: TF= 2/3; IDF = log2 (10000/1300) = 2.9; TF-IDF = 2.0 C: TF= 1/3; IDF= log2 (10000/250) = 5.3; TF-IDF = 31
  • 32. Performance and Correction Measures Precision- is the fraction of documents retrieved that are relevant to the user’s information need.  Recall- Recall is the fraction of the documents that are relevant to the query that are successfully retrieved  F-Measure  Mean Absolute Error(MAE) 
  • 33. Precision vs. Recall All docs Retrieved Recall | RelRetriev ed | | Rel in Collection | Relevant Precision 11/27/2013 | RelRetriev ed | | Retrieved | Introduction to Information Retrieval 33
  • 34. F-Measure The weighted harmonic mean of precision and recall , the traditional f- measure or balanced Fsource is F-measure = 2 *precision*recall (precision+recall)
  • 35. Mean Absolute Error(MAE) Mean absolute error for a set of queries is calculated as average of the absolute difference between the predicted rating and the actual rating for each query. Where n is the total number of queries, is the prediction and is the true value and the absolute error is
  • 36. Datasets  We have studied the datasets of some popular sites and have implemented basic functions like Pearson similarity, Cosine similarity, Resnick prediction formula and Tf-Idf model on them. The datasets we studied are as follows:  MovieLens Dataset Flickr Dataset 
  • 37. MovieLens Dataset MovieLens is a recommender system and virtual community website that recommends films based on user-provided ratings.  The dataset on which we have worked contains a total of 1,00,000 ratings from 943 users on 1682 movie items.  It was collected from September 19th, 1997 to April 22nd, 1998.  The dataset includes file that has every entry in 4tuples <user_id><item_id><rating><timestamp>. 
  • 38. Flickr Dataset Flickr is an image hosting and video hosting website where people host images that they embed in blogs and social media.  The dataset we have used is MRFLICKR-25000 and it is a collection of 25000 images downloaded from the social photography site Flickr through its public API.  The average number of tags per image is 8.94. In the collection there are 1386 tags which occur in at least 20 images.  The dataset includes a meta-data folder named “meta” that contains all the tags associated with a particular image in a respective file. 
  • 39. Visit my blog for more www.csekapil.wordpress.com Motilal Nehru National institute of Tech. Allahabad.(india)