Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...Alain Starke
Presentation on our long paper for the #RecSys2017 conference on Recommender Systems, Como, Presented by Alain Starke. It shows how the psychometric Rasch model can enhance user recommendations in the energy domain.
In collaboration with Martijn Willemsen & Chris Snijders - Eindhoven University of Technology.
ACM ICTIR 2019 Slides - Santa Clara, USAIadh Ounis
Talk entitled "Unifying Explicit and Implicit Feedback for Rating Prediction and Ranking Recommendation Tasks" presented at the ACM ICTIR 2019, Santa Clara. 2019.
Reference:
Jadidinejad, A. , Macdonald, C. and Ounis, I. (2019) Unifying Explicit and Implicit Feedback for Rating Prediction and Ranking Recommendation Tasks. In: 5th ACM SIGIR International Conference on the Theory of Information Retrieval, Santa Clara, CA, USA, 02-05 Oct 2019
URL: https://dl.acm.org/citation.cfm?id=3344225
bulk ieee projects in pondicherry,ieee projects in pondicherry,final year ieee projects in pondicherry
Nexgen Technology Address:
Nexgen Technology
No :66,4th cross,Venkata nagar,
Near SBI ATM,
Puducherry.
Email Id: praveen@nexgenproject.com.
www.nexgenproject.com
Mobile: 9751442511,9791938249
Telephone: 0413-2211159.
NEXGEN TECHNOLOGY as an efficient Software Training Center located at Pondicherry with IT Training on IEEE Projects in Android,IEEE IT B.Tech Student Projects, Android Projects Training with Placements Pondicherry, IEEE projects in pondicherry, final IEEE Projects in Pondicherry , MCA, BTech, BCA Projects in Pondicherry, Bulk IEEE PROJECTS IN Pondicherry.So far we have reached almost all engineering colleges located in Pondicherry and around 90km
A Survey on Sentiment Analysis and Opinion MiningIJSRD
In Today’s world, the social media has given web users a place for expressing and sharing their thoughts and opinions on different topics or events. For this purpose, the opinion mining has gained the importance. Sentiment classification and Opinion Mining is the study of people’s opinion, emotions, attitude towards the product, services, etc. Sentiment Analysis and Opinion Mining are the two interchangeable terms. There are various approaches and techniques exist for Sentiment Analysis like Naïve Bayes, Decision Trees, Support Vector Machines, Random Forests, Maximum Entropy, etc. Opinion mining is a useful and beneficial way to scientific surveys, political polls, market research and business intelligence, etc. This paper presents a literature review of various techniques used for opinion mining and sentiment analysis.
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...Alain Starke
Presentation on our long paper for the #RecSys2017 conference on Recommender Systems, Como, Presented by Alain Starke. It shows how the psychometric Rasch model can enhance user recommendations in the energy domain.
In collaboration with Martijn Willemsen & Chris Snijders - Eindhoven University of Technology.
ACM ICTIR 2019 Slides - Santa Clara, USAIadh Ounis
Talk entitled "Unifying Explicit and Implicit Feedback for Rating Prediction and Ranking Recommendation Tasks" presented at the ACM ICTIR 2019, Santa Clara. 2019.
Reference:
Jadidinejad, A. , Macdonald, C. and Ounis, I. (2019) Unifying Explicit and Implicit Feedback for Rating Prediction and Ranking Recommendation Tasks. In: 5th ACM SIGIR International Conference on the Theory of Information Retrieval, Santa Clara, CA, USA, 02-05 Oct 2019
URL: https://dl.acm.org/citation.cfm?id=3344225
bulk ieee projects in pondicherry,ieee projects in pondicherry,final year ieee projects in pondicherry
Nexgen Technology Address:
Nexgen Technology
No :66,4th cross,Venkata nagar,
Near SBI ATM,
Puducherry.
Email Id: praveen@nexgenproject.com.
www.nexgenproject.com
Mobile: 9751442511,9791938249
Telephone: 0413-2211159.
NEXGEN TECHNOLOGY as an efficient Software Training Center located at Pondicherry with IT Training on IEEE Projects in Android,IEEE IT B.Tech Student Projects, Android Projects Training with Placements Pondicherry, IEEE projects in pondicherry, final IEEE Projects in Pondicherry , MCA, BTech, BCA Projects in Pondicherry, Bulk IEEE PROJECTS IN Pondicherry.So far we have reached almost all engineering colleges located in Pondicherry and around 90km
A Survey on Sentiment Analysis and Opinion MiningIJSRD
In Today’s world, the social media has given web users a place for expressing and sharing their thoughts and opinions on different topics or events. For this purpose, the opinion mining has gained the importance. Sentiment classification and Opinion Mining is the study of people’s opinion, emotions, attitude towards the product, services, etc. Sentiment Analysis and Opinion Mining are the two interchangeable terms. There are various approaches and techniques exist for Sentiment Analysis like Naïve Bayes, Decision Trees, Support Vector Machines, Random Forests, Maximum Entropy, etc. Opinion mining is a useful and beneficial way to scientific surveys, political polls, market research and business intelligence, etc. This paper presents a literature review of various techniques used for opinion mining and sentiment analysis.
Analysis of Textual Data Classification with a Reddit Comments DatasetAdamBab
McGill COMP551, Applied Machine Learning Course
The main objective of this project is to categorize comments from the American social
news aggregation, web content rating, and discussion website – Reddit.
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
Sentiment analysis and Opinion mining has emerged as a popular and efficient technique for information retrieval and web data analysis. The exponential growth of the user generated content has opened new horizons for research in the field of sentiment analysis. This paper proposes a model for sentiment analysis of movie reviews using a combination of natural language processing and machine learning approaches. Firstly, different data pre-processing schemes are applied on the dataset. Secondly, the behaviour of twoclassifiers, Naive Bayes and SVM, is investigated in combination with different feature selection schemes to
obtain the results for sentiment analysis. Thirdly, the proposed model for sentiment analysis is extended to
obtain the results for higher order n-grams.
Data analysis chapter 18 from the companion website for educational researchYamith José Fandiño Parra
This is a slide show of chapter 18 from Educational Research: Competencies for Analysis and Applications. Primarily intended for instructor use in the classroom, it is also available for students’ study use or to review as an advance organizer before class lectures or discussions.
Vinayaka : A Semi-Supervised Projected Clustering Method Using Differential E...ijseajournal
Differential Evolution (DE) is an algorithm for evolutionary optimization. Clustering problems have been
solved by using DE based clustering methods but these methods may fail to find clusters hidden in
subspaces of high dimensional datasets. Subspace and projected clustering methods have been proposed in
literature to find subspace clusters that are present in subspaces of dataset. In this paper we propose
VINAYAKA, a semi-supervised projected clustering method based on DE. In this method DE optimizes a
hybrid cluster validation index. Subspace Clustering Quality Estimate index (SCQE index) is used for
internal cluster validation and Gini index gain is used for external cluster validation in the proposed hybrid
cluster validation index. Proposed method is applied on Wisconsin breast cancer dataset
I am Joshua M. I am a Statistics Assignment Expert at statisticsassignmenthelp.com. I hold a Masters in Statistics from, Michigan State University, USA
I have been helping students with their homework for the past 5 years. I solve assignments related to Statistics.
Visit statisticsassignmenthelp.com or email info@statisticsassignmenthelp.com.
You can also call on +1 678 648 4277 for any assistance with Statistics Assignments.
A guide to understand and application of Research Methodology for a research paper writing. This presentation has been prepared for a live webinar organised on 8th May, 2021.
Statistics is all about facts and ratio, well it is a lot more than that and understanding every bit of it requires right statistics assignment help from some expert sources. We at helpmeinhomework are one such homework helping source providing adequate help whenever necessary.
A Decision Tree Based Classifier for Classification & Prediction of Diseasesijsrd.com
In this paper, we are proposing a modified algorithm for classification. This algorithm is based on the concept of the decision trees. The proposed algorithm is better then the previous algorithms. It provides more accurate results. We have tested the proposed method on the example of patient data set. Our proposed methodology uses greedy approach to select the best attribute. To do so the information gain is used. The attribute with highest information gain is selected. If information gain is not good then again divide attributes values into groups. These steps are done until we get good classification/misclassification ratio. The proposed algorithms classify the data sets more accurately and efficiently.
Modern Perspectives on Recommender Systems and their Applications in MendeleyKris Jack
Presentation given for one of Pearson's Data Research teams. It motivates the use of recommender systems, describes common approaches to building and evaluating them and gives examples of how they are used in Mendeley. Thanks to Maya Hristakeva for creating some of the slides.
Analysis of Textual Data Classification with a Reddit Comments DatasetAdamBab
McGill COMP551, Applied Machine Learning Course
The main objective of this project is to categorize comments from the American social
news aggregation, web content rating, and discussion website – Reddit.
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
Sentiment analysis and Opinion mining has emerged as a popular and efficient technique for information retrieval and web data analysis. The exponential growth of the user generated content has opened new horizons for research in the field of sentiment analysis. This paper proposes a model for sentiment analysis of movie reviews using a combination of natural language processing and machine learning approaches. Firstly, different data pre-processing schemes are applied on the dataset. Secondly, the behaviour of twoclassifiers, Naive Bayes and SVM, is investigated in combination with different feature selection schemes to
obtain the results for sentiment analysis. Thirdly, the proposed model for sentiment analysis is extended to
obtain the results for higher order n-grams.
Data analysis chapter 18 from the companion website for educational researchYamith José Fandiño Parra
This is a slide show of chapter 18 from Educational Research: Competencies for Analysis and Applications. Primarily intended for instructor use in the classroom, it is also available for students’ study use or to review as an advance organizer before class lectures or discussions.
Vinayaka : A Semi-Supervised Projected Clustering Method Using Differential E...ijseajournal
Differential Evolution (DE) is an algorithm for evolutionary optimization. Clustering problems have been
solved by using DE based clustering methods but these methods may fail to find clusters hidden in
subspaces of high dimensional datasets. Subspace and projected clustering methods have been proposed in
literature to find subspace clusters that are present in subspaces of dataset. In this paper we propose
VINAYAKA, a semi-supervised projected clustering method based on DE. In this method DE optimizes a
hybrid cluster validation index. Subspace Clustering Quality Estimate index (SCQE index) is used for
internal cluster validation and Gini index gain is used for external cluster validation in the proposed hybrid
cluster validation index. Proposed method is applied on Wisconsin breast cancer dataset
I am Joshua M. I am a Statistics Assignment Expert at statisticsassignmenthelp.com. I hold a Masters in Statistics from, Michigan State University, USA
I have been helping students with their homework for the past 5 years. I solve assignments related to Statistics.
Visit statisticsassignmenthelp.com or email info@statisticsassignmenthelp.com.
You can also call on +1 678 648 4277 for any assistance with Statistics Assignments.
A guide to understand and application of Research Methodology for a research paper writing. This presentation has been prepared for a live webinar organised on 8th May, 2021.
Statistics is all about facts and ratio, well it is a lot more than that and understanding every bit of it requires right statistics assignment help from some expert sources. We at helpmeinhomework are one such homework helping source providing adequate help whenever necessary.
A Decision Tree Based Classifier for Classification & Prediction of Diseasesijsrd.com
In this paper, we are proposing a modified algorithm for classification. This algorithm is based on the concept of the decision trees. The proposed algorithm is better then the previous algorithms. It provides more accurate results. We have tested the proposed method on the example of patient data set. Our proposed methodology uses greedy approach to select the best attribute. To do so the information gain is used. The attribute with highest information gain is selected. If information gain is not good then again divide attributes values into groups. These steps are done until we get good classification/misclassification ratio. The proposed algorithms classify the data sets more accurately and efficiently.
Modern Perspectives on Recommender Systems and their Applications in MendeleyKris Jack
Presentation given for one of Pearson's Data Research teams. It motivates the use of recommender systems, describes common approaches to building and evaluating them and gives examples of how they are used in Mendeley. Thanks to Maya Hristakeva for creating some of the slides.
Mobile Recommendation Engine
collaborative filtering and content based approach in hybrid manner then Genetic Algorithm for Enhancement of the Recommendation Engine. by this marketers also will get the unique characteristics of the product that must be created and also recommend to the user.
Best Practices in Recommender System ChallengesAlan Said
Recommender System Challenges such as the Netflix Prize, KDD Cup, etc. have contributed vastly to the development and adoptability of recommender systems. Each year a number of challenges or contests are organized covering different aspects of recommendation. In this tutorial and panel, we present some of the factors involved in successfully organizing a challenge, whether for reasons purely related to research, industrial challenges, or to widen the scope of recommender systems applications.
The International Journal of Engineering and Science (The IJES)theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
Similar to Reputation Model Based on Rating Data and Application in Recommender Systems (20)
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
Reputation Model Based on Rating Data and Application in Recommender Systems
1. Reputation Model Based On
Rating Data and Application in
Recommender Systems
PhD Final Seminar
Student Name: Ahmad Abdel-Hafez
Principle Supervisor: Assoc. Prof. Yue Xu
Associate Supervisor: Assoc. Prof. Dian Tjondronegoro
Panel Members:
Assoc. Prof. Yue Xu (Chair)
Assoc. Prof. Richi Nayak (EECS)
Dr. Ernest Foo (EECS)
Assoc. Prof. Dian Tjondronegoro (IS)
2. Presentation Contents
• Background on reputation models and
reputation-aware recommender systems.
• Research problems and objectives.
• Contributions.
• Normal distribution-based reputation model.
• Beta distribution-based reputation model.
• Reputation-aware recommender system.
• Evaluating the proposed methods
• Conclusions.
3. Background - Reputation Models
• Many websites nowadays provide rating
systems for customers in order to rate
available products.
• Reputation systems provide methods for
collecting and aggregating users’ opinions in
order to calculate products’ reputations.
• The Naïve aggregation method is the
arithmetic mean of ratings method.
6. Background - Reputation Models
1. Weighted Mean:
– uses weight for ratings which represent:
• Reviewer reputation, expertise, or reliability [1,2,5,7].
• The time when the rating was given, assuming that
newer ratings should have more weight [1,3].
2. Bayesian Reputation Models:
– Jøsang and Haller [3] introduced a multinomial
Bayesian probability distribution reputation
system based on Dirichlet probability distribution.
7. Background - Reputation Models
3. Fuzzy Reputation Models:
– Bharadwaj and Al-Shamri [4] proposed a fuzzy
computational model for trust and reputation, which
uses fuzzy rules in order to aggregate the calculated
reputation values.
4. Other Reputation Models
– Flow Reputation Models: The Google PageRank
calculates page reputation using the number of links
to this page from other pages [5]
– Probabilistic reputation models: TRAVOS calculates
user trust based on past direct transactions between
users. [6]
8. Background - Reputation-Aware
Recommender Systems
• On the other hand, recommender systems are
widely used on websites that contain massive
number of elements where personalisation
becomes a necessity.
• Most of the available recommender systems
focus on users’ preferences and rarely involve
products reputation in the recommendation
process.
9. Literature Review – Reputation-Aware
Recommender Systems
1. Recently, Ku and Tai [15] investigated the effect of
recommender system and reputation system on
purchase intentions regarding recommended
products.
2. Jøsang et al. [11] Proposed Cascading Minimum
Common Belief Fusion method to combine
reputation scores with recommendation scores.
3. Most recently, Wang et al. [8] proposed a trust-
based probabilistic recommendation model. The
trust used is for products, which is obtained
based on product reputations and purchase
frequencies
10. Research Problems - Problem 1:
Sparse Dataset
• Sparse dataset is the dataset with the majority of
its items having small rating count.
• Reputation systems depend on the historical
feedback to generate reputation scores for items.
Therefore, when the available feedback is sparse
it becomes more difficult to produce accurate
reputation scores for items.
Research Question 1:
How to use statistical data in the rating aggregation process to
enhance the accuracy of reputation scores over sparse dataset?
11. Research Problems - Problem 2:
Item Popularity
• An item is considered popular if it has large
ratings count in comparison to other items
ratings count in the same dataset.
• A reputation score is assumed to reflect the
popularity of an item, specifically; unpopular
items should not have high reputation scores.
Research Question 2:
How to reflect the item popularity, presented by the count of
ratings of an item in the reputation calculation process?
12. Research Problems - Problem 3:
Reputation and Recommendation
• In most Collaborative Filtering (CF) based
recommender systems, items’ reputations
were not considered as part of the
recommendation process. And that may
produce a recommendation for an item with
low reputation score, which is not likely to be
consumed by the user.
Research Question 3:
How to incorporate item reputation into recommendation process to
provide more accurate product recommendations for users?
13. Research Objectives
• Objective 1: To provide a literature review
about the reputation system, and reputation-
aware recommender systems.
• Objective 2: To propose a new reputation
model that deals with the sparsity problem.
• Objective 3: To propose a new reputation
model which considers items popularity in the
item reputation score.
14. Research Objectives
• Objective 4: To propose a new method for
merging items reputation with the
recommender system generated lists.
• Objective 5: To conduct experiments and
evaluate the performance of the proposed
approaches.
15. Contributions- In Reputation Models
• We propose two novel reputation models:
– The normal distribution-based reputation model
with uncertainty (NDRU), which employs rating
level frequencies, and uncertainty factors and
produces more accurate reputation scores over
sparse datasets.
– The beta distribution based reputation model
(BetaDR), which uses the item relative ratings
count in its rating aggregation process and beats
the state-of-the-art methods in reputation scores
accuracy using several well-known datasets. .
16. Contributions- In Recommender
Systems
• We propose reputation-aware top-n
recommender system:
– We adopt the concept of voting systems and propose
the weighted Borda count (WBC) method to combine
item reputations with item recommendation scores.
– We propose using personalised reputation generated
list of items.
– We propose a method to calculate user coherence,
and use it to determine the contribution weights of
item reputations and recommendation scores.
17. Definitions
• Rating level represents the number of possible
rating values that can be assigned to a specific
item by a user.
• In the five stars rating system we have 5 rating
levels. More instances of rating levels 1 and 2
than rating level 4 and 5 indicate that the item is
not favoured by a larger number of customers.
• Usually the middle rating levels such as 3 in a
rating scale [1-5] system is the most frequent
rating level (we call these rating levels “Popular
Rating Levels”) and 1 and 5 are the least frequent
levels (we call these levels “Rare Rating Levels”).
18. Motivation
• Using weighted mean reputation models, if we
don’t consider other factors such as time and
user credibility, then the weight for each rating is
1
𝑛
, where 𝑛 is the number of ratings to an item.
Weights for 7 ratings
Ratings
Rating
Weight
Level
Weight
2 0.1429
0.5714
2 0.1429
2 0.1429
2 0.1429
3 0.1429 0.1429
5 0.1429
0.2857
5 0.1429
3.0
19. Normal Distribution-based
Reputation Model (NDR)
Weights for 100 ratings Unified Weights for 100 ratings with each score has same frequency (=20)
• Our method can be described as weighted mean;
where the weights will be generated using normal
distribution function.
21. Weighting Based on the Normal
Distribution
• The weights to the ratings will be calculated using
the normal distribution density function
• where 𝑎𝑖 is the weight for the rating at index 𝑖 ,
𝑖 = 0, … , 𝑛 − 1 , 𝜇 is the mean, 𝜎 is the standard
deviation and . 𝑘 is the number of levels in the
rating system
𝑎𝑖 =
1
𝜎 2𝜋
𝑒
−
𝑥 𝑖−𝜇
2
2𝜎2
𝑥𝑖 =
(𝑘−1)×𝑖
𝑛−1
+ 1
22. Calculating The NDR Reputation Score
• The final reputation score is calculated as
weighted mean for each rating level. Where
𝐿𝑊 𝑙
is called level weight
𝑤𝑖 =
𝑎𝑖
𝑗=0
𝑛−1
𝑎𝑗
,
𝑖=0
𝑛−1
𝑤𝑖 = 1
𝑁𝐷𝑅 𝑝 =
𝑙=1
𝑘
𝑙 × 𝐿𝑊 𝑙
𝐿𝑊 𝑙
=
𝑗=0
𝑅 𝑙 −1
𝑤𝑗
𝑙
23. Enhanced NDR model by adding
uncertainty (NDRU)
• We do a slight modification to our proposed
NDR method by combining uncertainty factor,
which is important to deal with sparse
dataset.
𝑁𝐷𝑅𝑈 𝑝1 =
𝑙=1
𝑘
𝑙 ×
𝑛 × 𝐿𝑊 𝑙
+ 𝐶 × 𝑏
𝐶 + 𝑛
• 𝐶 = 2 is a priori constant and 𝑏 =
1
𝑘
is a base
rate for any of the 𝑘 rating values.
24. Motivation - Item Popularity Problem
• Less popular items (item 1) is expected to
have lower reputation scores than popular
items (item 2) with similar ratings distribution.
Rating
Frequency
Item 1 Item 2
1 1 25
2 1 25
3 1 25
4 1 25
5 5 125
Count 9 225
Mean 3.89 3.89
NDR 4.089 4.052
Median 5 5
25. Beta Distribution-Based
Reputation Model (BetaDR)
• A weighted mean method where the weights
will be generated using the beta distribution
probability density function.
• The beta distribution is flexible to produce
different distribution shapes which reflect the
weighting tendency suitable to every item.
• The proposed model considers statistical
information of the dataset, including the
ratings count.
26. Weighting Using the BetaDR
• The weights to the ratings will be calculated
using the beta-distribution probability density
function
• Equation (2) is used to evenly deploy the
values of 𝑥𝑖; providing that 0 < 𝑥𝑖< 1, where
𝑥0 = 0.01 and 𝑥 𝑛−1 = 0.99.
Beta(𝑥𝑖) =
Γ 𝛼+𝛽
Γ 𝛼 Γ 𝛽
𝑥𝑖
𝛼−1
1 − 𝑥𝑖
𝛽−1
(1)
𝑥𝑖 =
0.98×𝑖
𝑛−1
+ 0.01 (2)
27. Symmetric vs Asymmetric Shapes
• The symmetric shapes will ensure the reputation
model is fair and unbiased. Occurs when ( = )
• The chart bellow represents examples of the beta
distributions, where and > 1
0
0.5
1
1.5
2
2.5
3
1 2 3 4 5 6 7
Index of ratings
Shape 1
Shape 2
Shape 3
𝛼
< 𝛽
𝛼
= 𝛽
𝛼
> 𝛽
28. Symmetric Beta-Shapes
• The different symmetric shapes of the Beta distribution
• The x-axis represents ratings indexes and y-axis
represents weights.
0
0.04
0.08
0.12
0.16
1 3 5 7 9 11 13 15 17 19
Beta Distribution Probability Density Function (PDF)
U Shape
(α=ß=0.5)
Universal
Distribution
(α=ß=1)
Bell Shape
(α=ß=5)
29. Shape Parameters Equation
𝛼 = 𝛽 =
𝜇
𝜎
2
× 𝐼𝑅𝑅𝐶 , 𝜎 ≠ 0
𝐼𝑅𝑅𝐶 , 𝜎 = 0
(3)
• We use symmetric shape for the beta
distribution all the times. 𝛼 = 𝛽
• 𝛼 = 𝛽 < 1 ∶ The Beta-distribution will
generate “U” shape
• 𝛼 = 𝛽 > 1 ∶ The Beta-distribution will
generate “Bell” shape
30. IRRC
• 𝐼𝑅𝑅𝐶 (Item Relative Rating Count): is a ratio of
the ratings count of item 𝑃 to the average of
ratings for all items in the dataset.
𝐼𝑅𝑅𝐶 =
𝑛 𝑖
𝑛 𝑖
, 𝑛𝑖 = 𝑖∈𝑃 𝑛 𝑖
𝑃
33. • Recommenders focus on generating personalized
results, without perceiving the global opinions of
users about the recommended items.
• The reputation of an item reflects the quality of
an item, which could affect a user opinion.
• We propose a method to combine the
recommender and reputation systems to
enhance the accuracy of the top-n recommender
results.
Reputation-Aware Recommender
System
34. A Block Diagram of the Proposed
Reputation-Aware Recommender System
User
Profiles
User Profile
Ratings
Users Similarity
Generating Nearest
Neighbours
Historical Data
Personalised Items’
Reputations Recommendation
ranked list of items
Combined list of Item
Recommendations
Item Reputations
Item
Profiles
Item Profile
Ratings
Reputation ranked
list of items
Item Clustering
35. Personalized Item Reputation
• The top items on the reputation-based list are not
necessarily the items that a particular user likes.
• We cluster items based on user ratings or based
on item categories.
• The personalized reputation is defined as the
degrading process for all the items in the
reputation-ranked list that do not belong to the
user preference.
𝑃𝐼𝑅 𝑢,𝑝 =
𝑆(𝑝), 𝑝 ∈ 𝐶𝑖, 𝐶𝑖 ∈ 𝐹𝑢
0, 𝑂𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
36. Weighted Borda-Count
• The BC method gives each item a number of
points corresponding to his rank in each list.
• We introduce a weight to the traditional BC
method to emphasize difference between the
two lists.
𝑊𝐵𝐶 𝑝 = 𝜔 × 𝐵𝐶𝑟𝑒𝑐 𝑝 + 1 − 𝜔 × 𝐵𝐶𝑟𝑒𝑝 𝑝
𝐵𝐶(𝑝) = 𝐵𝐶𝑟𝑒𝑐(𝑝) + 𝐵𝐶𝑟𝑒𝑝(𝑝)
38. User Coherence
ω 𝑢 =
𝑐∈𝐶
𝑝∈𝐼(𝑢,𝑐) 𝑟𝑢,𝑝 − 𝑟𝑢,𝑐
2
𝐼(𝑢, 𝑐)
𝐶
𝐶𝑜ℎ𝑒𝑟𝑒𝑛𝑐𝑒 𝑢 = −
𝑐∈𝐶
𝑝∈𝐼(𝑢,𝑐)
𝑟𝑢,𝑝 − 𝑟𝑢,𝑐
2
• In order to define the value of ω which will
enhance the accuracy of the recommender
system, we propose to use user coherence.
• User coherence is defined as the stability of user
ratings levels given to items with the same
category [14].
• A coherent user has no problem with getting
accurate recommendations, unlike incoherent
users
39. Evaluating the Proposed Reputation
Models
• Hypothesis 1: Embedding item relative rating
count, and the standard deviation of items
ratings in calculating items reputations will
produce more accurate reputation scores
using dense datasets.
• Hypothesis 2: Using uncertainty parameter,
alongside with the rating levels frequencies in
calculating items reputations will produce
more accurate reputation scores when the
dataset is sparse.
40. Experiment 1: Ratings prediction
• The first experiment is to predict an item rating
using the item reputation score.
• The hypothesis is that the more accurate the
reputation model the closer the scores it
generates to actual users’ ratings.
• The mean absolute error (MAE) metric will be
used to measure the prediction accuracy.
𝑀𝐴𝐸 =
1
𝑃 𝑖=1
𝑚 𝑟∈𝑅 𝑝
𝑟𝑝 − 𝑟
𝑅 𝑝
41. Experiment 2: Item Ranking List
Similarity
• We compare two lists of items ranked based on
their reputation scores generated using different
methods.
• 𝑛 𝑑 and 𝑛 𝑐 is the number of discordant and
concordant pairs between the two lists
respectively.
𝜏 =
𝑛 𝑐 − 𝑛 𝑑
1
2
𝑛 𝑛 − 1
𝑛 𝑑 = 𝑖, 𝑗 𝐴 𝑖 < 𝐴 𝑗 , 𝐵 𝑖 > 𝐵(𝑗)}
𝑛 𝑐 = 𝑖, 𝑗 𝐴 𝑖 < 𝐴 𝑗 , 𝐵 𝑖 < 𝐵(𝑗)}
42. Experiment 3: Item Ranking Accuracy
• The IMDb website provides a special
calculation for the top-250 movies of all time.
• We compare the top-250 movies generated by
each one of the implemented reputation
models with the IMDb produced top-250
movies
𝐴𝑃 =
𝑖∈𝐶 𝑃@𝑖
𝐸
𝐺 𝑝@𝑡 =
5 × 1 −
𝐼 𝑝,𝑖𝑚𝑑𝑏 − 𝐼 𝑝,𝑟𝑒𝑝
𝑡
, 𝑝 ∈ Top−t𝑖𝑚𝑑𝑏
0, 𝑝 ∉ Top−t𝑖𝑚𝑑𝑏
𝐷𝐶𝐺 𝑝@𝑡 =
𝑥=1
𝑡
2 𝐺 𝑝@𝑡
− 1
log2 𝑥 + 1
𝑛𝐷𝐶𝐺 𝑝@𝑡 =
𝐷𝐶𝐺 𝑝@𝑡
𝐼𝐷𝐶𝐺 𝑝@𝑡
43. Experiment 4: Reputation-Aware
Recommender Accuracy
• We implement the traditional
user-based CF as the top-n
recommender system.
• The method we use to
combine the reputation
models with recommender
system is the proposed
weighted Borda count method
(WBC) presented in Chapter 5.
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
Relevant ⋂ Recommended
Recommended
𝑅𝑒𝑐𝑎𝑙𝑙 =
Relevant ⋂ Recommended
Relevant
𝐹1−𝑆𝑐𝑜𝑟𝑒 = 2 ×
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 × 𝑅𝑒𝑐𝑎𝑙𝑙
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙
Merging Method
Reputation Model
Recommender
System
44. Baseline Models
• Naïve method: arithmetic mean.
• IMDb: a true Bayesian estimation.
• Dirichlet reputation model [3] (2007).
• Fuzzy reputation model [5] (2009).
• PerContRep model [7] (2014).
• Trusti Model [8] (2015).
45. Datasets
• Dense
• Sparse
Dataset #Users #Items #Ratings #Rating Levels
Average of Ratings Count
Per Item (ARCPI)
ML-100K 943 1682 100,000 5 59.453
ML-1M 6,040 3,952 1,000,209 5 253.089
ML-10M 71,567 10,681 10,000,054 10 936.246
IMDb - 13,479 385,581,991 10 28,606.127
Dataset #Users #Items #Ratings
#Rating
Levels
Sparsity ARCPI
Book Crossing
Dataset
17,854 113,481 277,906 10 0.99998 2.44892
Only 4 ratings per
movie (4RPM)
1361 3,706 14261 5 0.99717 3.84808
Only 6 ratings per
movie (6RPM)
1760 3,706 21054 5 0.99677 5.68105
Only 8 ratings per
movie (8RPM)
2098 3,706 27723 5 0.99643 7.48057
47. Results of Item Ranking List Similarity Experiment
0
0.2
0.4
0.6
0.8
1
0% 20% 40% 60% 80% 100%
Top X% of the ranked lists used in similarity calculation
NDR
NDRU
BetaDR
0
0.2
0.4
0.6
0.8
1
0% 20% 40% 60% 80% 100%
Top X% of the ranked lists used in similarity calculation
NDR
NDRU
BetaDR
0
0.2
0.4
0.6
0.8
1
0% 20% 40% 60% 80% 100%
Top X% of the ranked lists used in similarity calculation
NDR
NDRU
BetaDR
0
0.2
0.4
0.6
0.8
1
0% 20% 40% 60% 80% 100%
Top X% of the ranked lists used in similarity calculation
NDR
NDRU
BetaDR
Kendall similarities with the naïve method using 100K-ML dataset
Kendall similarities with the naïve method using 100K-ML dataset
Kendall similarities with the Trusti method using 8RPM dataset
Kendall similarities with the Trusti method using 8RPM dataset
50. Evaluating the Reputation-Aware
Recommender System
• Hypothesis 3: The accuracy of recommender
systems can be enhanced by combining the
item reputation factor using WBC as a merging
method. The accuracy enhancement takes
place with both sparse and dense datasets.
51. Recommender Systems and
Reputation Models Employed
• We use the BetaDR reputation model for the
dense dataset evaluation part. In the sparse
evaluation part we used the NDRU reputation
model.
• We use two recommender systems for the top-n
recommendation experiment:
– The traditional user-based CF [12].
– The reliability-aware recommender system [13].
Merging Method
Reputation Model
Recommender
System
52. Datasets
• For the dense datasets, we used the ML-100K
and the ML-1M datasets described before.
MovieLens 5% (ML5) MovieLens 10% (ML10)
Number of ratings 6,515 13,077
Sparsity 0.99589 0.99175
Minimum number of ratings per
user
5 10
Maximum number of ratings per
user
36 73
Average number of ratings per
user
6.849 13.867
Minimum number of ratings per
movie
0 0
Maximum number of ratings per
movie
59 114
Average number of ratings per
movie
3.840 7.774
56. Conclusions
• The first reputation model we propose is the NDRU
model.
– It uses the normal distribution to generate ratings weights.
– It employs the uncertainty factor.
– It is more accurate when used with sparse datasets.
• The second reputation model we propose is the
BetaDR model.
– It uses the beta distribution to generate the weights for
the ratings.
– It uses the item relative rating count (IRRC), and the
standard deviation of item’s ratings and its mean.
– This model proved to produce more accurate results when
used with dense dataset.
57. Conclusions (Cont.)
• We propose the weighted Borda count (WBC)
method to combine reputation scores with
recommender systems in order to enhance the
accuracy of the top-n recommendations.
• We propose to generate personalized reputation
scores for each user to purify reputation scores
and make them useful in recommender systems.
• We noticed that merging reputation models with
recommenders has the potential to enhance the
recommendation accuracy.
58. Relevant Publications
• Journal Papers
1. Abdel-Hafez, A., & Xu, Y. (2013). A survey of user modelling in social media websites. In Computer and Information
Science, 6(4), pp. 59-71.
2. Abdel-Hafez, A., Xu, Y., & Jøsang, A. (2015). A normal-distribution based rating aggregation method for generating
product reputations. In Web Intelligence. 13(1), pp. 43-51. IOS Press.
• Conference Papers
3. Abdel-Hafez, A., Xu, Y., & Tjondronegoro, D. (2012). Product reputation model: an opinion mining based approach.
Paper presented at the 1st International Workshop on Sentiment Discovery from Affective Data (SDAD’12), p16-20,
CEUR workshop.
4. Abdel-Hafez, A., & Xu, Y. (2013). Ontology-based product's reputation model. Paper presented at the 2013
IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT),
pp. 37-40. IEEE.
5. Abdel-Hafez, A., Tang, X., Tian, N., & Xu, Y. (2014). A reputation-enhanced recommender system. Paper presented
at Advanced Data Mining and Applications (ADMA’14), pp. 185-198. Springer International Publishing.
6. Abdel-Hafez, A., Xu, Y., & Jøsang, A. (2014). A normal-distribution based reputation model. Paper presented at the
Trust, Privacy, and Security in Digital Business (TrustBus’14), pp. 144-155. Springer International Publishing.
7. Abdel-Hafez, A., Phung, Q. V., & Xu, Y. (2014). Utilizing voting systems for ranking user tweets. Paper presented at
the 2014 Recommender Systems Challenge (RecSysChallenge’14), pp. 23-28. ACM.
8. Abdel-Hafez, A., Xu, Y., & Tian, N. (2014). Item reputation-aware recommender systems. Paper presented at the
16th International Conference on Information Integration and Web-based Applications & Services (iiWAS’14), pp.
79-86. ACM.
9. Abdel-Hafez, A., Xu, Y., & Jøsang, A. (2014). A rating aggregation method for generating product reputations. Paper
presented at the 25th ACM conference on Hypertext and social media (Hypertext’14), pp. 291-293.ACM.
10. Abdel-Hafez, A., Xu, Y., & Jøsang, A. (2015). An accurate rating aggregation method for generating item reputation.
Paper to be presented at the 2015 IEEE International Conference on Data Science and Advanced Analytics
(DSAA'15). IEEE. (Accepted)
11. Abdel-Hafez, A., & Xu, Y. (2015). Exploiting the beta distribution-based reputation model in recommender system.
Paper to be presented at the 28th Australasian Joint Conference on Artificial Intelligence. (Accepted)
59. References
[1] Ayday, E., Lee, H., & Fekri, F. (2009). An iterative algorithm for trust and reputation management. In IEEE International Symposium on
Information Theory (ISIT), 2051-2055, IEEE.
[2] Riggs, T., & Wilensky, R. (2001). An algorithm for automated rating of reviewers. Paper presented at the Proceedings of the 1st
ACM/IEEE-CS joint conference on Digital libraries.
[3] A. Jøsang and J. Haller, "Dirichlet reputation systems," in Availability, Reliability and Security, 2007. ARES 2007. The Second International
Conference on, 2007, pp. 112-119.
[4] Bharadwaj, K. K., & Al-Shamri, M. Y. H. (2009). Fuzzy computational models for trust and reputation systems. Electronic Commerce
Research and Applications, 8(1), 37-47.
[5] Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: bringing order to the web.
[6] Teacy, W. L., Patel, J., Jennings, N. R., & Luck, M. (2006). Travos: Trust and reputation in the context of inaccurate information sources.
Autonomous Agents and Multi-Agent Systems, 12(2), 183-198.
[7] Yan, Z., Chen, Y., & Shen, Y. (2014). PerContRep: a practical reputation system for pervasive content services. The Journal of
Supercomputing, 70(3), 1051-1074.
[8] Wang, Y., Yin, G., Cai, Z., Dong, Y., & Dong, H. (2015). A trust-based probabilistic recommendation model for social networks. Journal of
Network and Computer Applications, 55(2015), 59-67.
[9] De Grazia, A. (1953). Mathematical derivation of an election system. Isis, 44(1/2), 42-51.
[10] Coombs, C. H. (1964). A theory of data. New York: Wiley.
[11] Jøsang, A., Pini, M. S., Santini, F., & Xu, Y. (2013). Combining Recommender and Reputation Systems to Produce Better Online Advice.
Modeling Decisions for Artificial Intelligence, 8234, 126-138.
[12] Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2000). Analysis of recommendation algorithms for e-commerce. In Proceedings of the
2nd ACM Conference on Electronic Commerce, 158-167, ACM.
[13] Hernando, A., Bobadilla, J., Ortega, F., & Tejedor, J. (2013). Incorporating reliability measurements into the predictions of a
recommender system. Information Sciences, 218(2013), 1-16.
[14] Said, A., Jain, B. J., Narr, S., & Plumbaum, T. (2012). Users and noise: The magic barrier of recommender systems User Modeling,
Adaptation, and Personalization (pp. 237-248): Springer International Publishing.
[15] Ku, Y.-C., & Tai, Y.-M. (2013). What Happens When Recommendation System Meets Reputation System? The Impact of
Recommendation Information on Purchase Intention. In System Sciences (HICSS), 2013 46th Hawaii International Conference on,
1376-1383, IEEE.6