SlideShare a Scribd company logo
1 of 29
Nhận viết đề tài trọn gói – ZL: 0909 23 26
20– Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909
23 26 20
MINISTRY EDUCATION AND
TRAINING UNIVERSITY OF
DANANG
-----

-----
PHAN QUOC NGHIA
RECOMMENDER SYSTEM BASED ON
STATISTICAL IMPLICATIVE ANALYSIS
Speciality: Computer Science
Code: 62 48 01 01
DOCTORAL THESIS SUMMARY
Danang – 2018
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
The dissertation is completed at:
UNIVERSITY OF DANANG
Academic Instructors:
1. Associate Professor Huynh Xuan Hiep, PhD.
2. Dang Hoai Phuong, PhD.
Opponent 1:……………………………..……………
Opponent 2:………………...………...………………
Opponent 3:………………...……...…………………
The dissertation will be defended before the Board of
thesis review established by University of Da Nang
At ... ..... hour ......... day ....... month ....... year .......
The dissertation can be found at:
- National Library
- Information and Learning Center, University of Da Nang
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
1
PREFACE
1. The urgency of the thesis
The information overload problem really became popular
with the rise of the Internet and social networks, the amount of
information that people are approaching is expanding ever more.
Everyday, we are exposed to a multitude of types of
information: email communications, articles in Internet, social
media postings, advertising information from e-commerce sites.
With this huge amount of information, choosing the right
information for the decision-making of computer users and
smart devices users will be increasingly difficult. The
recommender model is considered as solution to support users to
select information effectively and is widely used in many fields.
Recommender model is a system capable of automatically
analyze, classify, select and provide users with the information,
goods or services that users are interested by application of
statistical techniques and artificial intelligence. In particular,
machine learning algorithms play an important role. In order to
provide the information that users need to support, many
recommender models have been proposed such as Collaborative
filtering recommender models, Content-based recommender
models, Demographic recommender models, Knowledge-based
recommender models, Hybrid recommender models.
However, due to the information explosion on social
networking sites and the spread of products on e-commerce
sites today, the current recommender models have not yet met
the complex requirements of the users. Therefore, the study of
recommender models continue to be interested in such research
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
both advanced methods and algorithms to improve the accuracy
of the current recommender models, research to improve the
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
2
systems to adapt for the problem of information explosion and
research to propose new recommender model.
Starting from this practical situation, the topic
"Recommender system based on statistical implicative analysis"
is conducted within the framework of a doctoral dissertation in
computer science with the desire to contribute a part to the
recommender model of research. Specifically, it is a
collaborative filtering recommender model.
2. Objectives, objects and scope of research of the thesis
2.1. Research objectives
The objective of the thesis is to propose collaborative
filtering recommender models that apply the proposed measures
from the statistical implicative analysis method, tendency of
variation in statistical implications, and association rules.
2.2. Research objects
The objective interestingness measures, statistical
implicative analysis method, recommender models.
2.3. Research scopes
Focus on Statistical implication analysis method, Tendency
of variation in statistical implications, Association rules, and
Recommender models.
3. Research methods
Analysis and synthesis of theory combined with experiment.
4. Thesis structure
Preface
Chapter 1: An overview.
Chapter 2: Classification objective interestingness measures
based on statistical implication parameters.
Chapter 3: Recommender model based on Implication index.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
3
Chapter 4: Collaborative filtering recommender model based
on Implication intensity.
Chapter 5: Collaborative filtering recommender model based
on statistical implicative similarity measures.
Appendix
5. Contribution of the thesis
- Propose a new method for classification objective
interestingness measures based on statistical implication
parameters.
- Propose recommender model based on Implication index.
- Propose a collaborative filtering recommender model
based on Implication intensity.
- Propose a collaborative filtering recommender model
based on statistical implicative similarity measures.
- Develop empirical toolkit (ARQAT) on the R language.
CHAPTER 1: AN OVERVIEW
The main content of this chapter studies an overview of
objective interestingness measures, statistical implicative
analysis method, tendency of variation in statistical
implications, and recommender models. Research on the
proposed recommender models and analysis of advantages and
disadvantages of each model. On the basis of these studies,
clearly define the research content of the thesis.
1.1. Statistical implicative analysis
Statistical implicative analysis is the method of data analysis
studying implicative relationships between variables or data
attributes, allowing detecting the asymmetrical rules a → b in the
form "if a then that almost b" or "consider to what extent that b
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
4
will meet implication of a". The purpose of this method is to
detect trends in a set of attributes (variables) by using statistical
implication measures.
Figure 1.1. The model represents statistical implication
analysis method
Let E be a set of n objects or individuals described by a finite set of binary variables (property).
A (A ⊂ E) is a subset of objects that meet the property a; B (B ⊂ E) is a subset of objects that
meet the property b; ̅(resp. ̅) is the complement of A (resp. B);
A B
= ( ) is the number of elements of set A; = ( ) is the number of elements of set B; and the counter-
examples ( ̅ = ( ∩ ̅)) is the number of objects that satisfy the attribute a but does not satisfy the property b. Let X
and Y be two random sets with the number = and = respectively.
For a certain process of sampling, the random variable ( ∩
̅) follows the Poisson distribution with the parameter
= ̅
.
The rule → is said to be admissible for a given
threshold α if
̅ ̅
1.2
Pr[card(X ∩ Y) ≤ card(A ∩ B)] ≤ α
Let us consider the case where n̅ ≠ 0. In this case, the
B
Poisson random variable
̅
can be standardized
card(X ∩ Y)
random as:
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
5
̅ nA(n − n )
̅
card(X ∩ Y) − n
Q(A, B) =
1.3
√
nA(n − n )
n
In experimental realization, the observed value
Q(A, B̅) is defined by:
n ̅ −
nA(n − n )
̅
AB
n
q(A, B) =
√nA(n − n )
n
q(A, B̅) of
1.4
This value measures a deviation between the contingency
and expected value when a and b are independent.
When the approximation is justified (e.g. > 4) the random variable q(A, B
̅) is approximatively N(0,1)-distributed. The implication intensity φ(a,b) of the rule → is defined by:
1
∞
2
̅ ̅
∫
−
nếu ≠
2
1 − Pr( ( , ) ≤ ( , )) =
2 1.5
φ(A,B)= { ( , ̅)
0 ngược lại
This measures is used to determine the unlikehood of the counter-example ̅ in the set . The implication
intensity φ(a,b) is admissible for a given threshold α if φ(a,b) ≥ 1 − .
1.2. Tendency of variation in statistical implications
The tendency of variation in statistical implications is a
research directions to examine the stability of the implication
intensity to observe small variations of measures in the
surrounding space of parameters , , , . To clarify the tendency of variation in statistical implications, we examine the
implication index measures under 4 parameters , , , with formula defined (1.4).
To observe the variation of q from the variability of the parameters , , , , Let us consider the parameters
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
6
, , ,
as real numbers which satisfy the following
inequalities:
≤ inf( , ) và sup( , ) ≤ 1.6
In this case, q can be considered as a continuous
differentiable function:
= + + + ̅ = grad q. dM
1.7
̅
The ( , , , ) s a function has 4 parameters. To observe the variation of q according to the parameters we
calculated the partial derivative for each parameter. In fact, this variation is
estimated rising of the function q with variation according to the variation of q
corresponding components ∆ , ∆ ,∆ ,∆ . Therefore, we have the formula:
∆ = ∆ + ∆ + ∆ + ∆ ̅
1.8
̅
Let us take the partial derivatives of q under ̅ we have the
following formula:
= 1
> 0
̅
√ ( − )
1.12
Equation 1.12 shows that if the ∆ tends to increase, then the q tends to increase.
1.3. Recommender models
1.3.1. The basic concepts
1.3.2. Content-based recommender models
1.3.3. Collaborative filtering recommender models
1.3.4. Demographic recommender models
1.3.5. Knowledge-based recommender models
1.3.6. Recommender based on association rule models
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
7
1.3.7. Recommender model based on statistical
implicative analysis
1.3.8. Hybrid recommender models
1.4. Evaluating recommender models
1.4.1. Method for developing evaluation data
1.4.2. Method for Evaluating the recommender models
1.5. Application of recommender models
1.6. Development trends of recommender models
1.7. Conclusion Chapter 1
The contribution of this chapter studies objective
interestingness measures, statistical implicative analysis
method. Study recommender models, analyze advantages and
disadvantages of each model. This is the basis for determining
the research contents of the thesis.
CHAPTER 2: CLASSIFICATION OBJECTIVE
INTERESTINGNESS MEASURES BASED ON
STATISTICAL IMPLICATION PARAMETERS
The main content of this chapter presents objective
interestingness measures, methods of classifying objective
interestingness measures, and proposing a method for
classifying measures based on an asymmetric approach using
statistical implication parameters.
The research results of this chapter have been published in
works (3), (4) in the published list by author.
2.1. An objective interestingness measures
An objective interestingness measures is the measurement of
knowledge patterns based on the distribution of data. Assume
that we have a finite set of transactions, with each transaction
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
8
contained in item set I. An association rule → where A and B are two disjoint sets of items ( ∩ = ∅) where a are
attributes of the objects of the set A, b are attributes of the objects of the set B. Item set A (resp. B) is associated
with a subset of transactions with = ( ) = { ∈ , ⊆ } (resp. = ( )), item set ̅ (resp. ̅ ) is associated with a
subset of transactions with ̅= ( ̅)= − ( ) = { ∈ , ⊊ } (resp. ̅ = ( ̅) ). The
rule can be described by four cardinalities ( , , , ̅ ) where = | |, = | |, = | |, ̅ = | ∩ ̅|. The interestingness value
of an association rule based on an objective interestingness
measures will then be calculated by using the cardinality of a rule ( → ) = ( , , , ̅).
n
T A
̅
B
̅
Figure 2.1. The cardinality of an association rule ⟶ 2.2. Classify the objective interestingness measures
2.2.1. Classification based on examining of
measures properties
2.2.2. Classification based on measures of behavior
2.3. Classifying objective interestingness measures based
on statistical implication parameters
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
9
2.3.1. The principles define the variance of the measure
based on the partial derivative
The principles used to investigate the objective
interestingness measures based on the partial derivative value
according to 4 parameters:
- If the partial derivative values of corresponding parameter
is positive, the property of measures in the corresponding
parameter is labeled as 1.
- If the partial derivative values of corresponding parameter
is negative, the property of measures in the corresponding
parameter is labeled as -1.
- If the partial derivative values of corresponding parameter
is zero, the property of measures in the corresponding
parameter is labeled as 0.
2.3.2. The rules for classification measures based on
the variable attribute of measures
Measures are classified according to the following rules:
- If the value of the partial survey has label 1, then put it in
the class of measures vary increasing with the corresponding
parameter;
- If the value of the partial survey has label -1, then put it in
the class of measures vary decreasing with the corresponding
parameter;
- If the value of the partial survey has label 0, then put it in the
class of measures is independent on corresponding parameter;
- If the value of the partial survey has label more than one
value (1, 0, -1), then put it in the other class.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
10
2.4. Classification results of asymmetric
objective interestingness measures
2.4.1. Classification result of measures based on
partial derivative under the parameter n
2.4.2. Classification result of measures based on
partial derivative under the parameter
2.4.3. Classification result of measures based on
partial derivative under the parameter
2.4.4. Classification result of measures based on
partial derivative under the parameter
2.5. Comparison and evaluation of classification results by
statistical implication parameters
- Class of measures independent of the parameter n by the
classification method based on tendency of variation in
statistical implications fall in the class of measures have
descriptive property by the classification method based on
properties of measures.
- The majority of measures have asymmetric properties
increase with the parameter and decrease with the parameter
when calculating the value based on the association rules.
- The class measures has statistical property is always
increasing or decreasing with statistical implication parameters.
2.6. Conclusion chapter 2
The main contribution of this chapter is to propose new
method for classifying objective interestingness measures based
on statistical implication parameters. Then apply this method to
conduct classification 39 asymmetric objective interestingness
measures. This result can help users choose the appropriate
measures for the collaborative filtering recommender model.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
11
CHAPTER 3: RECOMMENDER MODEL BASED ON
IMPLICATION INDEX
The main content of this chapter proposed recommender
model based on asymmetric approach using association rules,
Implication index, and partial derivatives under statistical
implication parameters. This model is particularly interested in
the relationship between the condition attributes and decision
attributes on the same object to give the recommendation
results for users.
The research results of this chapter have been published in
works (1), (2) in the published list by author.
3.1. An association rules based on decision attributes
3.1.1. Definition of association rule based on decision attributes
Let = { 1, 2, … , } is a set of n users, where each user is stored as a transaction, U is considered the transaction database; = { 1, 2, … , ,
+1, … , } is the set of m attributes of each user, where Đ = { 1, 2, … , } is the set of condition attributes, Đ = { +1, … , } is the set of decision
attributes.
An association rule based on decision attributes generated from the transaction database U is an
implicative expression of the form: a → b, with ⊆ Đ , ⊆ Đ, | | ≥ 1, | | ≥ 1.
3.1.2. Algorithm for generating association rule based on
decision attributes
Input:User transaction dataset ( ).
Output: Set of association rules for recommender models.
Begin
Step 1: Scan transaction database (U) to determine Support of each candidate 1-itemset, compare candidate Support with min_sup to find frequent 1-itemset (
1).
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
12
Step 2: Use −1 join −1 to generate a candidate set of candidate k-itemset. Prune not frequent itemsets to determine candidate k-itemset.
Step 3: Scan transaction database (U) to determine Support of each candidate k-itemset, compare candidate Support with min_sup to find
frequent k-itemset ( ).
Step 4: Repeat from step 2 until the candidate set is empty.
Step 5: For each frequent itemset I, generate all nonempty s subsets of I.
Step 6: For every nonempty subset s of l, generate the rules:
Đ= { → ( − )|( − ) ∈ { + , … , }, Confidence(( −
) → ) ≥ min_conf}.
End.
3.2. Statistical implication parameters of association rules
3.2.1. Statistical implication parameters
3.2.2. Statistical implication parameters based on binary matrix
3.3. Calculate Implication index and partial
derivatives based on statistical implication parameters
3.4. Recommender model based on Implication index
3.4.1. Definition of recommender model based on Implication index
The recommender model based on Implication index is
defined as follows:
= < , , , >
Where:
- = { 1, 2, … , } is a set of n users;
- = { 1, 2, … , , +1, … , } is the set of m attributes of each user, where Đ = { 1, 2, … , } is the set of condition attributes, Đ = { +1, … , } is the
set of decision attributes;
- = { 1, 2, … , } is the association rule set for the model;
- ∶ → are computational functions for finding association rules RUa = {r1,r2, … , rN} which have valuable
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
13
recommendation for user u based on Implication index and partial derivative under statistical implication parameters.
3.4.2. Recommendation algorithm based on Implication index
Input: - Transaction dataset;
- Value set of condition attributes ( Đ ) of the user ;
Output: The association rule set supports for users to choice the value of the decision attributes;
Begin
Step 1: Generate association rules based on decision attributes from transaction dataset.
Step 2: Determine the value of statistical implication parameters for association rule set:
For each association rule based on decision attributes of rule set do <Determine value of parameter >;
<Determine value of parameter >;
<Determine value of parameter >;
<Determine value of parameter ̅ >;
Step 3: Calculate value of Implication index and partial derivatives based on statistical implication parameters:
For each association rule based on decision attributes in RU set do
<Calculate value of Implication index of rule >;
<Calculate value of partial derivatives on parameter n of rule >; <Calculate value of partial derivatives on parameter of rule >;
<Calculate value of partial derivatives on parameter of rule >; <Calculate value of partial derivatives on parameter ̅ of rule >;
Step 4: Choose a recommendation rule set for the user :
For each association rule based on decision attributes in RU set do if (<Value attributes left side of rule > ⊆ < Value set of Đ >) then
<Put association rule into RU set for user >;
<Sort association rules of RU set by value of Implication index and value of partial derivatives on statistical implication parameters>;
<Select N rules with the highest Implication index value and the highest partial derivative value to recommend user >;
End.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
14
3.5. Experiment
3.5.1. Experimental data
The model has been tested on two datasets: Lenses dataset of
UCI and admissions dataset of Tra Vinh University (DVT-Data).
3.5.2. Evaluate the model's accuracy on standard dataset
Table 3.17. The recommendation result with the condition
attributes {i1 = 1, i2 = 2, i3 = 2, i4 = 1}
From the recommendation results in Table 3.17, the users
can consider value of decision attributes to make a final
decision to use contact lenses or not.
3.5.3. Evaluate the model's accuracy on real dataset
The model has 8 results that contain the course code
registered by the candidate over 10 tests. In particular, most
recommendation results are based on rules related to the scores
of subjects in the examination. This is a characteristic quite
suited to the actual requirements of admission consultants.
3.6. Conclusion chapter 3
The contribution of this chapter is to propose recommender
model based on Implication index and association rules. The
difference of this model is based on the value of the condition
attributes to calculate the values of the decision attributes on the
same object.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
15
CHAPTER 4. COLLABORATIVE FILTERING
RECOMMENDER MODEL BASED ON IMPLICATION
INTENSTY
This chapter proposes a new model for collaborative filtering
recommender model based on asymmetric approach. The model
uses association rules and Implication intensity. In particular, the
model using association rules with high surprisingness generated
based on rating matrix and Implication intensity.
The research results of this chapter have been published in
work (12) in the published list by author.
4.1. Association rules based on Implication intensity
4.1.1. Definition of association rules based on Implication
intensity
Let = { 1, 2, … , } is a set of n users; = { 1, 2, … , } is a set of m items; = { , } is a rating matrix of n
users for m items with each row representing a user (1 ≤ ≤ ); each column represents an item (1 ≤ ≤ ); , is
the rating value of user for item ; is a set of items rated by , is a set of items rated by , and , .
The association rule based on Implication intensity is defined as: → where , and ∩ = ∅;
and is accepted with threshold α (0 ≤ ≤ 1) if φ(a,b) ≥ 1 − α where φ(a,b) is defined in formula (1.5).
4.1.2. Algorithm for generating association rule
based on Implication intensity
Input: Training dataset and the threshold value of Implication intensity; Output: the association rule set based on Implication intensity;
Begin
Step 1: Generate a candidate set from 1-itemset to k-itemset <Generate a candidate set 1-itemset from training dataset >;
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
16
<Generate a candidate set 2-itemset from candidate set 1-itemset>;
…
<Generate a candidate set k-itemset from candidate set k-1-itemset >;
Step 2: Generate nonempty subsets of candidate set For each candidate set I do
<Generate nonempty subsets s of I >;
Step 3: Generate association rule with threshold value of Implication intensity
For each nonempty subsets s of I do <Generate association rule r : → ( − )>;
If (value of Implication intensity of rule r ≥ threshold α) then <Select association rule r for recommender model>;
End.
4.2. The collaborative filtering recommender model
based on Implication intensity
4.2.1. Definition of recommender model based on
Implication intensity
Suppose that = { 1, 2, … , } is a set of n users; = { 1, 2, … , } is a set of m items; = { , } is a training dataset, with , is
the rating value of user for item ;
= { , } is a testing dataset, with , is the rating value of user for item ; = { 1, 2, … , } is a set of
association rules generated from training dataset based on Implication
intensity; = { , } is logical matrix, with , is logical value between rule and user , if user has ratings for items of the left side of rule then , = " ",
otherwise
, = " " ; = { 1, 2, … , } is a set of association rules selected for user . The recommendation results for user is
a set of items belong to right side of
that user has not rated = { 1, 2, … , }.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
17
4.2.2. Recommendation algorithm based on
Implication intensity
Input:- Set of association rules generated from training dataset;
- Testing dataset;
Output: Recommendation results for each user in the testing dataset;
Begin
Step 1: Create a logical matrix based on association rule set and testing dataset
t = <The number of association rules>;
l = <The number of users in testing dataset>;
= { }, với = {1,2, … , }, = {1,2, … , }; For each association rule in RU do
For each user in testing dataset do
(( )
∩ ( )
≠ ∅)) ℎ
[ , ] = " "
Otherwise
[ , ] = " "
Step 2: Select association rules to recommend for each user For each user in testing dataset do
For each association rule selected do
If ( [ , ] = " ") then
<Select association rule on row i for user on column j>; Step 3: Select items to recommend for each user For
each user in testing dataset do
<Sort descending rule set based on Implication intensity values >; <Select N items from right side of rules with the highest Implication
intensity value that user j has not rated to recommend for user j>; End.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
18
4.2.3. Evaluate model accuracy
4.3. Experiment
4.3.1. Experimental data
The experiment is performed on two different datasets: real
rating dataset (MovieLens) and binary rating dataset (MSWeb).
4.3.2. Compare the model's accuracy on binary rating
dataset and real rating dataset
The accuracy of model is based on three metrics Precision,
Recall and Fmeasure (Figure 4.2) shows the recommendation
results on binary rating dataset higher precision than the
recommendation results on real rating dataset.
Figure 4.2. The diagram shows the pattern with high
precision on binary rating dataset
4.3.3. Compare the accuracy of the model with
other collaborative recommender models
The accuracy of the proposed model is compared to the
accuracy of the collaborative filtering recommender models:
User-based collaborative filtering (UBCF), Item-based
collaborative filtering (IBCF) and Collaborative filtering
recommender based on association rules (AR). Kết quả so sánh
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
19
được trình bày trong hình 4.3. The comparison results are
presented in Figure 4.3.
Figure 4.3. Compare the accuracy of the recommender
models
This result shows that the collaborative filtering
recommender model based on Implication intensity is highly
accurate on two empirical datasets. In particular, on binary
rating dataset (MSWeb), the model has far more precision than
the other models.
4.4. Conclusion Chapter 4
The contribution of this chapter is to propose collaborative
filtering recommender model based on Implication intensity.
Experiment the model on two standard datasets and compare
the evaluation results with other effective collaborative filtering
recommender models. The experimental results show that using
of asymmetric association rules based on statistical implication
analysis method can increase the accuracy of the collaborative
filtering recommender model.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
20
CHAPTER 5: COLLABORATIVE FILTERING
RECOMMENDER MODEL BASED ON STATISTICAL
IMPLICATIVE SIMILARITY MEASURES
This chapter presents steps to develop a similarity measures
based on the asymmetric approach for the user-based
collaborative filtering recommender model. The similarity
measures between the two users is based on association rules
and Implication intensity.
The research results of this chapter have been published in
works (7), (8) in the published list by author.
5.1. Similarity measures based on Implication intensity
5.1.1. The statistical implicative similarity measures
between two users
The statistical implicative similarity value between two
users , is determined by the following formula:
∑k II(r )
SIS(u , u ) = 1 −
i=1 i
k
Of which:
- SIS(u ,u ): is similarity value between two users u ,u ;
- II(ri): is implication intensity value of association rule ri;
- k: is the number of association rule set that generated from rating dataset of two users u ,u .
5.1.2. Algorithm to measure statistical implicative similarity
value between two users
Input:Rating dataset for items of two users u , u ;
Output: Similarity value between two users u , u ;
Begin
Step 1: Generate association rules from user's rating matrix
Step 2:Select association rules for two users u , u
< Select items are rated by user u : u >;
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
21
<Select items are not rated by user u : u
̅ >;
<Select association rules of the form: → Y with ∈ u ; ∈ u
̅ và ∩ =∅>;
Step 3: Define the parameters , , ,̅ for selected rules For each selected rule in selected association rule set do
<Define the parameters , , , ̅ >;
Step 4: Calculate implication intensity value for selected set of association rules
For each selected rule in selected association rule set do <Calculate implication intensity value>;
Step 5: Calculate similarity value between two users u , u ( (u , u ))
<Calculate mean of implication intensity values of rule set: ̅>;
<Similarity value between two users u , u : (u , u ) = 1 − ̅>;
End.
5.2. Collaborative filtering recommender model based
on statistical implicative similarity measures
5.2.1. Definition of recommender model based on
statistical implicative similarity measures
The collaborative filtering recommender model based on
statistical implicative similarity measures is defined as follows:
=< , , , >
Of which:
- = { 1, 2, … , } is a set n users in system;
- = { 1, 2, … , } is a set m items in system;
- = { , } is rating matrix of users for items, , is rating value of user for item ;
- ∶ → is a function to find out the items that need to recommend for user ∈ U (with = { 1, 2, … , }).
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
22
5.2.2. Recommendation algorithm based on on statistical
implicative similarity measures
Input: - User set U; Item set I, Rating matrix R;
- User need recommendation result ;
Output: Recommendation items for user : Iua = {i1, i2, … , iN}; Begin
Step 1: Determine the list of k users who are similar to user For each user ∈ do
<Determine similarity value between and by statistical implicative similarity measures: ( , )>;
<Sort descending user list by similarity values>;
<Select first k users with highest similarity values: ( )>;
Step 2: Calculate the predicted rating value for items
<Identify items that user has not rated yet ( , = ∅)>;
<Calculate the predicted rating value for items by the formula: ̂ =
1
,
∑ ∈ ( ) , , , with , is similarity value between user and
∑ ∈ ( ) ,
user ; , is rating value of user for item >;
Step 3: Select recommendation items for user <Sort descending item list by predicted rating values>;
<Select N items with highest predicted rating values to recommend to user >;
End.
5.3. Experiment
5.3.1. Experimental data
The experiment is performed on two datasets: the
MovieLense dataset and the MSWeb dataset.
4.3.2. Evaluate the model on real rating dataset
Table 5.4. Present comparison error parameters of two
models
RMSE MSE MAE
Model using statistical
0.8961562 0.8030960 0.7077939
implicative similarity measures
Model using similarity Pearson
0.9796664 0.9597462 0.7704055
measures
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
23
Table 5.4 shows that values of error parameters (RMSE,
MSE, MAE) of the model are lower than the model using
similarity Pearson measures.
4.3.3. Evaluate the model on binary rating dataset
The results of the model was compared with the model using
Jaccard by drawing the ROC diagram for Precision and Recall
ratios in figure 5.8. The diagram shows that the value of
Precision and Recall ratios on a model using statistical
implicative similarity measures are higher than the value of
Precision and Recall ratios on a model using Jaccard measures.
Figure 5.8. ROC diagram for Precision and Recall ratios of
two models
4.4. Conclusion chapter 5
The contribution of this chapter presents steps for developing a
new similarity measures for User-based collaborative filtering
recommender model. The similarity measure between two users is
based on association rules and Implication intensity. It calls
statistical implicative similarity measures.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
24
CONCLUSION AND FUTURE WORKS
1. The results of the thesis
- Research overview of objective interestingness measures,
statistical implication analysis method, tendency of variation in
statistical implications, and recommender models.
- Propose a new method for classification objective
interestingness measures based on asymmetric approach using
statistical implication parameters.
- Propose recommender model based on Implication index.
This model solves for the class of recommendation problems
where the condition attributes and decision attributes on the
same object.
- Propose a collaborative filtering recommender model based
on Implication intensity. This model based on association rules and
Implication intensity to provide recommendation results.
- Propose statistical implicative similarity measures based on
asymmetric approaches to improve the accuracy of the User-
based collaborative filtering recommender models.
- Develop experimental toolkit (ARQAT) on R language
including functionality to meet the experimental deployment of
the thesis.
2. Future works
- Build similarity kernel based on statistical implicative
analysis method for collaborative filtering recommender model
to reduce the computational time for online recommender
systems that respond quickly to user requests.
- Develop collaborative filtering recommender model based
on implicative hierarchy graph.
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
THE PUBLISHED ARTICLES
1. Phan Quốc Nghĩa, Nguyễn Minh Kỳ, Nguyễn Tấn Hoàng,
Huỳnh Xuân Hiệp, “Hệ tư vấn dựa trên tiếp cận hàm ý thống kê”,
Kỷ yếu Hội nghị Quốc gia lần thứ VIII về Nghiên cứu cơ bản và
ứng dụng Công Nghệ thông tin (FAIR), pp. 297-308, 2015.
2. Phan Quốc Nghĩa và Huỳnh Xuân Hiệp, “Hệ tư vấn dựa trên
khuynh hướng biến thiên hàm ý thống kê”, Hội thảo quốc gia
lần thứ XVIII: Một số vấn đề chọn lọc của Công nghệ thông tin
và truyền thông, Chủ đề: Xử lý dữ liệu lớn, pp. 93-99, 2015.
3. Nghia Quoc Phan, Hiep Xuan Huynh, Fabrice Guillet, Régis
Gras, “Classifying objective interestingness measures based on
the tendency of value variation”, Quaderni di Ricerca in
Didattica, ISSN 1592-4424, pp. 143-172, 2015.
4. Nghia Quoc Phan, Vinh Cong Phan, Hung Huu Huynh, Hiep
Xuan Huynh, “Clustering the objective interestingness
measures based on tendency of variation in statistical
implications”, EAI Endorsed Transactions on Context-Aware
Systems and Applications, Vol. 3, No. 9, pp. 1-8, 2016.
5. Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh,
“Similarity Kernel for User-based Collaborative Filtering
Recommendation System”, The 2016 IEEE RIVF International
Conference on Computing & Communication Technologies,
Addendum proceeding, pp. 40-46, 2016.
6. Phan Quốc Nghĩa, Đặng Hoài Phương, Huỳnh Xuân Hiệp,
“Lựa chọn mô hình và tham số cho bài toán tư vấn lọc cộng tác
dựa trên đồ thị đánh giá”, Tạp chí Khoa học Trường Đại học Cần
Thơ, Số chuyên đề: Công nghệ thông tin (2017), ISSN 1859-
2333, pp. 171-178, 2017.
7. Phan Quốc Nghĩa, Nguyễn Minh Kỳ, Đặng Hoài Phương,
Huỳnh Xuân Hiệp, “Hệ tư vấn lọc cộng tác theo người dùng dựa
Nhận viết đề tài trọn gói – ZL: 0909 23 26 20–
Luanvanmaster.com
TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26
20
trên độ đo hàm ý thống kê”, Kỷ yếu Hội nghị Quốc gia lần thứ
IX về Nghiên cứu cơ bản và ứng dụng Công Nghệ thông tin
(FAIR), pp. 231-239, 2016.
8. Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh,
“Statistical Implicative Similarity Measures for Userbased
Collaborative Filtering Recommender System”, (IJACSA)
International Journal of Advanced Computer Science and
Applications, Vol. 7, No. 11 (ISI/WoS ESCI index), pp. 140-
146, 2016.
9. Lan Phuong Phan, Nghia Quoc Phan, Vinh Cong Phan,
Hung Huu Huynh, Hiep Xuan Huynh, and Fabrice Guillet,
“Classification of objective interestingness measures”, EAI
Endorsed Transactions on Context-Aware Systems and
Applications, Vol. 3, No. 10, pp. 1-13, 2016.
10.Lan Phuong Phan, Nghia Quoc Phan, Ky Minh Nguyen,
Hung Huu Huynh, Hiep Xuan Huynh, and Fabrice Guillet,
“Interestingnesslab: A Framework for Developing and Using
Objective Interestingness Measures”, In ICTA 2016:
International Conference on Advances in Information and
Communication Technology, Springer-Verlag, pp. 302-311,
2016.
11.Phan Quốc Nghĩa, Đặng Hoài Phương, Huỳnh Xuân Hiệp,
“Mô hình tư vấn lọc cộng tác tích hợp dựa trên tương đồng sản
phẩm”, Tạp chí Khoa học và Công nghệ - Đại học Đà Nẵng,
Vol. 1(110), pp. 55-58, 2017.
12.Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh,
“Collaborative recommenderation based on statistical
implication rules”, Journal of Computer Science and
Cybernetics, Vol. 33, No. 3, pp. 1-16, 2017.

More Related Content

Similar to Recommender System Based On Statistical Implicative Analysis.doc

Homework 21. Complete Chapter 3, Problem #1 under Project.docx
Homework 21. Complete Chapter 3, Problem #1 under Project.docxHomework 21. Complete Chapter 3, Problem #1 under Project.docx
Homework 21. Complete Chapter 3, Problem #1 under Project.docx
adampcarr67227
 
Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
Nicky Grant
 

Similar to Recommender System Based On Statistical Implicative Analysis.doc (20)

Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
 
Homework 21. Complete Chapter 3, Problem #1 under Project.docx
Homework 21. Complete Chapter 3, Problem #1 under Project.docxHomework 21. Complete Chapter 3, Problem #1 under Project.docx
Homework 21. Complete Chapter 3, Problem #1 under Project.docx
 
Data mining 8 estimasi linear regression
Data mining 8   estimasi linear regressionData mining 8   estimasi linear regression
Data mining 8 estimasi linear regression
 
Week_3_Lecture.pdf
Week_3_Lecture.pdfWeek_3_Lecture.pdf
Week_3_Lecture.pdf
 
Data classification sammer
Data classification sammer Data classification sammer
Data classification sammer
 
Mathematical modeling
Mathematical modelingMathematical modeling
Mathematical modeling
 
Statistics for entrepreneurs
Statistics for entrepreneurs Statistics for entrepreneurs
Statistics for entrepreneurs
 
Regression
RegressionRegression
Regression
 
Econometrics of High-Dimensional Sparse Models
Econometrics of High-Dimensional Sparse ModelsEconometrics of High-Dimensional Sparse Models
Econometrics of High-Dimensional Sparse Models
 
Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
 
Applied Mathematics and Sciences: An International Journal (MathSJ)
Applied Mathematics and Sciences: An International Journal (MathSJ)Applied Mathematics and Sciences: An International Journal (MathSJ)
Applied Mathematics and Sciences: An International Journal (MathSJ)
 
Scalable Dynamic Graph Summarization
Scalable Dynamic Graph SummarizationScalable Dynamic Graph Summarization
Scalable Dynamic Graph Summarization
 
Petrini - MSc Thesis
Petrini - MSc ThesisPetrini - MSc Thesis
Petrini - MSc Thesis
 
A NEW STUDY OF TRAPEZOIDAL, SIMPSON’S1/3 AND SIMPSON’S 3/8 RULES OF NUMERICAL...
A NEW STUDY OF TRAPEZOIDAL, SIMPSON’S1/3 AND SIMPSON’S 3/8 RULES OF NUMERICAL...A NEW STUDY OF TRAPEZOIDAL, SIMPSON’S1/3 AND SIMPSON’S 3/8 RULES OF NUMERICAL...
A NEW STUDY OF TRAPEZOIDAL, SIMPSON’S1/3 AND SIMPSON’S 3/8 RULES OF NUMERICAL...
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
 
Cb36469472
Cb36469472Cb36469472
Cb36469472
 
MSE.pptx
MSE.pptxMSE.pptx
MSE.pptx
 
Essentials of machine learning algorithms
Essentials of machine learning algorithmsEssentials of machine learning algorithms
Essentials of machine learning algorithms
 
Time series.ppt
Time series.pptTime series.ppt
Time series.ppt
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdf
 

More from Dịch vụ viết đề tài trọn gói 0934.573.149

More from Dịch vụ viết đề tài trọn gói 0934.573.149 (20)

Luận Văn Một Số Biện Pháp Nhằm Quản Lý Hàng Hóa Tốt Hơn Tại Công Ty Tnhh.doc
Luận Văn Một Số Biện Pháp Nhằm Quản Lý Hàng Hóa Tốt Hơn Tại Công Ty Tnhh.docLuận Văn Một Số Biện Pháp Nhằm Quản Lý Hàng Hóa Tốt Hơn Tại Công Ty Tnhh.doc
Luận Văn Một Số Biện Pháp Nhằm Quản Lý Hàng Hóa Tốt Hơn Tại Công Ty Tnhh.doc
 
Luận Văn Một Số Biện Pháp Nâng Cao Hiệu Quả Sử Dụng Nguồn Nhân Lực Tại Công T...
Luận Văn Một Số Biện Pháp Nâng Cao Hiệu Quả Sử Dụng Nguồn Nhân Lực Tại Công T...Luận Văn Một Số Biện Pháp Nâng Cao Hiệu Quả Sử Dụng Nguồn Nhân Lực Tại Công T...
Luận Văn Một Số Biện Pháp Nâng Cao Hiệu Quả Sử Dụng Nguồn Nhân Lực Tại Công T...
 
Luận Văn Một Số Biện Pháp Maketing Nhằm Nâng Cao Hiệu Quả Kinh Doanh Tại Công...
Luận Văn Một Số Biện Pháp Maketing Nhằm Nâng Cao Hiệu Quả Kinh Doanh Tại Công...Luận Văn Một Số Biện Pháp Maketing Nhằm Nâng Cao Hiệu Quả Kinh Doanh Tại Công...
Luận Văn Một Số Biện Pháp Maketing Nhằm Nâng Cao Hiệu Quả Kinh Doanh Tại Công...
 
Luận Văn Lý Luận Chung Về Doanh Thu, Chi Phí, Kết Quả Kinh Doanh Và Kế Toán D...
Luận Văn Lý Luận Chung Về Doanh Thu, Chi Phí, Kết Quả Kinh Doanh Và Kế Toán D...Luận Văn Lý Luận Chung Về Doanh Thu, Chi Phí, Kết Quả Kinh Doanh Và Kế Toán D...
Luận Văn Lý Luận Chung Về Doanh Thu, Chi Phí, Kết Quả Kinh Doanh Và Kế Toán D...
 
Luận Văn Lễ Hội Hoa Phượng Đỏ Hải Phòng Và Thực Trạng Tổ Chức Lễ Hội Lần Thứ ...
Luận Văn Lễ Hội Hoa Phượng Đỏ Hải Phòng Và Thực Trạng Tổ Chức Lễ Hội Lần Thứ ...Luận Văn Lễ Hội Hoa Phượng Đỏ Hải Phòng Và Thực Trạng Tổ Chức Lễ Hội Lần Thứ ...
Luận Văn Lễ Hội Hoa Phượng Đỏ Hải Phòng Và Thực Trạng Tổ Chức Lễ Hội Lần Thứ ...
 
Luận Văn Lễ Hội Báo Slao Xã Quốc Khánh Với Phát Triển Du Lịch Văn Hóa Ở Tràng...
Luận Văn Lễ Hội Báo Slao Xã Quốc Khánh Với Phát Triển Du Lịch Văn Hóa Ở Tràng...Luận Văn Lễ Hội Báo Slao Xã Quốc Khánh Với Phát Triển Du Lịch Văn Hóa Ở Tràng...
Luận Văn Lễ Hội Báo Slao Xã Quốc Khánh Với Phát Triển Du Lịch Văn Hóa Ở Tràng...
 
Luận Văn Kỹ Thuật Giấu Tin Thuận Nghịch Trong Ảnh Bằng Hiệu Chỉnh Hệ Số Wavel...
Luận Văn Kỹ Thuật Giấu Tin Thuận Nghịch Trong Ảnh Bằng Hiệu Chỉnh Hệ Số Wavel...Luận Văn Kỹ Thuật Giấu Tin Thuận Nghịch Trong Ảnh Bằng Hiệu Chỉnh Hệ Số Wavel...
Luận Văn Kỹ Thuật Giấu Tin Thuận Nghịch Trong Ảnh Bằng Hiệu Chỉnh Hệ Số Wavel...
 
Luận Văn Khu Đô Thị Dành Cho Ngƣời Thu Nhập Trung Bình Tại Kiến An Hải Phõng.doc
Luận Văn Khu Đô Thị Dành Cho Ngƣời Thu Nhập Trung Bình Tại Kiến An Hải Phõng.docLuận Văn Khu Đô Thị Dành Cho Ngƣời Thu Nhập Trung Bình Tại Kiến An Hải Phõng.doc
Luận Văn Khu Đô Thị Dành Cho Ngƣời Thu Nhập Trung Bình Tại Kiến An Hải Phõng.doc
 
Luận Văn Khai Thác Tiềm Năng Phát Triển Du Lịch Ven Biển Tiền Hải - Thái Binh...
Luận Văn Khai Thác Tiềm Năng Phát Triển Du Lịch Ven Biển Tiền Hải - Thái Binh...Luận Văn Khai Thác Tiềm Năng Phát Triển Du Lịch Ven Biển Tiền Hải - Thái Binh...
Luận Văn Khai Thác Tiềm Năng Phát Triển Du Lịch Ven Biển Tiền Hải - Thái Binh...
 
Luận Văn Khai Thác Giá Trị Hệ Thống Chùa Tại Thủy Nguyên Phục Vụ Phát Triển D...
Luận Văn Khai Thác Giá Trị Hệ Thống Chùa Tại Thủy Nguyên Phục Vụ Phát Triển D...Luận Văn Khai Thác Giá Trị Hệ Thống Chùa Tại Thủy Nguyên Phục Vụ Phát Triển D...
Luận Văn Khai Thác Giá Trị Hệ Thống Chùa Tại Thủy Nguyên Phục Vụ Phát Triển D...
 
Luận Văn Khai thác các giá trị lịch sử - văn hóa làng cổ loa, xã cổ loa, huyệ...
Luận Văn Khai thác các giá trị lịch sử - văn hóa làng cổ loa, xã cổ loa, huyệ...Luận Văn Khai thác các giá trị lịch sử - văn hóa làng cổ loa, xã cổ loa, huyệ...
Luận Văn Khai thác các giá trị lịch sử - văn hóa làng cổ loa, xã cổ loa, huyệ...
 
Luận Văn Khách Sạn Hòn Dáu.doc
Luận Văn Khách Sạn Hòn Dáu.docLuận Văn Khách Sạn Hòn Dáu.doc
Luận Văn Khách Sạn Hòn Dáu.doc
 
Luận Văn Khách Sạn Hoàng Anh.doc
Luận Văn Khách Sạn Hoàng Anh.docLuận Văn Khách Sạn Hoàng Anh.doc
Luận Văn Khách Sạn Hoàng Anh.doc
 
Luận Văn Khách Sạn Hoa Phuợng Đồ Sơn.doc
Luận Văn Khách Sạn Hoa Phuợng Đồ Sơn.docLuận Văn Khách Sạn Hoa Phuợng Đồ Sơn.doc
Luận Văn Khách Sạn Hoa Phuợng Đồ Sơn.doc
 
Luận Văn Khách Sạn Đồ Sơn.doc
Luận Văn Khách Sạn Đồ Sơn.docLuận Văn Khách Sạn Đồ Sơn.doc
Luận Văn Khách Sạn Đồ Sơn.doc
 
Luận Văn Học Viện Văn Hóa Nghệ Thuật Dân Tộc Việt Bắc.doc
Luận Văn Học Viện Văn Hóa Nghệ Thuật Dân Tộc Việt Bắc.docLuận Văn Học Viện Văn Hóa Nghệ Thuật Dân Tộc Việt Bắc.doc
Luận Văn Học Viện Văn Hóa Nghệ Thuật Dân Tộc Việt Bắc.doc
 
Luận Văn Hoàn Thiện Tổ Chức Kế Toán Hàng Hóa Tại Công Ty Tnhh An Phöc Thịnh.doc
Luận Văn Hoàn Thiện Tổ Chức Kế Toán Hàng Hóa Tại Công Ty Tnhh An Phöc Thịnh.docLuận Văn Hoàn Thiện Tổ Chức Kế Toán Hàng Hóa Tại Công Ty Tnhh An Phöc Thịnh.doc
Luận Văn Hoàn Thiện Tổ Chức Kế Toán Hàng Hóa Tại Công Ty Tnhh An Phöc Thịnh.doc
 
Luận Văn Hoàn Thiện Tổ Chức Công Tác Kế Toán Doanh Thu, Chi Phí Bán Hàng Và X...
Luận Văn Hoàn Thiện Tổ Chức Công Tác Kế Toán Doanh Thu, Chi Phí Bán Hàng Và X...Luận Văn Hoàn Thiện Tổ Chức Công Tác Kế Toán Doanh Thu, Chi Phí Bán Hàng Và X...
Luận Văn Hoàn Thiện Tổ Chức Công Tác Kế Toán Doanh Thu, Chi Phí Bán Hàng Và X...
 
Luận Văn Hoàn Thiện Tổ Chức Công Tác Hạch Toán Kế Toán Tiền Lương Và Các Khoả...
Luận Văn Hoàn Thiện Tổ Chức Công Tác Hạch Toán Kế Toán Tiền Lương Và Các Khoả...Luận Văn Hoàn Thiện Tổ Chức Công Tác Hạch Toán Kế Toán Tiền Lương Và Các Khoả...
Luận Văn Hoàn Thiện Tổ Chức Công Tác Hạch Toán Kế Toán Tiền Lương Và Các Khoả...
 
Luận Văn Hoàn Thiện Công Tác Kế Toán Vốn Bằng Tiền Tại Công Ty Cổ Phần Bao Bì...
Luận Văn Hoàn Thiện Công Tác Kế Toán Vốn Bằng Tiền Tại Công Ty Cổ Phần Bao Bì...Luận Văn Hoàn Thiện Công Tác Kế Toán Vốn Bằng Tiền Tại Công Ty Cổ Phần Bao Bì...
Luận Văn Hoàn Thiện Công Tác Kế Toán Vốn Bằng Tiền Tại Công Ty Cổ Phần Bao Bì...
 

Recently uploaded

Recently uploaded (20)

Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 

Recommender System Based On Statistical Implicative Analysis.doc

  • 1. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 MINISTRY EDUCATION AND TRAINING UNIVERSITY OF DANANG -----  ----- PHAN QUOC NGHIA RECOMMENDER SYSTEM BASED ON STATISTICAL IMPLICATIVE ANALYSIS Speciality: Computer Science Code: 62 48 01 01 DOCTORAL THESIS SUMMARY Danang – 2018
  • 2. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 The dissertation is completed at: UNIVERSITY OF DANANG Academic Instructors: 1. Associate Professor Huynh Xuan Hiep, PhD. 2. Dang Hoai Phuong, PhD. Opponent 1:……………………………..…………… Opponent 2:………………...………...……………… Opponent 3:………………...……...………………… The dissertation will be defended before the Board of thesis review established by University of Da Nang At ... ..... hour ......... day ....... month ....... year ....... The dissertation can be found at: - National Library - Information and Learning Center, University of Da Nang
  • 3. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 1 PREFACE 1. The urgency of the thesis The information overload problem really became popular with the rise of the Internet and social networks, the amount of information that people are approaching is expanding ever more. Everyday, we are exposed to a multitude of types of information: email communications, articles in Internet, social media postings, advertising information from e-commerce sites. With this huge amount of information, choosing the right information for the decision-making of computer users and smart devices users will be increasingly difficult. The recommender model is considered as solution to support users to select information effectively and is widely used in many fields. Recommender model is a system capable of automatically analyze, classify, select and provide users with the information, goods or services that users are interested by application of statistical techniques and artificial intelligence. In particular, machine learning algorithms play an important role. In order to provide the information that users need to support, many recommender models have been proposed such as Collaborative filtering recommender models, Content-based recommender models, Demographic recommender models, Knowledge-based recommender models, Hybrid recommender models. However, due to the information explosion on social networking sites and the spread of products on e-commerce sites today, the current recommender models have not yet met the complex requirements of the users. Therefore, the study of recommender models continue to be interested in such research
  • 4. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 both advanced methods and algorithms to improve the accuracy of the current recommender models, research to improve the
  • 5. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 2 systems to adapt for the problem of information explosion and research to propose new recommender model. Starting from this practical situation, the topic "Recommender system based on statistical implicative analysis" is conducted within the framework of a doctoral dissertation in computer science with the desire to contribute a part to the recommender model of research. Specifically, it is a collaborative filtering recommender model. 2. Objectives, objects and scope of research of the thesis 2.1. Research objectives The objective of the thesis is to propose collaborative filtering recommender models that apply the proposed measures from the statistical implicative analysis method, tendency of variation in statistical implications, and association rules. 2.2. Research objects The objective interestingness measures, statistical implicative analysis method, recommender models. 2.3. Research scopes Focus on Statistical implication analysis method, Tendency of variation in statistical implications, Association rules, and Recommender models. 3. Research methods Analysis and synthesis of theory combined with experiment. 4. Thesis structure Preface Chapter 1: An overview. Chapter 2: Classification objective interestingness measures based on statistical implication parameters. Chapter 3: Recommender model based on Implication index.
  • 6. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 3 Chapter 4: Collaborative filtering recommender model based on Implication intensity. Chapter 5: Collaborative filtering recommender model based on statistical implicative similarity measures. Appendix 5. Contribution of the thesis - Propose a new method for classification objective interestingness measures based on statistical implication parameters. - Propose recommender model based on Implication index. - Propose a collaborative filtering recommender model based on Implication intensity. - Propose a collaborative filtering recommender model based on statistical implicative similarity measures. - Develop empirical toolkit (ARQAT) on the R language. CHAPTER 1: AN OVERVIEW The main content of this chapter studies an overview of objective interestingness measures, statistical implicative analysis method, tendency of variation in statistical implications, and recommender models. Research on the proposed recommender models and analysis of advantages and disadvantages of each model. On the basis of these studies, clearly define the research content of the thesis. 1.1. Statistical implicative analysis Statistical implicative analysis is the method of data analysis studying implicative relationships between variables or data attributes, allowing detecting the asymmetrical rules a → b in the form "if a then that almost b" or "consider to what extent that b
  • 7. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 4 will meet implication of a". The purpose of this method is to detect trends in a set of attributes (variables) by using statistical implication measures. Figure 1.1. The model represents statistical implication analysis method Let E be a set of n objects or individuals described by a finite set of binary variables (property). A (A ⊂ E) is a subset of objects that meet the property a; B (B ⊂ E) is a subset of objects that meet the property b; ̅(resp. ̅) is the complement of A (resp. B); A B = ( ) is the number of elements of set A; = ( ) is the number of elements of set B; and the counter- examples ( ̅ = ( ∩ ̅)) is the number of objects that satisfy the attribute a but does not satisfy the property b. Let X and Y be two random sets with the number = and = respectively. For a certain process of sampling, the random variable ( ∩ ̅) follows the Poisson distribution with the parameter = ̅ . The rule → is said to be admissible for a given threshold α if ̅ ̅ 1.2 Pr[card(X ∩ Y) ≤ card(A ∩ B)] ≤ α Let us consider the case where n̅ ≠ 0. In this case, the B Poisson random variable ̅ can be standardized card(X ∩ Y) random as:
  • 8. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 5 ̅ nA(n − n ) ̅ card(X ∩ Y) − n Q(A, B) = 1.3 √ nA(n − n ) n In experimental realization, the observed value Q(A, B̅) is defined by: n ̅ − nA(n − n ) ̅ AB n q(A, B) = √nA(n − n ) n q(A, B̅) of 1.4 This value measures a deviation between the contingency and expected value when a and b are independent. When the approximation is justified (e.g. > 4) the random variable q(A, B ̅) is approximatively N(0,1)-distributed. The implication intensity φ(a,b) of the rule → is defined by: 1 ∞ 2 ̅ ̅ ∫ − nếu ≠ 2 1 − Pr( ( , ) ≤ ( , )) = 2 1.5 φ(A,B)= { ( , ̅) 0 ngược lại This measures is used to determine the unlikehood of the counter-example ̅ in the set . The implication intensity φ(a,b) is admissible for a given threshold α if φ(a,b) ≥ 1 − . 1.2. Tendency of variation in statistical implications The tendency of variation in statistical implications is a research directions to examine the stability of the implication intensity to observe small variations of measures in the surrounding space of parameters , , , . To clarify the tendency of variation in statistical implications, we examine the implication index measures under 4 parameters , , , with formula defined (1.4). To observe the variation of q from the variability of the parameters , , , , Let us consider the parameters
  • 9. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 6 , , , as real numbers which satisfy the following inequalities: ≤ inf( , ) và sup( , ) ≤ 1.6 In this case, q can be considered as a continuous differentiable function: = + + + ̅ = grad q. dM 1.7 ̅ The ( , , , ) s a function has 4 parameters. To observe the variation of q according to the parameters we calculated the partial derivative for each parameter. In fact, this variation is estimated rising of the function q with variation according to the variation of q corresponding components ∆ , ∆ ,∆ ,∆ . Therefore, we have the formula: ∆ = ∆ + ∆ + ∆ + ∆ ̅ 1.8 ̅ Let us take the partial derivatives of q under ̅ we have the following formula: = 1 > 0 ̅ √ ( − ) 1.12 Equation 1.12 shows that if the ∆ tends to increase, then the q tends to increase. 1.3. Recommender models 1.3.1. The basic concepts 1.3.2. Content-based recommender models 1.3.3. Collaborative filtering recommender models 1.3.4. Demographic recommender models 1.3.5. Knowledge-based recommender models 1.3.6. Recommender based on association rule models
  • 10. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 7 1.3.7. Recommender model based on statistical implicative analysis 1.3.8. Hybrid recommender models 1.4. Evaluating recommender models 1.4.1. Method for developing evaluation data 1.4.2. Method for Evaluating the recommender models 1.5. Application of recommender models 1.6. Development trends of recommender models 1.7. Conclusion Chapter 1 The contribution of this chapter studies objective interestingness measures, statistical implicative analysis method. Study recommender models, analyze advantages and disadvantages of each model. This is the basis for determining the research contents of the thesis. CHAPTER 2: CLASSIFICATION OBJECTIVE INTERESTINGNESS MEASURES BASED ON STATISTICAL IMPLICATION PARAMETERS The main content of this chapter presents objective interestingness measures, methods of classifying objective interestingness measures, and proposing a method for classifying measures based on an asymmetric approach using statistical implication parameters. The research results of this chapter have been published in works (3), (4) in the published list by author. 2.1. An objective interestingness measures An objective interestingness measures is the measurement of knowledge patterns based on the distribution of data. Assume that we have a finite set of transactions, with each transaction
  • 11. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 8 contained in item set I. An association rule → where A and B are two disjoint sets of items ( ∩ = ∅) where a are attributes of the objects of the set A, b are attributes of the objects of the set B. Item set A (resp. B) is associated with a subset of transactions with = ( ) = { ∈ , ⊆ } (resp. = ( )), item set ̅ (resp. ̅ ) is associated with a subset of transactions with ̅= ( ̅)= − ( ) = { ∈ , ⊊ } (resp. ̅ = ( ̅) ). The rule can be described by four cardinalities ( , , , ̅ ) where = | |, = | |, = | |, ̅ = | ∩ ̅|. The interestingness value of an association rule based on an objective interestingness measures will then be calculated by using the cardinality of a rule ( → ) = ( , , , ̅). n T A ̅ B ̅ Figure 2.1. The cardinality of an association rule ⟶ 2.2. Classify the objective interestingness measures 2.2.1. Classification based on examining of measures properties 2.2.2. Classification based on measures of behavior 2.3. Classifying objective interestingness measures based on statistical implication parameters
  • 12. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 9 2.3.1. The principles define the variance of the measure based on the partial derivative The principles used to investigate the objective interestingness measures based on the partial derivative value according to 4 parameters: - If the partial derivative values of corresponding parameter is positive, the property of measures in the corresponding parameter is labeled as 1. - If the partial derivative values of corresponding parameter is negative, the property of measures in the corresponding parameter is labeled as -1. - If the partial derivative values of corresponding parameter is zero, the property of measures in the corresponding parameter is labeled as 0. 2.3.2. The rules for classification measures based on the variable attribute of measures Measures are classified according to the following rules: - If the value of the partial survey has label 1, then put it in the class of measures vary increasing with the corresponding parameter; - If the value of the partial survey has label -1, then put it in the class of measures vary decreasing with the corresponding parameter; - If the value of the partial survey has label 0, then put it in the class of measures is independent on corresponding parameter; - If the value of the partial survey has label more than one value (1, 0, -1), then put it in the other class.
  • 13. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 10 2.4. Classification results of asymmetric objective interestingness measures 2.4.1. Classification result of measures based on partial derivative under the parameter n 2.4.2. Classification result of measures based on partial derivative under the parameter 2.4.3. Classification result of measures based on partial derivative under the parameter 2.4.4. Classification result of measures based on partial derivative under the parameter 2.5. Comparison and evaluation of classification results by statistical implication parameters - Class of measures independent of the parameter n by the classification method based on tendency of variation in statistical implications fall in the class of measures have descriptive property by the classification method based on properties of measures. - The majority of measures have asymmetric properties increase with the parameter and decrease with the parameter when calculating the value based on the association rules. - The class measures has statistical property is always increasing or decreasing with statistical implication parameters. 2.6. Conclusion chapter 2 The main contribution of this chapter is to propose new method for classifying objective interestingness measures based on statistical implication parameters. Then apply this method to conduct classification 39 asymmetric objective interestingness measures. This result can help users choose the appropriate measures for the collaborative filtering recommender model.
  • 14. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 11 CHAPTER 3: RECOMMENDER MODEL BASED ON IMPLICATION INDEX The main content of this chapter proposed recommender model based on asymmetric approach using association rules, Implication index, and partial derivatives under statistical implication parameters. This model is particularly interested in the relationship between the condition attributes and decision attributes on the same object to give the recommendation results for users. The research results of this chapter have been published in works (1), (2) in the published list by author. 3.1. An association rules based on decision attributes 3.1.1. Definition of association rule based on decision attributes Let = { 1, 2, … , } is a set of n users, where each user is stored as a transaction, U is considered the transaction database; = { 1, 2, … , , +1, … , } is the set of m attributes of each user, where Đ = { 1, 2, … , } is the set of condition attributes, Đ = { +1, … , } is the set of decision attributes. An association rule based on decision attributes generated from the transaction database U is an implicative expression of the form: a → b, with ⊆ Đ , ⊆ Đ, | | ≥ 1, | | ≥ 1. 3.1.2. Algorithm for generating association rule based on decision attributes Input:User transaction dataset ( ). Output: Set of association rules for recommender models. Begin Step 1: Scan transaction database (U) to determine Support of each candidate 1-itemset, compare candidate Support with min_sup to find frequent 1-itemset ( 1).
  • 15. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 12 Step 2: Use −1 join −1 to generate a candidate set of candidate k-itemset. Prune not frequent itemsets to determine candidate k-itemset. Step 3: Scan transaction database (U) to determine Support of each candidate k-itemset, compare candidate Support with min_sup to find frequent k-itemset ( ). Step 4: Repeat from step 2 until the candidate set is empty. Step 5: For each frequent itemset I, generate all nonempty s subsets of I. Step 6: For every nonempty subset s of l, generate the rules: Đ= { → ( − )|( − ) ∈ { + , … , }, Confidence(( − ) → ) ≥ min_conf}. End. 3.2. Statistical implication parameters of association rules 3.2.1. Statistical implication parameters 3.2.2. Statistical implication parameters based on binary matrix 3.3. Calculate Implication index and partial derivatives based on statistical implication parameters 3.4. Recommender model based on Implication index 3.4.1. Definition of recommender model based on Implication index The recommender model based on Implication index is defined as follows: = < , , , > Where: - = { 1, 2, … , } is a set of n users; - = { 1, 2, … , , +1, … , } is the set of m attributes of each user, where Đ = { 1, 2, … , } is the set of condition attributes, Đ = { +1, … , } is the set of decision attributes; - = { 1, 2, … , } is the association rule set for the model; - ∶ → are computational functions for finding association rules RUa = {r1,r2, … , rN} which have valuable
  • 16. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 13 recommendation for user u based on Implication index and partial derivative under statistical implication parameters. 3.4.2. Recommendation algorithm based on Implication index Input: - Transaction dataset; - Value set of condition attributes ( Đ ) of the user ; Output: The association rule set supports for users to choice the value of the decision attributes; Begin Step 1: Generate association rules based on decision attributes from transaction dataset. Step 2: Determine the value of statistical implication parameters for association rule set: For each association rule based on decision attributes of rule set do <Determine value of parameter >; <Determine value of parameter >; <Determine value of parameter >; <Determine value of parameter ̅ >; Step 3: Calculate value of Implication index and partial derivatives based on statistical implication parameters: For each association rule based on decision attributes in RU set do <Calculate value of Implication index of rule >; <Calculate value of partial derivatives on parameter n of rule >; <Calculate value of partial derivatives on parameter of rule >; <Calculate value of partial derivatives on parameter of rule >; <Calculate value of partial derivatives on parameter ̅ of rule >; Step 4: Choose a recommendation rule set for the user : For each association rule based on decision attributes in RU set do if (<Value attributes left side of rule > ⊆ < Value set of Đ >) then <Put association rule into RU set for user >; <Sort association rules of RU set by value of Implication index and value of partial derivatives on statistical implication parameters>; <Select N rules with the highest Implication index value and the highest partial derivative value to recommend user >; End.
  • 17. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 14 3.5. Experiment 3.5.1. Experimental data The model has been tested on two datasets: Lenses dataset of UCI and admissions dataset of Tra Vinh University (DVT-Data). 3.5.2. Evaluate the model's accuracy on standard dataset Table 3.17. The recommendation result with the condition attributes {i1 = 1, i2 = 2, i3 = 2, i4 = 1} From the recommendation results in Table 3.17, the users can consider value of decision attributes to make a final decision to use contact lenses or not. 3.5.3. Evaluate the model's accuracy on real dataset The model has 8 results that contain the course code registered by the candidate over 10 tests. In particular, most recommendation results are based on rules related to the scores of subjects in the examination. This is a characteristic quite suited to the actual requirements of admission consultants. 3.6. Conclusion chapter 3 The contribution of this chapter is to propose recommender model based on Implication index and association rules. The difference of this model is based on the value of the condition attributes to calculate the values of the decision attributes on the same object.
  • 18. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 15 CHAPTER 4. COLLABORATIVE FILTERING RECOMMENDER MODEL BASED ON IMPLICATION INTENSTY This chapter proposes a new model for collaborative filtering recommender model based on asymmetric approach. The model uses association rules and Implication intensity. In particular, the model using association rules with high surprisingness generated based on rating matrix and Implication intensity. The research results of this chapter have been published in work (12) in the published list by author. 4.1. Association rules based on Implication intensity 4.1.1. Definition of association rules based on Implication intensity Let = { 1, 2, … , } is a set of n users; = { 1, 2, … , } is a set of m items; = { , } is a rating matrix of n users for m items with each row representing a user (1 ≤ ≤ ); each column represents an item (1 ≤ ≤ ); , is the rating value of user for item ; is a set of items rated by , is a set of items rated by , and , . The association rule based on Implication intensity is defined as: → where , and ∩ = ∅; and is accepted with threshold α (0 ≤ ≤ 1) if φ(a,b) ≥ 1 − α where φ(a,b) is defined in formula (1.5). 4.1.2. Algorithm for generating association rule based on Implication intensity Input: Training dataset and the threshold value of Implication intensity; Output: the association rule set based on Implication intensity; Begin Step 1: Generate a candidate set from 1-itemset to k-itemset <Generate a candidate set 1-itemset from training dataset >;
  • 19. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 16 <Generate a candidate set 2-itemset from candidate set 1-itemset>; … <Generate a candidate set k-itemset from candidate set k-1-itemset >; Step 2: Generate nonempty subsets of candidate set For each candidate set I do <Generate nonempty subsets s of I >; Step 3: Generate association rule with threshold value of Implication intensity For each nonempty subsets s of I do <Generate association rule r : → ( − )>; If (value of Implication intensity of rule r ≥ threshold α) then <Select association rule r for recommender model>; End. 4.2. The collaborative filtering recommender model based on Implication intensity 4.2.1. Definition of recommender model based on Implication intensity Suppose that = { 1, 2, … , } is a set of n users; = { 1, 2, … , } is a set of m items; = { , } is a training dataset, with , is the rating value of user for item ; = { , } is a testing dataset, with , is the rating value of user for item ; = { 1, 2, … , } is a set of association rules generated from training dataset based on Implication intensity; = { , } is logical matrix, with , is logical value between rule and user , if user has ratings for items of the left side of rule then , = " ", otherwise , = " " ; = { 1, 2, … , } is a set of association rules selected for user . The recommendation results for user is a set of items belong to right side of that user has not rated = { 1, 2, … , }.
  • 20. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 17 4.2.2. Recommendation algorithm based on Implication intensity Input:- Set of association rules generated from training dataset; - Testing dataset; Output: Recommendation results for each user in the testing dataset; Begin Step 1: Create a logical matrix based on association rule set and testing dataset t = <The number of association rules>; l = <The number of users in testing dataset>; = { }, với = {1,2, … , }, = {1,2, … , }; For each association rule in RU do For each user in testing dataset do (( ) ∩ ( ) ≠ ∅)) ℎ [ , ] = " " Otherwise [ , ] = " " Step 2: Select association rules to recommend for each user For each user in testing dataset do For each association rule selected do If ( [ , ] = " ") then <Select association rule on row i for user on column j>; Step 3: Select items to recommend for each user For each user in testing dataset do <Sort descending rule set based on Implication intensity values >; <Select N items from right side of rules with the highest Implication intensity value that user j has not rated to recommend for user j>; End.
  • 21. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 18 4.2.3. Evaluate model accuracy 4.3. Experiment 4.3.1. Experimental data The experiment is performed on two different datasets: real rating dataset (MovieLens) and binary rating dataset (MSWeb). 4.3.2. Compare the model's accuracy on binary rating dataset and real rating dataset The accuracy of model is based on three metrics Precision, Recall and Fmeasure (Figure 4.2) shows the recommendation results on binary rating dataset higher precision than the recommendation results on real rating dataset. Figure 4.2. The diagram shows the pattern with high precision on binary rating dataset 4.3.3. Compare the accuracy of the model with other collaborative recommender models The accuracy of the proposed model is compared to the accuracy of the collaborative filtering recommender models: User-based collaborative filtering (UBCF), Item-based collaborative filtering (IBCF) and Collaborative filtering recommender based on association rules (AR). Kết quả so sánh
  • 22. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 19 được trình bày trong hình 4.3. The comparison results are presented in Figure 4.3. Figure 4.3. Compare the accuracy of the recommender models This result shows that the collaborative filtering recommender model based on Implication intensity is highly accurate on two empirical datasets. In particular, on binary rating dataset (MSWeb), the model has far more precision than the other models. 4.4. Conclusion Chapter 4 The contribution of this chapter is to propose collaborative filtering recommender model based on Implication intensity. Experiment the model on two standard datasets and compare the evaluation results with other effective collaborative filtering recommender models. The experimental results show that using of asymmetric association rules based on statistical implication analysis method can increase the accuracy of the collaborative filtering recommender model.
  • 23. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 20 CHAPTER 5: COLLABORATIVE FILTERING RECOMMENDER MODEL BASED ON STATISTICAL IMPLICATIVE SIMILARITY MEASURES This chapter presents steps to develop a similarity measures based on the asymmetric approach for the user-based collaborative filtering recommender model. The similarity measures between the two users is based on association rules and Implication intensity. The research results of this chapter have been published in works (7), (8) in the published list by author. 5.1. Similarity measures based on Implication intensity 5.1.1. The statistical implicative similarity measures between two users The statistical implicative similarity value between two users , is determined by the following formula: ∑k II(r ) SIS(u , u ) = 1 − i=1 i k Of which: - SIS(u ,u ): is similarity value between two users u ,u ; - II(ri): is implication intensity value of association rule ri; - k: is the number of association rule set that generated from rating dataset of two users u ,u . 5.1.2. Algorithm to measure statistical implicative similarity value between two users Input:Rating dataset for items of two users u , u ; Output: Similarity value between two users u , u ; Begin Step 1: Generate association rules from user's rating matrix Step 2:Select association rules for two users u , u < Select items are rated by user u : u >;
  • 24. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 21 <Select items are not rated by user u : u ̅ >; <Select association rules of the form: → Y with ∈ u ; ∈ u ̅ và ∩ =∅>; Step 3: Define the parameters , , ,̅ for selected rules For each selected rule in selected association rule set do <Define the parameters , , , ̅ >; Step 4: Calculate implication intensity value for selected set of association rules For each selected rule in selected association rule set do <Calculate implication intensity value>; Step 5: Calculate similarity value between two users u , u ( (u , u )) <Calculate mean of implication intensity values of rule set: ̅>; <Similarity value between two users u , u : (u , u ) = 1 − ̅>; End. 5.2. Collaborative filtering recommender model based on statistical implicative similarity measures 5.2.1. Definition of recommender model based on statistical implicative similarity measures The collaborative filtering recommender model based on statistical implicative similarity measures is defined as follows: =< , , , > Of which: - = { 1, 2, … , } is a set n users in system; - = { 1, 2, … , } is a set m items in system; - = { , } is rating matrix of users for items, , is rating value of user for item ; - ∶ → is a function to find out the items that need to recommend for user ∈ U (with = { 1, 2, … , }).
  • 25. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 22 5.2.2. Recommendation algorithm based on on statistical implicative similarity measures Input: - User set U; Item set I, Rating matrix R; - User need recommendation result ; Output: Recommendation items for user : Iua = {i1, i2, … , iN}; Begin Step 1: Determine the list of k users who are similar to user For each user ∈ do <Determine similarity value between and by statistical implicative similarity measures: ( , )>; <Sort descending user list by similarity values>; <Select first k users with highest similarity values: ( )>; Step 2: Calculate the predicted rating value for items <Identify items that user has not rated yet ( , = ∅)>; <Calculate the predicted rating value for items by the formula: ̂ = 1 , ∑ ∈ ( ) , , , with , is similarity value between user and ∑ ∈ ( ) , user ; , is rating value of user for item >; Step 3: Select recommendation items for user <Sort descending item list by predicted rating values>; <Select N items with highest predicted rating values to recommend to user >; End. 5.3. Experiment 5.3.1. Experimental data The experiment is performed on two datasets: the MovieLense dataset and the MSWeb dataset. 4.3.2. Evaluate the model on real rating dataset Table 5.4. Present comparison error parameters of two models RMSE MSE MAE Model using statistical 0.8961562 0.8030960 0.7077939 implicative similarity measures Model using similarity Pearson 0.9796664 0.9597462 0.7704055 measures
  • 26. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 23 Table 5.4 shows that values of error parameters (RMSE, MSE, MAE) of the model are lower than the model using similarity Pearson measures. 4.3.3. Evaluate the model on binary rating dataset The results of the model was compared with the model using Jaccard by drawing the ROC diagram for Precision and Recall ratios in figure 5.8. The diagram shows that the value of Precision and Recall ratios on a model using statistical implicative similarity measures are higher than the value of Precision and Recall ratios on a model using Jaccard measures. Figure 5.8. ROC diagram for Precision and Recall ratios of two models 4.4. Conclusion chapter 5 The contribution of this chapter presents steps for developing a new similarity measures for User-based collaborative filtering recommender model. The similarity measure between two users is based on association rules and Implication intensity. It calls statistical implicative similarity measures.
  • 27. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 24 CONCLUSION AND FUTURE WORKS 1. The results of the thesis - Research overview of objective interestingness measures, statistical implication analysis method, tendency of variation in statistical implications, and recommender models. - Propose a new method for classification objective interestingness measures based on asymmetric approach using statistical implication parameters. - Propose recommender model based on Implication index. This model solves for the class of recommendation problems where the condition attributes and decision attributes on the same object. - Propose a collaborative filtering recommender model based on Implication intensity. This model based on association rules and Implication intensity to provide recommendation results. - Propose statistical implicative similarity measures based on asymmetric approaches to improve the accuracy of the User- based collaborative filtering recommender models. - Develop experimental toolkit (ARQAT) on R language including functionality to meet the experimental deployment of the thesis. 2. Future works - Build similarity kernel based on statistical implicative analysis method for collaborative filtering recommender model to reduce the computational time for online recommender systems that respond quickly to user requests. - Develop collaborative filtering recommender model based on implicative hierarchy graph.
  • 28. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 THE PUBLISHED ARTICLES 1. Phan Quốc Nghĩa, Nguyễn Minh Kỳ, Nguyễn Tấn Hoàng, Huỳnh Xuân Hiệp, “Hệ tư vấn dựa trên tiếp cận hàm ý thống kê”, Kỷ yếu Hội nghị Quốc gia lần thứ VIII về Nghiên cứu cơ bản và ứng dụng Công Nghệ thông tin (FAIR), pp. 297-308, 2015. 2. Phan Quốc Nghĩa và Huỳnh Xuân Hiệp, “Hệ tư vấn dựa trên khuynh hướng biến thiên hàm ý thống kê”, Hội thảo quốc gia lần thứ XVIII: Một số vấn đề chọn lọc của Công nghệ thông tin và truyền thông, Chủ đề: Xử lý dữ liệu lớn, pp. 93-99, 2015. 3. Nghia Quoc Phan, Hiep Xuan Huynh, Fabrice Guillet, Régis Gras, “Classifying objective interestingness measures based on the tendency of value variation”, Quaderni di Ricerca in Didattica, ISSN 1592-4424, pp. 143-172, 2015. 4. Nghia Quoc Phan, Vinh Cong Phan, Hung Huu Huynh, Hiep Xuan Huynh, “Clustering the objective interestingness measures based on tendency of variation in statistical implications”, EAI Endorsed Transactions on Context-Aware Systems and Applications, Vol. 3, No. 9, pp. 1-8, 2016. 5. Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh, “Similarity Kernel for User-based Collaborative Filtering Recommendation System”, The 2016 IEEE RIVF International Conference on Computing & Communication Technologies, Addendum proceeding, pp. 40-46, 2016. 6. Phan Quốc Nghĩa, Đặng Hoài Phương, Huỳnh Xuân Hiệp, “Lựa chọn mô hình và tham số cho bài toán tư vấn lọc cộng tác dựa trên đồ thị đánh giá”, Tạp chí Khoa học Trường Đại học Cần Thơ, Số chuyên đề: Công nghệ thông tin (2017), ISSN 1859- 2333, pp. 171-178, 2017. 7. Phan Quốc Nghĩa, Nguyễn Minh Kỳ, Đặng Hoài Phương, Huỳnh Xuân Hiệp, “Hệ tư vấn lọc cộng tác theo người dùng dựa
  • 29. Nhận viết đề tài trọn gói – ZL: 0909 23 26 20– Luanvanmaster.com TẢI TÀI LIỆU KẾT BẠN ZALO : 0909 23 26 20 trên độ đo hàm ý thống kê”, Kỷ yếu Hội nghị Quốc gia lần thứ IX về Nghiên cứu cơ bản và ứng dụng Công Nghệ thông tin (FAIR), pp. 231-239, 2016. 8. Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh, “Statistical Implicative Similarity Measures for Userbased Collaborative Filtering Recommender System”, (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 7, No. 11 (ISI/WoS ESCI index), pp. 140- 146, 2016. 9. Lan Phuong Phan, Nghia Quoc Phan, Vinh Cong Phan, Hung Huu Huynh, Hiep Xuan Huynh, and Fabrice Guillet, “Classification of objective interestingness measures”, EAI Endorsed Transactions on Context-Aware Systems and Applications, Vol. 3, No. 10, pp. 1-13, 2016. 10.Lan Phuong Phan, Nghia Quoc Phan, Ky Minh Nguyen, Hung Huu Huynh, Hiep Xuan Huynh, and Fabrice Guillet, “Interestingnesslab: A Framework for Developing and Using Objective Interestingness Measures”, In ICTA 2016: International Conference on Advances in Information and Communication Technology, Springer-Verlag, pp. 302-311, 2016. 11.Phan Quốc Nghĩa, Đặng Hoài Phương, Huỳnh Xuân Hiệp, “Mô hình tư vấn lọc cộng tác tích hợp dựa trên tương đồng sản phẩm”, Tạp chí Khoa học và Công nghệ - Đại học Đà Nẵng, Vol. 1(110), pp. 55-58, 2017. 12.Nghia Quoc Phan, Phuong Hoai Dang, Hiep Xuan Huynh, “Collaborative recommenderation based on statistical implication rules”, Journal of Computer Science and Cybernetics, Vol. 33, No. 3, pp. 1-16, 2017.