SlideShare a Scribd company logo
1 of 7
Download to read offline
Department of Geomatics, National Cheng Kung University
[106-2] Data Mining, Homework 3, Instructor: Prof. Hsueh-Chan Lu
Muhammad Irsyadi Firdaus (P66067055)
Using R tool to analyze the association relationships of all multiple check questions from my all collected
data.
 Try to set a suitable min_support and min_confidence threshold to mine the (Maximal / Closed) association
rules and show your mining results.
 Write a short report to summarize what do you get / find after association relationship analysis.
 Hint: You can sort your patterns by support / confidence/ lift and try to explain why these events have
frequent positive / negative association relationships.
Answers
I use data on HW1 and HW2 about hobbies on vacation. The number of recorders in my data is 68. I use
data titled "What is the place you most like to visit?" With lots of options like Beach, Museum, Historic
Heritage, Natural Ecology, Mountains, Events & Festivals, Recreation Area, Department Stores , Night
Market, Jungle.
To try to set min_Support and min_confidence threshold to mine the association rules, first is to store the
data in .txt format or another with the title Place_to_Visit.txt. In this case I will use RStudio software to
do the calculation.
If I use a priori standard, I will get the result as below with the parameters that I use is
The results can be seen below:
οƒ˜ I will use min_support = 20% and min_confidence = 40%, it can be seen below:
The purpose of the above result is for support, s {Night Market} => {Department Stores} has count is 14
so the support value is 20.58% because 14/68 = 0.205. The rule is confidence is obtained by diving the
support count for {Night Market, Department Stores} by the support count for {Night Market}. Since
there are 22 transactions that contain Night Market, the confidence for this rule is 14/22 = 0.636.
An association rule is an implication expression of the form 𝑋𝑋 β†’ π‘Œπ‘Œ, where X and Y are disjoint itemsets,
i.e., 𝑋𝑋 ∩ π‘Œπ‘Œ = βˆ…. The strength of an association rule can be measured in terms of support and confidence.
Support determines how often a rule is applicable to given data set, while confidence determines how
frequently items in Y appear in transactions that contain X. The formal definitions of these metrics are
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆, 𝑠𝑠(𝑋𝑋 β†’ π‘Œπ‘Œ) =
𝜎𝜎(𝑋𝑋βˆͺπ‘Œπ‘Œ)
𝑁𝑁
....................................... 1
𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢, 𝑐𝑐(𝑋𝑋 β†’ π‘Œπ‘Œ) =
𝜎𝜎(𝑋𝑋βˆͺπ‘Œπ‘Œ)
𝜎𝜎(𝑋𝑋)
..................................2
Support is an important measure because a rule that has very low support may occur simply by chance. A
low support rule is also likely to be uninteresting from a business perspective because it may not be
profitable to promote tourist attractions to tourists. For these reasons, supports is often used to eliminate
uninteresting rules. Support also has a desirable property that can be exploited for the efficient discovery
of association rules.
Confidence, on the other hand, measures the reliability of the inference made by a rule. For a given rule
𝑋𝑋 β†’, the higher the confidence, the more likely it is for Y to be present in transactions that contain X.
Confidence also provides an estimate of the conditional probability of Y given X.
Association analysis results should be interpreted with caution. The inference made by an association rule
does not necessarily imply causality. Instead, it suggests a strong co-occurrence relationship between items
in the antecedent and consequent of the rule. Causality, on the other hand, requires knowledge about the
causal and effect attributes in the data and typically involves relationships occurring over time (e.g., ozone
depletion leads to global warming).
Frequent Itemset Generation
A lattice structure can be used to enumerate the list of all possible itemsets. In general, a data set that
contains k items can potentially generate up to 2k
βˆ’ 1 frequent itemsets, excluding the null set. Because k
can be very large in many practical applications, the search space of itemsets that need to be explored is
exponentially large. A brute-force approach for finding frequent itemsets is to determine the support count
for every candidate itemset in the lattice structure
οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 2, and target = frequent itemsets. The results
can be seen below:
οƒ˜ I use min_support = 10%, min_confidence = 40%, minl = 3, and target = frequent itemsets. The results
can be seen below:
Maximal Frequent Itemsets
A maximal frequent itemset is defined as a frequent itemset for which none of its immediate supersets
are frequent.
οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 1, and target = maximal. The results can be
seen below:
Maximal frequent itemsets effectively provide a compact representation of frequent itemsets. In other
words, they form the smallest set of itemsets from which all frequent itemsets can be derived. Maximal
frequent itemsets provide a valuable representation for data sets that can produce very long, frequent
itemsets, as there are exponentially many frequent itemsets in such data. Nevertheless, this approach is
practical only if an efficient algorithm exists to explicitly find the maximal frequent itemsets without
having to enumerate all their subsets. Despite providing a compact representation, maximal frequent
itemsets do not contain the support information of their subsets An additional pass over the data set is
therefore needed to determine the support counts of the non-maximal frequent itemsets. In some cases, it
might be desirable to have a minimal representation of frequent itemsets that preserves the support
information.
Closed Frequent Itemsets
Closed itemsets provide a minimal representation of itemsets without losing their support information. An
itemset X is closed if none of its immediate supersets has exactly the same support count as X. An itemset
is a closed frequent itemset if it is closed and its support is greater than or equal to min_support. We can
use the closed frequent itemsets to determine the support counts for the non-closed frequent itemsets.
οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 1, and target = closed. The results can be
seen below:
Closed frequent itemsets are useful for removing some of the redundant association rules. An association
rule X βˆ’β†’ Y is redundant if there exists another rule Xβ€² βˆ’β†’ Yβ€², where X is a subset of Xβ€² and Y
is a subset of Yβ€², such that the support and confidence for both rules are identical. Such redundant rules
are not generated if closed frequent itemsets are used for rule generation. Finally, note that all maximal
frequent itemsets are closed because none of the maximal frequent itemsets can have the same support
count as their immediate supersets.
Lift
One way to address this problem is by applying a metric known as lift:
𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿 =
𝑐𝑐(𝐴𝐴→𝐡𝐡)
𝑠𝑠(𝐡𝐡)
...................... 3
which computes the ratio between the rule’s confidence and the support of the itemset in the rule
consequent. For binary variables, lift is equivalent to another objective measure called interest factor.
which is defined as follows:
𝐼𝐼( 𝐴𝐴, 𝐡𝐡) =
𝑠𝑠(𝐴𝐴,𝐡𝐡)
𝑠𝑠(𝐴𝐴) Γ— 𝑠𝑠(𝐡𝐡)
=
𝑁𝑁𝑓𝑓11
𝑓𝑓1+ 𝑓𝑓1+
.....................................4
Interest factor compares the frequency of a pattern against a baseline frequency computed under the
statistical independence assumption. The baseline frequency for a pair of mutually independent variables
is
𝑓𝑓11
𝑁𝑁
=
𝑓𝑓1+
𝑁𝑁
Γ—
𝑓𝑓1+
𝑁𝑁
, or equivalently 𝑓𝑓11 =
𝑓𝑓1+ 𝑓𝑓1+
𝑁𝑁
....................5
Using Equations 4 and 5, we can interpret the measure as follows:
𝐼𝐼(𝐴𝐴, 𝐡𝐡) �
= 1, if A and B are independent
> 1, if A and B are positively correlated
< 1, if A and B are negatively correlated
So when using min_support = 20% and min_confidence = 40%, then the result can be concluded:
- (Lift) is equal to 1, there is no association rule/correlation (Independent)
- (Lift) is larger than 1, there is positive association rule
- (Lift) is smaller than 1, there is negative association rule.
No data found has a negative correlation
We can also sort the data patterns by support / confidence / lift as below:
- Support
- Confidence
- Lift
Appendix
Algorithms Association Rule Analysis in RStudio
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent Itemsets, Maximal Frequent Itemsets

More Related Content

What's hot

Implementation of Knowledge-Based Expert System Using Probabilistic Network M...
Implementation of Knowledge-Based Expert System Using Probabilistic Network M...Implementation of Knowledge-Based Expert System Using Probabilistic Network M...
Implementation of Knowledge-Based Expert System Using Probabilistic Network M...rahulmonikasharma
Β 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining TechniquesValerii Klymchuk
Β 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemIRJET Journal
Β 
Statistics for data scientists
Statistics for  data scientistsStatistics for  data scientists
Statistics for data scientistsAjay Ohri
Β 
Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Derek Kane
Β 
Data Science - Part III - EDA & Model Selection
Data Science - Part III - EDA & Model SelectionData Science - Part III - EDA & Model Selection
Data Science - Part III - EDA & Model SelectionDerek Kane
Β 
Data Mining using SAS
Data Mining using SASData Mining using SAS
Data Mining using SASTanu Puri
Β 
Analyzing Adverse Drug Events Using Data Mining Approach
Analyzing Adverse Drug Events Using Data Mining ApproachAnalyzing Adverse Drug Events Using Data Mining Approach
Analyzing Adverse Drug Events Using Data Mining ApproachRupal7
Β 
Exam Short Preparation on Data Analytics
Exam Short Preparation on Data AnalyticsExam Short Preparation on Data Analytics
Exam Short Preparation on Data AnalyticsHarsh Parekh
Β 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET Journal
Β 
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SETA NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SETcscpconf
Β 
Data analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataData analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataKaren Yang
Β 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentIJDKP
Β 
BAS 250 Lecture 3
BAS 250 Lecture 3BAS 250 Lecture 3
BAS 250 Lecture 3Wake Tech BAS
Β 
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...IRJET Journal
Β 
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...IJDKP
Β 
Column store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setColumn store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setijma
Β 

What's hot (20)

Implementation of Knowledge-Based Expert System Using Probabilistic Network M...
Implementation of Knowledge-Based Expert System Using Probabilistic Network M...Implementation of Knowledge-Based Expert System Using Probabilistic Network M...
Implementation of Knowledge-Based Expert System Using Probabilistic Network M...
Β 
02 Related Concepts
02 Related Concepts02 Related Concepts
02 Related Concepts
Β 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
Β 
Two Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information SystemTwo Layer k-means based Consensus Clustering for Rural Health Information System
Two Layer k-means based Consensus Clustering for Rural Health Information System
Β 
G
GG
G
Β 
Statistics for data scientists
Statistics for  data scientistsStatistics for  data scientists
Statistics for data scientists
Β 
Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests
Β 
Data Science - Part III - EDA & Model Selection
Data Science - Part III - EDA & Model SelectionData Science - Part III - EDA & Model Selection
Data Science - Part III - EDA & Model Selection
Β 
Data Mining using SAS
Data Mining using SASData Mining using SAS
Data Mining using SAS
Β 
Analyzing Adverse Drug Events Using Data Mining Approach
Analyzing Adverse Drug Events Using Data Mining ApproachAnalyzing Adverse Drug Events Using Data Mining Approach
Analyzing Adverse Drug Events Using Data Mining Approach
Β 
Exam Short Preparation on Data Analytics
Exam Short Preparation on Data AnalyticsExam Short Preparation on Data Analytics
Exam Short Preparation on Data Analytics
Β 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
Β 
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SETA NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
Β 
Data analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataData analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorData
Β 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environment
Β 
BAS 250 Lecture 3
BAS 250 Lecture 3BAS 250 Lecture 3
BAS 250 Lecture 3
Β 
Data Mining Technniques
Data Mining TechnniquesData Mining Technniques
Data Mining Technniques
Β 
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...
Hybrid Model using Unsupervised Filtering Based on Ant Colony Optimization an...
Β 
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
Β 
Column store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setColumn store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute set
Β 

Similar to Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent Itemsets, Maximal Frequent Itemsets

An apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceAn apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceIJDKP
Β 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIJSRD
Β 
Pattern Discovery Using Apriori and Ch-Search Algorithm
 Pattern Discovery Using Apriori and Ch-Search Algorithm Pattern Discovery Using Apriori and Ch-Search Algorithm
Pattern Discovery Using Apriori and Ch-Search Algorithmijceronline
Β 
5Association AnalysisBasic Concepts an.docx
 5Association AnalysisBasic Concepts an.docx 5Association AnalysisBasic Concepts an.docx
5Association AnalysisBasic Concepts an.docxShiraPrater50
Β 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and CorrelationsIRJET Journal
Β 
A model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmA model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmeSAT Journals
Β 
A model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmA model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmeSAT Journals
Β 
A novel association rule mining and clustering based hybrid method for music ...
A novel association rule mining and clustering based hybrid method for music ...A novel association rule mining and clustering based hybrid method for music ...
A novel association rule mining and clustering based hybrid method for music ...eSAT Publishing House
Β 
Mining Negative Association Rules
Mining Negative Association RulesMining Negative Association Rules
Mining Negative Association RulesIOSR Journals
Β 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creationcscpconf
Β 
Comparative study of frequent item set in data mining
Comparative study of frequent item set in data miningComparative study of frequent item set in data mining
Comparative study of frequent item set in data miningijpla
Β 
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...IOSR Journals
Β 
RDataMining slides-association-rule-mining-with-r
RDataMining slides-association-rule-mining-with-rRDataMining slides-association-rule-mining-with-r
RDataMining slides-association-rule-mining-with-rYanchang Zhao
Β 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.pptSowmyaJyothi3
Β 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.pptSowmyaJyothi3
Β 
Data mining approaches and methods
Data mining approaches and methodsData mining approaches and methods
Data mining approaches and methodssonangrai
Β 
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...IOSR Journals
Β 
Classifiers
ClassifiersClassifiers
ClassifiersAyurdata
Β 

Similar to Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent Itemsets, Maximal Frequent Itemsets (20)

An apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceAn apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distance
Β 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its Methods
Β 
Pattern Discovery Using Apriori and Ch-Search Algorithm
 Pattern Discovery Using Apriori and Ch-Search Algorithm Pattern Discovery Using Apriori and Ch-Search Algorithm
Pattern Discovery Using Apriori and Ch-Search Algorithm
Β 
5Association AnalysisBasic Concepts an.docx
 5Association AnalysisBasic Concepts an.docx 5Association AnalysisBasic Concepts an.docx
5Association AnalysisBasic Concepts an.docx
Β 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and Correlations
Β 
A model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmA model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithm
Β 
A model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithmA model for profit pattern mining based on genetic algorithm
A model for profit pattern mining based on genetic algorithm
Β 
A novel association rule mining and clustering based hybrid method for music ...
A novel association rule mining and clustering based hybrid method for music ...A novel association rule mining and clustering based hybrid method for music ...
A novel association rule mining and clustering based hybrid method for music ...
Β 
Mining Negative Association Rules
Mining Negative Association RulesMining Negative Association Rules
Mining Negative Association Rules
Β 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Β 
Comparative study of frequent item set in data mining
Comparative study of frequent item set in data miningComparative study of frequent item set in data mining
Comparative study of frequent item set in data mining
Β 
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
Β 
RDataMining slides-association-rule-mining-with-r
RDataMining slides-association-rule-mining-with-rRDataMining slides-association-rule-mining-with-r
RDataMining slides-association-rule-mining-with-r
Β 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
Β 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
Β 
I43055257
I43055257I43055257
I43055257
Β 
Ijcet 06 06_003
Ijcet 06 06_003Ijcet 06 06_003
Ijcet 06 06_003
Β 
Data mining approaches and methods
Data mining approaches and methodsData mining approaches and methods
Data mining approaches and methods
Β 
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...
A Survey on Identification of Closed Frequent Item Sets Using Intersecting Al...
Β 
Classifiers
ClassifiersClassifiers
Classifiers
Β 

More from National Cheng Kung University

Accuracy assessment and 3D Mapping by Consumer Grade Spherical Camera
Accuracy assessment and 3D Mapping by Consumer Grade Spherical CameraAccuracy assessment and 3D Mapping by Consumer Grade Spherical Camera
Accuracy assessment and 3D Mapping by Consumer Grade Spherical CameraNational Cheng Kung University
Β 
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...National Cheng Kung University
Β 
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...National Cheng Kung University
Β 
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical CameraNational Cheng Kung University
Β 
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical CameraNational Cheng Kung University
Β 
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborSatellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborNational Cheng Kung University
Β 
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU Data
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU DataOptimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU Data
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU DataNational Cheng Kung University
Β 
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborSatellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborNational Cheng Kung University
Β 
A Method of Mining Association Rules for Geographical Points of Interest
A Method of Mining Association Rules for Geographical Points of InterestA Method of Mining Association Rules for Geographical Points of Interest
A Method of Mining Association Rules for Geographical Points of InterestNational Cheng Kung University
Β 
Calibration of Inertial Sensor within Smartphone
Calibration of Inertial Sensor within SmartphoneCalibration of Inertial Sensor within Smartphone
Calibration of Inertial Sensor within SmartphoneNational Cheng Kung University
Β 
Pengukuran GPS Menggunakan Trimble Secara Manual
Pengukuran GPS Menggunakan Trimble Secara ManualPengukuran GPS Menggunakan Trimble Secara Manual
Pengukuran GPS Menggunakan Trimble Secara ManualNational Cheng Kung University
Β 
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...National Cheng Kung University
Β 
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...National Cheng Kung University
Β 
SIFT/SURF can achieve scale, rotation and illumination invariant during image...
SIFT/SURF can achieve scale, rotation and illumination invariant during image...SIFT/SURF can achieve scale, rotation and illumination invariant during image...
SIFT/SURF can achieve scale, rotation and illumination invariant during image...National Cheng Kung University
Β 
3D reconstruction by photogrammetry and 4D deformation measurement
3D reconstruction by photogrammetry and 4D deformation measurement3D reconstruction by photogrammetry and 4D deformation measurement
3D reconstruction by photogrammetry and 4D deformation measurementNational Cheng Kung University
Β 

More from National Cheng Kung University (20)

Accuracy assessment and 3D Mapping by Consumer Grade Spherical Camera
Accuracy assessment and 3D Mapping by Consumer Grade Spherical CameraAccuracy assessment and 3D Mapping by Consumer Grade Spherical Camera
Accuracy assessment and 3D Mapping by Consumer Grade Spherical Camera
Β 
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
Β 
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
3D Rekonstruksi Bangunan Menggunakan Gambar Panorama Sebagai Upaya Untuk Miti...
Β 
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
Β 
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
3D Indoor and Outdoor Mapping from Point Cloud Generated by Spherical Camera
Β 
Handbook PPI Tainan Taiwan 2018
Handbook PPI Tainan Taiwan 2018Handbook PPI Tainan Taiwan 2018
Handbook PPI Tainan Taiwan 2018
Β 
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborSatellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Β 
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU Data
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU DataOptimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU Data
Optimal Filtering with Kalman Filters and Smoothers Using AndroSensor IMU Data
Β 
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest NeighborSatellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Satellite Image Classification using Decision Tree, SVM and k-Nearest Neighbor
Β 
EKF and RTS smoother toolbox
EKF and RTS smoother toolboxEKF and RTS smoother toolbox
EKF and RTS smoother toolbox
Β 
Kalman Filter Basic
Kalman Filter BasicKalman Filter Basic
Kalman Filter Basic
Β 
A Method of Mining Association Rules for Geographical Points of Interest
A Method of Mining Association Rules for Geographical Points of InterestA Method of Mining Association Rules for Geographical Points of Interest
A Method of Mining Association Rules for Geographical Points of Interest
Β 
DSM Extraction from Pleiades Images Using RSP
DSM Extraction from Pleiades Images Using RSPDSM Extraction from Pleiades Images Using RSP
DSM Extraction from Pleiades Images Using RSP
Β 
Calibration of Inertial Sensor within Smartphone
Calibration of Inertial Sensor within SmartphoneCalibration of Inertial Sensor within Smartphone
Calibration of Inertial Sensor within Smartphone
Β 
Pengukuran GPS Menggunakan Trimble Secara Manual
Pengukuran GPS Menggunakan Trimble Secara ManualPengukuran GPS Menggunakan Trimble Secara Manual
Pengukuran GPS Menggunakan Trimble Secara Manual
Β 
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Accuracy Analysis of Three-Dimensional Model Reconstructed by Spherical Video...
Β 
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...
The rotation matrix (DCM) and quaternion in Inertial Survey and Navigation Sy...
Β 
SIFT/SURF can achieve scale, rotation and illumination invariant during image...
SIFT/SURF can achieve scale, rotation and illumination invariant during image...SIFT/SURF can achieve scale, rotation and illumination invariant during image...
SIFT/SURF can achieve scale, rotation and illumination invariant during image...
Β 
3D reconstruction by photogrammetry and 4D deformation measurement
3D reconstruction by photogrammetry and 4D deformation measurement3D reconstruction by photogrammetry and 4D deformation measurement
3D reconstruction by photogrammetry and 4D deformation measurement
Β 
Shadow Detection Using MatLAB
Shadow Detection Using MatLABShadow Detection Using MatLAB
Shadow Detection Using MatLAB
Β 

Recently uploaded

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
Β 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
Β 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
Β 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
Β 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
Β 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
Β 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
Β 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
Β 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
Β 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Β 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
Β 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
Β 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
Β 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Β 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
Β 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
Β 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
Β 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
Β 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
Β 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa
Β 

Recently uploaded (20)

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
Β 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
Β 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
Β 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
Β 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
Β 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
Β 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
Β 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
Β 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
Β 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Β 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
Β 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
Β 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
Β 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Β 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
Β 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
Β 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
Β 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
Β 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
Β 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
Β 

Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent Itemsets, Maximal Frequent Itemsets

  • 1. Department of Geomatics, National Cheng Kung University [106-2] Data Mining, Homework 3, Instructor: Prof. Hsueh-Chan Lu Muhammad Irsyadi Firdaus (P66067055) Using R tool to analyze the association relationships of all multiple check questions from my all collected data.  Try to set a suitable min_support and min_confidence threshold to mine the (Maximal / Closed) association rules and show your mining results.  Write a short report to summarize what do you get / find after association relationship analysis.  Hint: You can sort your patterns by support / confidence/ lift and try to explain why these events have frequent positive / negative association relationships. Answers I use data on HW1 and HW2 about hobbies on vacation. The number of recorders in my data is 68. I use data titled "What is the place you most like to visit?" With lots of options like Beach, Museum, Historic Heritage, Natural Ecology, Mountains, Events & Festivals, Recreation Area, Department Stores , Night Market, Jungle. To try to set min_Support and min_confidence threshold to mine the association rules, first is to store the data in .txt format or another with the title Place_to_Visit.txt. In this case I will use RStudio software to do the calculation. If I use a priori standard, I will get the result as below with the parameters that I use is The results can be seen below: οƒ˜ I will use min_support = 20% and min_confidence = 40%, it can be seen below:
  • 2. The purpose of the above result is for support, s {Night Market} => {Department Stores} has count is 14 so the support value is 20.58% because 14/68 = 0.205. The rule is confidence is obtained by diving the support count for {Night Market, Department Stores} by the support count for {Night Market}. Since there are 22 transactions that contain Night Market, the confidence for this rule is 14/22 = 0.636. An association rule is an implication expression of the form 𝑋𝑋 β†’ π‘Œπ‘Œ, where X and Y are disjoint itemsets, i.e., 𝑋𝑋 ∩ π‘Œπ‘Œ = βˆ…. The strength of an association rule can be measured in terms of support and confidence. Support determines how often a rule is applicable to given data set, while confidence determines how frequently items in Y appear in transactions that contain X. The formal definitions of these metrics are 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆, 𝑠𝑠(𝑋𝑋 β†’ π‘Œπ‘Œ) = 𝜎𝜎(𝑋𝑋βˆͺπ‘Œπ‘Œ) 𝑁𝑁 ....................................... 1 𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢𝐢, 𝑐𝑐(𝑋𝑋 β†’ π‘Œπ‘Œ) = 𝜎𝜎(𝑋𝑋βˆͺπ‘Œπ‘Œ) 𝜎𝜎(𝑋𝑋) ..................................2 Support is an important measure because a rule that has very low support may occur simply by chance. A low support rule is also likely to be uninteresting from a business perspective because it may not be profitable to promote tourist attractions to tourists. For these reasons, supports is often used to eliminate uninteresting rules. Support also has a desirable property that can be exploited for the efficient discovery of association rules. Confidence, on the other hand, measures the reliability of the inference made by a rule. For a given rule 𝑋𝑋 β†’, the higher the confidence, the more likely it is for Y to be present in transactions that contain X. Confidence also provides an estimate of the conditional probability of Y given X. Association analysis results should be interpreted with caution. The inference made by an association rule does not necessarily imply causality. Instead, it suggests a strong co-occurrence relationship between items in the antecedent and consequent of the rule. Causality, on the other hand, requires knowledge about the causal and effect attributes in the data and typically involves relationships occurring over time (e.g., ozone depletion leads to global warming). Frequent Itemset Generation A lattice structure can be used to enumerate the list of all possible itemsets. In general, a data set that contains k items can potentially generate up to 2k βˆ’ 1 frequent itemsets, excluding the null set. Because k can be very large in many practical applications, the search space of itemsets that need to be explored is exponentially large. A brute-force approach for finding frequent itemsets is to determine the support count for every candidate itemset in the lattice structure οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 2, and target = frequent itemsets. The results
  • 3. can be seen below: οƒ˜ I use min_support = 10%, min_confidence = 40%, minl = 3, and target = frequent itemsets. The results can be seen below: Maximal Frequent Itemsets A maximal frequent itemset is defined as a frequent itemset for which none of its immediate supersets are frequent. οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 1, and target = maximal. The results can be seen below: Maximal frequent itemsets effectively provide a compact representation of frequent itemsets. In other words, they form the smallest set of itemsets from which all frequent itemsets can be derived. Maximal
  • 4. frequent itemsets provide a valuable representation for data sets that can produce very long, frequent itemsets, as there are exponentially many frequent itemsets in such data. Nevertheless, this approach is practical only if an efficient algorithm exists to explicitly find the maximal frequent itemsets without having to enumerate all their subsets. Despite providing a compact representation, maximal frequent itemsets do not contain the support information of their subsets An additional pass over the data set is therefore needed to determine the support counts of the non-maximal frequent itemsets. In some cases, it might be desirable to have a minimal representation of frequent itemsets that preserves the support information. Closed Frequent Itemsets Closed itemsets provide a minimal representation of itemsets without losing their support information. An itemset X is closed if none of its immediate supersets has exactly the same support count as X. An itemset is a closed frequent itemset if it is closed and its support is greater than or equal to min_support. We can use the closed frequent itemsets to determine the support counts for the non-closed frequent itemsets. οƒ˜ I use min_support = 20%, min_confidence = 40%, minl = 1, and target = closed. The results can be seen below: Closed frequent itemsets are useful for removing some of the redundant association rules. An association rule X βˆ’β†’ Y is redundant if there exists another rule Xβ€² βˆ’β†’ Yβ€², where X is a subset of Xβ€² and Y is a subset of Yβ€², such that the support and confidence for both rules are identical. Such redundant rules are not generated if closed frequent itemsets are used for rule generation. Finally, note that all maximal frequent itemsets are closed because none of the maximal frequent itemsets can have the same support count as their immediate supersets. Lift One way to address this problem is by applying a metric known as lift: 𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿 = 𝑐𝑐(𝐴𝐴→𝐡𝐡) 𝑠𝑠(𝐡𝐡) ...................... 3 which computes the ratio between the rule’s confidence and the support of the itemset in the rule consequent. For binary variables, lift is equivalent to another objective measure called interest factor. which is defined as follows: 𝐼𝐼( 𝐴𝐴, 𝐡𝐡) = 𝑠𝑠(𝐴𝐴,𝐡𝐡) 𝑠𝑠(𝐴𝐴) Γ— 𝑠𝑠(𝐡𝐡) = 𝑁𝑁𝑓𝑓11 𝑓𝑓1+ 𝑓𝑓1+ .....................................4 Interest factor compares the frequency of a pattern against a baseline frequency computed under the
  • 5. statistical independence assumption. The baseline frequency for a pair of mutually independent variables is 𝑓𝑓11 𝑁𝑁 = 𝑓𝑓1+ 𝑁𝑁 Γ— 𝑓𝑓1+ 𝑁𝑁 , or equivalently 𝑓𝑓11 = 𝑓𝑓1+ 𝑓𝑓1+ 𝑁𝑁 ....................5 Using Equations 4 and 5, we can interpret the measure as follows: 𝐼𝐼(𝐴𝐴, 𝐡𝐡) οΏ½ = 1, if A and B are independent > 1, if A and B are positively correlated < 1, if A and B are negatively correlated So when using min_support = 20% and min_confidence = 40%, then the result can be concluded: - (Lift) is equal to 1, there is no association rule/correlation (Independent) - (Lift) is larger than 1, there is positive association rule - (Lift) is smaller than 1, there is negative association rule. No data found has a negative correlation We can also sort the data patterns by support / confidence / lift as below: - Support - Confidence
  • 6. - Lift Appendix Algorithms Association Rule Analysis in RStudio