Association Rule Mining Using WEKA

ASSOCIATION RULE MINING
USING WEKA
PREPARED BY
TAZEEN TASNEEM
TABEEN TASNEEM

A-PRIORI ALGORITHM
• A classical algorithm.
• Used for mining frequent item sets and relevant association rules.
• Uses a “bottom up" approach.
• Devised to operate on a database containing a lot of transactions.
• Produces association rules.
Implementing A-priori algorithm using weka 12/10/2018 2

ATTRIBUTE TYPES IN A-PRIORI
For running a-priori algorithm all attribute type must be one of these –
 Nominal
 Binary
 Unary

ASSOCIATION RULE
• A prominent and well-explored method for determining relations among variables in large databases.
• Helps to uncover relationships between seemingly unrelated data in a relational database.
• It has two parts –
 Antecedent (if)
 Consequent (then)
• Example –
Let us consider an association rule be
{Onion, Potato} => {Burger}
which means that if onion and potato are bought, customers also buy a burger.
• Created by analyzing data for frequent if/then patterns and using the criteria support and confidence to
identify the most important relationships.

SUPPORT
• The support of an itemset X, supp(X) is the proportion of transaction in the database in which the item
X appears. It signifies the popularity of an itemset.
• 𝑠𝑢𝑝𝑝 𝑋 =
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛 𝑖𝑛 𝑤ℎ𝑖𝑐ℎ 𝑋 𝑎𝑝𝑝𝑒𝑎𝑟𝑠
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠

CONFIDENCE
• Signifies the likelihood of item Y being purchased when item X is purchased.
• 𝑐𝑜𝑛𝑓 𝑋 → 𝑌 =
𝑠𝑢𝑝𝑝(𝑋 ∪ 𝑌)
𝑠𝑢𝑝𝑝(𝑋)

AVAILABLE TOOLS
More popular tools used for data mining are –
• Weka
• Keel
In this presentation, we will use WEKA data mining tool.

WHAT IS WEKA
• Waikato Environment for Knowledge Analysis (Weka).
• A collection of machine learning algorithms for data mining tasks.
• Contains tools for data –
 pre-processing
 classification
 Regression
 Clustering
 association rules
 visualization.
• An open source software issued under the GNU General Public License.

DATASET IN WEKA
• Data set can be -
 CREATED
 DOWNLOAED
• For this presentation, we have created our own dataset using Microsoft Excel

CREATING DATASET IN MICROSOFT EXCEL

CREATING .CSV FILE

CREATING .ARFF FILE

WEKA INTERFACE

LOADING (.ARFF) FILE IN WEKA

APPLYING ASSOCIATION RULE
In WEKA, a-priori algorithm is default association rule.
Before running a-priori algorithm we have checked if all attributes are nominal or binary or unary.

RUNNING A-PRIORI ALGORITHM

ASSOCIATOR OUTPUT

INTERPRETATION OF RULES
Let us interpret our first rule where the rule is –
age = aged 5 ==> purchase = willBuy 5
Defines, if age = aged, then there are 5 incidents where purchase = willBuy.

GRAPHICAL REPRESENTATION
12/10/2018Implementing A-priori algorithm using weka 20

JITTER
• Jitter function –
 adds artificial random noise to the coordinates of the plotted points
 spread the data out a bit.

Association Rule Mining Using WEKA

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Association Rule Mining Using WEKA

Similar to Association Rule Mining Using WEKA (20)

Recently uploaded

Recently uploaded (20)

Association Rule Mining Using WEKA