SlideShare a Scribd company logo
1 of 14
DATAMINING
MINING MULTIDIMENSIONAL
ASSOCIATION RULES FROM
RELATIONAL DATABASES AND
DATA WAREHOUSES
M.SUJITHA,
I-M.SC(CS&IT),
NadarSaraswathiCollegeOfArtsAndScience,Theni.
 Mining multidimensional association rules involves more than
one dimension or predicate.
EXAMPLE:
Rules relating what a customer buys as well as the customer's
age.
 These methods can be organized according to their treatment of
quantitative attributes.
MULTIDIMENSIONAL ASSOCIATION
RULES
 It implies a single predicate, for the predicate buys,
 For instance,
 Mining our ABC company database, we may discover the
Boolean association rule "IBM desktop computer“
 Which implies "Sony b/w printer" .
 It can also be written as
 buys(X,"IBM desktop computer:")implies buys(X, "sony b/w
printer")
 where X is a variable representing customers who purchased
items in AB Company transcations.
 It contains a single distinct predicate(e.g..buys) with multiple
occurrences.(i.e..predicate occurs more than once)
 Such rules are commonly mined from transactions data.
 Rather than using a transactional database, sales and related
information are stared in a relational database or data
warehouse.
 Such data stares are multidimensional, by definition.
 To mine association rules containing multiple predicates,
 age(x,"20......29")^occupation(X,“ student") implies buys(X,“
laptop":)
 Association rules that involve two or more dimensions or
predicates can be referred to as multidimensional association
rules.
 The above rule contains three predicates(age, occupation,and
buys),each of which occurs only once in the rule .
 It has no repeated predicates.
 Multidimensional association rules with no repeated predicates
are called inter-dimensional association rules.
MINING MULTIDIMENSIONAL ASSOCIATION
RULES USING STATIC DISCRETIZATION OF
QUANTITATIVE ATTRIBUTES
 Quantitative attributes are discretized prior to mining using
predefined concept hierarchies.
 Numeric values are replaced by ranges.
 Categorical attributes may also be generalized to higher
conceptual levels if desired.
 The resulting task-relavant data are stored in a relational table,
then the a priori algorithm requires a slight modification.
 To find all frequent predicate sets rather than frequent
itemsets(i.e.,by searching through all of the relevant
attributes, instead of searching only one attribute, like buys').
MINING QUANTITATIVE ASSOCIATION
RULES
 Quantitative association rules are multidimensional association
rules in which the numeric attributes are dynamically
discretized during the mining process.
 To satisfy some mining creteria, maximizing the confidence or
compactness 'of the rules mined.
 In this, we will focus specifically to mine quantitative
association rules having two quantitative attributes.
 On the left-hand side of the rule, and one categorical attribute
on the right -hand side of the rule,
 for example,
 Aquan1^Aquan2 implies Acat
 Where Aquan1 and Aquan2 are tests on quantitative attribute
ranges(where the ranges are dynamically Determined)
 Acat tests a categorical attribute form the task relevant data.
 Such rules have been referred to as two-dimensional
quantitative association rules.
 They contain two quantitative dimensions.
 For instance,
 suppose you are curious about the association relationship
between pairs of quantitative attributes, like customer age and
income, and the type of television that customers like to buy.
BINNING:
 Quantitative attributes can have a very wide range of values
denning their domain.
 These intervals are dynamic in that they may be combined
during the mining process.
 The partitioning process is referred to as binning ,where the
intervals are considered "bins.“
 Three common binning strategies are
 Equi width binning:
 Where the interval size of each bin is the same.
Equi depth binning:
where each bin has approximately the same number of
tuples assigned to it.
Homogeneity-based binning:
where bin size is determined so that the tuples in each bin
are uniformly distributed.
Finding frequent predicate sets:
Once the 2-D array containing the count distribution for
each category is set up this can be scanned in order to find the
frequent predicate sets(those satisfying minimum support)that
also satisfy minimum.
MINING DISTANCE-BASED
ASSOCIATION RULES
 Quantitative association rules the quantitative attributes are
discretized initially by methods.
 The resulting intervals an then combined.
 Such an approach, may not capture the semantics of intervals
data-since they do not consider the relative distance between
data points or between intervals.
A DISADVANTAGE OF ASSOCIATION
RULES
 They do not allow for approximations of attribute values.
 Consider the folowing association rule:
 Item_type(x,"electronic")^manufacturer(X,"foreign")implies
price(X,200)
 Where X is a variable describing items at ABCompany .
 In reality, it is more likely that the prices of foreign electronic items are
close to or approximately $200,rather than exactly $20.
 It would be useful to have association rules that can express such a notion
of closeness.
 The support and confidence measures do not consider.
 The closeness of values for a given attribute.
 This motivates the mining of distance-based association rules,
which captures the semantics of interval data while allowing for
approximation in data values.
 A two -phase algorithm can be used to mine distance-based
association rules.
 The first phase employs clustering to find the intervals or
clusters.
 Adapting to the amount of available memory.
 The second phase obtains distance-based association rules by
searching for groups of clusters that occur frequently together.
THANKYOU

More Related Content

What's hot

What's hot (20)

Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
 
Advanced data modeling
Advanced data modelingAdvanced data modeling
Advanced data modeling
 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
Data Reduction
Data ReductionData Reduction
Data Reduction
 
Web content mining
Web content miningWeb content mining
Web content mining
 
2.4 rule based classification
2.4 rule based classification2.4 rule based classification
2.4 rule based classification
 
Mining Frequent Patterns, Association and Correlations
Mining Frequent Patterns, Association and CorrelationsMining Frequent Patterns, Association and Correlations
Mining Frequent Patterns, Association and Correlations
 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data mining
 
Clustering: Large Databases in data mining
Clustering: Large Databases in data miningClustering: Large Databases in data mining
Clustering: Large Databases in data mining
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
 
Relational Data Model Introduction
Relational Data Model IntroductionRelational Data Model Introduction
Relational Data Model Introduction
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
2.3 bayesian classification
2.3 bayesian classification2.3 bayesian classification
2.3 bayesian classification
 
Apriori Algorithm
Apriori AlgorithmApriori Algorithm
Apriori Algorithm
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
 

Similar to Data Mining

Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning ApproachReducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Erik De Monte
 

Similar to Data Mining (20)

Data mining and its concepts
Data mining and its conceptsData mining and its concepts
Data mining and its concepts
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data mining
 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
 
Association rule mining.pptx
Association rule mining.pptxAssociation rule mining.pptx
Association rule mining.pptx
 
Reducing False Positives
Reducing False PositivesReducing False Positives
Reducing False Positives
 
Data mining
Data miningData mining
Data mining
 
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning ApproachReducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
 
Mining Frequent Patterns, Associations, and.pptx
 Mining Frequent Patterns, Associations, and.pptx Mining Frequent Patterns, Associations, and.pptx
Mining Frequent Patterns, Associations, and.pptx
 
Association rule Mining
Association rule MiningAssociation rule Mining
Association rule Mining
 
Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023
 
Classification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining techniqueClassification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining technique
 
Ca25458463
Ca25458463Ca25458463
Ca25458463
 
Datamining
DataminingDatamining
Datamining
 
Computational Methods in Medicine
Computational Methods in MedicineComputational Methods in Medicine
Computational Methods in Medicine
 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its Methods
 
Chapter 1.pdf
Chapter 1.pdfChapter 1.pdf
Chapter 1.pdf
 
Data Mining: Data Preprocessing
Data Mining: Data PreprocessingData Mining: Data Preprocessing
Data Mining: Data Preprocessing
 
App B
App BApp B
App B
 
Chapter24
Chapter24Chapter24
Chapter24
 

More from Abinaya B

More from Abinaya B (18)

Multimedia
MultimediaMultimedia
Multimedia
 
Overview of bigdata
Overview of bigdataOverview of bigdata
Overview of bigdata
 
exception handling in java
exception handling in javaexception handling in java
exception handling in java
 
data structures
data structuresdata structures
data structures
 
graphics programming in java
graphics programming in javagraphics programming in java
graphics programming in java
 
data structures- back tracking
data structures- back trackingdata structures- back tracking
data structures- back tracking
 
exception handling in java
exception handling in javaexception handling in java
exception handling in java
 
digital image processing
digital image processingdigital image processing
digital image processing
 
Image filtering in Digital image processing
Image filtering in Digital image processingImage filtering in Digital image processing
Image filtering in Digital image processing
 
software engineering
software engineeringsoftware engineering
software engineering
 
software cost factor
software cost factorsoftware cost factor
software cost factor
 
Datamining
DataminingDatamining
Datamining
 
Basic topic on os
Basic topic on osBasic topic on os
Basic topic on os
 
Digital principles basic
Digital principles basicDigital principles basic
Digital principles basic
 
Rdbms1
Rdbms1Rdbms1
Rdbms1
 
Managing I/O & String function in C
Managing I/O & String function in CManaging I/O & String function in C
Managing I/O & String function in C
 
Introduction to 80386
Introduction to 80386Introduction to 80386
Introduction to 80386
 
Network standardization
Network standardizationNetwork standardization
Network standardization
 

Recently uploaded

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 

Data Mining

  • 1. DATAMINING MINING MULTIDIMENSIONAL ASSOCIATION RULES FROM RELATIONAL DATABASES AND DATA WAREHOUSES M.SUJITHA, I-M.SC(CS&IT), NadarSaraswathiCollegeOfArtsAndScience,Theni.
  • 2.  Mining multidimensional association rules involves more than one dimension or predicate. EXAMPLE: Rules relating what a customer buys as well as the customer's age.  These methods can be organized according to their treatment of quantitative attributes.
  • 3. MULTIDIMENSIONAL ASSOCIATION RULES  It implies a single predicate, for the predicate buys,  For instance,  Mining our ABC company database, we may discover the Boolean association rule "IBM desktop computer“  Which implies "Sony b/w printer" .  It can also be written as  buys(X,"IBM desktop computer:")implies buys(X, "sony b/w printer")
  • 4.  where X is a variable representing customers who purchased items in AB Company transcations.  It contains a single distinct predicate(e.g..buys) with multiple occurrences.(i.e..predicate occurs more than once)  Such rules are commonly mined from transactions data.  Rather than using a transactional database, sales and related information are stared in a relational database or data warehouse.  Such data stares are multidimensional, by definition.
  • 5.  To mine association rules containing multiple predicates,  age(x,"20......29")^occupation(X,“ student") implies buys(X,“ laptop":)  Association rules that involve two or more dimensions or predicates can be referred to as multidimensional association rules.  The above rule contains three predicates(age, occupation,and buys),each of which occurs only once in the rule .  It has no repeated predicates.  Multidimensional association rules with no repeated predicates are called inter-dimensional association rules.
  • 6. MINING MULTIDIMENSIONAL ASSOCIATION RULES USING STATIC DISCRETIZATION OF QUANTITATIVE ATTRIBUTES  Quantitative attributes are discretized prior to mining using predefined concept hierarchies.  Numeric values are replaced by ranges.  Categorical attributes may also be generalized to higher conceptual levels if desired.  The resulting task-relavant data are stored in a relational table, then the a priori algorithm requires a slight modification.  To find all frequent predicate sets rather than frequent itemsets(i.e.,by searching through all of the relevant attributes, instead of searching only one attribute, like buys').
  • 7. MINING QUANTITATIVE ASSOCIATION RULES  Quantitative association rules are multidimensional association rules in which the numeric attributes are dynamically discretized during the mining process.  To satisfy some mining creteria, maximizing the confidence or compactness 'of the rules mined.  In this, we will focus specifically to mine quantitative association rules having two quantitative attributes.  On the left-hand side of the rule, and one categorical attribute on the right -hand side of the rule,
  • 8.  for example,  Aquan1^Aquan2 implies Acat  Where Aquan1 and Aquan2 are tests on quantitative attribute ranges(where the ranges are dynamically Determined)  Acat tests a categorical attribute form the task relevant data.  Such rules have been referred to as two-dimensional quantitative association rules.  They contain two quantitative dimensions.  For instance,  suppose you are curious about the association relationship between pairs of quantitative attributes, like customer age and income, and the type of television that customers like to buy.
  • 9. BINNING:  Quantitative attributes can have a very wide range of values denning their domain.  These intervals are dynamic in that they may be combined during the mining process.  The partitioning process is referred to as binning ,where the intervals are considered "bins.“  Three common binning strategies are  Equi width binning:  Where the interval size of each bin is the same.
  • 10. Equi depth binning: where each bin has approximately the same number of tuples assigned to it. Homogeneity-based binning: where bin size is determined so that the tuples in each bin are uniformly distributed. Finding frequent predicate sets: Once the 2-D array containing the count distribution for each category is set up this can be scanned in order to find the frequent predicate sets(those satisfying minimum support)that also satisfy minimum.
  • 11. MINING DISTANCE-BASED ASSOCIATION RULES  Quantitative association rules the quantitative attributes are discretized initially by methods.  The resulting intervals an then combined.  Such an approach, may not capture the semantics of intervals data-since they do not consider the relative distance between data points or between intervals.
  • 12. A DISADVANTAGE OF ASSOCIATION RULES  They do not allow for approximations of attribute values.  Consider the folowing association rule:  Item_type(x,"electronic")^manufacturer(X,"foreign")implies price(X,200)  Where X is a variable describing items at ABCompany .  In reality, it is more likely that the prices of foreign electronic items are close to or approximately $200,rather than exactly $20.  It would be useful to have association rules that can express such a notion of closeness.  The support and confidence measures do not consider.
  • 13.  The closeness of values for a given attribute.  This motivates the mining of distance-based association rules, which captures the semantics of interval data while allowing for approximation in data values.  A two -phase algorithm can be used to mine distance-based association rules.  The first phase employs clustering to find the intervals or clusters.  Adapting to the amount of available memory.  The second phase obtains distance-based association rules by searching for groups of clusters that occur frequently together.