SlideShare a Scribd company logo
DATAMINING
MINING MULTIDIMENSIONAL
ASSOCIATION RULES FROM
RELATIONAL DATABASES AND
DATA WAREHOUSES
M.SUJITHA,
I-M.SC(CS&IT),
NadarSaraswathiCollegeOfArtsAndScience,Theni.
 Mining multidimensional association rules involves more than
one dimension or predicate.
EXAMPLE:
Rules relating what a customer buys as well as the customer's
age.
 These methods can be organized according to their treatment of
quantitative attributes.
MULTIDIMENSIONAL ASSOCIATION
RULES
 It implies a single predicate, for the predicate buys,
 For instance,
 Mining our ABC company database, we may discover the
Boolean association rule "IBM desktop computer“
 Which implies "Sony b/w printer" .
 It can also be written as
 buys(X,"IBM desktop computer:")implies buys(X, "sony b/w
printer")
 where X is a variable representing customers who purchased
items in AB Company transcations.
 It contains a single distinct predicate(e.g..buys) with multiple
occurrences.(i.e..predicate occurs more than once)
 Such rules are commonly mined from transactions data.
 Rather than using a transactional database, sales and related
information are stared in a relational database or data
warehouse.
 Such data stares are multidimensional, by definition.
 To mine association rules containing multiple predicates,
 age(x,"20......29")^occupation(X,“ student") implies buys(X,“
laptop":)
 Association rules that involve two or more dimensions or
predicates can be referred to as multidimensional association
rules.
 The above rule contains three predicates(age, occupation,and
buys),each of which occurs only once in the rule .
 It has no repeated predicates.
 Multidimensional association rules with no repeated predicates
are called inter-dimensional association rules.
MINING MULTIDIMENSIONAL ASSOCIATION
RULES USING STATIC DISCRETIZATION OF
QUANTITATIVE ATTRIBUTES
 Quantitative attributes are discretized prior to mining using
predefined concept hierarchies.
 Numeric values are replaced by ranges.
 Categorical attributes may also be generalized to higher
conceptual levels if desired.
 The resulting task-relavant data are stored in a relational table,
then the a priori algorithm requires a slight modification.
 To find all frequent predicate sets rather than frequent
itemsets(i.e.,by searching through all of the relevant
attributes, instead of searching only one attribute, like buys').
MINING QUANTITATIVE ASSOCIATION
RULES
 Quantitative association rules are multidimensional association
rules in which the numeric attributes are dynamically
discretized during the mining process.
 To satisfy some mining creteria, maximizing the confidence or
compactness 'of the rules mined.
 In this, we will focus specifically to mine quantitative
association rules having two quantitative attributes.
 On the left-hand side of the rule, and one categorical attribute
on the right -hand side of the rule,
 for example,
 Aquan1^Aquan2 implies Acat
 Where Aquan1 and Aquan2 are tests on quantitative attribute
ranges(where the ranges are dynamically Determined)
 Acat tests a categorical attribute form the task relevant data.
 Such rules have been referred to as two-dimensional
quantitative association rules.
 They contain two quantitative dimensions.
 For instance,
 suppose you are curious about the association relationship
between pairs of quantitative attributes, like customer age and
income, and the type of television that customers like to buy.
BINNING:
 Quantitative attributes can have a very wide range of values
denning their domain.
 These intervals are dynamic in that they may be combined
during the mining process.
 The partitioning process is referred to as binning ,where the
intervals are considered "bins.“
 Three common binning strategies are
 Equi width binning:
 Where the interval size of each bin is the same.
Equi depth binning:
where each bin has approximately the same number of
tuples assigned to it.
Homogeneity-based binning:
where bin size is determined so that the tuples in each bin
are uniformly distributed.
Finding frequent predicate sets:
Once the 2-D array containing the count distribution for
each category is set up this can be scanned in order to find the
frequent predicate sets(those satisfying minimum support)that
also satisfy minimum.
MINING DISTANCE-BASED
ASSOCIATION RULES
 Quantitative association rules the quantitative attributes are
discretized initially by methods.
 The resulting intervals an then combined.
 Such an approach, may not capture the semantics of intervals
data-since they do not consider the relative distance between
data points or between intervals.
A DISADVANTAGE OF ASSOCIATION
RULES
 They do not allow for approximations of attribute values.
 Consider the folowing association rule:
 Item_type(x,"electronic")^manufacturer(X,"foreign")implies
price(X,200)
 Where X is a variable describing items at ABCompany .
 In reality, it is more likely that the prices of foreign electronic items are
close to or approximately $200,rather than exactly $20.
 It would be useful to have association rules that can express such a notion
of closeness.
 The support and confidence measures do not consider.
 The closeness of values for a given attribute.
 This motivates the mining of distance-based association rules,
which captures the semantics of interval data while allowing for
approximation in data values.
 A two -phase algorithm can be used to mine distance-based
association rules.
 The first phase employs clustering to find the intervals or
clusters.
 Adapting to the amount of available memory.
 The second phase obtains distance-based association rules by
searching for groups of clusters that occur frequently together.
THANKYOU

More Related Content

Similar to Dmppt 180312092027

Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning ApproachReducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Erik De Monte
 
Mining Frequent Patterns, Associations, and.pptx
 Mining Frequent Patterns, Associations, and.pptx Mining Frequent Patterns, Associations, and.pptx
Mining Frequent Patterns, Associations, and.pptx
RushikeshChikane2
 
Association rule Mining
Association rule MiningAssociation rule Mining
Association rule Mining
afsana40
 
Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023
SakshiTiwari490123
 
Classification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining techniqueClassification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining technique
eSAT Publishing House
 
Ca25458463
Ca25458463Ca25458463
Ca25458463
IJERA Editor
 
Datamining
DataminingDatamining
Datamining
Debashis Pradhan
 
Computational Methods in Medicine
Computational Methods in MedicineComputational Methods in Medicine
Computational Methods in Medicine
Alina Cristian
 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its Methods
IJSRD
 
Chapter 1.pdf
Chapter 1.pdfChapter 1.pdf
Chapter 1.pdf
DrGnaneswariG
 
Data Mining: Data Preprocessing
Data Mining: Data PreprocessingData Mining: Data Preprocessing
Data Mining: Data Preprocessing
Lakshmi Sarvani Videla
 
App B
App BApp B
Chapter24
Chapter24Chapter24
Chapter24
gourab87
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
T Kavitha
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
Datamining Tools
 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
cscpconf
 
The D-basis Algorithm for Association Rules of High Confidence
The D-basis Algorithm for Association Rules of High ConfidenceThe D-basis Algorithm for Association Rules of High Confidence
The D-basis Algorithm for Association Rules of High Confidence
ITIIIndustries
 
Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084
Editor IJARCET
 
Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084
Editor IJARCET
 

Similar to Dmppt 180312092027 (20)

Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning ApproachReducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
Reducing False Positives - BSA AML Transaction Monitoring Re-Tuning Approach
 
Mining Frequent Patterns, Associations, and.pptx
 Mining Frequent Patterns, Associations, and.pptx Mining Frequent Patterns, Associations, and.pptx
Mining Frequent Patterns, Associations, and.pptx
 
Association rule Mining
Association rule MiningAssociation rule Mining
Association rule Mining
 
Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023Data Mining Presentation on Science Day 2023
Data Mining Presentation on Science Day 2023
 
Classification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining techniqueClassification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining technique
 
Ca25458463
Ca25458463Ca25458463
Ca25458463
 
Datamining
DataminingDatamining
Datamining
 
Computational Methods in Medicine
Computational Methods in MedicineComputational Methods in Medicine
Computational Methods in Medicine
 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its Methods
 
Chapter 1.pdf
Chapter 1.pdfChapter 1.pdf
Chapter 1.pdf
 
Data Mining: Data Preprocessing
Data Mining: Data PreprocessingData Mining: Data Preprocessing
Data Mining: Data Preprocessing
 
App B
App BApp B
App B
 
Chapter24
Chapter24Chapter24
Chapter24
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
 
The D-basis Algorithm for Association Rules of High Confidence
The D-basis Algorithm for Association Rules of High ConfidenceThe D-basis Algorithm for Association Rules of High Confidence
The D-basis Algorithm for Association Rules of High Confidence
 
Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084
 
Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084Volume 2-issue-6-2081-2084
Volume 2-issue-6-2081-2084
 

More from kirupasuchi1996

Rotor machine,subsitution technique
Rotor machine,subsitution techniqueRotor machine,subsitution technique
Rotor machine,subsitution technique
kirupasuchi1996
 
rotor machine
rotor machinerotor machine
rotor machine
kirupasuchi1996
 
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSIONDVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
kirupasuchi1996
 
Cyper crime
Cyper crimeCyper crime
Cyper crime
kirupasuchi1996
 
DS ppt
DS pptDS ppt
Image compression standards
Image compression standardsImage compression standards
Image compression standards
kirupasuchi1996
 
Language and Processors for Requirements Specification
Language and Processors for Requirements SpecificationLanguage and Processors for Requirements Specification
Language and Processors for Requirements Specification
kirupasuchi1996
 
Software Cost Factor
Software Cost FactorSoftware Cost Factor
Software Cost Factor
kirupasuchi1996
 
GUI components in Java
GUI components in JavaGUI components in Java
GUI components in Java
kirupasuchi1996
 
Designing Techniques in Software Engineering
Designing Techniques in Software EngineeringDesigning Techniques in Software Engineering
Designing Techniques in Software Engineering
kirupasuchi1996
 
Datatransferandmanipulation 180214044522
Datatransferandmanipulation 180214044522Datatransferandmanipulation 180214044522
Datatransferandmanipulation 180214044522
kirupasuchi1996
 
Filesharing 180214044607
Filesharing 180214044607Filesharing 180214044607
Filesharing 180214044607
kirupasuchi1996
 
B tree-180214044656
B tree-180214044656B tree-180214044656
B tree-180214044656
kirupasuchi1996
 
Addressingmodes
Addressingmodes Addressingmodes
Addressingmodes
kirupasuchi1996
 
Managing,working with files
Managing,working with filesManaging,working with files
Managing,working with files
kirupasuchi1996
 

More from kirupasuchi1996 (15)

Rotor machine,subsitution technique
Rotor machine,subsitution techniqueRotor machine,subsitution technique
Rotor machine,subsitution technique
 
rotor machine
rotor machinerotor machine
rotor machine
 
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSIONDVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
DVI,FRACTAL IMAGE,SUB BAND IMAGE,VIDEO CODING AND WAVELET BASED COMPRESSION
 
Cyper crime
Cyper crimeCyper crime
Cyper crime
 
DS ppt
DS pptDS ppt
DS ppt
 
Image compression standards
Image compression standardsImage compression standards
Image compression standards
 
Language and Processors for Requirements Specification
Language and Processors for Requirements SpecificationLanguage and Processors for Requirements Specification
Language and Processors for Requirements Specification
 
Software Cost Factor
Software Cost FactorSoftware Cost Factor
Software Cost Factor
 
GUI components in Java
GUI components in JavaGUI components in Java
GUI components in Java
 
Designing Techniques in Software Engineering
Designing Techniques in Software EngineeringDesigning Techniques in Software Engineering
Designing Techniques in Software Engineering
 
Datatransferandmanipulation 180214044522
Datatransferandmanipulation 180214044522Datatransferandmanipulation 180214044522
Datatransferandmanipulation 180214044522
 
Filesharing 180214044607
Filesharing 180214044607Filesharing 180214044607
Filesharing 180214044607
 
B tree-180214044656
B tree-180214044656B tree-180214044656
B tree-180214044656
 
Addressingmodes
Addressingmodes Addressingmodes
Addressingmodes
 
Managing,working with files
Managing,working with filesManaging,working with files
Managing,working with files
 

Recently uploaded

Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024
facilitymanager11
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
wyddcwye1
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 

Recently uploaded (20)

Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 

Dmppt 180312092027

  • 1. DATAMINING MINING MULTIDIMENSIONAL ASSOCIATION RULES FROM RELATIONAL DATABASES AND DATA WAREHOUSES M.SUJITHA, I-M.SC(CS&IT), NadarSaraswathiCollegeOfArtsAndScience,Theni.
  • 2.  Mining multidimensional association rules involves more than one dimension or predicate. EXAMPLE: Rules relating what a customer buys as well as the customer's age.  These methods can be organized according to their treatment of quantitative attributes.
  • 3. MULTIDIMENSIONAL ASSOCIATION RULES  It implies a single predicate, for the predicate buys,  For instance,  Mining our ABC company database, we may discover the Boolean association rule "IBM desktop computer“  Which implies "Sony b/w printer" .  It can also be written as  buys(X,"IBM desktop computer:")implies buys(X, "sony b/w printer")
  • 4.  where X is a variable representing customers who purchased items in AB Company transcations.  It contains a single distinct predicate(e.g..buys) with multiple occurrences.(i.e..predicate occurs more than once)  Such rules are commonly mined from transactions data.  Rather than using a transactional database, sales and related information are stared in a relational database or data warehouse.  Such data stares are multidimensional, by definition.
  • 5.  To mine association rules containing multiple predicates,  age(x,"20......29")^occupation(X,“ student") implies buys(X,“ laptop":)  Association rules that involve two or more dimensions or predicates can be referred to as multidimensional association rules.  The above rule contains three predicates(age, occupation,and buys),each of which occurs only once in the rule .  It has no repeated predicates.  Multidimensional association rules with no repeated predicates are called inter-dimensional association rules.
  • 6. MINING MULTIDIMENSIONAL ASSOCIATION RULES USING STATIC DISCRETIZATION OF QUANTITATIVE ATTRIBUTES  Quantitative attributes are discretized prior to mining using predefined concept hierarchies.  Numeric values are replaced by ranges.  Categorical attributes may also be generalized to higher conceptual levels if desired.  The resulting task-relavant data are stored in a relational table, then the a priori algorithm requires a slight modification.  To find all frequent predicate sets rather than frequent itemsets(i.e.,by searching through all of the relevant attributes, instead of searching only one attribute, like buys').
  • 7. MINING QUANTITATIVE ASSOCIATION RULES  Quantitative association rules are multidimensional association rules in which the numeric attributes are dynamically discretized during the mining process.  To satisfy some mining creteria, maximizing the confidence or compactness 'of the rules mined.  In this, we will focus specifically to mine quantitative association rules having two quantitative attributes.  On the left-hand side of the rule, and one categorical attribute on the right -hand side of the rule,
  • 8.  for example,  Aquan1^Aquan2 implies Acat  Where Aquan1 and Aquan2 are tests on quantitative attribute ranges(where the ranges are dynamically Determined)  Acat tests a categorical attribute form the task relevant data.  Such rules have been referred to as two-dimensional quantitative association rules.  They contain two quantitative dimensions.  For instance,  suppose you are curious about the association relationship between pairs of quantitative attributes, like customer age and income, and the type of television that customers like to buy.
  • 9. BINNING:  Quantitative attributes can have a very wide range of values denning their domain.  These intervals are dynamic in that they may be combined during the mining process.  The partitioning process is referred to as binning ,where the intervals are considered "bins.“  Three common binning strategies are  Equi width binning:  Where the interval size of each bin is the same.
  • 10. Equi depth binning: where each bin has approximately the same number of tuples assigned to it. Homogeneity-based binning: where bin size is determined so that the tuples in each bin are uniformly distributed. Finding frequent predicate sets: Once the 2-D array containing the count distribution for each category is set up this can be scanned in order to find the frequent predicate sets(those satisfying minimum support)that also satisfy minimum.
  • 11. MINING DISTANCE-BASED ASSOCIATION RULES  Quantitative association rules the quantitative attributes are discretized initially by methods.  The resulting intervals an then combined.  Such an approach, may not capture the semantics of intervals data-since they do not consider the relative distance between data points or between intervals.
  • 12. A DISADVANTAGE OF ASSOCIATION RULES  They do not allow for approximations of attribute values.  Consider the folowing association rule:  Item_type(x,"electronic")^manufacturer(X,"foreign")implies price(X,200)  Where X is a variable describing items at ABCompany .  In reality, it is more likely that the prices of foreign electronic items are close to or approximately $200,rather than exactly $20.  It would be useful to have association rules that can express such a notion of closeness.  The support and confidence measures do not consider.
  • 13.  The closeness of values for a given attribute.  This motivates the mining of distance-based association rules, which captures the semantics of interval data while allowing for approximation in data values.  A two -phase algorithm can be used to mine distance-based association rules.  The first phase employs clustering to find the intervals or clusters.  Adapting to the amount of available memory.  The second phase obtains distance-based association rules by searching for groups of clusters that occur frequently together.