SlideShare a Scribd company logo
1 of 6
Download to read offline
Information and Knowledge Management                                                                    www.iiste.org
ISSN 2224-5758 (Paper) ISSN 2224-896X (Online)
                                 896X
Vol 2, No.6, 2012



      Interestingness Measures for Multi Level Association Rules
                                   Multi-Level
                               R Vijaya Prakash1*, Dr. A. Govardhan2, Prof. SSVN. Sarma3
               1.      Dept. of Informatics, Kakatiya University, Warangal, India
               2.      Dept. Of Computer Science & Engineering, JNT University, Hyderabad, India
               3.      Dept. Of Computer Science & Engineering, Vaagdevi College of Engineering
                               *E-mail of the corresponding author: vijprak@hotmail.com
                                   mail

Abstract
Association rule mining is one technique that is widely used to obtain useful associations rules among sets of
items. Much work has been done focusing on efficiency, effectiveness and redundancy. There has also been a
focusing on the quality of rules from single level datasets with many interestingness measures proposed.
However, there is a lack of interestingness measures developed for multi
                                                                   multi-level and cross-
                                                                                        -level Association rules.
Single level measures do not take into account the hierarchy found in a multi
                                                                            multi-level dataset. This leaves the
Support-Confidence approach, which does not consider for the hierarchy. In this paper we propose two
approaches which measure multi-level association rules to help and evaluate their interestingness. These
                                     level
measures of diversity and peculiarity can be used to identify those rules from multi-    -level datasets that are
potentially useful.
Keywords: Information Retrieval, Interestingness Measures, Association Rules, Multi-Level Datasets Itemsets
                                                                                      Level Datasets,

1. Introduction
Interestingness measures are necessary to rank Association rule patterns. Each interestingness measure produces
                                                     ssociation
different results, and experts, Lenca et.al (2008) have different opinions of what constitutes a good rule. The
                               ,
interestingness of discovered association rules is an important and active area within data mining research (L.
Geng & H.J. Hamilton 2006). The primary problem is the selection of interestingness measures for a given
                                 .
application domain. However, there is no formal agreement on a definition for what makes rules interesting.
Association rule algorithms produce thousands of rules, m    many of which are redundant (K. McGarry 2005, Li. J.
et al 2003). In order to filter the rules, the user generally supplies a minimum threshold for support and
             .
confidence. Support and confidence
                            confidence(R. Agrawal et al 1993, G. Dong & J. Li 1998, S. Lallich et al 2006) are
basic measures of association rule interestingness. All of these measures were proposed for association rules
derived from single level or flat datasets, which were most commonly transactional datasets. Today multi-level
datasets are more common in many domains. With this increase in usage there is a big demand for techniques to
discover multi-level and cross-level association rules and also techniques to measure interestingness of rules
                                   level
derived from multi-level datasets. J. Han & Y. Fu (1995, 1999), C.N. Zeigler et al (2005) proposed some
                      level
approaches for multi-level and cross
                       level       cross-level frequent itemset discovery have been proposed. However, multi
                                                                                                        multi-level
datasets are often a source of numerous rules and in fact the rules can be so numerous it can be much more
difficult to determine which ones are interesting (R. Agrawal et al 1993, G. Dong & J. Li 1998) Moreover, the
                                                                                              1998).
existing interestingness measures for single level association rules can not accurately measure the interestingness
of multi-level rules since they do not take into consideration the concept of the hierarchical structure that exists
in multi-level datasets.
     In this paper, we propose measures particularly for assessing the interestingness of multilevel association
rules by examining the diversity and peculiarity among rules. These measures can be determined at rule
discovery phase during post-processing to help users determine the interesting rules. Diversity of a data set is
                              processing
defined as when comparing two data sets, the one with more diverse rules is more interesting. Diversity will be
                                                                      rules
used to compare two data sets to determine which data set contains rules that are more interesting.
     The paper is organized as follows. Section 2 discusses related work. The theory, background and
assumptions behind our proposed interestingness measures are presented in Section 3. Experiments and results
                 ind
are presented in Section 4. Lastly, Section 5 concludes the paper.

2. Related Work
For as long as association rule mining has been around, there has been a need to determine which rules are
interesting. Originally this started with using the concepts of support and confidence (R. Agrawal et al 1993).
                                                                                        R.
Since then, many more measures have been proposed (G. Dong & J. Li 1998, L. Geng & H.J. Hamilton 2006, S.
                                                        G.
Lallich et al 2006). The Support-Confidence approach is appealing due to the anti monotonicity property of the
                                   Confidence
support. However, the support component will ignore itemsets with a low support even though these itemsets
may generate rules with a high confidence (S. Lallich et al 2006). Also, the Support-Confidence approach does
                                                                                      Confidence



                                                        20
Information and Knowledge Management                                                                     www.iiste.org
ISSN 2224-5758 (Paper) ISSN 2224-896X (Online)
                                 896X
Vol 2, No.6, 2012

not necessarily ensure that the rules are truly interesting, especially when the confidence is equal to the marginal
frequency of the consequent (S. Lallich et al 2006). Based on this argument, other measures for determine the
                                 S.                                                ,
interestingness of a rule are needed.
      All of these existing measures fall into three categories; objective based measures (based on the raw data),
        ll
subjective based (based on the raw data and the user) and semantic based measures (based on the semantic and
explanations of the patterns) (L. Geng & H.J. Hamilton 2006) In the survey presented in (L. Geng & H.J.
                                                                2006).
Hamilton 2006) there are nine criteria listed that can be used to determine if a pattern or rule is interesting. These
nine criteria are; conciseness, coverage, reliability, peculiarity, diversity, novelty, surprisingness, utility and
actionability or applicability. The first five criteria are considered to be objective, with the next two, novelty and
surprisingness being considered to be subjective. The final two criteria are considered to be semantic.
      Despite all the different measures, studies and works undertaken, there is no widely agreed upon formal
definition of what interestingness is in the context of patterns and association rules (L. Geng & H.J. Hamilton
2006). More recently several surveys of interestingness measures have been presented (L. Geng & H.J. Hamilton
2006, S. Lallich et al 2006, P. Lenca et al 2007 & K. McGray 2008). In K. McGray (2008) survey evaluated the
strengths and weaknesses of various measures from the point of view of the level or extent of user interaction. P.
Lenca et al (2007) survey looked at classifying various interestingness measures into five formal and five
experimental classes, along with eight evaluation properties. However, all of these surveys result in different
outcomes over how useful, suitable etc., an interestingness measure is. Therefore the usefulness of a measure can
be considered to be subjective.
      All of these measures mentioned above are for rules derived from single level datasets. They work on items
                             s
on a single level but do not have the capacity for comparing different levels or rules containing items from
multiple levels simultaneously.
Hilderman R.J. and Hamilton H.J. (2001) established three primary principles that a good interestingness
measure should satisfy :
     • The minimum value principle, which states that a uniform distribution is the most uninteresting.
                                principle,
     • The maximum value principle which states the most uneven distribution is the most interesting.
                                 principle,
     • The skewness principle, which states that the interestingness measure for the most uneven distribution
                                    ,
           will decrease when then number of classes of tuples increases.
     • The permutation invariance principle, which states that interestingness for diversity is unrelated to the
                                          principle,
           order of the class and it is only determined by the distribution of counts.
     • The transfer principle, which states that interestingness increases when a positive transfer is made from
                                  ,
           the count of one tuple to another whose count is greater.
Here in our work we propose to measure the interestingness of multi-level rules in terms of diversity and
                                                                                level
peculiarity (also known as distance). These measures were chosen as they are considered to be objective (rely on
just the data).

3. Interestingness Measures for Multi
                                    Multi-Level Association Rules
3.1. Diversity-Based Interestingness Measures
               Based
According to L. Geng & H.J. Hamilton 2006, a "pattern is diverse if its elements differ significantly from each
                                            2006,
other, while a set of patterns is diverse if the patterns in the set differ significantly from each other" (L. Geng &
H.J. Hamilton 2006). Summaries can be measured using diversity diversity-based interestingness measures. While there
                                                                              d
has been research on using diversity to measure summaries, there is little, if any, research that focuses on
measuring the interestingness of association or classification rules (L. Geng & H.J. Hamilton 2006) Therefore,
                                                                                                     2006).
this study suggests that diversity can be used to measure association rule interestingness.
     Diversity generally determined by two factors: 1) the proportional distribution of classes in the population,
and 2) the number of classes. Consider two different sets of rules mined from a dataset. We can consider the
                                              different
rules that are more diverse to be more interesting. Additionally, we can consider a set of rules less interesting if
the rules are less diverse. Another way of thinking about this is by considering that too many similar rules will
                                                                     considering
convey less knowledge to a user.
     The equations for variance and the Shannon C.E (1948) measure are shown in the equation (1) and (2)
                                          Shannon,
These are only two measures based on diversity. There are fourteen other measures based on diversity that are
                                                                                       based
available. Most measures are designed to be used with a specific application domain. We chose variance and
Shannon because of their wide-spread use.
                                spread
                                                  ∑
                                                                              (1)

                                                              ∑!$   !   "#    (2)




                                                         21
Information and Knowledge Management                                                                       www.iiste.org
ISSN 2224-5758 (Paper) ISSN 2224-896X (Online)
                                 896X
Vol 2, No.6, 2012

In the above equation for variance, pi is the probability for class i, and is the average probability for all classes.
                                                                     ,
3.2. Peculiarity Based Interestingness Measures
Peculiarity is an objective measure that determines how far away one association rule is from others. The further
away the rule is, the more peculiar. It is usually done through the use of a distance measure to determine how far
apart rules are from each other. Peculiar rules are usually few in number (often generated from outlying data)
and significantly different from the rest of the rule set. It is also possible that these peculiar rules can be
interesting as they may be unknown. G. Dong & J. Li (1998) proposed peculiarity measure, which is
neighborhood-based unexpected measure In this proposal it is argued that a rule’s interestingness is influenced
                                    measure.
by the rules that surround it in its neigh
                                     neighborhood.
     The measure is based on the idea of determining and measuring the symmetric difference between two rules,
which forms the basis of the distance between them. From this G. Dong & J. Li (1998) proposed that
unexpected confidence (where the confiden of a rule R is far from the average confidence of the rules in R’s
                                      confidence
neighborhood) and sparsity (where the number of mined rules in a neighborhood is far less than that of all the
potential rules for that neighborhood) could be determined, measured and used as interestingness measures (G.
                                                                                  ed
Dong & J. Li 1998, L. Geng & H.J. Hamilton 2006) 2006).
     G. Dong & J. Li (1998) measure determines the symmetric difference was developed for single level
datasets where each item was equally weighted. Thus the measure is actually a count of the number of items that
                                                                              lly
are not common between the two rules. In a multi
                                               multi-level dataset, each item cannot be regarded as being equal due
to the hierarchy. Thus the G. Dong & J. Li (1998) measure needs to be enhanced to be useful with these datasets.
Here we will present an enhancement as part of our proposed work.
     We believe it is possible to take the distance measure presented in (G. Dong & J. Li 1998) and enhance it
for multi-level datasets. The original measure is a syntax
                                                    syntax-based distance metric in the following form:
% & , &#      ( )  | + , - ∆ +# , -# | 1 (# ) | + ∆+# | 1 (2 | - ∆-# |                     (3)
The ∆ operator denotes the symmetric difference between two item sets, thus X ∆ Y is equivalent to (X –Y)∪(Y
–X) δ1, δ2 and δ3 are the weighting factors to be applied to different parts of the rule. Equation 3 measures the
peculiarity of two rules by a weighted sum of the cardinalities of the symmetric difference between the two
rule’s antecedents, consequents and the rules themselves.
     We propose an enhancement to this measure to allow it to handle a hierarchy. Under the existing measure,
                             ement
every item is unique and therefore none share any kind of ’syntax’ similarity. However, we argue that the items
1-*-*-*, 1-1-*-*, 1-1-1-* and 1-1-1- (based on Figure 1) all have a relationship with each other. Thus they are
                                     -1
not completely different and should have a ’syntax’ similarity due to their relation through the dataset’s
hierarchy.
     The greater the P(R1,R2) value is, lower similarity and so the greater the distance between those two rules.
Therefore, the further apart the relation is between two items, the greater the difference and distance. Thus if we
have,
R1 : 1 − 1 − 1 − * 1 − * − * − *
R2 : 1 − 1 − * − *     1−*−*−*
R3 : 1 − 1 − 1 − 1     1−*−*−*
We believe that the following should hold; P(R1,R3) < P(R2,R3) as 1     1-1-*-* and 1-1-1 are further removed
                                                                                           1-1
from each other than 1-1-1-* and 1
                             *      1-1-1-1. The difference between any two hierarchically related items nodes
                                          1.                                                            items
must be less than 1. Thus, for the above rules, 1 > P(R2,R3) > P(R1,R2) > 0. In order to achieve this we modify
Equation 3 by calculating the diversity of the symmetric difference between two rules instead of the cardinality
of the symmetric difference. The cardinality of the symmetric difference measures the difference between two
rules in terms of the number of different items in the rules. The diversity of the symmetric difference takes into
consideration the hierarchical difference of the items in the symmetric difference to measure the differ
                                                                                                    difference of the
two rules. We recite Equation 2 in terms of a set of items below, where S is a set containing n ite
                                                                                                items:
                                          3 ∑8; ∑8 : 456 !,7
                                                 9                  = ∑8; ∑8 : >6 !,7
                                                                            9
                            %/                                 1                         (4)
                                                < <                       < <

Where HRD is The Hierarchical Relationship Distance between two items is defined as the ratio between the
average number of levels between the two items and their common ancestor and the height of the tree. In general
HRD is defined as width or horizontal distance, which is defined as

                                                 ?>6 < ,@A B?>6 < ,@A
                             0&/      ,    #                                            (5)
                                                     #)CDEE4E!FGH




                                                               22
Information and Knowledge Management                                                                       www.iiste.org
ISSN 2224-5758 (Paper) ISSN 2224-896X (Online)
                                 896X
Vol 2, No.6, 2012

LD is the Level Distance which measures the distance between two items in terms of their height. This is also
called as height distance or vertical distance, which is defined as
                                                  ?>6 < ,<
                                 I/    ,   #                                           (6)
                                                 CDEE4E!FGH

NLD is Number of levels NLD(x,y) = |hierarchy level of x| - |hierarchy level y|.
N1 and n2 are two items, ca is common ancestor. The α, β are the user threshold values, where α+β=1. In our
experiment we set these values as α=β=0.5.
Thus the neighbourhood-based distance measure between two rules shown in Equation 3 now becomes;
                       based
           %J & , &#           ( ) %/ + , - ∆ +# , -# 1 (# ) %/ + ∆+# 1 (2 ) %/ - ∆-#
                                                                                   -                       (7)

4. Experiments
The dataset used for our experiments is a real world dataset, the BookCrossing dataset (obtained from
http://www.informatik.uni-freiburg.de/ cziegler/BX/). From this dataset we built a multi
                             freiburg.de/                                                     multi-level transactional
dataset that contains 91,550 user records and 970 leaf items, with 3 concept / hierarchy levels. To discover the
frequent itemsets we use the MLT2 L1 algorithm proposed by J. Han & Y. Fu (1995, 1999) with each concept
level having its own minimum support. From these frequent itemsets we then derive the f        frequent closed itemsets
and generators using the CLOSE+ algorithm proposed by J. Pei et al. From this we then derive the
                                                                                    .
non-redundant association rules using the MinMaxApprox (MMA) rule mining algorithm.
     redundant
      The confidence curve shows that the rules are spread out from 0.5 (which is the minimum confidence
threshold) up to close to 1. The distribution of rules in this area is fairly consistant and even, ranging from as low
as 2,181 rules for 0.95 to 1, to as high as 4,430 rules for 0.85 to 0.9. Using confidence to determine the
                                                                                               dence
interesting rules is more practical than support, but still leaves over 2,000 rules in the top bin.
      The overall diversity curve shows that the majority of rules (23,665) here have an average overall diversity
value of between 0.3 to 0.4. The curve however, also shows that there are some rules which have an overall
diveristy value below the majority, in the range of 0.15 to 0.25 and some that are above the majority, in the range
of 0.45 up to 0.7. The rules located above the majority are different to the rules that make up the majority and
                                                              re
could be of interest as these rules have a high overall diversity.
      The antecedent-consequent diversity curve is similar to that of the overall diversity. It has a similar spread
                      consequent
of rules, but the antecedent-consequent diversity curve peaks earlier at 0.3 to 0.35 (where as the overall diversity
                              consequent
curve peaks at 0.35 to 0.4), with 12,408 rules. The curve then drops down to a low number of rules at 0.45 to 0.5,
before peaking again at 0.5 to 0.55, wih 2,564 rules. The shape of this curve with that of the overall diversity
seems to show that the two diversity approaches are related. Using the antecedent-consequent diversity allows
                                                                                           consequent
rules with differing antecedents and consequents to be discovered when support and confidence will not identify
them.                             Fig 1. Interesting Mesures


                                           Interesting Measures
                       39000
                       36000
                       33000
                       30000
                       27000
        No. of Rules




                       24000                                                                   Support
                       21000
                                                                                               Conf
                       18000
                       15000                                                                   Ove_Diver_Dist
                       12000
                        9000                                                                   A_C_Dist
                        6000                                                                   Dist_Dist
                        3000
                           0
                                  0
                               0.05
                                0.1
                               0.15
                                0.2
                               0.25
                                0.3
                               0.35
                                0.4
                               0.45
                                0.5
                               0.55
                                0.6
                               0.65
                                0.7
                               0.75
                                0.8
                               0.85
                                0.9
                               0.95
                                  1




                                                 Measure Values




                                                              23
Information and Knowledge Management                                                                 www.iiste.org
ISSN 2224-5758 (Paper) ISSN 2224-896X (Online)
                                 896X
Vol 2, No.6, 2012

5. Conclusion
In this paper we proposed two interestingness measures for Multi Level Association rules These proposed
                                                                                          rules.
interestingness measures are diversity and peculiarity respectively. Diversity is a measure that compares items
within a rule and peculiarity compares items in two rules to see how different they are. In our experiments we
have shown how diversity and peculiarity distance can be used to identify potentially interesting rules which
can’t be identified using basic measurements support and confidence.

References
R. Agrawal, T. Imielinski and A. Swami. (1993), “Mining Association Rules between Sets of Items in Large
Databases”. In ACM SIGMOD International Conference on Management of Data (SIGMOD’93), pages
            .
207–216, Washington D.C., USA.
G. Dong and J. Li. (1998), “Interestingness of Discovered Association Rules in terms of Neighbourhood
                               Interestingness                                              Neighbourhood-Based
Unexpectedness”. In Second Pacific
                  .             Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’98),
                                                                         iscovery
pages 72–86, Melbourne, Australia.
L. Geng and H. J. Hamilton. (2006), “Interestingness Measures for Data Mining: A Survey ACM Computing
                                                                                      Survey”.
Surveys (CSUR), Volume 38, pages 9.
J. Han and Y. Fu (1995). “Discovery of Mult
                                Discovery      Multiple-Level Association Rules from Large Databases In 21st
                                                                                             Databases”.
International Conference on Very Large Databases (VLDB’95), pages 420–431, Zurich, Switzerland
                                                                                    ich, Switzerland.
J. Han and Y. Fu (1999). “Mining Multiple
                            Mining Multiple-Level Association Rules in Large Databases” IEEE Transactions on
                                                                                         ”.
Knowledge and Data Engineering, Volume 11, pages 798–805.
S. Lallich, O. Teytaud and E. Prudhomme (2006), “Association rule interestingness: measure and statistical
validation. Quality Measures in Data Mining Volume 43, pages 251–276.
                                        Mining”,
P. Lenca, B. Vaillant, B. Meyer and S. Lallich. (2007) “Association rule interestingness: experimental and
          ,
theoretical studies”. Studies in Computational Intelligence, Volume 43, pages 51–76.
                    .
K. McGarry. (2005), “A Survey of Interestingness Measures for Knowledge Discovery The Knowledge
                          A                                                        Discovery”.
Engineering Review, Volume 20, pages 39
                  ew,                      39–61.
C.-N. Ziegler, S. M. McNee, J. A. Konstan and G. Lausen (2005), “Improving Recommendation Lists Through
   N.                                                                  Improving
Topic Diversification”. In 14th International Conference on World Wide Web (WWW’05 pages 22–32, Chiba,
                        .                                                       (WWW’05),
Japan.
Lenca, P., Meyer, P., Vaillant, B., & Lallich, S. (2008). “On selecting interestingness measures for association
                                                             On
rules: User oriented description and multiple criteria decision aid European Journal of Operations Research,
                                                                 aid”.
184, 610-626.
Li, J., & Zhang, Y. (2003). “Direct Interesting Rule Generation Proceedings of the Third IEEE International
                                Direct                  Generation”.
Conference on Data Mining (ICDM '03) 155-167.
                                       '03),
Hilderman, R. J., & Hamilton, H. J. (2001). “Knowledge Discovery and Measures of In
                                                  Knowledge                               Interest”. Boston, MA:
Kluwer Academic.
Shannon, C. E. (1948). “A mathematical theory of communication The Bell System Technical Journal, 27
                           A                            communication”.                                       27,
379-423.




                                                       24
This academic article was published by The International Institute for Science,
Technology and Education (IISTE). The IISTE is a pioneer in the Open Access
Publishing service based in the U.S. and Europe. The aim of the institute is
Accelerating Global Knowledge Sharing.

More information about the publisher can be found in the IISTE’s homepage:
http://www.iiste.org


The IISTE is currently hosting more than 30 peer-reviewed academic journals and
collaborating with academic institutions around the world. Prospective authors of
IISTE journals can find the submission instruction on the following page:
http://www.iiste.org/Journals/

The IISTE editorial team promises to the review and publish all the qualified
submissions in a fast manner. All the journals articles are available online to the
readers all over the world without financial, legal, or technical barriers other than
those inseparable from gaining access to the internet itself. Printed version of the
journals is also available upon request of readers and authors.

IISTE Knowledge Sharing Partners

EBSCO, Index Copernicus, Ulrich's Periodicals Directory, JournalTOCS, PKP Open
Archives Harvester, Bielefeld Academic Search Engine, Elektronische
Zeitschriftenbibliothek EZB, Open J-Gate, OCLC WorldCat, Universe Digtial
Library , NewJour, Google Scholar

More Related Content

What's hot

Towards A Differential Privacy and Utility Preserving Machine Learning Classi...
Towards A Differential Privacy and Utility Preserving Machine Learning Classi...Towards A Differential Privacy and Utility Preserving Machine Learning Classi...
Towards A Differential Privacy and Utility Preserving Machine Learning Classi...Kato Mivule
 
Applying Data Privacy Techniques on Published Data in Uganda
 Applying Data Privacy Techniques on Published Data in Uganda Applying Data Privacy Techniques on Published Data in Uganda
Applying Data Privacy Techniques on Published Data in UgandaKato Mivule
 
4213ijaia01
4213ijaia014213ijaia01
4213ijaia01ijaia
 
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...Kato Mivule
 
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...Avishek Choudhury
 
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAIN
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAINCOMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAIN
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAINIJDKP
 

What's hot (10)

Towards A Differential Privacy and Utility Preserving Machine Learning Classi...
Towards A Differential Privacy and Utility Preserving Machine Learning Classi...Towards A Differential Privacy and Utility Preserving Machine Learning Classi...
Towards A Differential Privacy and Utility Preserving Machine Learning Classi...
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Applying Data Privacy Techniques on Published Data in Uganda
 Applying Data Privacy Techniques on Published Data in Uganda Applying Data Privacy Techniques on Published Data in Uganda
Applying Data Privacy Techniques on Published Data in Uganda
 
Gr2411971203
Gr2411971203Gr2411971203
Gr2411971203
 
4213ijaia01
4213ijaia014213ijaia01
4213ijaia01
 
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
 
J046065457
J046065457J046065457
J046065457
 
Exploration versus exploitation information filtering challenges joys and l...
Exploration versus exploitation   information filtering challenges joys and l...Exploration versus exploitation   information filtering challenges joys and l...
Exploration versus exploitation information filtering challenges joys and l...
 
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
 
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAIN
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAINCOMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAIN
COMPACT WEIGHTED CLASS ASSOCIATION RULE MINING USING INFORMATION GAIN
 

Similar to Interestingness measures for multi level association rules

The use of LISREL in validating marketing constructs
The use of LISREL in validating marketing constructsThe use of LISREL in validating marketing constructs
The use of LISREL in validating marketing constructsJan-Benedict Steenkamp
 
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...QUESTJOURNAL
 
Interestingness Measures In Rule Mining: A Valuation
Interestingness Measures In Rule Mining: A ValuationInterestingness Measures In Rule Mining: A Valuation
Interestingness Measures In Rule Mining: A ValuationIJERA Editor
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointMark Parsons
 
History and Overview of the Recommender Systems.pdf
History and Overview of the Recommender Systems.pdfHistory and Overview of the Recommender Systems.pdf
History and Overview of the Recommender Systems.pdfssuser5b0f5e
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid wasteAlexander Decker
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid wasteAlexander Decker
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid wasteAlexander Decker
 
Observational research methods
Observational research methodsObservational research methods
Observational research methodsChrisSwanson37
 
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...Barki & hartwick, 1994, misq, measuring user participation, user involvement,...
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...ilham_ilyas772
 
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULEROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULEIJCSEA Journal
 
A literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesA literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesijctet
 

Similar to Interestingness measures for multi level association rules (20)

Hu3414421448
Hu3414421448Hu3414421448
Hu3414421448
 
Ae32208215
Ae32208215Ae32208215
Ae32208215
 
I43055257
I43055257I43055257
I43055257
 
The use of LISREL in validating marketing constructs
The use of LISREL in validating marketing constructsThe use of LISREL in validating marketing constructs
The use of LISREL in validating marketing constructs
 
Petit_Final
Petit_FinalPetit_Final
Petit_Final
 
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...
The Efficacy of Technology Acceptance Model: A Review of Applicable Theoretic...
 
Interestingness Measures In Rule Mining: A Valuation
Interestingness Measures In Rule Mining: A ValuationInterestingness Measures In Rule Mining: A Valuation
Interestingness Measures In Rule Mining: A Valuation
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
System Adoption: Socio-Technical Integration
System Adoption: Socio-Technical IntegrationSystem Adoption: Socio-Technical Integration
System Adoption: Socio-Technical Integration
 
History and Overview of the Recommender Systems.pdf
History and Overview of the Recommender Systems.pdfHistory and Overview of the Recommender Systems.pdf
History and Overview of the Recommender Systems.pdf
 
Hl2513421351
Hl2513421351Hl2513421351
Hl2513421351
 
Hl2513421351
Hl2513421351Hl2513421351
Hl2513421351
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid waste
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid waste
 
A fuzzy analytical hierarchy model for solid waste
 A fuzzy analytical hierarchy model for solid waste A fuzzy analytical hierarchy model for solid waste
A fuzzy analytical hierarchy model for solid waste
 
44 49
44 4944 49
44 49
 
Observational research methods
Observational research methodsObservational research methods
Observational research methods
 
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...Barki & hartwick, 1994, misq, measuring user participation, user involvement,...
Barki & hartwick, 1994, misq, measuring user participation, user involvement,...
 
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULEROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE
ROLE OF CERTAINTY FACTOR IN GENERATING ROUGH-FUZZY RULE
 
A literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesA literature review of modern association rule mining techniques
A literature review of modern association rule mining techniques
 

More from Alexander Decker

Abnormalities of hormones and inflammatory cytokines in women affected with p...
Abnormalities of hormones and inflammatory cytokines in women affected with p...Abnormalities of hormones and inflammatory cytokines in women affected with p...
Abnormalities of hormones and inflammatory cytokines in women affected with p...Alexander Decker
 
A validation of the adverse childhood experiences scale in
A validation of the adverse childhood experiences scale inA validation of the adverse childhood experiences scale in
A validation of the adverse childhood experiences scale inAlexander Decker
 
A usability evaluation framework for b2 c e commerce websites
A usability evaluation framework for b2 c e commerce websitesA usability evaluation framework for b2 c e commerce websites
A usability evaluation framework for b2 c e commerce websitesAlexander Decker
 
A universal model for managing the marketing executives in nigerian banks
A universal model for managing the marketing executives in nigerian banksA universal model for managing the marketing executives in nigerian banks
A universal model for managing the marketing executives in nigerian banksAlexander Decker
 
A unique common fixed point theorems in generalized d
A unique common fixed point theorems in generalized dA unique common fixed point theorems in generalized d
A unique common fixed point theorems in generalized dAlexander Decker
 
A trends of salmonella and antibiotic resistance
A trends of salmonella and antibiotic resistanceA trends of salmonella and antibiotic resistance
A trends of salmonella and antibiotic resistanceAlexander Decker
 
A transformational generative approach towards understanding al-istifham
A transformational  generative approach towards understanding al-istifhamA transformational  generative approach towards understanding al-istifham
A transformational generative approach towards understanding al-istifhamAlexander Decker
 
A time series analysis of the determinants of savings in namibia
A time series analysis of the determinants of savings in namibiaA time series analysis of the determinants of savings in namibia
A time series analysis of the determinants of savings in namibiaAlexander Decker
 
A therapy for physical and mental fitness of school children
A therapy for physical and mental fitness of school childrenA therapy for physical and mental fitness of school children
A therapy for physical and mental fitness of school childrenAlexander Decker
 
A theory of efficiency for managing the marketing executives in nigerian banks
A theory of efficiency for managing the marketing executives in nigerian banksA theory of efficiency for managing the marketing executives in nigerian banks
A theory of efficiency for managing the marketing executives in nigerian banksAlexander Decker
 
A systematic evaluation of link budget for
A systematic evaluation of link budget forA systematic evaluation of link budget for
A systematic evaluation of link budget forAlexander Decker
 
A synthetic review of contraceptive supplies in punjab
A synthetic review of contraceptive supplies in punjabA synthetic review of contraceptive supplies in punjab
A synthetic review of contraceptive supplies in punjabAlexander Decker
 
A synthesis of taylor’s and fayol’s management approaches for managing market...
A synthesis of taylor’s and fayol’s management approaches for managing market...A synthesis of taylor’s and fayol’s management approaches for managing market...
A synthesis of taylor’s and fayol’s management approaches for managing market...Alexander Decker
 
A survey paper on sequence pattern mining with incremental
A survey paper on sequence pattern mining with incrementalA survey paper on sequence pattern mining with incremental
A survey paper on sequence pattern mining with incrementalAlexander Decker
 
A survey on live virtual machine migrations and its techniques
A survey on live virtual machine migrations and its techniquesA survey on live virtual machine migrations and its techniques
A survey on live virtual machine migrations and its techniquesAlexander Decker
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbAlexander Decker
 
A survey on challenges to the media cloud
A survey on challenges to the media cloudA survey on challenges to the media cloud
A survey on challenges to the media cloudAlexander Decker
 
A survey of provenance leveraged
A survey of provenance leveragedA survey of provenance leveraged
A survey of provenance leveragedAlexander Decker
 
A survey of private equity investments in kenya
A survey of private equity investments in kenyaA survey of private equity investments in kenya
A survey of private equity investments in kenyaAlexander Decker
 
A study to measures the financial health of
A study to measures the financial health ofA study to measures the financial health of
A study to measures the financial health ofAlexander Decker
 

More from Alexander Decker (20)

Abnormalities of hormones and inflammatory cytokines in women affected with p...
Abnormalities of hormones and inflammatory cytokines in women affected with p...Abnormalities of hormones and inflammatory cytokines in women affected with p...
Abnormalities of hormones and inflammatory cytokines in women affected with p...
 
A validation of the adverse childhood experiences scale in
A validation of the adverse childhood experiences scale inA validation of the adverse childhood experiences scale in
A validation of the adverse childhood experiences scale in
 
A usability evaluation framework for b2 c e commerce websites
A usability evaluation framework for b2 c e commerce websitesA usability evaluation framework for b2 c e commerce websites
A usability evaluation framework for b2 c e commerce websites
 
A universal model for managing the marketing executives in nigerian banks
A universal model for managing the marketing executives in nigerian banksA universal model for managing the marketing executives in nigerian banks
A universal model for managing the marketing executives in nigerian banks
 
A unique common fixed point theorems in generalized d
A unique common fixed point theorems in generalized dA unique common fixed point theorems in generalized d
A unique common fixed point theorems in generalized d
 
A trends of salmonella and antibiotic resistance
A trends of salmonella and antibiotic resistanceA trends of salmonella and antibiotic resistance
A trends of salmonella and antibiotic resistance
 
A transformational generative approach towards understanding al-istifham
A transformational  generative approach towards understanding al-istifhamA transformational  generative approach towards understanding al-istifham
A transformational generative approach towards understanding al-istifham
 
A time series analysis of the determinants of savings in namibia
A time series analysis of the determinants of savings in namibiaA time series analysis of the determinants of savings in namibia
A time series analysis of the determinants of savings in namibia
 
A therapy for physical and mental fitness of school children
A therapy for physical and mental fitness of school childrenA therapy for physical and mental fitness of school children
A therapy for physical and mental fitness of school children
 
A theory of efficiency for managing the marketing executives in nigerian banks
A theory of efficiency for managing the marketing executives in nigerian banksA theory of efficiency for managing the marketing executives in nigerian banks
A theory of efficiency for managing the marketing executives in nigerian banks
 
A systematic evaluation of link budget for
A systematic evaluation of link budget forA systematic evaluation of link budget for
A systematic evaluation of link budget for
 
A synthetic review of contraceptive supplies in punjab
A synthetic review of contraceptive supplies in punjabA synthetic review of contraceptive supplies in punjab
A synthetic review of contraceptive supplies in punjab
 
A synthesis of taylor’s and fayol’s management approaches for managing market...
A synthesis of taylor’s and fayol’s management approaches for managing market...A synthesis of taylor’s and fayol’s management approaches for managing market...
A synthesis of taylor’s and fayol’s management approaches for managing market...
 
A survey paper on sequence pattern mining with incremental
A survey paper on sequence pattern mining with incrementalA survey paper on sequence pattern mining with incremental
A survey paper on sequence pattern mining with incremental
 
A survey on live virtual machine migrations and its techniques
A survey on live virtual machine migrations and its techniquesA survey on live virtual machine migrations and its techniques
A survey on live virtual machine migrations and its techniques
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo db
 
A survey on challenges to the media cloud
A survey on challenges to the media cloudA survey on challenges to the media cloud
A survey on challenges to the media cloud
 
A survey of provenance leveraged
A survey of provenance leveragedA survey of provenance leveraged
A survey of provenance leveraged
 
A survey of private equity investments in kenya
A survey of private equity investments in kenyaA survey of private equity investments in kenya
A survey of private equity investments in kenya
 
A study to measures the financial health of
A study to measures the financial health ofA study to measures the financial health of
A study to measures the financial health of
 

Recently uploaded

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 

Recently uploaded (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 

Interestingness measures for multi level association rules

  • 1. Information and Knowledge Management www.iiste.org ISSN 2224-5758 (Paper) ISSN 2224-896X (Online) 896X Vol 2, No.6, 2012 Interestingness Measures for Multi Level Association Rules Multi-Level R Vijaya Prakash1*, Dr. A. Govardhan2, Prof. SSVN. Sarma3 1. Dept. of Informatics, Kakatiya University, Warangal, India 2. Dept. Of Computer Science & Engineering, JNT University, Hyderabad, India 3. Dept. Of Computer Science & Engineering, Vaagdevi College of Engineering *E-mail of the corresponding author: vijprak@hotmail.com mail Abstract Association rule mining is one technique that is widely used to obtain useful associations rules among sets of items. Much work has been done focusing on efficiency, effectiveness and redundancy. There has also been a focusing on the quality of rules from single level datasets with many interestingness measures proposed. However, there is a lack of interestingness measures developed for multi multi-level and cross- -level Association rules. Single level measures do not take into account the hierarchy found in a multi multi-level dataset. This leaves the Support-Confidence approach, which does not consider for the hierarchy. In this paper we propose two approaches which measure multi-level association rules to help and evaluate their interestingness. These level measures of diversity and peculiarity can be used to identify those rules from multi- -level datasets that are potentially useful. Keywords: Information Retrieval, Interestingness Measures, Association Rules, Multi-Level Datasets Itemsets Level Datasets, 1. Introduction Interestingness measures are necessary to rank Association rule patterns. Each interestingness measure produces ssociation different results, and experts, Lenca et.al (2008) have different opinions of what constitutes a good rule. The , interestingness of discovered association rules is an important and active area within data mining research (L. Geng & H.J. Hamilton 2006). The primary problem is the selection of interestingness measures for a given . application domain. However, there is no formal agreement on a definition for what makes rules interesting. Association rule algorithms produce thousands of rules, m many of which are redundant (K. McGarry 2005, Li. J. et al 2003). In order to filter the rules, the user generally supplies a minimum threshold for support and . confidence. Support and confidence confidence(R. Agrawal et al 1993, G. Dong & J. Li 1998, S. Lallich et al 2006) are basic measures of association rule interestingness. All of these measures were proposed for association rules derived from single level or flat datasets, which were most commonly transactional datasets. Today multi-level datasets are more common in many domains. With this increase in usage there is a big demand for techniques to discover multi-level and cross-level association rules and also techniques to measure interestingness of rules level derived from multi-level datasets. J. Han & Y. Fu (1995, 1999), C.N. Zeigler et al (2005) proposed some level approaches for multi-level and cross level cross-level frequent itemset discovery have been proposed. However, multi multi-level datasets are often a source of numerous rules and in fact the rules can be so numerous it can be much more difficult to determine which ones are interesting (R. Agrawal et al 1993, G. Dong & J. Li 1998) Moreover, the 1998). existing interestingness measures for single level association rules can not accurately measure the interestingness of multi-level rules since they do not take into consideration the concept of the hierarchical structure that exists in multi-level datasets. In this paper, we propose measures particularly for assessing the interestingness of multilevel association rules by examining the diversity and peculiarity among rules. These measures can be determined at rule discovery phase during post-processing to help users determine the interesting rules. Diversity of a data set is processing defined as when comparing two data sets, the one with more diverse rules is more interesting. Diversity will be rules used to compare two data sets to determine which data set contains rules that are more interesting. The paper is organized as follows. Section 2 discusses related work. The theory, background and assumptions behind our proposed interestingness measures are presented in Section 3. Experiments and results ind are presented in Section 4. Lastly, Section 5 concludes the paper. 2. Related Work For as long as association rule mining has been around, there has been a need to determine which rules are interesting. Originally this started with using the concepts of support and confidence (R. Agrawal et al 1993). R. Since then, many more measures have been proposed (G. Dong & J. Li 1998, L. Geng & H.J. Hamilton 2006, S. G. Lallich et al 2006). The Support-Confidence approach is appealing due to the anti monotonicity property of the Confidence support. However, the support component will ignore itemsets with a low support even though these itemsets may generate rules with a high confidence (S. Lallich et al 2006). Also, the Support-Confidence approach does Confidence 20
  • 2. Information and Knowledge Management www.iiste.org ISSN 2224-5758 (Paper) ISSN 2224-896X (Online) 896X Vol 2, No.6, 2012 not necessarily ensure that the rules are truly interesting, especially when the confidence is equal to the marginal frequency of the consequent (S. Lallich et al 2006). Based on this argument, other measures for determine the S. , interestingness of a rule are needed. All of these existing measures fall into three categories; objective based measures (based on the raw data), ll subjective based (based on the raw data and the user) and semantic based measures (based on the semantic and explanations of the patterns) (L. Geng & H.J. Hamilton 2006) In the survey presented in (L. Geng & H.J. 2006). Hamilton 2006) there are nine criteria listed that can be used to determine if a pattern or rule is interesting. These nine criteria are; conciseness, coverage, reliability, peculiarity, diversity, novelty, surprisingness, utility and actionability or applicability. The first five criteria are considered to be objective, with the next two, novelty and surprisingness being considered to be subjective. The final two criteria are considered to be semantic. Despite all the different measures, studies and works undertaken, there is no widely agreed upon formal definition of what interestingness is in the context of patterns and association rules (L. Geng & H.J. Hamilton 2006). More recently several surveys of interestingness measures have been presented (L. Geng & H.J. Hamilton 2006, S. Lallich et al 2006, P. Lenca et al 2007 & K. McGray 2008). In K. McGray (2008) survey evaluated the strengths and weaknesses of various measures from the point of view of the level or extent of user interaction. P. Lenca et al (2007) survey looked at classifying various interestingness measures into five formal and five experimental classes, along with eight evaluation properties. However, all of these surveys result in different outcomes over how useful, suitable etc., an interestingness measure is. Therefore the usefulness of a measure can be considered to be subjective. All of these measures mentioned above are for rules derived from single level datasets. They work on items s on a single level but do not have the capacity for comparing different levels or rules containing items from multiple levels simultaneously. Hilderman R.J. and Hamilton H.J. (2001) established three primary principles that a good interestingness measure should satisfy : • The minimum value principle, which states that a uniform distribution is the most uninteresting. principle, • The maximum value principle which states the most uneven distribution is the most interesting. principle, • The skewness principle, which states that the interestingness measure for the most uneven distribution , will decrease when then number of classes of tuples increases. • The permutation invariance principle, which states that interestingness for diversity is unrelated to the principle, order of the class and it is only determined by the distribution of counts. • The transfer principle, which states that interestingness increases when a positive transfer is made from , the count of one tuple to another whose count is greater. Here in our work we propose to measure the interestingness of multi-level rules in terms of diversity and level peculiarity (also known as distance). These measures were chosen as they are considered to be objective (rely on just the data). 3. Interestingness Measures for Multi Multi-Level Association Rules 3.1. Diversity-Based Interestingness Measures Based According to L. Geng & H.J. Hamilton 2006, a "pattern is diverse if its elements differ significantly from each 2006, other, while a set of patterns is diverse if the patterns in the set differ significantly from each other" (L. Geng & H.J. Hamilton 2006). Summaries can be measured using diversity diversity-based interestingness measures. While there d has been research on using diversity to measure summaries, there is little, if any, research that focuses on measuring the interestingness of association or classification rules (L. Geng & H.J. Hamilton 2006) Therefore, 2006). this study suggests that diversity can be used to measure association rule interestingness. Diversity generally determined by two factors: 1) the proportional distribution of classes in the population, and 2) the number of classes. Consider two different sets of rules mined from a dataset. We can consider the different rules that are more diverse to be more interesting. Additionally, we can consider a set of rules less interesting if the rules are less diverse. Another way of thinking about this is by considering that too many similar rules will considering convey less knowledge to a user. The equations for variance and the Shannon C.E (1948) measure are shown in the equation (1) and (2) Shannon, These are only two measures based on diversity. There are fourteen other measures based on diversity that are based available. Most measures are designed to be used with a specific application domain. We chose variance and Shannon because of their wide-spread use. spread ∑ (1) ∑!$ ! "# (2) 21
  • 3. Information and Knowledge Management www.iiste.org ISSN 2224-5758 (Paper) ISSN 2224-896X (Online) 896X Vol 2, No.6, 2012 In the above equation for variance, pi is the probability for class i, and is the average probability for all classes. , 3.2. Peculiarity Based Interestingness Measures Peculiarity is an objective measure that determines how far away one association rule is from others. The further away the rule is, the more peculiar. It is usually done through the use of a distance measure to determine how far apart rules are from each other. Peculiar rules are usually few in number (often generated from outlying data) and significantly different from the rest of the rule set. It is also possible that these peculiar rules can be interesting as they may be unknown. G. Dong & J. Li (1998) proposed peculiarity measure, which is neighborhood-based unexpected measure In this proposal it is argued that a rule’s interestingness is influenced measure. by the rules that surround it in its neigh neighborhood. The measure is based on the idea of determining and measuring the symmetric difference between two rules, which forms the basis of the distance between them. From this G. Dong & J. Li (1998) proposed that unexpected confidence (where the confiden of a rule R is far from the average confidence of the rules in R’s confidence neighborhood) and sparsity (where the number of mined rules in a neighborhood is far less than that of all the potential rules for that neighborhood) could be determined, measured and used as interestingness measures (G. ed Dong & J. Li 1998, L. Geng & H.J. Hamilton 2006) 2006). G. Dong & J. Li (1998) measure determines the symmetric difference was developed for single level datasets where each item was equally weighted. Thus the measure is actually a count of the number of items that lly are not common between the two rules. In a multi multi-level dataset, each item cannot be regarded as being equal due to the hierarchy. Thus the G. Dong & J. Li (1998) measure needs to be enhanced to be useful with these datasets. Here we will present an enhancement as part of our proposed work. We believe it is possible to take the distance measure presented in (G. Dong & J. Li 1998) and enhance it for multi-level datasets. The original measure is a syntax syntax-based distance metric in the following form: % & , &# ( ) | + , - ∆ +# , -# | 1 (# ) | + ∆+# | 1 (2 | - ∆-# | (3) The ∆ operator denotes the symmetric difference between two item sets, thus X ∆ Y is equivalent to (X –Y)∪(Y –X) δ1, δ2 and δ3 are the weighting factors to be applied to different parts of the rule. Equation 3 measures the peculiarity of two rules by a weighted sum of the cardinalities of the symmetric difference between the two rule’s antecedents, consequents and the rules themselves. We propose an enhancement to this measure to allow it to handle a hierarchy. Under the existing measure, ement every item is unique and therefore none share any kind of ’syntax’ similarity. However, we argue that the items 1-*-*-*, 1-1-*-*, 1-1-1-* and 1-1-1- (based on Figure 1) all have a relationship with each other. Thus they are -1 not completely different and should have a ’syntax’ similarity due to their relation through the dataset’s hierarchy. The greater the P(R1,R2) value is, lower similarity and so the greater the distance between those two rules. Therefore, the further apart the relation is between two items, the greater the difference and distance. Thus if we have, R1 : 1 − 1 − 1 − * 1 − * − * − * R2 : 1 − 1 − * − * 1−*−*−* R3 : 1 − 1 − 1 − 1 1−*−*−* We believe that the following should hold; P(R1,R3) < P(R2,R3) as 1 1-1-*-* and 1-1-1 are further removed 1-1 from each other than 1-1-1-* and 1 * 1-1-1-1. The difference between any two hierarchically related items nodes 1. items must be less than 1. Thus, for the above rules, 1 > P(R2,R3) > P(R1,R2) > 0. In order to achieve this we modify Equation 3 by calculating the diversity of the symmetric difference between two rules instead of the cardinality of the symmetric difference. The cardinality of the symmetric difference measures the difference between two rules in terms of the number of different items in the rules. The diversity of the symmetric difference takes into consideration the hierarchical difference of the items in the symmetric difference to measure the differ difference of the two rules. We recite Equation 2 in terms of a set of items below, where S is a set containing n ite items: 3 ∑8; ∑8 : 456 !,7 9 = ∑8; ∑8 : >6 !,7 9 %/ 1 (4) < < < < Where HRD is The Hierarchical Relationship Distance between two items is defined as the ratio between the average number of levels between the two items and their common ancestor and the height of the tree. In general HRD is defined as width or horizontal distance, which is defined as ?>6 < ,@A B?>6 < ,@A 0&/ , # (5) #)CDEE4E!FGH 22
  • 4. Information and Knowledge Management www.iiste.org ISSN 2224-5758 (Paper) ISSN 2224-896X (Online) 896X Vol 2, No.6, 2012 LD is the Level Distance which measures the distance between two items in terms of their height. This is also called as height distance or vertical distance, which is defined as ?>6 < ,< I/ , # (6) CDEE4E!FGH NLD is Number of levels NLD(x,y) = |hierarchy level of x| - |hierarchy level y|. N1 and n2 are two items, ca is common ancestor. The α, β are the user threshold values, where α+β=1. In our experiment we set these values as α=β=0.5. Thus the neighbourhood-based distance measure between two rules shown in Equation 3 now becomes; based %J & , &# ( ) %/ + , - ∆ +# , -# 1 (# ) %/ + ∆+# 1 (2 ) %/ - ∆-# - (7) 4. Experiments The dataset used for our experiments is a real world dataset, the BookCrossing dataset (obtained from http://www.informatik.uni-freiburg.de/ cziegler/BX/). From this dataset we built a multi freiburg.de/ multi-level transactional dataset that contains 91,550 user records and 970 leaf items, with 3 concept / hierarchy levels. To discover the frequent itemsets we use the MLT2 L1 algorithm proposed by J. Han & Y. Fu (1995, 1999) with each concept level having its own minimum support. From these frequent itemsets we then derive the f frequent closed itemsets and generators using the CLOSE+ algorithm proposed by J. Pei et al. From this we then derive the . non-redundant association rules using the MinMaxApprox (MMA) rule mining algorithm. redundant The confidence curve shows that the rules are spread out from 0.5 (which is the minimum confidence threshold) up to close to 1. The distribution of rules in this area is fairly consistant and even, ranging from as low as 2,181 rules for 0.95 to 1, to as high as 4,430 rules for 0.85 to 0.9. Using confidence to determine the dence interesting rules is more practical than support, but still leaves over 2,000 rules in the top bin. The overall diversity curve shows that the majority of rules (23,665) here have an average overall diversity value of between 0.3 to 0.4. The curve however, also shows that there are some rules which have an overall diveristy value below the majority, in the range of 0.15 to 0.25 and some that are above the majority, in the range of 0.45 up to 0.7. The rules located above the majority are different to the rules that make up the majority and re could be of interest as these rules have a high overall diversity. The antecedent-consequent diversity curve is similar to that of the overall diversity. It has a similar spread consequent of rules, but the antecedent-consequent diversity curve peaks earlier at 0.3 to 0.35 (where as the overall diversity consequent curve peaks at 0.35 to 0.4), with 12,408 rules. The curve then drops down to a low number of rules at 0.45 to 0.5, before peaking again at 0.5 to 0.55, wih 2,564 rules. The shape of this curve with that of the overall diversity seems to show that the two diversity approaches are related. Using the antecedent-consequent diversity allows consequent rules with differing antecedents and consequents to be discovered when support and confidence will not identify them. Fig 1. Interesting Mesures Interesting Measures 39000 36000 33000 30000 27000 No. of Rules 24000 Support 21000 Conf 18000 15000 Ove_Diver_Dist 12000 9000 A_C_Dist 6000 Dist_Dist 3000 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 Measure Values 23
  • 5. Information and Knowledge Management www.iiste.org ISSN 2224-5758 (Paper) ISSN 2224-896X (Online) 896X Vol 2, No.6, 2012 5. Conclusion In this paper we proposed two interestingness measures for Multi Level Association rules These proposed rules. interestingness measures are diversity and peculiarity respectively. Diversity is a measure that compares items within a rule and peculiarity compares items in two rules to see how different they are. In our experiments we have shown how diversity and peculiarity distance can be used to identify potentially interesting rules which can’t be identified using basic measurements support and confidence. References R. Agrawal, T. Imielinski and A. Swami. (1993), “Mining Association Rules between Sets of Items in Large Databases”. In ACM SIGMOD International Conference on Management of Data (SIGMOD’93), pages . 207–216, Washington D.C., USA. G. Dong and J. Li. (1998), “Interestingness of Discovered Association Rules in terms of Neighbourhood Interestingness Neighbourhood-Based Unexpectedness”. In Second Pacific . Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’98), iscovery pages 72–86, Melbourne, Australia. L. Geng and H. J. Hamilton. (2006), “Interestingness Measures for Data Mining: A Survey ACM Computing Survey”. Surveys (CSUR), Volume 38, pages 9. J. Han and Y. Fu (1995). “Discovery of Mult Discovery Multiple-Level Association Rules from Large Databases In 21st Databases”. International Conference on Very Large Databases (VLDB’95), pages 420–431, Zurich, Switzerland ich, Switzerland. J. Han and Y. Fu (1999). “Mining Multiple Mining Multiple-Level Association Rules in Large Databases” IEEE Transactions on ”. Knowledge and Data Engineering, Volume 11, pages 798–805. S. Lallich, O. Teytaud and E. Prudhomme (2006), “Association rule interestingness: measure and statistical validation. Quality Measures in Data Mining Volume 43, pages 251–276. Mining”, P. Lenca, B. Vaillant, B. Meyer and S. Lallich. (2007) “Association rule interestingness: experimental and , theoretical studies”. Studies in Computational Intelligence, Volume 43, pages 51–76. . K. McGarry. (2005), “A Survey of Interestingness Measures for Knowledge Discovery The Knowledge A Discovery”. Engineering Review, Volume 20, pages 39 ew, 39–61. C.-N. Ziegler, S. M. McNee, J. A. Konstan and G. Lausen (2005), “Improving Recommendation Lists Through N. Improving Topic Diversification”. In 14th International Conference on World Wide Web (WWW’05 pages 22–32, Chiba, . (WWW’05), Japan. Lenca, P., Meyer, P., Vaillant, B., & Lallich, S. (2008). “On selecting interestingness measures for association On rules: User oriented description and multiple criteria decision aid European Journal of Operations Research, aid”. 184, 610-626. Li, J., & Zhang, Y. (2003). “Direct Interesting Rule Generation Proceedings of the Third IEEE International Direct Generation”. Conference on Data Mining (ICDM '03) 155-167. '03), Hilderman, R. J., & Hamilton, H. J. (2001). “Knowledge Discovery and Measures of In Knowledge Interest”. Boston, MA: Kluwer Academic. Shannon, C. E. (1948). “A mathematical theory of communication The Bell System Technical Journal, 27 A communication”. 27, 379-423. 24
  • 6. This academic article was published by The International Institute for Science, Technology and Education (IISTE). The IISTE is a pioneer in the Open Access Publishing service based in the U.S. and Europe. The aim of the institute is Accelerating Global Knowledge Sharing. More information about the publisher can be found in the IISTE’s homepage: http://www.iiste.org The IISTE is currently hosting more than 30 peer-reviewed academic journals and collaborating with academic institutions around the world. Prospective authors of IISTE journals can find the submission instruction on the following page: http://www.iiste.org/Journals/ The IISTE editorial team promises to the review and publish all the qualified submissions in a fast manner. All the journals articles are available online to the readers all over the world without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. Printed version of the journals is also available upon request of readers and authors. IISTE Knowledge Sharing Partners EBSCO, Index Copernicus, Ulrich's Periodicals Directory, JournalTOCS, PKP Open Archives Harvester, Bielefeld Academic Search Engine, Elektronische Zeitschriftenbibliothek EZB, Open J-Gate, OCLC WorldCat, Universe Digtial Library , NewJour, Google Scholar