SlideShare a Scribd company logo
1 of 10
A Novel Recommender Algorithm for Bipartite
Networks employing Association Rules to Discover
Characteristics Intrinsic to the Content
A.J.M. Korale & N.S.Weeraman
Abstract: Faced with an extremely competitive landscape and low margins, Mobile
Telecommunications Operators are motivated to seek new sources of revenue streams by promoting
Value Added Services (VAS) to their customers. This strategy in effect exploits the novelty and
attractive power of the supplementary service portfolio to draw in new subscribers and retain and
satisfy the existing base. However, delivering satisfying content to a vast and diverse customer base
with widely differing tastes and expectations is a challenge faced by Mobile Telecommunications
Operators. Thus Intelligent Algorithms in the form of Recommender Systems are employed to match
available content with customer tastes and desires. Such algorithms usually require considerable Meta
data on the content and detailed customer information and their consumption patterns to attempt this
match. As the Meta data does not and indeed cannot always accurately characterize the intrinsic
properties of the content, which is the basis of its attraction to a consumer, the match is at best not a
very accurate one.
In light of this we propose an enhancement that operates without recourse to the Meta data of the
content but instead employ the novel idea that certain items of the content itself can be used to
represent the larger set. This result is established through a process of Association Rule discovery
where it is shown that Ring Back Tone Songs (RBTS) that group together do in fact exhibit similarities
at an intrinsic and fundamental level rather than that which would be observed by merely comparing
the Meta data of the content.
We also show that due to this grouping of songs via the estimation of associations we are able to
broadly categorize the available song choices using selected songs as representative categories. It then
allows us to quantify the strength of the relationships between songs by using the rules so arrived at
through performance measures derived from confidence and lift.
Thus the algorithm provides a more accurate match between a customer and his preference for RBTS
content by analysing song content at a fundamental and intrinsic level and recommends song content
using the properties inherent in the songs. It also improves its match by utilizing the similarities
between customers who prefer a certain category of song by analysing the single mode representation
of the customer vs. content Bipartite graph.
Keywords: Recommender Systems, Collaborative Filtering, Content Filtering, Bi-Partite Network
1. Introduction
We live in an era in which mass marketing is
increasingly replaced by mass customisation,
where catering to the individual needs of a
consumer is the desired strategy as opposed to
marketing to large groups of customers. The
thinking has thus shifted from meeting the
requirements of groups of customers or larger
segments with similar characteristics to
satisfying the needs of individuals, or a
segment of size one. This fundamental change
in approach to the customer is evidenced by the
ability to customise a new PC or even an
automobile by selecting combinations of
desired features online before the order is
placed for a fully customised product.
The vast array of choice facing a customer
however makes it practically impossible for a
consumer to make a detailed assessment of
each individual option before making a choice,
Eng. (Dr.) A. J.M. Korale, C. Eng., MIE(SL), B.Sc. Eng.
(Wisconsin), M.Sc. Eng. (Columbia), Ph.D. Eng. (Imperial
College), MBA. (Sri Jayewardenepura), Consultant - Data
Scientist, Millennium Information Technology.
Eng. (Mr.) N. S. Weeraman, AMIE(SL), B.Sc. Eng.
(Moratuwa), MBA. (Sri Jayewardenepura), Unit Manager,
Business Intelligence, Dialog Axiata PLC.
necessitating the use of Recommender Systems.
Thus it is in our everyday experience when
booking a hotel, making an airline reservation
or buying a book on Amazon that we are
guided by intelligent agents in making a
suitable choice.
Mobile Telecommunications Operators (MTO),
in addition to their core business of providing
Mobile Wireless Telecommunications Services
also offer Fixed Wireless Services, Broad Band
Internet, and Direct to Home Broadcast
Television Services, and so have a wide variety
in their product offering. While making the
choice between different product types as
above may be more clear cut for a particular
customer, when faced with the task of selecting
a particular talk time package for the Mobile
Telecommunications Service or the selection of
a particular set of TV channels via a channel
package however, the buying decision is more
complex. Thus, so it is with respect to value
added service (VAS) offerings at Mobile
Telecommunications Operators where there can
be thousands of applications catering to
different tastes and interests. It is particularly
so in the case of the RBTS offering where there
are nearly 10,000 songs to choose from and
nearly one million subscribers to the service.
The main challenge for a Mobile Operator is
then matching this vast array of choice with its
customer base so that the most suitable content
can be directed to the appropriate customer in a
way that tastes and expectations are satisfied.
As it is most often the case that the customers
themselves are unaware of the vast choice in
the service offering, it is up to the Operator to
make the match between a customer and a
product and notify the options or available
choice to the subscriber. In the case of Mobile
Telecommunications Operators this task is
somewhat easier as there is a direct Digital
Marketing Channel to each individual
subscriber through the Short Message Service
(SMS) and other automated means. Thus the
Operators can target each individual subscriber
via "SMS offers" carefully designed to match
the individual requirements of each customer.
2. Current State of the Art in
Recommender Systems
The current state of the art with respect to
Recommender System Algorithms to our
knowledge is broadly based on two concepts;
the ability to discover groups of users who like
products similar to the ones in the portfolio of
the target customer and then suggesting those
products that the customer group have liked
but has not yet been experienced by the target
customer (Collaborative Filtering), and the
ability to find groups of products similar to the
products liked by the target consumer and
recommending those products that have not yet
been experienced by that particular consumer
(Content Filtering) [2]. These algorithms are
then based on making “similarity”
determinations with respect to "similar
customers” or "similar products".
More advanced algorithms factor in the
demographics, geography, Income and other
related characteristics of the subscriber in
addition to the consumption behaviour to
compare between customers and arrive at
better groups of "similar" customers [1].
The vast majority of recommender algorithms
also utilise detailed Meta data on the content
derived from databases and from the content
creator and distributor to characterize each item
of the content and then utilize this information
to find other similar items. This characterization
usually takes the form of classifying the content
in to genre's, performing artists, creative
personnel involved in the direction and
production of the content and production
houses involved, in an effort to finely and
accurately describe as much of its outwardly
visible and easily definable properties [3].
However this type of classification cannot
capture all of the intrinsic qualities inherent in
the content which is more about the feelings it
inspires, images that come to mind and
emotions it stirs. It is only a customer who has
experienced the content that can if at all
describe in what ways a particular piece of
content is appealing and this knowledge is
usually absent in the standard current
techniques used in Recommenders.
In spite of this, a variety of approaches ranging
from correlation based techniques to machine
learning algorithms using decision trees and
neural networks are successfully used in the
these Recommenders.
Other approaches that consider association
rules in recommenders do not in general use
rules to categorise items of content and use
those single items to represent larger groups as
we do in our algorithm [3]. In particular,
association rule based recommender algorithms
that use clustering to identify user groups does
so by utilizing other criteria which then may
not result in grouping the content on its
intrinsic properties [5]. This method also does
not consider the properties of the consumer vs.
consumer network neighbourhood in arriving
at its recommendations.
The proposed association rule generation
algorithm to the best of our knowledge is a
novel scheme that is based on existing
principals. It is designed to efficiently generate
rules utilising components of rules from
previous stages and so uses items that are
already known to have fulfilled constraints on
Confidence and Lift.
The proposed scheme is also able to consider
several layers/levels of neighbours around a
target consumer and this approach combined
with an association rule recommender we
believe is a new configuration for this type of
algorithm.
3. The underlying basis for a Novel
Association Rule based
Recommender
It is usually very difficult to analyse the
underlying characteristics of a product chosen
by a consumer and examine what makes a
particular consumer prefer such a product
merely from the basket of goods chosen by that
consumer or by observing a group of
consumers similar in characteristics to the
customer under study as there usually is no
prior "meta data" of the product or content
readily available. In the case of movie or music
recommender systems for instance, it would be
possible to analyse each film or song by the
performing artists, genre, production houses
and a host of other ancillary information that
could potentially be gathered by a content
provider and used as input to a recommender
algorithm.
In the case of an RBTS Recommender however,
all of the product items are of one type (songs)
and thus we need only find associations and
relationships between songs. While there is
considerable Meta data that is associated with
song content, for the purposes of this paper and
the algorithm described herein, we did not
have access to such attributes. In light of this
we propose an algorithm that operates in an
environment where we do not have ready
access to the Meta data of the content, but
instead employ the novel idea that the content
itself can be used in the characterization of
itself. Thus for the particular case of RBTS
subscriptions by customers of Mobile Telecom
Services, we are able to categorize the RBTS
(music choices) by using a set of song choices.
In other words we categorize and reduce the
number of potential song choices using a subset
of the songs to represent certain groups of
songs.
Thus it is our contention that certain types of
content can be characterized by the properties
inherent in the content itself, particularly when
the content is used for a very personal and
personalized purpose. In our use case, we
contend that the Ring Back Tone Song (RBTS)
chosen by each customer to indicate a ringing
telephone to a caller is one that his highly
personal to the customer as it is chosen with a
view of representing the customer to the
outside world and hence from the point of view
of that customer it is also one that has a certain
appeal and great deal of intrinsic meaning.
In this paper we also study how multiple songs
are selected by a particular customer and
establish that songs that "go together" are songs
that are similar to one another. This similarity
goes beyond the usual commonalties that can
be expected between songs that are "similar" to
one another in the sense of being sung by the
same artist, or belonging to the same genre, or
those using a particular type of instrumentation
for example. Instead we show that the songs
that are found to be "similar “are so on a much
deeper and intrinsic level; like being similar in
theme, meaning, and in concept. We believe
that this result is due to the inherently personal
nature of the choice of a Ring Back Tone Song
that may also be used to gain deeper insights in
to the very personal and personalized
preferences of the customer making the choice.
We establish these results through a process of
Association Rule discovery where it is shown
that songs that group together in fact exhibit
similarities at an intrinsic and fundamental
level rather than that which would be observed
by merely comparing the Meta data of the
content. In fact there is no guarantee that items
of content that exhibit broad similarities with
respect their Meta data would appear similar or
more importantly that it would have similar
appeal to a particular customer. This is the
result of the wide variety in the content that is
available, particularly in music that may share
the same type of Meta data and other ancillary
information but differ in its intrinsic qualities
that appeal to a customer.
We also show that due to this grouping of
songs via the estimation of associations we are
able to broadly categorize the song choices
using the songs themselves as representative
categories. It then allows us to quantify the
strength of the relationships between songs by
using the rules so arrived at and by
performance measures such as confidence and
lift.
We then take the analysis a level deeper by
considering the properties of the customer to
customer network that lies at the heart of the
customer vs. song bipartite graph by translating
it in to its corresponding customer vs. customer
single mode representation.
In this way we build a recommender algorithm
that has the ability to analyse song content at a
fundamental and intrinsic level and suggest
song content using the properties inherent in
the songs itself while also utilising the
similarities between customers who prefer a
certain category of song.
4. Psychological Factors Underlying
the Proposed Recommender
Algorithm for RBTS
We believe that that Ring Back Tone Songs are
an aspect of oneself a subscriber wishes to
project to the outside world and is thus an
insight in to the subjects "personality". We also
believe that the RBTS choice is a deeply
personal choice and is therefore reflection of
some innate quality or characteristic of a
subscriber's personality.
We can show from the results of the association
rule analysis and broad song choices made by
large groups of the subscribers that when more
than one song is selected by a subscriber, the
songs within that group show similarities at an
intrinsic and fundamental level. We believe this
may be reflection or an insight in to some
aspect of the subscriber’s personality. As
"personality" is more or less fixed upon
reaching maturity we believe that the intrinsic
quality of a person’s choice of RBTS is also
unlikely to change much.
In this paper we are not concerned with the
underlying factors that were responsible for the
formation of a particular personality type but
only use the insight that it is unlikely to change
much over time and that it will also govern or
influence personal choices such as selecting a
RBTS.
While there will be exceptions to this line of
reasoning the results of the association rule
analysis do show that songs that show strong
associations with one another are "similar". As
the association rules are also a reflection of the
aggregate choices of a large number of
subscribers, our original line of reasoning
seems to be justified.
5. Main Contributions of the Paper
In this paper we employ the novel idea that
certain items of content can be used to represent
the larger set to which it belongs. We show that
certain RBTS songs selected on the basis of their
strong associations to other songs can be used
as representative categories of the larger song
set.
We also show that the proposed recommender
is able to match consumers taste for content
without recourse to its Meta data as it operates
by discovering and matching the underlying
intrinsic properties of the content such as its
theme, meaning and concept which is the basis
of its attraction to a consumer.
This is facilitated in part by a novel Association
Rule mining algorithm which efficiently
generates rules at each stage by merging the
components (antecedent and consequent terms)
of the rules in the prior stage to form the
antecedent of the subsequent stage. In this way
we generate only the minimum set of rules that
satisfy certain a priori constraints on confidence
and lift.
We improve the match between the
recommended song set and consumer’s own
taste in song choice by considering the song
choices of consumers most like the target
consumer. This is accomplished by considering
the network properties of the consumer’s
immediate neighbours identified from the
consumer vs. song bi-partite network.
We also show that a better match across a wider
variety of song content can be achieved by
combining the results of the suggestions
obtained by considering a consumer’s own
choice of songs with the suggestions obtained
by considering the song choices of his
neighbours that are determined by placing
constraints on the proximity of such individuals
to the target consumer in the consumer vs.
consumer network.
5.1 A novel Association Rule Mining
Algorithm for the purposes of Recommending
RBTS
A novel Association Rule Mining algorithm
that generates all the unique combinations of
the items that comprise the antecedent and
consequent terms of a rule while satisfying
apriori constraints on support for that rule. The
algorithm is formulated in such a way as to
utilize the rules generated in the (N-1)th stage
to form the rules at the Nth stage by selecting
only those rules that meet the minimum
support requirement in the (N-1)th stage. This
is done by forming the antecedent of the rule in
the Nth stage by merging the antecedent and
consequent terms of the rule from the (N-1)th
stage which we know meet the constraints on
support and then computing the support of this
new rule.
In this way we avoid having to generate apriori
all possible combinations of items that statisfy
the minimum support criteria and then test the
support of each newly formed rule. In our
scheme we only need to test rules from the
prior stage that we know have met the
minimum support criteria. Through this
method we are also able to automatically
generate all rules progressively proceeding
down each stage until all rules with all possible
unique combinations of items in the antecedent
and consequent parts have been created.
It is also possible to filter the rules apriori as
they are being generated by enforcing
conditions on confidence and lift if it is so
desired. Thus there is a gain in efficiency
obtained by only utilizing those rules from the
previous stage that meet certain performance
criteria to create the rules for the subsequent
stage by merging the antecedent and
consequent parts of rules that meet the minim
support criteria from the previous stage to form
the antecedent part of the rule at the next stage.
5.2 The use of Association Rules to Categorize
and Classify Ring Back Tone Songs using
Characteristics Intrinsic to the Content.
We establish that in the case of Ring Back Tone
Songs, the songs that are show strong
association with each other are largely those
that are similar in content and character
showing remarkable similarities in its deep and
intrinsic qualities. Through this result we are
able to classify and categorize the songs by
grouping the songs that are strongly associated
in to groups. This is also a key result that helps
in the recommendation of new song content to
a subscriber through song categories that are
taken to represent smaller groups of songs.
We establish this result without recourse to the
Meta data of the songs or any other content
related attributes using only the relationships
estimated from the Association rule mining.
5.3 Ranked Recommendation choices
estimated from Association Rules
The algorithm classifies RBTS according to its
intrinsic properties in to categories derived
from the items of content itself and
recommends items via the association rules
ranked according to confidence in a way that
accounts for the content choices of the
neighbours in the single mode projection of the
customer vs. item bipartite graph.
After the rules are mined as per (1) and
classified as (2) above, the RBTS subscription
portfolio of the subscriber is used to determine
which rules should apply in the
recommendation by selecting those that have
the same antecedent as the song subscriptions.
Next the selected rules are ranked by
confidence and the possible consequent items
determined arriving at a set of choices ordered
by the degree of "correlation" to the existing set.
The neighbourhood of the subscriber that is
found by placing limits on the number of songs
in common with other subscribers and size of
the subscription set of the neighbours in
relation to the customer in question is also used
to determine which of the ranked consequent
choices are actually recommended.
6. Association Rule Discovery
Algorithm for RBTS
The Association Rules are mined from an
Incidence Matrix (IM) by placing apriori limits
on the support for each Item which is the
column sum of IM. IM is formed as a customer
vs. Item matrix, by allocating unity for every
song that has been experienced by each
customer. If a Customer has selected the same
song more than once, the entry in the
corresponding song column in the row
allocated for that customer is still unity.
Once the column sum (IMsum_min) is formed
the particular columns that do not meet the
minimum support criteria (min_supp) are not
considered and eliminated from the analysis as
a rule that incorporates that item cannot in any
case meet the minimum support requirement.
The rule discovery then proceeds to the first
stage by determining all pairs of items that
meet the minimum support criteria. The
support for a pair of items is then simply the
dot product of the corresponding columns of
IM. In the next step the pairs of items that meet
the minimum support criteria are selected and
the support and confidence of the rule are also
determined.
In the second stage only the rules that met the
minimum support criteria from the first stage
are selected to form the rules. The antecedent in
the second stage is then formed by merging
antecedent and consequent terms from the
rules (that met the minimum support criteria)
of the first stage. The three item rules (2 in
items in the antecedent derived from stage 1,
and one new item in the consequent) that can
be formed that meet the minimum support
criteria can only come from the set of terms
selected from stage 1 and the set IMsum_min.
For example, let A, B, C, D be items that meet
the minim support criteria in IM.
In the first stage all pairs {A,B}, {A,C}, {A,D},
{B,C}, {B,D}, {C,D} are tested for minimum
support. (There will be 4C2 = 6 such terms to be
tested.)
A->B is a valid rule, only if the support for AUB
meets min_supp criteria
In the second stage A and B (and all other pairs
that met min_supp) (are merged to form the
antecedent of the rule at stage 2 if they meet the
min_supp criteria. (Let us assume that {A,B},
{A,C} and {C,D} meet the min_supp criteria)
Thus in stage 2
A&B->C is a valid rule only if the support for
AUBUC meets min_supp criteria (and similarly
we test support for rules A&B->D, A&C->B,
A&C->D and C&D->A and C&D->B)
The rule discovery proceeds in this manner
until all unique combinations of items have
been created in the rules.
6.1 Definitions
{Support (A->B)= N(AUB)/N (1)}
Where N(AUB) represents the number of times
items A and B are bought together in the whole
transaction set and N is the number of rows in
IM (corresponds to the number of unique
subscribers or transactions)
{Confidence(A->B)
= Support(AUB)/Support(A) (2)}
= P(EA&EB)/P(EA) (3)}
= P(EB/EA) (4)}
Where P(EX) is the probability of item X being
bought in a particular transaction
Confidence of a rule (A->B) is a measure of the
validity of the rule, it has a conditional
probability interpretation in that, it is the
probability that B occurs when A has already
occurred [4]. In market basket parlance it is the
probability of item B being bought given that A
has been bought in a particular transaction.
Loosely speaking, we use the confidence
measure in our reasoning as an indication of
the degree of correlation between songs when
we have rules of the form song1 -> song2. It is
also used to rank the association rules
particularly when one common antecedent
gives rise to many consequent items.
The Lift is a popular measure used to measure
how likely a rule is if the items are independent
[4].
{Lift(A->B)
= Confidence(A->B)/Support(B) (5)}
{Lift(A->B) = P(EA&EB)/P(EA)P(EB) (6)}
In other words it is the number times how
likely it is that A&B are bought together when
the probability of buying A and the probability
of buying B are independent. (Ie when the
buying decision of item A does not influence
the buying decision of B and vice versa)
In this modelling we generate all combinations
of stage 1 rules that satisfy the min_supp
criteria as we need to match the antecedents of
the rules with the subscription portfolio of a
subscriber in order to generate
recommendation options. We use low
minimum support levels to ensure that we
generate a large number of rules and handle the
filtering and ranking via the categorization of
songs and rule confidence measure.
7. Recommender Algorithm &
Strategy
The Recommender Algorithm determines the
best RBTS suggestions for a particular
subscriber by considering the items currently
subscribed to by the target subscriber and the
subscription pattern of that subscriber’s nearest
neighbours. This neighbourhood can be
determined from the single mode
representation of the Bipartite Network that is
derived by multiplying the incidence matrix by
its transpose IM*IMT
.
The resulting subscriber vs. subscriber network
is similar in form to an Adjacency matrix from
which the neighbours can easily be found. In
essence the neighbours of the target subscriber
are those that have at least some of the same
items in common.
Constraints are set on the size of the
subscription set of each neighbour in relation to
the size of the subscription set of the target
subscriber so that they are approximately of the
same size. Additionally to qualify as a
neighbour the subscriber in question must have
at least a certain minimum number of items in
common with the target subscriber. These
conditions are placed to ensure that the
neighbour set is more or less “similar” to the
target subscriber.
7.1 Determining Nearest Neighbours
1. For each subscriber determine all of the
other subscribers NS = {ns1,ns2,…,nsi,...,nsT}
that share at least one song item in common
2. For each of the neighbours nsi
determine the size of their subscription set nsiT
3. For each of the neighbours nsi
determine the number of songs in common nsci
with the target subscriber Si
4. The neighbour set is then the set of
subscribers that meet an apriori limit on the
nsiT and nsci
5. The limits can be set on the basis of the
size of the subscription set of the target
subscriber Si
6. So we can say a neighbour of the target
subscriber is one who is within (+/-) 50% the
size of the subscription set of Si and who also
matches between (+/-) 20% of the songs in the
subscription set of Si
7. The neighbour item set is then the
union of the set of items (RBTS) found among
the subscribers who meet conditions in 6 above.
7.2 Generate Association Rules
The Association rules for the RBTS items can be
obtained using the algorithm described in
section 6.
In this instance we generate only the level 1
associations with one antecedent and one
consequent term. We use low support and
confidence thresholds so as to capture as many
rules as possible. This is done with the object of
matching the antecedents in the rules to find
consequent terms that have high confidence.
7.3 Hybrid Recommender Algorithm
The hybrid nature of the algorithm arises by
combining the suggestions obtained via the
association rules with those obtained by
considering the song choices of the consumer’s
immediate neighbours that are identified via
the single mode representation of the consumer
vs. song bi-partite network.
In this hybrid approach the final recommended
song set is determined as the intersection of the
song set suggested for the consumer by
considering the consumer’s own song choices
with the choices of his nearest neighbours.
Thus, we endeavour to find better matches for a
consumers taste in song content by using that
consumers own (prior) taste in songs together
with the tastes of consumers most like him.
This neighbourhood of similar consumers is
determined by their preferences for similar
types of content as the target consumer.
However there is sufficient variation across the
content preferences of the neighbours to ensure
that a diverse set of songs are available for
consideration. In essence we allow for a certain
degree of overlap in the tastes between the
target consumer and his neighbours to ensure
that there is sufficient variety in the set of songs
suggested by the neighbouring consumers
which is then observed in the final
recommendation.
1. For each subscriber obtain his
subscription song set S = {s1,s2,…,si,…,sN}
2. For each song si in the subscription set
S, obtain all rules Ri = {r1,r2,…,rj,…,rK} which
have same antecedent as si
3. From this rule set Ri, choose the rule
that has the highest confidence ri_max and the
corresponding consequent term Ci_max of the
rule.
It may be that in some cases more than one rule
from Ri may meet this criteria (if they have the
same max value for instance), in which case all
such consequent terms are extracted.
4. The “available” song set {AS} is then
the set of consequent items which have the
highest confidence from each set of rules Ri
derived for each song si in the subscription set.
5. The recommended song set is then the
intersection between songs that are in the
available song set {AS} and the Neighbour song
set (set composed of the songs found in the
neighbours subscription set, across all of the
neighbours).
6. If the intersection is null or too small
the available song set is used as the
recommended set.
8. Results
We present some example results extracted to
demonstrate the match between the
subscription set of songs (Table 1) and the
corresponding recommended set of songs
(Table 2) for 10 subscribers drawn randomly. It
is evident that the recommendations do match
the subscriber’s song set to a great degree in
terms of the characteristics described
previously.
Table 1. Subscription Song Characteristics
Su
b
No
Subscription
Song Set of
Subscriber
Languag
e
Source Release
Theme
1
Theme
2
Theme
3
1 Raja Malige Sinhala Charts New Other
2
Athi Mawa
Rawatuwa Sinhala Charts New Love Sorrow Losing
2
Kandulu
Hollala Sinhala Charts New Love Sorrow Losing
2
Sithama
Ridawa Sinhala Charts New Love Sorrow Losing
3
Wen Weela
Giyada Sinhala Charts New Love Sorrow Losing
4
Nidi Warapu
As Walin Sinhala Charts New Love Sorrow Losing
4 Obath Giya Sinhala Charts New Love Sorrow Losing
5
Athi Mawa
Rawatuwa Sinhala Charts New Love Sorrow Losing
5
Sithama
Ridawa Sinhala Charts New Love Sorrow Losing
6
Ennai
thalaatta Tamil Movie New Love
7
Dheere
Dheere Hindi Movie New Love
7
Ananthen Aa
Tharu Kumara Sinhala Teledrama New Love
8
Chinna Mani
Kuyile Tamil Movie Old Love
9
Ennodu Nee
Irundhaal Tamil Movie New Love
10
Dukama
Widala Sinhala Charts New Love Sorrow Losing
10 Hithumanapen Sinhala Charts New Love Sorrow Losing
10 Waradi Karala Sinhala Charts New Love Sorrow Losing
As discussed earlier a single song subscription
can give rise to multiple song recommendations
as each song appearing in the antecedent of the
rule be correlated with multiple consequent
songs that have the same maximum confidence
level.
In practice then the unique set of items taken
from all the recommended items for a
particular subscriber will be used (multiple
occurrences of the same consequent item in
Table 2 is left to demonstrate the working of the
algorithm)
If the recommended set is too small the option
of using the consequent items with the next
highest confidence level can be employed.
We also observed several cases where the
recommended matches were not in line with
our hypothesis. This resulted when certain
subscribers had songs that had broad appeal
across many tastes (like national anthem
themed songs).
Table 2. Recommended Song Characteristics
Sub
No
Recomm
endation
Song
Set of
Subscrib
er
Langu
age
Sourc
e
Relea
se
The
me1
The
me2
The
me3
1 Amma Sinhal
a
Kavi
Bana
New Moth
er
Moth
er
2 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
2 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
2 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
3 Awasana
Premayai
Sinhal
a
Charts New Love Sorr
ow
4
Hadawath
a Gahena
Sinhal
a
Charts New Love Sorr
ow
Losin
g
4 Awasana
Premayai
Sinhal
a
Charts New Love Sorr
ow
4 Awasana
Premayai
Sinhal
a
Charts New Love Sorr
ow
5 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
5 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
6
Kannukull
e
Tamil Movie Old Love
7 Awasana
Premayai
Sinhal
a
Charts New Love Sorr
ow
7 Dukama
Vidala
Sinhal
a
Charts New Love Sorr
ow
Losin
g
8 Kanne
En
Kanmaniy
e.
Tamil Movie Old Love
8 Enna Oru
Enna Oru
Tamil Movie New Love
9
Idhayatha
i Yedho
Ondru
Tamil Movie New Love
9 Un Mele
Oru
Kannu
Tamil Movie New Love
10 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
10 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
10 Ayeth
Warak
Sinhal
a
Charts New Love Sorr
ow
Losin
g
In such cases there could be a large number of
subscribers with widely differing tastes who
also have this song in their subscription
portfolio giving rise to spurious associations
/correlations at high confidence levels. These
occurrences are in the minority and should not
noticeably impact the overall trend in the result.
We observed approximately 83% match
between the subscription set and the
recommendations.
8.1 Operationalizing the Recommender
A subscriber may download RBTS using
different channels such as IVR (Interactive
Voice Response), Web, SMS or copy from
another RBTS user. We favour the Out Bound
Dialler (OBD) as the main promotion channel
as the user can hear the recommended songs
before selection and also as it has been
successfully used to promote RBTS in the past.
Also a customized SMS sent to the target user
with recommended songs having a direct dial
short code can be used for users who are using
SMS and who are familiar with downloading
RBTS via direct dial method.
In particular we may target existing subscribers
who have many songs in their subscription and
who have a tendency to change their songs
fairly regularly. Although each subscriber is
different, the Operator has the ability to select
the best individual marketing channel to reach
each one by observing past RBTS downloading
patterns and core Telecommunication Service
usage behaviour of that individual.
9. Conclusion and Future Work
We established that the proposed algorithm
and model successfully matches a subscriber’s
taste for Ring Back Tone Song content by
recommending songs that are very much
similar in its intrinsic qualities such as type,
theme and content.
We also showed that the proposed Association
Rule discovery algorithm that derives rules for
the current level by evaluating rules that meet
certain performance criteria from the previous
level is sufficiently flexible and efficient in
order to generate the large number of rules at
low support constraints with the option of
filtering at apriori confidence levels.
Through our modelling we also established a
fundamental insight in to consumer behaviour
in this particular type of content by observing
that items of content preferred by consumers
are strongly associated / correlated when their
intrinsic properties are very similar. With this
insight we are able to successfully match the
tastes of the subscriber in recommended RBTS
content.
Although we employ only level one rules with
a single antecedent and consequent term in our
model the ability to generate the full set may
prove useful in subsequent models where we
use rule priorities and coverage concepts to
rationalize and derive only those most relevant
to the recommendation.
While the neighbourhood calculation did not
significantly contribute to improving the
recommendations in the current incarnation of
the algorithm, we envisage factoring in
subscriber profiling attributes such as
demographics, language preference and Mobile
Service usage criteria to improve on the ability
to find groups of songs from subscribers that
have more in common with each other.
10. References
1. Wilson, D., Symth, B., & Reily, J., “Case-Studies in
Association Rule Mining for Recommender
Systems”, Proceedings of the 2005 International
Conference on Artificial Intelligence, ICAI, 2005.
2. Lin, W., Alvarez, S., & Ruiz, C., “Collaborative
Recommendation via Association Rule Mining”,
Department of Computer Science, Worcester
Polytechnic Institute, International Workshop on
Web Mining, 2005.
3. Swathi, V., Reddy, S, “Music Recommendation
System Using Association Rules”, International
Journal of Technology Enhancements and
Emerging Engineering Research, vol 2, Issue 7,
2014.
4. Hastie, T., Tibshirani, R., “The Elements of
Statistical Learning”, 2nd
ed., Springer, USA, 2009,
488-499pp.
5. M. Sunitha Reddy, T. Adilakshmi, V. Swathi, A
Novel Association Rule Mining and Clustering
Based Hybrid Method for Music
Recommendation System, International Journal
of Research in Engineering and Technology, vol.
3, issue 5, 2014.

More Related Content

What's hot

IRJET- Review on Marketing Analysis in Social Media
IRJET-  	  Review on Marketing Analysis in Social MediaIRJET-  	  Review on Marketing Analysis in Social Media
IRJET- Review on Marketing Analysis in Social MediaIRJET Journal
 
Customer segmentation for a mobile telecommunications company based on servic...
Customer segmentation for a mobile telecommunications company based on servic...Customer segmentation for a mobile telecommunications company based on servic...
Customer segmentation for a mobile telecommunications company based on servic...Shohin Aheleroff
 
Cross Selling Through Database Marketing
Cross Selling Through Database MarketingCross Selling Through Database Marketing
Cross Selling Through Database MarketingAnkit Gupta
 
Corporate Overview June 2009
Corporate Overview June 2009Corporate Overview June 2009
Corporate Overview June 2009jjmoore21
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataIJERA Editor
 
Independent study thesis
Independent study thesisIndependent study thesis
Independent study thesisRaghuChoudhary2
 
A Model of Personalization
A Model of PersonalizationA Model of Personalization
A Model of PersonalizationGarros Fung
 
IRJET- E-commerce Recommendation System
IRJET- E-commerce Recommendation SystemIRJET- E-commerce Recommendation System
IRJET- E-commerce Recommendation SystemIRJET Journal
 

What's hot (10)

IRJET- Review on Marketing Analysis in Social Media
IRJET-  	  Review on Marketing Analysis in Social MediaIRJET-  	  Review on Marketing Analysis in Social Media
IRJET- Review on Marketing Analysis in Social Media
 
Customer segmentation for a mobile telecommunications company based on servic...
Customer segmentation for a mobile telecommunications company based on servic...Customer segmentation for a mobile telecommunications company based on servic...
Customer segmentation for a mobile telecommunications company based on servic...
 
Cross Selling Through Database Marketing
Cross Selling Through Database MarketingCross Selling Through Database Marketing
Cross Selling Through Database Marketing
 
Corporate Overview June 2009
Corporate Overview June 2009Corporate Overview June 2009
Corporate Overview June 2009
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence Data
 
Ao33238242
Ao33238242Ao33238242
Ao33238242
 
Independent study thesis
Independent study thesisIndependent study thesis
Independent study thesis
 
Term Paper on WEKA
Term Paper on WEKA Term Paper on WEKA
Term Paper on WEKA
 
A Model of Personalization
A Model of PersonalizationA Model of Personalization
A Model of Personalization
 
IRJET- E-commerce Recommendation System
IRJET- E-commerce Recommendation SystemIRJET- E-commerce Recommendation System
IRJET- E-commerce Recommendation System
 

Viewers also liked

Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...
Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...
Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...Asoka Korale
 
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...Asoka Korale
 
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...Asoka Korale
 
Sentiment Analysis for IET ATC 2016
Sentiment Analysis for IET ATC 2016Sentiment Analysis for IET ATC 2016
Sentiment Analysis for IET ATC 2016Asoka Korale
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedinAsoka Korale
 
Libraries transform communities
Libraries transform communitiesLibraries transform communities
Libraries transform communitiesLisa Neal Shaw
 
Sitios web paginas web
Sitios web paginas webSitios web paginas web
Sitios web paginas webcarlosl mora
 
Día de la paz
Día de la pazDía de la paz
Día de la pazianer71
 
El trabajo de infantes
El trabajo de infantesEl trabajo de infantes
El trabajo de infantesCarlita Paucar
 
2 trab de investigacion colegio
2 trab de investigacion colegio2 trab de investigacion colegio
2 trab de investigacion colegiojorgecoco79
 
Diapo de info gemyjuda
Diapo de info gemyjudaDiapo de info gemyjuda
Diapo de info gemyjudajudagema
 
caracteristicas de la misukkkkaaa
caracteristicas de la misukkkkaaacaracteristicas de la misukkkkaaa
caracteristicas de la misukkkkaaaanithadeke
 
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授文化大學
 
Proyecto y evaluacion
Proyecto y evaluacionProyecto y evaluacion
Proyecto y evaluacionyamithmora32
 
Aprenentacrèixer Tractament obesitat infantil
Aprenentacrèixer Tractament obesitat infantil Aprenentacrèixer Tractament obesitat infantil
Aprenentacrèixer Tractament obesitat infantil aprenentacreixer
 

Viewers also liked (20)

Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...
Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...
Recommender Algorithm for PRBT BiPartite Networks - IESL 18 Oct 2016_final_us...
 
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...
ATC full paper format-2014 Social Networks in Telecommunications Asoka Korale...
 
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...
Novel_Sentiment_Analysis_and_Classification_Algorithm_ATC_2016_final_publishe...
 
Sentiment Analysis for IET ATC 2016
Sentiment Analysis for IET ATC 2016Sentiment Analysis for IET ATC 2016
Sentiment Analysis for IET ATC 2016
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedin
 
Libraries transform communities
Libraries transform communitiesLibraries transform communities
Libraries transform communities
 
Sitios web paginas web
Sitios web paginas webSitios web paginas web
Sitios web paginas web
 
El ratón45
El ratón45El ratón45
El ratón45
 
Día de la paz
Día de la pazDía de la paz
Día de la paz
 
El trabajo de infantes
El trabajo de infantesEl trabajo de infantes
El trabajo de infantes
 
2 trab de investigacion colegio
2 trab de investigacion colegio2 trab de investigacion colegio
2 trab de investigacion colegio
 
Camtasia
CamtasiaCamtasia
Camtasia
 
Diapo de info gemyjuda
Diapo de info gemyjudaDiapo de info gemyjuda
Diapo de info gemyjuda
 
caracteristicas de la misukkkkaaa
caracteristicas de la misukkkkaaacaracteristicas de la misukkkkaaa
caracteristicas de la misukkkkaaa
 
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授
高市職業訓練創新發展協會 創造與捕捉顧客價值與消費者分析-詹翔霖教授
 
LA FUNDACIÓN DE JUJUY
LA FUNDACIÓN DE JUJUYLA FUNDACIÓN DE JUJUY
LA FUNDACIÓN DE JUJUY
 
Proyecto y evaluacion
Proyecto y evaluacionProyecto y evaluacion
Proyecto y evaluacion
 
Aprenentacrèixer Tractament obesitat infantil
Aprenentacrèixer Tractament obesitat infantil Aprenentacrèixer Tractament obesitat infantil
Aprenentacrèixer Tractament obesitat infantil
 
cada loco
cada lococada loco
cada loco
 
YH Teong Bio
YH Teong BioYH Teong Bio
YH Teong Bio
 

Similar to Bipartite Recommender Algorithm for RBTS - IESL 2016 Final Reviewed

Personalization - Whitepaper
Personalization - WhitepaperPersonalization - Whitepaper
Personalization - WhitepaperTony Shrader
 
Simply Data driven behavioural algorithms
Simply Data driven behavioural algorithmsSimply Data driven behavioural algorithms
Simply Data driven behavioural algorithmsNana Bianca
 
IRJET- Rating based Recommedation System for Web Service
IRJET- Rating based Recommedation System for Web ServiceIRJET- Rating based Recommedation System for Web Service
IRJET- Rating based Recommedation System for Web ServiceIRJET Journal
 
ECT 584_Research Paper_JoyceRose_08182015
ECT 584_Research Paper_JoyceRose_08182015ECT 584_Research Paper_JoyceRose_08182015
ECT 584_Research Paper_JoyceRose_08182015Joyce Rose
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems BasicsJarin Tasnim Khan
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)krsenthamizhselvi
 
Consumption capability analysis for Micro-blog users based on data mining
Consumption capability analysis for Micro-blog users based on data miningConsumption capability analysis for Micro-blog users based on data mining
Consumption capability analysis for Micro-blog users based on data miningijaia
 
76 s201921
76 s20192176 s201921
76 s201921IJRAT
 
Social media impact on service brand reinforcement
Social media impact on service brand reinforcementSocial media impact on service brand reinforcement
Social media impact on service brand reinforcementAndrey Markin
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...IJTET Journal
 
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERSIMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERSacijjournal
 
Behavioral Targeting and Dynamic Content creation
Behavioral Targeting and Dynamic Content creationBehavioral Targeting and Dynamic Content creation
Behavioral Targeting and Dynamic Content creationSimon Hjorth
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET Journal
 
Recommendation engine pratik_kadam
Recommendation engine pratik_kadamRecommendation engine pratik_kadam
Recommendation engine pratik_kadampratikkadam78
 
Telecom analytics brochure
Telecom analytics brochure Telecom analytics brochure
Telecom analytics brochure Daniel Thomas
 
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...IRJET Journal
 
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold  Digging Beneath The Surface Of Data MiningAll That Glitters Is Not Gold  Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold Digging Beneath The Surface Of Data MiningJim Webb
 
Applied Analytics for Real-Time Decision Support
Applied Analytics for Real-Time Decision SupportApplied Analytics for Real-Time Decision Support
Applied Analytics for Real-Time Decision SupportAnametrix
 

Similar to Bipartite Recommender Algorithm for RBTS - IESL 2016 Final Reviewed (20)

Personalization - Whitepaper
Personalization - WhitepaperPersonalization - Whitepaper
Personalization - Whitepaper
 
Simply Data driven behavioural algorithms
Simply Data driven behavioural algorithmsSimply Data driven behavioural algorithms
Simply Data driven behavioural algorithms
 
IRJET- Rating based Recommedation System for Web Service
IRJET- Rating based Recommedation System for Web ServiceIRJET- Rating based Recommedation System for Web Service
IRJET- Rating based Recommedation System for Web Service
 
ECT 584_Research Paper_JoyceRose_08182015
ECT 584_Research Paper_JoyceRose_08182015ECT 584_Research Paper_JoyceRose_08182015
ECT 584_Research Paper_JoyceRose_08182015
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)
 
Consumption capability analysis for Micro-blog users based on data mining
Consumption capability analysis for Micro-blog users based on data miningConsumption capability analysis for Micro-blog users based on data mining
Consumption capability analysis for Micro-blog users based on data mining
 
76 s201921
76 s20192176 s201921
76 s201921
 
Social media impact on service brand reinforcement
Social media impact on service brand reinforcementSocial media impact on service brand reinforcement
Social media impact on service brand reinforcement
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
 
Banking
BankingBanking
Banking
 
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERSIMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
 
Behavioral Targeting and Dynamic Content creation
Behavioral Targeting and Dynamic Content creationBehavioral Targeting and Dynamic Content creation
Behavioral Targeting and Dynamic Content creation
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
 
Recommendation engine pratik_kadam
Recommendation engine pratik_kadamRecommendation engine pratik_kadam
Recommendation engine pratik_kadam
 
Telecom analytics brochure
Telecom analytics brochure Telecom analytics brochure
Telecom analytics brochure
 
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONSAN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
 
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
 
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold  Digging Beneath The Surface Of Data MiningAll That Glitters Is Not Gold  Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
 
Applied Analytics for Real-Time Decision Support
Applied Analytics for Real-Time Decision SupportApplied Analytics for Real-Time Decision Support
Applied Analytics for Real-Time Decision Support
 

More from Asoka Korale

Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Asoka Korale
 
Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Asoka Korale
 
Novel price models in the capital market
Novel price models in the capital marketNovel price models in the capital market
Novel price models in the capital marketAsoka Korale
 
Modeling prices for capital market surveillance
Modeling prices for capital market surveillanceModeling prices for capital market surveillance
Modeling prices for capital market surveillanceAsoka Korale
 
Entity profling and collusion detection
Entity profling and collusion detectionEntity profling and collusion detection
Entity profling and collusion detectionAsoka Korale
 
Entity Profiling and Collusion Detection
Entity Profiling and Collusion DetectionEntity Profiling and Collusion Detection
Entity Profiling and Collusion DetectionAsoka Korale
 
Markov Decision Processes in Market Surveillance
Markov Decision Processes in Market SurveillanceMarkov Decision Processes in Market Surveillance
Markov Decision Processes in Market SurveillanceAsoka Korale
 
A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...Asoka Korale
 
A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...Asoka Korale
 
Customer Lifetime Value Modeling
Customer Lifetime Value ModelingCustomer Lifetime Value Modeling
Customer Lifetime Value ModelingAsoka Korale
 
Forecasting models for Customer Lifetime Value
Forecasting models for Customer Lifetime ValueForecasting models for Customer Lifetime Value
Forecasting models for Customer Lifetime ValueAsoka Korale
 
Capacity and utilization enhancement
Capacity and utilization enhancementCapacity and utilization enhancement
Capacity and utilization enhancementAsoka Korale
 
Cell load KPIs in support of event triggered Cellular Yield Maximization
Cell load KPIs in support of event triggered Cellular Yield MaximizationCell load KPIs in support of event triggered Cellular Yield Maximization
Cell load KPIs in support of event triggered Cellular Yield MaximizationAsoka Korale
 
Vehicular Traffic Monitoring Scenarios
Vehicular Traffic Monitoring ScenariosVehicular Traffic Monitoring Scenarios
Vehicular Traffic Monitoring ScenariosAsoka Korale
 
Mixed Numeric and Categorical Attribute Clustering Algorithm
Mixed Numeric and Categorical Attribute Clustering AlgorithmMixed Numeric and Categorical Attribute Clustering Algorithm
Mixed Numeric and Categorical Attribute Clustering AlgorithmAsoka Korale
 
Introduction to Bit Coin Model
Introduction to Bit Coin ModelIntroduction to Bit Coin Model
Introduction to Bit Coin ModelAsoka Korale
 
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...Asoka Korale
 
Mapping Mobile Average Revenue per User to Personal Income level via Househol...
Mapping Mobile Average Revenue per User to Personal Income level via Househol...Mapping Mobile Average Revenue per User to Personal Income level via Househol...
Mapping Mobile Average Revenue per User to Personal Income level via Househol...Asoka Korale
 
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedin
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedinAsoka_Korale_Event_based_CYM_IET_2013_submitted linkedin
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedinAsoka Korale
 
event tiggered cellular yield enhancement linkedin
event tiggered cellular yield enhancement linkedinevent tiggered cellular yield enhancement linkedin
event tiggered cellular yield enhancement linkedinAsoka Korale
 

More from Asoka Korale (20)

Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...
 
Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...Improving predictability and performance by relating the number of events and...
Improving predictability and performance by relating the number of events and...
 
Novel price models in the capital market
Novel price models in the capital marketNovel price models in the capital market
Novel price models in the capital market
 
Modeling prices for capital market surveillance
Modeling prices for capital market surveillanceModeling prices for capital market surveillance
Modeling prices for capital market surveillance
 
Entity profling and collusion detection
Entity profling and collusion detectionEntity profling and collusion detection
Entity profling and collusion detection
 
Entity Profiling and Collusion Detection
Entity Profiling and Collusion DetectionEntity Profiling and Collusion Detection
Entity Profiling and Collusion Detection
 
Markov Decision Processes in Market Surveillance
Markov Decision Processes in Market SurveillanceMarkov Decision Processes in Market Surveillance
Markov Decision Processes in Market Surveillance
 
A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...
 
A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...A framework for dynamic pricing electricity consumption patterns via time ser...
A framework for dynamic pricing electricity consumption patterns via time ser...
 
Customer Lifetime Value Modeling
Customer Lifetime Value ModelingCustomer Lifetime Value Modeling
Customer Lifetime Value Modeling
 
Forecasting models for Customer Lifetime Value
Forecasting models for Customer Lifetime ValueForecasting models for Customer Lifetime Value
Forecasting models for Customer Lifetime Value
 
Capacity and utilization enhancement
Capacity and utilization enhancementCapacity and utilization enhancement
Capacity and utilization enhancement
 
Cell load KPIs in support of event triggered Cellular Yield Maximization
Cell load KPIs in support of event triggered Cellular Yield MaximizationCell load KPIs in support of event triggered Cellular Yield Maximization
Cell load KPIs in support of event triggered Cellular Yield Maximization
 
Vehicular Traffic Monitoring Scenarios
Vehicular Traffic Monitoring ScenariosVehicular Traffic Monitoring Scenarios
Vehicular Traffic Monitoring Scenarios
 
Mixed Numeric and Categorical Attribute Clustering Algorithm
Mixed Numeric and Categorical Attribute Clustering AlgorithmMixed Numeric and Categorical Attribute Clustering Algorithm
Mixed Numeric and Categorical Attribute Clustering Algorithm
 
Introduction to Bit Coin Model
Introduction to Bit Coin ModelIntroduction to Bit Coin Model
Introduction to Bit Coin Model
 
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...
Estimating Gaussian Mixture Densities via an implemetation of the Expectaatio...
 
Mapping Mobile Average Revenue per User to Personal Income level via Househol...
Mapping Mobile Average Revenue per User to Personal Income level via Househol...Mapping Mobile Average Revenue per User to Personal Income level via Househol...
Mapping Mobile Average Revenue per User to Personal Income level via Househol...
 
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedin
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedinAsoka_Korale_Event_based_CYM_IET_2013_submitted linkedin
Asoka_Korale_Event_based_CYM_IET_2013_submitted linkedin
 
event tiggered cellular yield enhancement linkedin
event tiggered cellular yield enhancement linkedinevent tiggered cellular yield enhancement linkedin
event tiggered cellular yield enhancement linkedin
 

Bipartite Recommender Algorithm for RBTS - IESL 2016 Final Reviewed

  • 1. A Novel Recommender Algorithm for Bipartite Networks employing Association Rules to Discover Characteristics Intrinsic to the Content A.J.M. Korale & N.S.Weeraman Abstract: Faced with an extremely competitive landscape and low margins, Mobile Telecommunications Operators are motivated to seek new sources of revenue streams by promoting Value Added Services (VAS) to their customers. This strategy in effect exploits the novelty and attractive power of the supplementary service portfolio to draw in new subscribers and retain and satisfy the existing base. However, delivering satisfying content to a vast and diverse customer base with widely differing tastes and expectations is a challenge faced by Mobile Telecommunications Operators. Thus Intelligent Algorithms in the form of Recommender Systems are employed to match available content with customer tastes and desires. Such algorithms usually require considerable Meta data on the content and detailed customer information and their consumption patterns to attempt this match. As the Meta data does not and indeed cannot always accurately characterize the intrinsic properties of the content, which is the basis of its attraction to a consumer, the match is at best not a very accurate one. In light of this we propose an enhancement that operates without recourse to the Meta data of the content but instead employ the novel idea that certain items of the content itself can be used to represent the larger set. This result is established through a process of Association Rule discovery where it is shown that Ring Back Tone Songs (RBTS) that group together do in fact exhibit similarities at an intrinsic and fundamental level rather than that which would be observed by merely comparing the Meta data of the content. We also show that due to this grouping of songs via the estimation of associations we are able to broadly categorize the available song choices using selected songs as representative categories. It then allows us to quantify the strength of the relationships between songs by using the rules so arrived at through performance measures derived from confidence and lift. Thus the algorithm provides a more accurate match between a customer and his preference for RBTS content by analysing song content at a fundamental and intrinsic level and recommends song content using the properties inherent in the songs. It also improves its match by utilizing the similarities between customers who prefer a certain category of song by analysing the single mode representation of the customer vs. content Bipartite graph. Keywords: Recommender Systems, Collaborative Filtering, Content Filtering, Bi-Partite Network 1. Introduction We live in an era in which mass marketing is increasingly replaced by mass customisation, where catering to the individual needs of a consumer is the desired strategy as opposed to marketing to large groups of customers. The thinking has thus shifted from meeting the requirements of groups of customers or larger segments with similar characteristics to satisfying the needs of individuals, or a segment of size one. This fundamental change in approach to the customer is evidenced by the ability to customise a new PC or even an automobile by selecting combinations of desired features online before the order is placed for a fully customised product. The vast array of choice facing a customer however makes it practically impossible for a consumer to make a detailed assessment of each individual option before making a choice,
  • 2. Eng. (Dr.) A. J.M. Korale, C. Eng., MIE(SL), B.Sc. Eng. (Wisconsin), M.Sc. Eng. (Columbia), Ph.D. Eng. (Imperial College), MBA. (Sri Jayewardenepura), Consultant - Data Scientist, Millennium Information Technology. Eng. (Mr.) N. S. Weeraman, AMIE(SL), B.Sc. Eng. (Moratuwa), MBA. (Sri Jayewardenepura), Unit Manager, Business Intelligence, Dialog Axiata PLC. necessitating the use of Recommender Systems. Thus it is in our everyday experience when booking a hotel, making an airline reservation or buying a book on Amazon that we are guided by intelligent agents in making a suitable choice. Mobile Telecommunications Operators (MTO), in addition to their core business of providing Mobile Wireless Telecommunications Services also offer Fixed Wireless Services, Broad Band Internet, and Direct to Home Broadcast Television Services, and so have a wide variety in their product offering. While making the choice between different product types as above may be more clear cut for a particular customer, when faced with the task of selecting a particular talk time package for the Mobile Telecommunications Service or the selection of a particular set of TV channels via a channel package however, the buying decision is more complex. Thus, so it is with respect to value added service (VAS) offerings at Mobile Telecommunications Operators where there can be thousands of applications catering to different tastes and interests. It is particularly so in the case of the RBTS offering where there are nearly 10,000 songs to choose from and nearly one million subscribers to the service. The main challenge for a Mobile Operator is then matching this vast array of choice with its customer base so that the most suitable content can be directed to the appropriate customer in a way that tastes and expectations are satisfied. As it is most often the case that the customers themselves are unaware of the vast choice in the service offering, it is up to the Operator to make the match between a customer and a product and notify the options or available choice to the subscriber. In the case of Mobile Telecommunications Operators this task is somewhat easier as there is a direct Digital Marketing Channel to each individual subscriber through the Short Message Service (SMS) and other automated means. Thus the Operators can target each individual subscriber via "SMS offers" carefully designed to match the individual requirements of each customer. 2. Current State of the Art in Recommender Systems The current state of the art with respect to Recommender System Algorithms to our knowledge is broadly based on two concepts; the ability to discover groups of users who like products similar to the ones in the portfolio of the target customer and then suggesting those products that the customer group have liked but has not yet been experienced by the target customer (Collaborative Filtering), and the ability to find groups of products similar to the products liked by the target consumer and recommending those products that have not yet been experienced by that particular consumer (Content Filtering) [2]. These algorithms are then based on making “similarity” determinations with respect to "similar customers” or "similar products". More advanced algorithms factor in the demographics, geography, Income and other related characteristics of the subscriber in addition to the consumption behaviour to compare between customers and arrive at better groups of "similar" customers [1]. The vast majority of recommender algorithms also utilise detailed Meta data on the content derived from databases and from the content creator and distributor to characterize each item of the content and then utilize this information to find other similar items. This characterization usually takes the form of classifying the content in to genre's, performing artists, creative personnel involved in the direction and production of the content and production houses involved, in an effort to finely and accurately describe as much of its outwardly visible and easily definable properties [3]. However this type of classification cannot capture all of the intrinsic qualities inherent in the content which is more about the feelings it inspires, images that come to mind and emotions it stirs. It is only a customer who has experienced the content that can if at all describe in what ways a particular piece of content is appealing and this knowledge is usually absent in the standard current techniques used in Recommenders. In spite of this, a variety of approaches ranging from correlation based techniques to machine learning algorithms using decision trees and neural networks are successfully used in the these Recommenders.
  • 3. Other approaches that consider association rules in recommenders do not in general use rules to categorise items of content and use those single items to represent larger groups as we do in our algorithm [3]. In particular, association rule based recommender algorithms that use clustering to identify user groups does so by utilizing other criteria which then may not result in grouping the content on its intrinsic properties [5]. This method also does not consider the properties of the consumer vs. consumer network neighbourhood in arriving at its recommendations. The proposed association rule generation algorithm to the best of our knowledge is a novel scheme that is based on existing principals. It is designed to efficiently generate rules utilising components of rules from previous stages and so uses items that are already known to have fulfilled constraints on Confidence and Lift. The proposed scheme is also able to consider several layers/levels of neighbours around a target consumer and this approach combined with an association rule recommender we believe is a new configuration for this type of algorithm. 3. The underlying basis for a Novel Association Rule based Recommender It is usually very difficult to analyse the underlying characteristics of a product chosen by a consumer and examine what makes a particular consumer prefer such a product merely from the basket of goods chosen by that consumer or by observing a group of consumers similar in characteristics to the customer under study as there usually is no prior "meta data" of the product or content readily available. In the case of movie or music recommender systems for instance, it would be possible to analyse each film or song by the performing artists, genre, production houses and a host of other ancillary information that could potentially be gathered by a content provider and used as input to a recommender algorithm. In the case of an RBTS Recommender however, all of the product items are of one type (songs) and thus we need only find associations and relationships between songs. While there is considerable Meta data that is associated with song content, for the purposes of this paper and the algorithm described herein, we did not have access to such attributes. In light of this we propose an algorithm that operates in an environment where we do not have ready access to the Meta data of the content, but instead employ the novel idea that the content itself can be used in the characterization of itself. Thus for the particular case of RBTS subscriptions by customers of Mobile Telecom Services, we are able to categorize the RBTS (music choices) by using a set of song choices. In other words we categorize and reduce the number of potential song choices using a subset of the songs to represent certain groups of songs. Thus it is our contention that certain types of content can be characterized by the properties inherent in the content itself, particularly when the content is used for a very personal and personalized purpose. In our use case, we contend that the Ring Back Tone Song (RBTS) chosen by each customer to indicate a ringing telephone to a caller is one that his highly personal to the customer as it is chosen with a view of representing the customer to the outside world and hence from the point of view of that customer it is also one that has a certain appeal and great deal of intrinsic meaning. In this paper we also study how multiple songs are selected by a particular customer and establish that songs that "go together" are songs that are similar to one another. This similarity goes beyond the usual commonalties that can be expected between songs that are "similar" to one another in the sense of being sung by the same artist, or belonging to the same genre, or those using a particular type of instrumentation for example. Instead we show that the songs that are found to be "similar “are so on a much deeper and intrinsic level; like being similar in theme, meaning, and in concept. We believe that this result is due to the inherently personal nature of the choice of a Ring Back Tone Song that may also be used to gain deeper insights in to the very personal and personalized preferences of the customer making the choice. We establish these results through a process of Association Rule discovery where it is shown that songs that group together in fact exhibit similarities at an intrinsic and fundamental level rather than that which would be observed by merely comparing the Meta data of the content. In fact there is no guarantee that items of content that exhibit broad similarities with respect their Meta data would appear similar or more importantly that it would have similar
  • 4. appeal to a particular customer. This is the result of the wide variety in the content that is available, particularly in music that may share the same type of Meta data and other ancillary information but differ in its intrinsic qualities that appeal to a customer. We also show that due to this grouping of songs via the estimation of associations we are able to broadly categorize the song choices using the songs themselves as representative categories. It then allows us to quantify the strength of the relationships between songs by using the rules so arrived at and by performance measures such as confidence and lift. We then take the analysis a level deeper by considering the properties of the customer to customer network that lies at the heart of the customer vs. song bipartite graph by translating it in to its corresponding customer vs. customer single mode representation. In this way we build a recommender algorithm that has the ability to analyse song content at a fundamental and intrinsic level and suggest song content using the properties inherent in the songs itself while also utilising the similarities between customers who prefer a certain category of song. 4. Psychological Factors Underlying the Proposed Recommender Algorithm for RBTS We believe that that Ring Back Tone Songs are an aspect of oneself a subscriber wishes to project to the outside world and is thus an insight in to the subjects "personality". We also believe that the RBTS choice is a deeply personal choice and is therefore reflection of some innate quality or characteristic of a subscriber's personality. We can show from the results of the association rule analysis and broad song choices made by large groups of the subscribers that when more than one song is selected by a subscriber, the songs within that group show similarities at an intrinsic and fundamental level. We believe this may be reflection or an insight in to some aspect of the subscriber’s personality. As "personality" is more or less fixed upon reaching maturity we believe that the intrinsic quality of a person’s choice of RBTS is also unlikely to change much. In this paper we are not concerned with the underlying factors that were responsible for the formation of a particular personality type but only use the insight that it is unlikely to change much over time and that it will also govern or influence personal choices such as selecting a RBTS. While there will be exceptions to this line of reasoning the results of the association rule analysis do show that songs that show strong associations with one another are "similar". As the association rules are also a reflection of the aggregate choices of a large number of subscribers, our original line of reasoning seems to be justified. 5. Main Contributions of the Paper In this paper we employ the novel idea that certain items of content can be used to represent the larger set to which it belongs. We show that certain RBTS songs selected on the basis of their strong associations to other songs can be used as representative categories of the larger song set. We also show that the proposed recommender is able to match consumers taste for content without recourse to its Meta data as it operates by discovering and matching the underlying intrinsic properties of the content such as its theme, meaning and concept which is the basis of its attraction to a consumer. This is facilitated in part by a novel Association Rule mining algorithm which efficiently generates rules at each stage by merging the components (antecedent and consequent terms) of the rules in the prior stage to form the antecedent of the subsequent stage. In this way we generate only the minimum set of rules that satisfy certain a priori constraints on confidence and lift. We improve the match between the recommended song set and consumer’s own taste in song choice by considering the song choices of consumers most like the target consumer. This is accomplished by considering the network properties of the consumer’s immediate neighbours identified from the consumer vs. song bi-partite network. We also show that a better match across a wider variety of song content can be achieved by combining the results of the suggestions obtained by considering a consumer’s own choice of songs with the suggestions obtained
  • 5. by considering the song choices of his neighbours that are determined by placing constraints on the proximity of such individuals to the target consumer in the consumer vs. consumer network. 5.1 A novel Association Rule Mining Algorithm for the purposes of Recommending RBTS A novel Association Rule Mining algorithm that generates all the unique combinations of the items that comprise the antecedent and consequent terms of a rule while satisfying apriori constraints on support for that rule. The algorithm is formulated in such a way as to utilize the rules generated in the (N-1)th stage to form the rules at the Nth stage by selecting only those rules that meet the minimum support requirement in the (N-1)th stage. This is done by forming the antecedent of the rule in the Nth stage by merging the antecedent and consequent terms of the rule from the (N-1)th stage which we know meet the constraints on support and then computing the support of this new rule. In this way we avoid having to generate apriori all possible combinations of items that statisfy the minimum support criteria and then test the support of each newly formed rule. In our scheme we only need to test rules from the prior stage that we know have met the minimum support criteria. Through this method we are also able to automatically generate all rules progressively proceeding down each stage until all rules with all possible unique combinations of items in the antecedent and consequent parts have been created. It is also possible to filter the rules apriori as they are being generated by enforcing conditions on confidence and lift if it is so desired. Thus there is a gain in efficiency obtained by only utilizing those rules from the previous stage that meet certain performance criteria to create the rules for the subsequent stage by merging the antecedent and consequent parts of rules that meet the minim support criteria from the previous stage to form the antecedent part of the rule at the next stage. 5.2 The use of Association Rules to Categorize and Classify Ring Back Tone Songs using Characteristics Intrinsic to the Content. We establish that in the case of Ring Back Tone Songs, the songs that are show strong association with each other are largely those that are similar in content and character showing remarkable similarities in its deep and intrinsic qualities. Through this result we are able to classify and categorize the songs by grouping the songs that are strongly associated in to groups. This is also a key result that helps in the recommendation of new song content to a subscriber through song categories that are taken to represent smaller groups of songs. We establish this result without recourse to the Meta data of the songs or any other content related attributes using only the relationships estimated from the Association rule mining. 5.3 Ranked Recommendation choices estimated from Association Rules The algorithm classifies RBTS according to its intrinsic properties in to categories derived from the items of content itself and recommends items via the association rules ranked according to confidence in a way that accounts for the content choices of the neighbours in the single mode projection of the customer vs. item bipartite graph. After the rules are mined as per (1) and classified as (2) above, the RBTS subscription portfolio of the subscriber is used to determine which rules should apply in the recommendation by selecting those that have the same antecedent as the song subscriptions. Next the selected rules are ranked by confidence and the possible consequent items determined arriving at a set of choices ordered by the degree of "correlation" to the existing set. The neighbourhood of the subscriber that is found by placing limits on the number of songs in common with other subscribers and size of the subscription set of the neighbours in relation to the customer in question is also used to determine which of the ranked consequent choices are actually recommended. 6. Association Rule Discovery Algorithm for RBTS The Association Rules are mined from an Incidence Matrix (IM) by placing apriori limits on the support for each Item which is the column sum of IM. IM is formed as a customer vs. Item matrix, by allocating unity for every song that has been experienced by each customer. If a Customer has selected the same song more than once, the entry in the corresponding song column in the row allocated for that customer is still unity. Once the column sum (IMsum_min) is formed the particular columns that do not meet the
  • 6. minimum support criteria (min_supp) are not considered and eliminated from the analysis as a rule that incorporates that item cannot in any case meet the minimum support requirement. The rule discovery then proceeds to the first stage by determining all pairs of items that meet the minimum support criteria. The support for a pair of items is then simply the dot product of the corresponding columns of IM. In the next step the pairs of items that meet the minimum support criteria are selected and the support and confidence of the rule are also determined. In the second stage only the rules that met the minimum support criteria from the first stage are selected to form the rules. The antecedent in the second stage is then formed by merging antecedent and consequent terms from the rules (that met the minimum support criteria) of the first stage. The three item rules (2 in items in the antecedent derived from stage 1, and one new item in the consequent) that can be formed that meet the minimum support criteria can only come from the set of terms selected from stage 1 and the set IMsum_min. For example, let A, B, C, D be items that meet the minim support criteria in IM. In the first stage all pairs {A,B}, {A,C}, {A,D}, {B,C}, {B,D}, {C,D} are tested for minimum support. (There will be 4C2 = 6 such terms to be tested.) A->B is a valid rule, only if the support for AUB meets min_supp criteria In the second stage A and B (and all other pairs that met min_supp) (are merged to form the antecedent of the rule at stage 2 if they meet the min_supp criteria. (Let us assume that {A,B}, {A,C} and {C,D} meet the min_supp criteria) Thus in stage 2 A&B->C is a valid rule only if the support for AUBUC meets min_supp criteria (and similarly we test support for rules A&B->D, A&C->B, A&C->D and C&D->A and C&D->B) The rule discovery proceeds in this manner until all unique combinations of items have been created in the rules. 6.1 Definitions {Support (A->B)= N(AUB)/N (1)} Where N(AUB) represents the number of times items A and B are bought together in the whole transaction set and N is the number of rows in IM (corresponds to the number of unique subscribers or transactions) {Confidence(A->B) = Support(AUB)/Support(A) (2)} = P(EA&EB)/P(EA) (3)} = P(EB/EA) (4)} Where P(EX) is the probability of item X being bought in a particular transaction Confidence of a rule (A->B) is a measure of the validity of the rule, it has a conditional probability interpretation in that, it is the probability that B occurs when A has already occurred [4]. In market basket parlance it is the probability of item B being bought given that A has been bought in a particular transaction. Loosely speaking, we use the confidence measure in our reasoning as an indication of the degree of correlation between songs when we have rules of the form song1 -> song2. It is also used to rank the association rules particularly when one common antecedent gives rise to many consequent items. The Lift is a popular measure used to measure how likely a rule is if the items are independent [4]. {Lift(A->B) = Confidence(A->B)/Support(B) (5)} {Lift(A->B) = P(EA&EB)/P(EA)P(EB) (6)} In other words it is the number times how likely it is that A&B are bought together when the probability of buying A and the probability of buying B are independent. (Ie when the buying decision of item A does not influence the buying decision of B and vice versa) In this modelling we generate all combinations of stage 1 rules that satisfy the min_supp criteria as we need to match the antecedents of the rules with the subscription portfolio of a subscriber in order to generate recommendation options. We use low minimum support levels to ensure that we generate a large number of rules and handle the filtering and ranking via the categorization of songs and rule confidence measure.
  • 7. 7. Recommender Algorithm & Strategy The Recommender Algorithm determines the best RBTS suggestions for a particular subscriber by considering the items currently subscribed to by the target subscriber and the subscription pattern of that subscriber’s nearest neighbours. This neighbourhood can be determined from the single mode representation of the Bipartite Network that is derived by multiplying the incidence matrix by its transpose IM*IMT . The resulting subscriber vs. subscriber network is similar in form to an Adjacency matrix from which the neighbours can easily be found. In essence the neighbours of the target subscriber are those that have at least some of the same items in common. Constraints are set on the size of the subscription set of each neighbour in relation to the size of the subscription set of the target subscriber so that they are approximately of the same size. Additionally to qualify as a neighbour the subscriber in question must have at least a certain minimum number of items in common with the target subscriber. These conditions are placed to ensure that the neighbour set is more or less “similar” to the target subscriber. 7.1 Determining Nearest Neighbours 1. For each subscriber determine all of the other subscribers NS = {ns1,ns2,…,nsi,...,nsT} that share at least one song item in common 2. For each of the neighbours nsi determine the size of their subscription set nsiT 3. For each of the neighbours nsi determine the number of songs in common nsci with the target subscriber Si 4. The neighbour set is then the set of subscribers that meet an apriori limit on the nsiT and nsci 5. The limits can be set on the basis of the size of the subscription set of the target subscriber Si 6. So we can say a neighbour of the target subscriber is one who is within (+/-) 50% the size of the subscription set of Si and who also matches between (+/-) 20% of the songs in the subscription set of Si 7. The neighbour item set is then the union of the set of items (RBTS) found among the subscribers who meet conditions in 6 above. 7.2 Generate Association Rules The Association rules for the RBTS items can be obtained using the algorithm described in section 6. In this instance we generate only the level 1 associations with one antecedent and one consequent term. We use low support and confidence thresholds so as to capture as many rules as possible. This is done with the object of matching the antecedents in the rules to find consequent terms that have high confidence. 7.3 Hybrid Recommender Algorithm The hybrid nature of the algorithm arises by combining the suggestions obtained via the association rules with those obtained by considering the song choices of the consumer’s immediate neighbours that are identified via the single mode representation of the consumer vs. song bi-partite network. In this hybrid approach the final recommended song set is determined as the intersection of the song set suggested for the consumer by considering the consumer’s own song choices with the choices of his nearest neighbours. Thus, we endeavour to find better matches for a consumers taste in song content by using that consumers own (prior) taste in songs together with the tastes of consumers most like him. This neighbourhood of similar consumers is determined by their preferences for similar types of content as the target consumer. However there is sufficient variation across the content preferences of the neighbours to ensure that a diverse set of songs are available for consideration. In essence we allow for a certain degree of overlap in the tastes between the target consumer and his neighbours to ensure that there is sufficient variety in the set of songs suggested by the neighbouring consumers which is then observed in the final recommendation. 1. For each subscriber obtain his subscription song set S = {s1,s2,…,si,…,sN} 2. For each song si in the subscription set S, obtain all rules Ri = {r1,r2,…,rj,…,rK} which have same antecedent as si 3. From this rule set Ri, choose the rule that has the highest confidence ri_max and the corresponding consequent term Ci_max of the rule. It may be that in some cases more than one rule from Ri may meet this criteria (if they have the same max value for instance), in which case all such consequent terms are extracted.
  • 8. 4. The “available” song set {AS} is then the set of consequent items which have the highest confidence from each set of rules Ri derived for each song si in the subscription set. 5. The recommended song set is then the intersection between songs that are in the available song set {AS} and the Neighbour song set (set composed of the songs found in the neighbours subscription set, across all of the neighbours). 6. If the intersection is null or too small the available song set is used as the recommended set. 8. Results We present some example results extracted to demonstrate the match between the subscription set of songs (Table 1) and the corresponding recommended set of songs (Table 2) for 10 subscribers drawn randomly. It is evident that the recommendations do match the subscriber’s song set to a great degree in terms of the characteristics described previously. Table 1. Subscription Song Characteristics Su b No Subscription Song Set of Subscriber Languag e Source Release Theme 1 Theme 2 Theme 3 1 Raja Malige Sinhala Charts New Other 2 Athi Mawa Rawatuwa Sinhala Charts New Love Sorrow Losing 2 Kandulu Hollala Sinhala Charts New Love Sorrow Losing 2 Sithama Ridawa Sinhala Charts New Love Sorrow Losing 3 Wen Weela Giyada Sinhala Charts New Love Sorrow Losing 4 Nidi Warapu As Walin Sinhala Charts New Love Sorrow Losing 4 Obath Giya Sinhala Charts New Love Sorrow Losing 5 Athi Mawa Rawatuwa Sinhala Charts New Love Sorrow Losing 5 Sithama Ridawa Sinhala Charts New Love Sorrow Losing 6 Ennai thalaatta Tamil Movie New Love 7 Dheere Dheere Hindi Movie New Love 7 Ananthen Aa Tharu Kumara Sinhala Teledrama New Love 8 Chinna Mani Kuyile Tamil Movie Old Love 9 Ennodu Nee Irundhaal Tamil Movie New Love 10 Dukama Widala Sinhala Charts New Love Sorrow Losing 10 Hithumanapen Sinhala Charts New Love Sorrow Losing 10 Waradi Karala Sinhala Charts New Love Sorrow Losing As discussed earlier a single song subscription can give rise to multiple song recommendations as each song appearing in the antecedent of the rule be correlated with multiple consequent songs that have the same maximum confidence level. In practice then the unique set of items taken from all the recommended items for a particular subscriber will be used (multiple occurrences of the same consequent item in Table 2 is left to demonstrate the working of the algorithm) If the recommended set is too small the option of using the consequent items with the next highest confidence level can be employed. We also observed several cases where the recommended matches were not in line with our hypothesis. This resulted when certain subscribers had songs that had broad appeal across many tastes (like national anthem themed songs). Table 2. Recommended Song Characteristics Sub No Recomm endation Song Set of Subscrib er Langu age Sourc e Relea se The me1 The me2 The me3 1 Amma Sinhal a Kavi Bana New Moth er Moth er 2 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 2 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 2 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 3 Awasana Premayai Sinhal a Charts New Love Sorr ow 4 Hadawath a Gahena Sinhal a Charts New Love Sorr ow Losin g 4 Awasana Premayai Sinhal a Charts New Love Sorr ow 4 Awasana Premayai Sinhal a Charts New Love Sorr ow 5 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 5 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 6 Kannukull e Tamil Movie Old Love 7 Awasana Premayai Sinhal a Charts New Love Sorr ow 7 Dukama Vidala Sinhal a Charts New Love Sorr ow Losin g
  • 9. 8 Kanne En Kanmaniy e. Tamil Movie Old Love 8 Enna Oru Enna Oru Tamil Movie New Love 9 Idhayatha i Yedho Ondru Tamil Movie New Love 9 Un Mele Oru Kannu Tamil Movie New Love 10 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 10 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g 10 Ayeth Warak Sinhal a Charts New Love Sorr ow Losin g In such cases there could be a large number of subscribers with widely differing tastes who also have this song in their subscription portfolio giving rise to spurious associations /correlations at high confidence levels. These occurrences are in the minority and should not noticeably impact the overall trend in the result. We observed approximately 83% match between the subscription set and the recommendations. 8.1 Operationalizing the Recommender A subscriber may download RBTS using different channels such as IVR (Interactive Voice Response), Web, SMS or copy from another RBTS user. We favour the Out Bound Dialler (OBD) as the main promotion channel as the user can hear the recommended songs before selection and also as it has been successfully used to promote RBTS in the past. Also a customized SMS sent to the target user with recommended songs having a direct dial short code can be used for users who are using SMS and who are familiar with downloading RBTS via direct dial method. In particular we may target existing subscribers who have many songs in their subscription and who have a tendency to change their songs fairly regularly. Although each subscriber is different, the Operator has the ability to select the best individual marketing channel to reach each one by observing past RBTS downloading patterns and core Telecommunication Service usage behaviour of that individual. 9. Conclusion and Future Work We established that the proposed algorithm and model successfully matches a subscriber’s taste for Ring Back Tone Song content by recommending songs that are very much similar in its intrinsic qualities such as type, theme and content. We also showed that the proposed Association Rule discovery algorithm that derives rules for the current level by evaluating rules that meet certain performance criteria from the previous level is sufficiently flexible and efficient in order to generate the large number of rules at low support constraints with the option of filtering at apriori confidence levels. Through our modelling we also established a fundamental insight in to consumer behaviour in this particular type of content by observing that items of content preferred by consumers are strongly associated / correlated when their intrinsic properties are very similar. With this insight we are able to successfully match the tastes of the subscriber in recommended RBTS content. Although we employ only level one rules with a single antecedent and consequent term in our model the ability to generate the full set may prove useful in subsequent models where we use rule priorities and coverage concepts to rationalize and derive only those most relevant to the recommendation. While the neighbourhood calculation did not significantly contribute to improving the recommendations in the current incarnation of the algorithm, we envisage factoring in subscriber profiling attributes such as demographics, language preference and Mobile Service usage criteria to improve on the ability to find groups of songs from subscribers that have more in common with each other. 10. References 1. Wilson, D., Symth, B., & Reily, J., “Case-Studies in Association Rule Mining for Recommender Systems”, Proceedings of the 2005 International Conference on Artificial Intelligence, ICAI, 2005. 2. Lin, W., Alvarez, S., & Ruiz, C., “Collaborative Recommendation via Association Rule Mining”, Department of Computer Science, Worcester Polytechnic Institute, International Workshop on Web Mining, 2005. 3. Swathi, V., Reddy, S, “Music Recommendation System Using Association Rules”, International
  • 10. Journal of Technology Enhancements and Emerging Engineering Research, vol 2, Issue 7, 2014. 4. Hastie, T., Tibshirani, R., “The Elements of Statistical Learning”, 2nd ed., Springer, USA, 2009, 488-499pp. 5. M. Sunitha Reddy, T. Adilakshmi, V. Swathi, A Novel Association Rule Mining and Clustering Based Hybrid Method for Music Recommendation System, International Journal of Research in Engineering and Technology, vol. 3, issue 5, 2014.